The AI discussion thread

Page 67 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,670
7,288
136
1) anything that's beginner friendly and/or step-by-step friendly.
2) should I register kaido.io and put this entire thread on it?

Twitter has the best gif summaries tbh!

Here's the basics:

1. All databases are just big spreadsheets.

2. A query is when you ask the database (spreadsheet) for something. In Excel, you can do a CTRL + F to do a find query, type in a formula to do a math query, or draw a bar chart in a data query.

3. The interface is what you use (command line, GUI, etc.)

As far as AI goes:

1. They vacuum up allll of the data on the Internet to create a JUMBO spreadsheet (big ole' database)

2. They have fancy queries called "machine learning" (ML). When you train ML on giant databases, you get a Large Language Model (LLM). When you pre-train it for stuff like human text, you get a Generative Pre-trained Transformer (GPT). So basically, a fancy query! Or rather, modern Microsoft Clippy, haha!

3. The interface can have different modes. Text input is a mode (as well as PDF upload - still text! or programming code text!), as is speaking (speech to text), using an image, using a video, getting data from a database, or getting live data from the Internet or devices. Plus you can now output in each of those modes! So that's called "multi-modal input/output".

Examples:

Take a sample database. Add speech-to-text input, then text-to-speech output. Apply a voice model, set to conversational mode, and use a datacenter for real-time operation. Say hi to Maya on your phone or browser! Ask her how to make chocolate-chip cookies, then try interrupting her to change it to raisins with gluten-free flour:


What if your input mode was a drawing and your output mode was a video?


Or what if your input mode was a drawing and your output mode was a whole computer program?


If you're willing to see past the curtain, there are only 3 parts to any AI system, that's it! Data, query, interface! You don't need to know how to code to program in ChatGPT, or how to do web connectors to work in n8n...that's the beauty of it! It's like how artists can use game engines to create beautiful games without needing to master programming...you can start building stuff right now, today, for free!!

 
  • Like
Reactions: RossMAN

[DHT]Osiris

Lifer
Dec 15, 2015
17,389
16,670
146
I do not understand how this is supposed to work. You need cooling for compute, and cooling is very very hard in space. It worked on the ISS because the ISS is an extremely large structure with elements designed for it. I don't see how it could ever be profitable to deploy very large structures for computing resources you can never manually service. Literally putting them in the ocean would be more reasonable.
 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,670
7,288
136
2-minute anime short made with Sora:

OP answers questions in their comments

15-second clips stitched together in Adobe Premier. Sora image input for character and environment consistency

This project took about 4-8 hours a day generating & editing 20-35s clips. After 4 days, I put them all together in about 2 hours since most of the work was already done. It was all done through the Sora website so absolutely zero budget!


I'll be dropping a discord later this week with workflows, pipelines, and prompt templates. I'll also expose the full prompts for the project as well.

Absolutely game-changing! This will democratize film animation creation for the next generation!!

 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,670
7,288
136
Nano Banana (photo) + Wan 2.2 Replace (video) to de-age:



19 years ago, they did this with a team of people in X-Men:

17 years ago, with Brad Pitt in Benjamin Button:


Brad Pitt today:

1761670425046.png

De-aged 20 years in Nano Banana:

1761670402477.png