The AI discussion thread

Page 65 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
What the field needs right now is digital signatures for non AI work and browser plugins that automatically check for those.

They have a pretty good text-detector:


It is far from perfect, however:


They already watermark the images & video with invisible watermarks:


The logo watermarks are easy to defeat...with AI lol:

 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
Introducing RTFM (Real-Time Frame Model): a highly efficient World Model that generates video frames in real time as you interact with it, powered by a single H100 GPU.

lol...a single H100 GPU costs $30,000 :p But seriously, that's some crazy technology & crazy efficiency:

RTFM can be seen as a learned renderer: it is an autoregressive diffusion transformer trained end-to-end on large-scale video data, and it learns to model 3D geometry, reflections, shadows and more just by observing them in its training set.

So they can build smarter & smarter custom-rendered 3D-navigatable environments & simply stream them to your device!


 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
Introducing RTFM (Real-Time Frame Model): a highly efficient World Model that generates video frames in real time as you interact with it, powered by a single H100 GPU.

lol...a single H100 GPU costs $30,000 :p But seriously, that's some crazy technology & crazy efficiency:



So they can build smarter & smarter custom-rendered 3D-navigatable environments & simply stream them to your device!

All you need is a few photos to recreate a real environment!

 

Chaotic42

Lifer
Jun 15, 2001
34,783
1,962
126
I pay for OpenAI, so I'm hopeful I'll get a Sora 2 key at some point. Sora 1's videos aren't that interesting and it easily goes off the rails.
 
  • Like
Reactions: Kaido

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
I pay for OpenAI, so I'm hopeful I'll get a Sora 2 key at some point. Sora 1's videos aren't that interesting and it easily goes off the rails.

I have Freepik, which is a multi-model workflow subscription website:


You buy a subscription to get access & can load additional credits as needed:


It's basically a way to:

1. Get access to a lot of different photo & video models
2. Have workflow integration (video generator, video editor, etc.)

You can run stuff like LM Studio & Msty for local, offline GPT's. I have some IT customers who require closed-environment solutions & those are great for that! But you are hardware-limited as far as performance goes. If you like to tinker, Hugging Face is an online playground with over 2 million AI models available.

I have a 1080Ti from like 2017, so my local horsepower is not all that great, plus I'm more interested in using AI to DO stuff with. Some models use your paid credits & others are unlimited. Their photo editor gives you unlimited generations with Nano Banana & it has Ideogram for illustration...I haven't touched my Adobe suite in like a month lol.

The video generator has all the latest stuff, including the more advanced Sora 2 Pro & WAN 2.5, plus I get unlimited free generations of WAN 2.2 under the subscription. They also have ElevenLabs for voice & whatnot. Overall, my current favorite tools are:

1. Freepik (multimedia suite)
2. ChatGPT Plus (search, PDF Q&A, voice chat)
3. NotebookLM (studying & research)
4. Midjourney (incredible UI ^ image generation)
5. Suno (music)

1760662081364.png
 
  • Like
Reactions: Chaotic42

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
I pay for OpenAI, so I'm hopeful I'll get a Sora 2 key at some point. Sora 1's videos aren't that interesting and it easily goes off the rails.

The tools today are crazy tho...here's some free marketing @Gizmo j

Nano Banana image generation via text prompting:
1760665822059.png

Nano Banana image-to-image scene editing:

1760667077072.png

WAN 2.2 video:

bar_10mb.gif
 
Last edited:

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
I have not been impressed with Microsoft's Copilot until today: Copilot Actions. Basically:

1. It creates a non-admin account
2. It opens a remote desktop screen
3. You explain what you want it to do in conversational language & it logs every step as it works

@RossMAN here you go!!


DANG MAN!

Microsoft states that it is securing Copilot AI agents through four security and privacy principles: using distinct agent accounts, limiting agent privileges on files and folders, ensuring operational trust through digitally signed agents, and ensuring that agents are governed by the Microsoft Privacy Statement and Responsible AI Standard.

Each AI agent will run under its own distinct "standard" Windows account, which means it does not have administrative privileges. As each agent uses its own account, it means Windows can restrict agents based on applications and file system access rules.

At launch, agents will only have access to the standard Windows data folders, such as Documents, Downloads, Desktop, and Pictures, and other "resources" available to all accounts. Access to other file locations can be configured using the Windows access control lists (ACLs).

BleepingComputer asked Microsoft if they would be adding easier ways to manage file system access for agents and was told that more granular security controls would be coming at a later date.

Microsoft further told BleepingComputer that each Agent Workspace, where Copilot Actions performs its tasks, is implemented as a Windows Remote Desktop child session, rather than as a virtual machine or within a Windows Sandbox.

A Windows Remote Desktop child session is a distinct, isolated desktop environment tied to a user's existing session, preventing the agent from directly viewing or interacting with the user's desktop.

"Each agentic app will manage their own agent workspace. For isolation purposes, there will not be crossover of workspaces across apps," Microsoft told BleepingComputer.

While the AI agent can't access a user's desktop, Microsoft plans to implement a way for users to authorize, monitor, and take control of agent actions in the workspace.

 
  • Like
Reactions: RossMAN

sdifox

No Lifer
Sep 30, 2005
100,228
17,894
126
I have not been impressed with Microsoft's Copilot until today: Copilot Actions. Basically:

1. It creates a non-admin account
2. It opens a remote desktop screen
3. You explain what you want it to do in conversational language & it logs every step as it works

@RossMAN here you go!!


DANG MAN!



lol I am not trusting my local file to any ai
 

Red Squirrel

No Lifer
May 24, 2003
70,565
13,802
126
www.anyf.ca
Bet all that gets sent to the MS servers too, I doubt it's local. So this could be a huge privacy issue especially if dealing with sensitive data.


I'm a test subject at work for Windows 11 as I got deployed a new PC that has it installed and every now and then copilot pops up, it's really random, like if I'm copying text or something. Sending random bits of text to it could be an issue too especially if dealing with customer info. Just glad I don't run windows at home anymore it's become too invasive imo.
 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
Yeah, I'm slowly preparing to just go full Linux except for my gaming PC which will be as isolated as possible from everything else.

 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
51,535
7,230
136
Bet all that gets sent to the MS servers too, I doubt it's local. So this could be a huge privacy issue especially if dealing with sensitive data.


I'm a test subject at work for Windows 11 as I got deployed a new PC that has it installed and every now and then copilot pops up, it's really random, like if I'm copying text or something. Sending random bits of text to it could be an issue too especially if dealing with customer info. Just glad I don't run windows at home anymore it's become too invasive imo.



:(