More

m3kw9 · 2025-10-09T17:47:41 1760032061

How are they trained to your preferences?

m3kw9 · 2025-10-08T00:41:28 1759884088

I think the definition isn’t accurate, it’s more like engineering using AI.

m3kw9 · 2025-10-07T13:07:16 1759842436

Is this inferring to the double slit experiment?

jebarker · 2025-10-07T13:31:08 1759843868

m3kw9 · 2025-10-03T00:19:00 1759450740

They should call it anti aging finding

m3kw9 · 2025-09-30T18:06:07 1759255567

So they tested using training examples? Lmao

fxwin · 2025-09-30T18:12:46 1759255966

> held out

Aperocky · 2025-09-30T18:33:03 1759257183

Actually in this case that's not exactly true:

> generation of 281,128 augmented examples

All example are already correlated because they are generated in the same way.

littlestymaar · 2025-09-30T18:47:03 1759258023

> All example are already correlated because they are generated in the same way.

All examples of “document information extraction” would be correlated no matter where they come from because they all would be “document information extraction” examples…

The real question is whether or not the examples are representative of the broad “document information extraction” use-case.

_carltg · 2025-09-30T21:53:50 1759269230

The problem is the methodology they use to hold them out. For a truly independent validation set, they need to hold out the material before augmentation, not after. If you hold out after augmentation, then you leverage biases from the training regimen already and hence you artificially boost your model's performance. This is not sufficient to demonstrate your model is generalizing properly.

In analogy: instead of taking leaves off of different trees, they are taking leaves from different branches from the same tree.

selim-now · 2025-10-01T07:20:25 1759303225

That would definitely make the evaluation more robust. My fear is that with LLMs at hand people became allergic to preparing good human-labelled evaluation sets and would always to some degree use an LLM as a crutch.

fxwin · 2025-10-02T21:26:03 1759440363

I would agree with that

m3kw9 · 2025-09-30T18:05:00 1759255500

I’m eagerly awaiting for some unexpected social problems this crops up

m3kw9 · 2025-09-30T18:03:49 1759255429

I’m wondering how they really prevent uploads of other peoples faces if they take a clip of a video of another person. I’m sure Apple didn’t open up the 3d Face ID scanning to them to verify

m3kw9 · 2025-09-30T18:00:04 1759255204

No doubt they can create Hollywood quality clips if the tools are good enough to keep objects consistent, example, coming back to the same scene with same decor and also emotional consistency in actors

gretch · 2025-09-30T18:11:42 1759255902

> keep objects consistent

I think this is not nearly as important as most people think it is.

In hollywood movies, everyone already knows about "continuity errors" - like when the water level of a glass goes up over time due to shots being spliced together. Sometimes shots with continuity errors are explicitly chosen by the editor because it had the most emotional resonance for the scene.

These types of things rarely affect our human subjective enjoyment of a video.

In terms of physics errors - current human CGI has physics errors. People just accept it and move on.

We know that superman can't lift an airplane because all of that weight on a single point of the fuselage doesn't hold, but like whatever.

ileonichwiesz · 2025-09-30T18:27:41 1759256861

Water level in a glass changing between shots is one thing, the protagonist’s face and clothes changing is another.

echelon · 2025-09-30T19:10:26 1759259426

Location consistency is important. Even something as simple and subtle as breaking the 180-rule [1] feels super uncanny to most audiences. Let alone changing the set the actor occupies, their wardrobe, props, etc.

There are lots of tools being built to address this, but they're still immature.

https://x.com/get_artcraft/status/1972723816087392450 (This is something we built and are open sourcing - still has a ways to go.)

ComfyUI has a lot of tools for this, they're just hard to use for most people.

[1] https://en.wikipedia.org/wiki/180-degree_rule

bbor · 2025-09-30T19:06:07 1759259167

Well put. Honestly the actor part is mostly solved by now, the tricky part is depicting any kind of believable, persistent space across different shots. Based off of amateur outputs from places like https://www.reddit.com/r/aivideo/, at least!

This release is clearly capable of generating mind-blowingly realistic short clips, but I don't see any evidence that longer, multi-shot videos can be automated yet. With a professional's time and existing editing techniques, however...

layer8 · 2025-09-30T18:30:58 1759257058

People got used to James Bond actors changing between movies, but from scene to scene in the same movie would be a bit confusing.

inerte · 2025-09-30T18:26:17 1759256777

It all depends on quantity and "quality" of the continuity errors. There's even a job for it https://en.wikipedia.org/wiki/Script_supervisor

cryptoz · 2025-09-30T18:28:59 1759256939

I wonder if this stuff is trained on enough Hallmark movies that even AI actors will buy a hot coffee at a cafe and then proceed to flail the empty cup around like the humans do. Really takes me out of the scene every time - they can't even put water in the cup!?

beefnugs · 2025-09-30T19:04:53 1759259093

No way man, this is why i loved Mr Robot, they actually payed a real expert and worked story around realism and not just made up gobbleygook that shuts my brain off entirely to its nonsense

m3kw9 · 2025-09-30T17:55:18 1759254918

This is what happens when you let the AI run for 30 minutes. Ain’t no way you will read the code with much critique if it’s a 1 hour+ read. You have to generate compartmentized code so you don’t need to check much

m3kw9 · 2025-09-30T17:50:58 1759254658

Which seem to level the play field, at least virtually

al_borland · 2025-09-30T18:32:30 1759257150

Maybe inside of a social network specially for AI, but a concerning number of people don't realize images and videos are AI, even when it's bad AI. As it gets better, and starts integrating the poster's image (like Sora 2), that's going to get even worse.