site banner

Small-Scale Question Sunday for December 28, 2025

Do you have a dumb question that you're kind of embarrassed to ask in the main thread? Is there something you're just not sure about?

This is your opportunity to ask questions. No question too simple or too silly.

Culture war topics are accepted, and proposals for a better intro post are appreciated.

1
Jump in the discussion.

No email address required.

the SFX quality is arguably a step above modern CGI in many cases (Avatar movies notwithstanding).

If nothing else (and that's an if that won't hold); AI is to CGI as CGI was to stop-motion (and many other practical effects). CGI is soon to be over as the state of the art way to produce special effects. It will be reduced tremendously in it's purpose

I think that is a correct analogy.

My guess is that there might be an opening where very low-fidelity renderings are used to map out the action on screen, but AI is doing the work of dozens of other animators in texturing, lighting, simulating and 'rendering' the actual image on screen, with a human just nudging it along and rejecting outputs as they go.

The missing step seems to be fine-grained control over the details, but creators like Gossip Goblin have been able to keep an extremely consistent style, so either that's a solved problem or they've got their prompts refined to a point that they aren't having to toss out much.

The quality available at what has to be a fraction of the cost of traditional FX is going to lead to rapid uptake.

Something like SCAIL and LoRA abuses can probably do that today and is probably already getting used in that sense today, but the current version of the technology goes a little nuts for segments longer than 9 seconds, and it's painful to do even short segments using the existing workflows, on top of being egregiously slow on consumer hardware. I've seen people take it into a couple minutes by doing really aggressive generation of prompts to make a flipshow to start with, but anything longer than that tends to either end up needing to compromise on weird physics or ugly scene changes.

And the current implementations have some limits; pose info can't do talking heads well, going beyond three characters with pose info gets rough, and some particular pose changes can go full-on Exorcist. SCAIL's lipsync capabilities are worse than WAN animate, and while it's possible to combine them, it's even more finicky.

But compared to the cost and unpleasantness of traditional mocap, or even makeup? If you can possibly use this tech, there's a lot of good arguments in its favor.