This is the next frontier for AI art as it will let you build a series, graphic novel, or even video with consistent objects.
There’s techniques like textual inversion that let you associate a label with an object, but they rely on having multiple images of that object already, so it won’t work for an image you just generated. To get around that, some people have tried using tools to generate multiple images of a synthetic object, eg Deep Nostalgia that can animate a static portrait photo.
So in theory you select one photo with the AI image generator, create variants of it with separate image tools, then build a fine-tuned model based on some cherry-picked variants.
I think this will get easier as AI image tools focus more on depth and 3D modelling.
The “aiactors” subreddit has some interesting experiments along these lines.
So in theory you select one photo with the AI image generator, create variants of it with separate image tools, then build a fine-tuned model based on some cherry-picked variants.
I think this will get easier as AI image tools focus more on depth and 3D modelling.
The “aiactors” subreddit has some interesting experiments along these lines.