Midjourney debuts characteristic for producing fixed characters throughout multiple gen AI photos

Digital Author March 12, 2024

0 1 5 minutes read

March 11, 2024 7: 10 PM

Example of an older muscular bald man with beard in various different AI images.

Credit rating: VentureBeat made with Midjourney V6

Join leaders in Boston on March 27 for a typical night time of networking, insights, and dialog. Quiz an invite right here.

The fashionable AI image producing carrier Midjourney has deployed indubitably one of its most oft-requested aspects: the flexibility to recreate characters continuously throughout new photos.

This has been a major hurdle for AI image generators to-date, by their very nature.

That’s attributable to most AI image generators depend upon “diffusion units,” tools equivalent to or in step with Stability AI’s Stable Diffusion commence-source image generation algorithm, which work roughly by taking text inputted by an person and attempting to share collectively a image pixel-by-pixel that suits that description, as learned from same imagery and text tags in their large (and controversial) coaching recordsdata field of millions of human created photos.

Why fixed characters are so mighty — and elusive — for generative AI imagery

But, as is the case with text-based fine language units (LLMs) a lot like OpenAI’s ChatGPT or Cohere’s new Characterize-R, the disaster with all generative AI purposes is in their inconsistency of responses: the AI generates something new for each urged entered into it, even though the urged is repeated or one of the most the same key phrases are extinct.

VB Match

The AI Affect Tour – Boston

We’re infected for the following stop on the AI Affect Tour in Boston on March 27th. This habitual, invite-simplest occasion, in partnership with Microsoft, will characteristic discussions on finest practices for recordsdata integrity in 2024 and previous. Set is dinky, so demand an invite at the new time.

Quiz an invite

Here’s nice for producing total new pieces of snort material — within the case of Midjourney, photos. But what whenever you’re storyboarding a film, a fresh, a graphic fresh or comic e book, or any other visible medium where you wish the the same persona or characters to go by it and seem in utterly different scenes, settings, with utterly different facial expressions and props?

This staunch scenario, which is in general main for story continuity, has been very complex to attain with generative AI — to this point. But Midjourney is now taking a crack at it, introducing a brand new label, “–cref” (rapid for “persona reference”) that customers can add to the end of their text prompts within the Midjourney Discord and may presumably additionally peaceful are attempting to check the persona’s facial aspects, body variety, and even dresses from a URL that the person pastes in following mentioned label.

As the characteristic progresses and is refined, it may perchance presumably additionally use Midjourney farther from being a groovy toy or ideation source into more of a real tool.

How one can use the new Midjourney fixed persona characteristic

The label works finest with previously generated Midjourney photos. So, as an instance, the workflow for an person would be to first generate or retrieve the URL of a previously generated persona.

Let’s originate from scratch and notify we’re producing a brand new persona with this urged: “a muscular bald man with a bead and be conscious patch.”

We’ll upscale the image that we love finest, then preserve an eye on-click it within the Midjourney Discord server to search out the “reproduction link” probability.

Then, we’re going to variety a brand new urged in “wearing a white tuxedo standing in a villa –cref [URL]” and paste within the URL of the image we true generated, and Midjourney will are attempting to generate that same persona from sooner than in our newly typed setting.

As you’ll glance, the outcomes are removed from staunch to the fresh persona (and even our fashioned urged), however no doubt encouraging.

In addition, the person can preserve an eye on to a degree the “weight” of how intently the new image reproduces the fresh persona by making use of the label “–cw” followed by a number 1 by 100 to the end of their new urged (after the “–cref [URL]” string, so love this: “–cref [URL] –cw 100.” The decrease the “cw” number, the more variance the following image can grasp. The easier the “cw” number, the more intently the following new image will educate the fresh reference.

As you may presumably glance in our example, inputting a extraordinarily low “cw 8” truly returns what we wished: the white tuxedo. Though now it has eliminated our persona’s distinctive eyepatch.

Oh effectively, nothing a dinky of “range quandary” can’t repair — swish?

Good ample, so the eyepatch is on the scandalous be conscious…however we’re getting there!

You may presumably presumably additionally mix multiple characters into one utilizing two “–cref” tags aspect by aspect with their respective URLs.

The characteristic true went stay earlier this evening, however already artists and creators are attempting out it now. Strive it on your self whenever you grasp Midjourney. And read founder David Holz’s beefy showcase about it below:

Hiya @all individuals @right here we’re attempting out a brand new “Character Reference” characteristic at the new time Here’s equivalent to the “Vogue Reference” characteristic, except in desire to matching a reference kind it tries to make the persona match a “Character Reference” image.

The plan it genuinely works

Variety --cref URL after your urged with a URL to a image of a persona
You may presumably presumably use --cw to alter reference ‘energy’ from 100 to 0
energy 100 (--cw 100) is default and makes use of the face, hair, and dresses
At energy 0 (--cw 0) it’ll true focal point on face (moral for changing outfits / hair and tons others)

What it’s meant for

This characteristic works finest when utilizing characters made of Midjourney photos. It’s now now not designed for staunch individuals / photos (and may presumably additionally peaceful most likely distort them as typical image prompts attain)
Cref works within the same kind to typical image prompts except it ‘focuses’ on the persona traits
The precision of this approach is dinky, it received’t reproduction staunch dimples / freckles / or tshirt trademarks.
Cref works for every Niji and fashioned MJ units and additionally may presumably additionally additionally be blended with --sref

Evolved Points

You may presumably presumably use better than one URL to mix the solutions /characters from multiple photos love this --cref URL1 URL2 (this is equivalent to multiple image or kind prompts)

How does it work on the rep alpha?

Slide or paste a image into the take into consideration bar, it now has three icons. selecting these sets whether or now now not it’s a image urged, a kind reference, or a persona reference. Shift+have interaction an probability to use a image for multiple categories

Be conscious, while MJ V6 is in alpha this and other aspects may presumably additionally commerce all correct now, however V6 unswerving beta is coming soon. We’d love all individuals’s solutions in ⁠solutions-and-aspects We hope you revel on this early free up and hope it helps you play with constructing tales and worlds

VentureBeat’s mission is to be a digital city sq. for technical decision-makers to attain recordsdata about transformative endeavor technology and transact. Explore our Briefings.