People are already saying Stable Diffusion 3 trounces Midjourney and DALL-E

An AI image generated in Stable Diffusion 3

(Image credit: Stability AI)

Another day, another massive leap forward in AI image generation. With Midjourney and OpenAI's DALL-E both getting big upgrades late last year, Stability AI's open-source Stable Diffusion needed to pull off something special to remain relevant. And it looks like it's done just that.

Stable Diffusion 3, which is still only available as a preview via a waiting list, uses a whole new diffusion transformer architecture and flow matching. Public access is very limited, but based on sample images and the few results shared by those who have access, it looks like another big advance in several areas. And it seems we're going to have to update our guide to the best AI art generators.

One of the big improvements in Midjourney V6, DALL-E 3 and Google Imagen 2 was a much more consistent handling of text. They can recognise when we're including instructions for text in prompts and can render it correctly... sometimes. Basically, Stable Diffusion was left trailing way behind as the only major AI image generator that still couldn't spell. That appears to have been fixed in the upgrade to Stable Diffusion 3.

Latest Videos From Creative Bloq

But there's more. Image quality also seems to have improved. But what looks most impressive of all so far is the new model's adherence to complex text prompts; that is, prompts that ask it for several specific elements rather than just a 'cat wearing a hat'. This could make Stable Diffusion 3 a more viable option for realising more specific creative visions. It should also make inpainting – the editing of sections of the initial image to swap out elements – more reliable.

YouTuber MattVidPro AI described Stable Diffusion 3 as "easily the most capable AI image generator we have seen to date". He says it beats DALL-3 in prompt understanding. His comparisons aren't based on his own use, however. He compares the sample images provided by Stability AI with his own test results from DALL-E 3 and Midjourney. Naturally, we presume that Stability AI has shared the best images it could produce, possibly after many many attempts.

Thank you for reading 5 articles this month* Join now for unlimited access

Enjoy your first month for just £1 / $1 / €1

*Read 5 free articles per month without a subscription

Join now for unlimited access

Try first month for just £1 / $1 / €1

TOPICS

Joe is a regular freelance journalist and editor at Creative Bloq. He writes news, features and buying guides and keeps track of the best equipment and software for creatives, from video editing programs to monitors and accessories. A veteran news writer and photographer, he now works as a project manager at the London and Buenos Aires-based design, production and branding agency Hermana Creatives. There he manages a team of designers, photographers and video editors who specialise in producing visual content and design assets for the hospitality sector. He also dances Argentine tango.

Get the Creative Bloq Newsletter