7 Text-To-Image AI Models: Tested
I look at the seven main players in AI image generation.
Hey, remember when I demoed six text-to-video sites?
Cat genitals, psychedelics, creepy chimeras, and other shenanigans? Ring a bell?
Today, I want to do the same for AI images. (Hopefully with 100% fewer cat penises.)
By my latest count, we now have seven primary public text-to-image models1:
DALL-E 3 (OpenAI)
Emu (Meta)
Firefly Image 2 (Adobe)
Ideogram (Ideogram)
Imagen (Google)
Midjourney 5.2 (Midjourney)
SDXL (Stability AI)2
Let’s check out the images they generate and learn more about the models.
The process
This won’t be a deep-dive showdown like my SDXL 1.0 vs. Midjourney 5.2 post.
Instead, I’ll briefly introduce each model and showcase the visuals it generates. To keep things consistent and comparable, I’ll be using the same 6 prompts for each model:
Tulips in a meadow, golden hour, watercolor painting
Parrot on a branch, wildlife photography, National Geographic
P…

