Text-to-Image Model Showdown: GPT-4o vs. Ideogram 3.0 vs. Reve 1.0
A steampunk platypus, a cyberpunk goose, and a dieselpunk duck walk into a club...
Last week was crazy y’all!
After months of relative calm on the text-to-image scene1, three top-tier image models suddenly rolled out within days of each other:
As if that wasn’t enough, Midjourney is also gearing up to release the long-awaited V7.
But while we wait for that, I wanted to put last week’s three “best” models through their paces.
It’s hard to grasp how quickly we went from diffusion models that could barely string a dozen words together in my spelling test…
…to native image models that can effortlessly write entire pages of text inside an image.
Not so long ago, I wrote about the somewhat lengthy back-and-forth process of working on AI-generated cartoons for
:But now, based on nothing but this short vague prompt…
Make a hilarious four-panel comic about our relationship with modern technology, relying on relatable tropes.
…GPT-4o comes up with the concept, lays out the four panels,…




