<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Why Try AI: Hot Takes]]></title><description><![CDATA[Ad hoc tests and commentary on emerging AI models, tools, and so on.]]></description><link>https://www.whytryai.com/s/hot-takes</link><image><url>https://substackcdn.com/image/fetch/$s_!raEn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c4d0362-24d4-4046-9ccd-cb331c34edc4_1024x1024.png</url><title>Why Try AI: Hot Takes</title><link>https://www.whytryai.com/s/hot-takes</link></image><generator>Substack</generator><lastBuildDate>Sun, 21 Jun 2026 03:21:30 GMT</lastBuildDate><atom:link href="https://www.whytryai.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Daniel Nest]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[whytryai@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[whytryai@substack.com]]></itunes:email><itunes:name><![CDATA[Daniel Nest]]></itunes:name></itunes:owner><itunes:author><![CDATA[Daniel Nest]]></itunes:author><googleplay:owner><![CDATA[whytryai@substack.com]]></googleplay:owner><googleplay:email><![CDATA[whytryai@substack.com]]></googleplay:email><googleplay:author><![CDATA[Daniel Nest]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Do You Even Need Claude Fable 5?]]></title><description><![CDATA[Anthropic's new top model solves problems most of us don't have.]]></description><link>https://www.whytryai.com/p/claude-fable-5</link><guid isPermaLink="false">https://www.whytryai.com/p/claude-fable-5</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 11 Jun 2026 08:21:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2c97d3bd-fa2a-44f1-93d7-26d4bfd132f2_1731x909.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>Anthropic&#8217;s new top-tier model (Claude Fable 5/Mythos 5) is way too pricey and comes with access restrictions, but most of us don&#8217;t need it anyway.</p><h2>What is it?</h2><p>Fable 5/Mythos 5 is <a href="https://www.anthropic.com/news/claude-fable-5-mythos-5">Anthropic&#8217;s latest model</a> and a step change for frontier LLMs.</p><div id="youtube2-Y9Wz2PV404E" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;Y9Wz2PV404E&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/Y9Wz2PV404E?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Before we move on, let&#8217;s clear up the whole Fable vs. Mythos thing, so I don&#8217;t have to keep writing them in tandem:</p><ul><li><p><strong>Fable 5</strong>: Is <em>the same underlying model</em> as Mythos 5 but with additional safeguards in place that automatically reroute certain sensitive queries (e.g. cybersecurity and biology) to Opus 4.8 instead. Available to everyone.</p></li><li><p><strong>Mythos 5</strong>: The &#8220;full&#8221; version without any safeguards. Available only to a vetted group of &#8220;cyberdefenders and infrastructure providers.&#8221;</p></li></ul><p>Claude Fable 5 leads the pack on virtually every benchmark self-reported by Anthropic:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2QA8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2QA8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 424w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 848w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 1272w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2QA8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp" width="1456" height="1607" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1607,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Benchmark table showing Claude Fable and Mythos compared to other leading models&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Benchmark table showing Claude Fable and Mythos compared to other leading models" title="Benchmark table showing Claude Fable and Mythos compared to other leading models" srcset="https://substackcdn.com/image/fetch/$s_!2QA8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 424w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 848w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 1272w, https://substackcdn.com/image/fetch/$s_!2QA8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F717516c4-d0db-4739-a21a-a0fd8a437948_2600x2870.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.anthropic.com/news/claude-fable-5-mythos-5">Anthropic</a></strong>.</figcaption></figure></div><p>But you don&#8217;t have to take Anthropic&#8217;s word for it.</p><p>The emerging consensus is that Fable 5 truly <em>is</em> a leap forward across the board.</p><p>Opinions aren&#8217;t nearly as divided as they&#8217;ve been for <a href="https://www.whytryai.com/p/what-the-claude-is-going-on-with">Anthropic&#8217;s latest releases</a>, and even the notoriously critical Reddit crowd <a href="https://www.reddit.com/r/claudexplorers/comments/1u1bg4m/fable_5_is_out_megathread/">seems broadly positive</a>.</p><p>You can find real-world tests and examples below in &#8220;Further reading &amp; watching,&#8221; but the short of it is: Fable 5 is a state-of-the-art model that excels at the hardest, most complex long-horizon tasks and agentic coding work.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><p>Do you have a <a href="https://claude.com/pricing">paid Claude</a> subscription?</p><p>Congratulations: You already have access to Claude Fable 5.</p><p>Just open your <a href="https://claude.com/download">Claude app</a> or <a href="https://claude.ai/">claude.ai</a>, and you&#8217;ll see Fable 5 in the model picker:<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ELdb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ELdb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 424w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 848w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 1272w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ELdb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png" width="285" height="342" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:342,&quot;width&quot;:285,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:29100,&quot;alt&quot;:&quot;Claude app model picker showing Fable 5 selected with an \&quot;Included until June 22\&quot; badge, above Opus 4.8, Sonnet 4.6, and Haiku 4.5 options.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/201350994?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Claude app model picker showing Fable 5 selected with an &quot;Included until June 22&quot; badge, above Opus 4.8, Sonnet 4.6, and Haiku 4.5 options." title="Claude app model picker showing Fable 5 selected with an &quot;Included until June 22&quot; badge, above Opus 4.8, Sonnet 4.6, and Haiku 4.5 options." srcset="https://substackcdn.com/image/fetch/$s_!ELdb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 424w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 848w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 1272w, https://substackcdn.com/image/fetch/$s_!ELdb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99631258-7257-4eb1-a95c-fb6fe96f5c66_285x342.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Go crazy!</p><p>Ah, but you must hurry, fellow explorer: Regular Claude subscribers only have <strong>until June 22</strong> to test this mythical model. For from June 23 onward, Fable 5 shall only reveal its powers to those willing to <a href="https://support.claude.com/en/articles/12429409-manage-usage-credits-for-paid-claude-plans">pay its legendary per-usage fees</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>Even before the June 23 switcheroo, Fable 5 will burn through your daily/weekly rate limits at twice the speed of the already expensive Opus 4.X model family.</p><p>To summarize, Fable 5 is:</p><ul><li><p>&#8220;Mythos 5 Lite&#8221; with safety guardrails that limit its range of use cases.</p></li><li><p>Only available in existing paid Claude plans until June 22.</p></li><li><p>Anthropic&#8217;s most expensive model (double the cost of Opus 4.8).</p></li></ul><p>Looks like the &#8220;<a href="https://www.thealgorithmicbridge.com/p/the-ai-rich-and-the-ai-poor">AI rich vs. AI poor</a>&#8221; future <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Alberto Romero&quot;,&quot;id&quot;:91075008,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cc40fb4-3e5b-43e0-8e5e-820ba35f4e02_1153x1152.jpeg&quot;,&quot;uuid&quot;:&quot;847412fd-e99a-421e-ac37-a940b3f41654&quot;}" data-component-name="MentionToDOM"></span> predicted is finally here.</p><p>But wait&#8230;how much does this <em>really</em> matter to you?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>Why should you (not) care?</h2><p>Okay, real talk.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>For the overwhelming majority of us, Fable 5 is pure overkill.</p><p>Unless you do frontier research or work on complex software engineering projects <a href="https://www.anthropic.com/news/claude-fable-5-mythos-5#:~:text=life%20sciences%20research.-,Software%20engineering,-.%20During%20early">like Stripe&#8217;s reported codebase migration</a>, you simply don&#8217;t need Fable 5&#8217;s wizard skills.</p><p>Take a look at these ELO scores for <a href="https://artificialanalysis.ai/#gdpval">GDPval-AA</a>, which &#8220;evaluates AI models on real-world, economically valuable tasks across a wide range of occupations&#8221;:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Imht!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Imht!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 424w, https://substackcdn.com/image/fetch/$s_!Imht!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 848w, https://substackcdn.com/image/fetch/$s_!Imht!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 1272w, https://substackcdn.com/image/fetch/$s_!Imht!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Imht!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png" width="3256" height="1688" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1688,&quot;width&quot;:3256,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:551684,&quot;alt&quot;:&quot;Artificial Analysis GDPval-AA leaderboard bar chart: Claude Fable 5 leads at 1932 ELO, just 42 points above Claude Opus 4.8 at 1890, with GPT-5.5 third at 1769.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/201350994?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3aeaf49d-3d71-4f50-83cd-687d2cec91f8_3264x1804.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Artificial Analysis GDPval-AA leaderboard bar chart: Claude Fable 5 leads at 1932 ELO, just 42 points above Claude Opus 4.8 at 1890, with GPT-5.5 third at 1769." title="Artificial Analysis GDPval-AA leaderboard bar chart: Claude Fable 5 leads at 1932 ELO, just 42 points above Claude Opus 4.8 at 1890, with GPT-5.5 third at 1769." srcset="https://substackcdn.com/image/fetch/$s_!Imht!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 424w, https://substackcdn.com/image/fetch/$s_!Imht!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 848w, https://substackcdn.com/image/fetch/$s_!Imht!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 1272w, https://substackcdn.com/image/fetch/$s_!Imht!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78c5b5de-d92a-495b-a3f2-86db9be69263_3256x1688.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://artificialanalysis.ai/#gdpval">Artificial Analysis</a></strong></figcaption></figure></div><p>Now be honest: Do Fable&#8217;s extra 42 ELO points over Opus 4.8 really matter for your daily tasks? How would you even measure it? Are those marginal gains worth the 2x price hike?</p><p>I&#8217;ll take a wild swing here and say: No, they aren&#8217;t. Not for the average person.</p><p>For instance, I consider myself to be at least an intermediate AI user. I work with and test AI tools daily to keep up with all the madness for this newsletter.</p><p>I run <a href="https://www.whytryai.com/p/claude-code-codex-shared-brain">Claude Code together with Codex</a>, hooked up to my <a href="https://www.whytryai.com/p/obsidian-claude-code-control-center">Obsidian vault as their command center</a>.</p><p>I&#8217;m in the weeds with this stuff, is what I&#8217;m saying.</p><p>But for most of my use cases, I find that the good old Opus 4.6<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> is more than plenty. Hell, even Sonnet 4.6 works well for much of what I need done.</p><p>(I do ground-breaking scientific research only occasionally and have to date cured at most two or three cancers.)</p><p>And it&#8217;s not just me, either.</p><p>In their recent Fable 5 &#8220;vibe check,&#8221; Dan Shipper and the crew at <a href="https://every.to/">Every</a> wrote:</p><blockquote><p><em>&#8230;we found that users who were highly adept with AI&#8212;at Level 7 or 8 on our AI adoption ladder&#8212;found [Fable 5] paradigm-shifting for their hardest tasks. Users who were lower down on the curve, however, struggled to find something to use it for.</em></p></blockquote><p>For reference, here are the 8 levels they&#8217;re talking about:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!X7ns!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!X7ns!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 424w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 848w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 1272w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!X7ns!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png" width="726" height="593" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:593,&quot;width&quot;:726,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92164,&quot;alt&quot;:&quot;Every's eight levels of AI adoption table, from Level 1 Chatbot through Copilot, Agent, Autopilot, Workflows, Assistant, and Multi-agent to Level 8 Orchestrator.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/201350994?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Every's eight levels of AI adoption table, from Level 1 Chatbot through Copilot, Agent, Autopilot, Workflows, Assistant, and Multi-agent to Level 8 Orchestrator." title="Every's eight levels of AI adoption table, from Level 1 Chatbot through Copilot, Agent, Autopilot, Workflows, Assistant, and Multi-agent to Level 8 Orchestrator." srcset="https://substackcdn.com/image/fetch/$s_!X7ns!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 424w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 848w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 1272w, https://substackcdn.com/image/fetch/$s_!X7ns!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7479a534-95fc-4a64-af4b-bc3192824cec_726x593.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://every.to/guides/the-eight-levels-of-ai-adoption">Every</a></strong></figcaption></figure></div><p>So if you&#8217;re not &#8220;managing multiple long-running agents at the same time,&#8221; it&#8217;s perfectly fine to give Fable 5 a pass.</p><p>You don&#8217;t need Fable 5 to draft a quick work email or suggest entertainment ideas for your pet iguana&#8217;s birthday party.</p><p>Fable 5 is also way overqualified for most of your everyday knowledge work.</p><p>It&#8217;s like grabbing a bazooka to take down a mosquito.</p><p>It&#8217;s like taking a Formula 1 car to drive your kids to school.</p><p>It&#8217;s like asking Fable 5 to come up with a third beat for this three-part analogy.</p><p>There&#8217;s also <a href="https://www.youtube.com/watch?v=IREnr4I89Ho">early anecdotal evidence</a> that on certain white-collar tasks like writing and front-end design, Fable 5 fares worse than cheaper models due to its dense engineer-style lingo and verbosity.</p><p>&#8220;But Daniel, I saw that Fable 5 can <a href="https://www.oneusefulthing.org/p/what-it-feels-like-to-work-with-mythos">flawlessly code entire games</a> and build advanced tools from a single prompt! Isn&#8217;t that worth something?&#8221; you ask, conveniently setting up my upcoming response.</p><p>And yes, that appears to be a legitimately magical use of this tech. But:</p><p>a) Let&#8217;s face it: Most of us aren&#8217;t sitting on a backlog of brilliant ideas we&#8217;re itching to will into existence. (Although now that the option is available, this may gradually change, which <em>is</em> a net positive for all of us.)</p><p>b) One-shotting complex software is bound to push your token spend into the stratosphere, which brings us full circle to square one.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><p>As it stands, Fable 5 is likely not for me and you&#8230;and that&#8217;s okay.</p><p>Take Fable 5 for a spin on your regular Claude plan until June 22. Poke at it, see what it can do and how it feels compared to what you&#8217;re used to.</p><p>Then ask yourself if you&#8217;re willing to pay double the price&#8212;and wait longer for Fable to painstakingly think through every request&#8212;instead of sticking to Sonnet or Opus.</p><p>My guess?</p><p>We won&#8217;t be too heartbroken when it&#8217;s time to say goodbye to Fable 5 in two weeks.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>Further reading &amp; watching</h2><ul><li><p>&#8220;<strong><a href="https://karozieminski.substack.com/p/claude-fable-5-routing-trap">Anthropic Just Split the Frontier in Two</a></strong>&#8220; - <em>Product with Attitude</em></p></li><li><p>&#8220;<strong><a href="https://www.anthropic.com/news/claude-fable-5-mythos-5">Claude Fable 5 and Claude Mythos 5</a></strong>&#8220; - <em>Anthropic</em></p></li><li><p>&#8220;<strong><a href="https://youtu.be/IREnr4I89Ho">Claude Fable 5 &#8211; is this Mythos model worth the wait?</a></strong>&#8220; [VIDEO] - <em>How I AI</em></p></li><li><p>&#8220;<strong><a href="https://www.thealgorithmicbridge.com/p/nine-things-about-claude-mythos-5">Nine Things About Claude Mythos 5 That Matter If You&#8217;re Not an Enterprise Customer</a></strong>&#8220; - <em>The Algorithmic Bridge</em></p></li><li><p>&#8220;<strong><a href="https://every.to/vibe-check/anthropic-mythos-our-fable-vibe-check">Vibe Check: Fable 5 Is the Best Coding Model in the World</a></strong>&#8220; - <em>Every</em></p></li><li><p>&#8220;<strong><a href="https://www.oneusefulthing.org/p/what-it-feels-like-to-work-with-mythos">What it feels like to work with Mythos</a></strong>&#8220; - <em>One Useful Thing</em></p></li></ul><h2>&#129781; Over to you&#8230;</h2><p>Have you tried Fable 5 yet? If so, how much of a difference did you notice on your tasks and projects? Is my hot take way off? Is it way on? Am I rambling?</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/claude-fable-5/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/claude-fable-5/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>You can also access it via the <a href="https://platform.claude.com/docs/en/about-claude/models/introducing-claude-fable-5-and-claude-mythos-5">Claude API</a> and cloud providers like <a href="https://aws.amazon.com/blogs/aws/anthropic-claude-fable-5-on-aws-mythos-class-capabilities-with-built-in-safeguards-now-available/">Amazon Bedrock</a>, <a href="https://cloud.google.com/blog/products/ai-machine-learning/cloud-fable-5-on-google-cloud">Google Vertex</a>, and <a href="https://azure.microsoft.com/en-us/blog/claude-fable-5-is-now-available-in-microsoft-foundry-powering-the-next-era-of-autonomous-agents/">Microsoft Foundry</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Anthropic plans to eventually bring Fable 5 back to regular subscription plans, but nobody knows when that will happen.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>I was always looking for an excuse to use this phrase at least once.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>I&#8217;m still on the fence about Opus 4.7 and 4.8 based on all the bad press and token burn.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>It may take a sec to triangulate that joke.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Gemini 3: Google’s Silent Knockout Punch]]></title><description><![CDATA[Google proves that a "quiet" launch is enough, as long as you bring receipts.]]></description><link>https://www.whytryai.com/p/gemini-3</link><guid isPermaLink="false">https://www.whytryai.com/p/gemini-3</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 20 Nov 2025 12:21:14 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/e6112eb2-455d-4de1-812d-47b1aeec5e7a_1456x1048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>Google launched the world&#8217;s best model with all the fanfare of a firmware update for a toaster, but the consensus speaks for itself.</p><h2>What is it?</h2><p>Simply put, <a href="https://blog.google/products/gemini/gemini-3/">Gemini 3 Pro</a> is the best language model by virtually every measure. This isn&#8217;t a subjective value judgement. <a href="https://blog.google/products/gemini/gemini-3/">Here are the benchmarks</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!toXD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!toXD!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 424w, https://substackcdn.com/image/fetch/$s_!toXD!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 848w, https://substackcdn.com/image/fetch/$s_!toXD!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 1272w, https://substackcdn.com/image/fetch/$s_!toXD!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!toXD!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif" width="1456" height="1331" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1331,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Table of AI benchmark results comparing Gemini 3 Pro, Gemini 2.5 Pro, Claude Sonnet 4.5, and GPT-5.1 across academic reasoning, math, multimodal understanding, coding, OCR, long-horizon tasks, multilingual Q&amp;A, commonsense, and performance tests, with Gemini 3 Pro leading most categories.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Table of AI benchmark results comparing Gemini 3 Pro, Gemini 2.5 Pro, Claude Sonnet 4.5, and GPT-5.1 across academic reasoning, math, multimodal understanding, coding, OCR, long-horizon tasks, multilingual Q&amp;A, commonsense, and performance tests, with Gemini 3 Pro leading most categories." title="Table of AI benchmark results comparing Gemini 3 Pro, Gemini 2.5 Pro, Claude Sonnet 4.5, and GPT-5.1 across academic reasoning, math, multimodal understanding, coding, OCR, long-horizon tasks, multilingual Q&amp;A, commonsense, and performance tests, with Gemini 3 Pro leading most categories." srcset="https://substackcdn.com/image/fetch/$s_!toXD!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 424w, https://substackcdn.com/image/fetch/$s_!toXD!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 848w, https://substackcdn.com/image/fetch/$s_!toXD!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 1272w, https://substackcdn.com/image/fetch/$s_!toXD!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e12839-61f2-409f-a7e1-b682f5bd9976_2420x2212.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Oof! Letting Claude Sonnet 4.5 get away with a 1% lead on SWE-Bench Verified? How embarrassing!</figcaption></figure></div><p>Artificial Analysis Intelligence Index tells the same story:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AMQq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AMQq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 424w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 848w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 1272w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AMQq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png" width="491" height="344" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:344,&quot;width&quot;:491,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:39920,&quot;alt&quot;:&quot;ChatGPT said:  Bar chart from Artificial Analysis showing the Intelligence Index scores of major AI models, with Gemini 3 Pro leading at 73, followed by GPT-5.1 at 70, GPT-4.1 at 67, and other models like Claude 4.5, QwQ 2.5, and DeepSeek V2 trailing lower.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ChatGPT said:  Bar chart from Artificial Analysis showing the Intelligence Index scores of major AI models, with Gemini 3 Pro leading at 73, followed by GPT-5.1 at 70, GPT-4.1 at 67, and other models like Claude 4.5, QwQ 2.5, and DeepSeek V2 trailing lower." title="ChatGPT said:  Bar chart from Artificial Analysis showing the Intelligence Index scores of major AI models, with Gemini 3 Pro leading at 73, followed by GPT-5.1 at 70, GPT-4.1 at 67, and other models like Claude 4.5, QwQ 2.5, and DeepSeek V2 trailing lower." srcset="https://substackcdn.com/image/fetch/$s_!AMQq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 424w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 848w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 1272w, https://substackcdn.com/image/fetch/$s_!AMQq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ceb8f7f-b445-433c-b1b8-6f7a74f93574_491x344.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://artificialanalysis.ai/?intelligence=artificial-analysis-intelligence-index">Artificial Analysis</a></strong></figcaption></figure></div><p>So do the votes from real users on LMArena:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aa3R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aa3R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 424w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 848w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 1272w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aa3R!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png" width="1200" height="699.5911692559281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:713,&quot;width&quot;:1223,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:109994,&quot;alt&quot;:&quot;Two LMArena leaderboard tables comparing AI model performance: the left table ranks Text models with Gemini-3-Pro at #1, followed by Grok-4.1-Thinking, Grok-4.1, GPT-5.1-High, and Gemini-2.5-Pro; the right table ranks WebDev models, also led by Gemini-3-Pro, with GPT-5-Medium, Claude Opus, and Claude Sonnet models following.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="Two LMArena leaderboard tables comparing AI model performance: the left table ranks Text models with Gemini-3-Pro at #1, followed by Grok-4.1-Thinking, Grok-4.1, GPT-5.1-High, and Gemini-2.5-Pro; the right table ranks WebDev models, also led by Gemini-3-Pro, with GPT-5-Medium, Claude Opus, and Claude Sonnet models following." title="Two LMArena leaderboard tables comparing AI model performance: the left table ranks Text models with Gemini-3-Pro at #1, followed by Grok-4.1-Thinking, Grok-4.1, GPT-5.1-High, and Gemini-2.5-Pro; the right table ranks WebDev models, also led by Gemini-3-Pro, with GPT-5-Medium, Claude Opus, and Claude Sonnet models following." srcset="https://substackcdn.com/image/fetch/$s_!aa3R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 424w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 848w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 1272w, https://substackcdn.com/image/fetch/$s_!aa3R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d660ccb-b282-4341-8ac7-d84d166c58df_1223x713.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://lmarena.ai/leaderboard/">LMArena Leaderboards</a></strong></figcaption></figure></div><p>Even the normally hype-averse, high-signal-to-noise channel&nbsp;<em>AI Explained</em>&nbsp;opens with, &#8220;For me, [Gemini 3] genuinely marks a new chapter in the race to true artificial intelligence.&#8221;:</p><div id="youtube2-chr2I7CZTfk" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;chr2I7CZTfk&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/chr2I7CZTfk?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Reddit, typically the first place to surface real-world critical takes, is <a href="https://www.reddit.com/search/?q=gemini+3&amp;type=posts&amp;cId=6bde26a6-c783-4976-9d75-9e9d66f3e0ad&amp;iId=04e0f096-884e-4f18-af1f-cbc8ab6da152&amp;captcha=1">also largely positive</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.reddit.com/search/?q=gemini+3&amp;type=posts&amp;cId=6bde26a6-c783-4976-9d75-9e9d66f3e0ad&amp;iId=04e0f096-884e-4f18-af1f-cbc8ab6da152&amp;captcha=1" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kf-o!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 424w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 848w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 1272w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kf-o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png" width="742" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:742,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:100024,&quot;alt&quot;:&quot;Reddit search results page showing top all-time posts about Gemini 3, including headlines like &#8220;Gemini 3 is what gpt 5 should have been,&#8221; &#8220;Gemini 3 Pro first impressions,&#8221; &#8220;I want 2.5 back,&#8221; and discussions about Gemini 3&#8217;s business potential and rapid rise in popularity.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://www.reddit.com/search/?q=gemini+3&amp;type=posts&amp;cId=6bde26a6-c783-4976-9d75-9e9d66f3e0ad&amp;iId=04e0f096-884e-4f18-af1f-cbc8ab6da152&amp;captcha=1&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Reddit search results page showing top all-time posts about Gemini 3, including headlines like &#8220;Gemini 3 is what gpt 5 should have been,&#8221; &#8220;Gemini 3 Pro first impressions,&#8221; &#8220;I want 2.5 back,&#8221; and discussions about Gemini 3&#8217;s business potential and rapid rise in popularity." title="Reddit search results page showing top all-time posts about Gemini 3, including headlines like &#8220;Gemini 3 is what gpt 5 should have been,&#8221; &#8220;Gemini 3 Pro first impressions,&#8221; &#8220;I want 2.5 back,&#8221; and discussions about Gemini 3&#8217;s business potential and rapid rise in popularity." srcset="https://substackcdn.com/image/fetch/$s_!Kf-o!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 424w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 848w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 1272w, https://substackcdn.com/image/fetch/$s_!Kf-o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ac4cad5-9fbf-4349-84d5-6a6c43784769_742x800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The &#8220;I want 2.5 back&#8221; is actually a satirical post with the opposite message.</figcaption></figure></div><p>I could keep going, but I&#8217;m sure I&#8217;ve made my point: Gemini 3 is genuinely, provably impressive across the board.</p><p>No, it isn&#8217;t flawless. Yes, it <a href="https://storage.googleapis.com/deepmind-media/Model-Cards/Gemini-3-Pro-Model-Card.pdf">still hallucinates</a> and makes silly errors. But it&#8217;s hard to argue with the fact that Gemini 3 Pro is currently the best model available.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587&quot;,&quot;text&quot;:&quot;Get 30% off for 1 year&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587"><span>Get 30% off for 1 year</span></a></p><h2>How do you use it?</h2><p>Good news: Gemini 3 Pro has already rolled out to all Gemini app users.</p><p>So just head to <a href="https://gemini.google.com/app">gemini.google.com</a> and log in with your Google account:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8O4K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8O4K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 424w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 848w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 1272w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8O4K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png" width="993" height="435" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:435,&quot;width&quot;:993,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:60688,&quot;alt&quot;:&quot;Google Gemini interface showing the model selector expanded, with an orange arrow pointing to &#8220;Thinking with 3 Pro&#8221; as the chosen model option.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0626d3f3-af20-4d00-9d53-9584f40a0afe_993x435.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Google Gemini interface showing the model selector expanded, with an orange arrow pointing to &#8220;Thinking with 3 Pro&#8221; as the chosen model option." title="Google Gemini interface showing the model selector expanded, with an orange arrow pointing to &#8220;Thinking with 3 Pro&#8221; as the chosen model option." srcset="https://substackcdn.com/image/fetch/$s_!8O4K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 424w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 848w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 1272w, https://substackcdn.com/image/fetch/$s_!8O4K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a89a7d9-ee3c-4426-9616-5c6595ba02d0_993x435.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Remember to pick &#8220;Thinking with 3 Pro&#8221; to trigger Gemini 3 Pro</figcaption></figure></div><p>If you hit a message limit or simply prefer using Google AI Studio, you can also try Gemini 3 over at <a href="https://aistudio.google.com/">aistudio.google.com</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LE3E!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LE3E!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 424w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 848w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 1272w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LE3E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png" width="960" height="240" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:240,&quot;width&quot;:960,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:33933,&quot;alt&quot;:&quot;Google Gemini &#8220;Build your ideas&#8221; interface with an orange arrow pointing to the selected model label reading &#8220;Gemini 3 Pro Preview.&#8221; - Google AI Studio&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4aba1fa4-9841-45a0-a32e-d309bff35621_960x240.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Google Gemini &#8220;Build your ideas&#8221; interface with an orange arrow pointing to the selected model label reading &#8220;Gemini 3 Pro Preview.&#8221; - Google AI Studio" title="Google Gemini &#8220;Build your ideas&#8221; interface with an orange arrow pointing to the selected model label reading &#8220;Gemini 3 Pro Preview.&#8221; - Google AI Studio" srcset="https://substackcdn.com/image/fetch/$s_!LE3E!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 424w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 848w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 1272w, https://substackcdn.com/image/fetch/$s_!LE3E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb59a4644-c99c-494c-a543-4b774cfa0f9e_960x240.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Go ahead: Ask Gemini 3 to vibe code an app, perform research, review your work, analyze images, or however you prefer to put models through their paces.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587&quot;,&quot;text&quot;:&quot;Get 30% off for 1 year&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587"><span>Get 30% off for 1 year</span></a></p><h2>Why should you care?</h2><p>When OpenAI was about to announce GPT-5, the hype couldn&#8217;t get any louder.<br><br>Sam Altman was teasing the launch for days, culminating with <a href="https://x.com/sama/status/1953264193890861114">this (in)famous and confusing tweet</a> one day before the livestream:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/sama/status/1953264193890861114" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MpSz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 424w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 848w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 1272w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MpSz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png" width="653" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:653,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:169354,&quot;alt&quot;:&quot;Tweet from Sam Altman showing Earth&#8217;s horizon with a large, shadowed Death Star emerging above the clouds.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/sama/status/1953264193890861114&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Tweet from Sam Altman showing Earth&#8217;s horizon with a large, shadowed Death Star emerging above the clouds." title="Tweet from Sam Altman showing Earth&#8217;s horizon with a large, shadowed Death Star emerging above the clouds." srcset="https://substackcdn.com/image/fetch/$s_!MpSz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 424w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 848w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 1272w, https://substackcdn.com/image/fetch/$s_!MpSz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04fdfae9-41b1-4ef4-9cf1-d3e0ccb69994_653x448.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Ah, the Death Star. Exactly what you want people to associate with your AI model.</figcaption></figure></div><p>The livestream itself lasted <em>almost</em> <em>1.5 hours</em>, with the team showcasing their new model&#8217;s capabilities like proud parents parading their child at a beauty pageant:</p><div id="youtube2-0Uu_VJeVVfo" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;0Uu_VJeVVfo&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/0Uu_VJeVVfo?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Many have since speculated that this level of pre-launch publicity and showmanship was a big reason for what ultimately ended up being an <a href="https://www.whytryai.com/i/170795882/the-big-one-gpt-saga">underwhelming rollout</a>.</p><p>When Gemini 3 launched, Google&#8230;<a href="https://blog.google/products/gemini/gemini-3/">quietly published</a> a few <a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-3-is-available-for-enterprise">text-heavy blog posts</a>.</p><p>That&#8217;s it.</p><p>No flashy livestreams. No presentations. No pre-launch hype cycle.</p><p>Not even a single premature Death Star tweet by Sundar Pichai.</p><p>As you know, I&#8217;ve been <a href="https://www.whytryai.com/p/nano-banana">railing against excessive hype</a> for as long as this newsletter has existed.</p><p>So to me, it&#8217;s nice to see a company letting a model&#8217;s real-world performance do the heavy lifting without having to hype it up. Google&#8217;s approach proves that this path works, as long as your model delivers the goods.</p><p>I really wish this were the default in the GenAI space.</p><p>Who knows, maybe the launch of Gemini 3 marks the moment we all collectively take a chill pill and start talking about major launches in a calm and measured way.</p><p>Right?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FaL7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FaL7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 424w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 848w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 1272w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FaL7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png" width="682" height="688" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:688,&quot;width&quot;:682,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:163926,&quot;alt&quot;:&quot;Google search video results showing YouTube thumbnails and titles about Gemini 3.0, with creators claiming it &#8220;killed&#8221; or &#8220;destroyed&#8221; AI coding tools, outperformed other AI models, and enabled rapid app and voice-agent builds.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/179435587?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Google search video results showing YouTube thumbnails and titles about Gemini 3.0, with creators claiming it &#8220;killed&#8221; or &#8220;destroyed&#8221; AI coding tools, outperformed other AI models, and enabled rapid app and voice-agent builds." title="Google search video results showing YouTube thumbnails and titles about Gemini 3.0, with creators claiming it &#8220;killed&#8221; or &#8220;destroyed&#8221; AI coding tools, outperformed other AI models, and enabled rapid app and voice-agent builds." srcset="https://substackcdn.com/image/fetch/$s_!FaL7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 424w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 848w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 1272w, https://substackcdn.com/image/fetch/$s_!FaL7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a3ab0c-684d-416c-8b7f-896ad631d86f_682x688.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Damn it!</p><h2>Further reading &amp; watching</h2><ul><li><p><a href="https://www.ignorance.ai/p/gemini-3-and-googles-antigravity">&#8220;Gemini 3 and Google&#8217;s Antigravity Trajectory</a>&#8221; - <em>AI Ignorance</em></p></li><li><p>&#8220;<a href="https://www.youtube.com/watch?v=sQt3wuG0Kik">Google Gemini 3 Is a Powerhouse</a>&#8221; [YouTube] - <em>Theoretically Media</em></p></li><li><p>&#8220;<a href="https://www.thealgorithmicbridge.com/p/google-gemini-3-just-killed-every">Google Gemini 3 Is the Best Model Ever. One Score Stands Out Above the Rest</a>&#8221; - <em>The Algorithmic Bridge</em></p></li><li><p>&#8220;<a href="https://www.youtube.com/watch?v=UH2_Sgeu4lc">Gemini 3 just crushed everything</a>&#8221; [YouTube] - <em>AI Search</em></p></li><li><p>&#8220;<a href="https://www.oneusefulthing.org/p/three-years-from-gpt-3-to-gemini">Three Years from GPT-3 to Gemini 3</a>&#8221; - <em>One Useful Thing</em></p></li><li><p>&#8220;<a href="https://every.to/vibe-check/vibe-check-gemini-3-pro-a-reliable-workhorse-with-surprising-flair">Vibe Check: Gemini 3 Pro, A Reliable Workhorse With Surprising Flair</a>&#8221; - <em>Every</em></p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>What are your first impressions of Gemini 3 Pro? How does it compare to your current go-to model? Do you have any practical use cases to share?</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/gemini-3/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/gemini-3/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587&quot;,&quot;text&quot;:&quot;Get 30% off for 1 year&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?coupon=9e8acfc3&amp;utm_content=179435587"><span>Get 30% off for 1 year</span></a></p>]]></content:encoded></item><item><title><![CDATA[Sora 2: Amazing Model. Dubious Rollout.]]></title><description><![CDATA[OpenAI chose to release its top-tier video model inside a gimmicky social app.]]></description><link>https://www.whytryai.com/p/sora-2</link><guid isPermaLink="false">https://www.whytryai.com/p/sora-2</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 02 Oct 2025 11:50:46 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/927b53eb-25de-4503-9268-8a4b4b169715_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>Sora 2 is a genuinely impressive video model, but it&#8217;s baked into a social media app built for ultra-short meme clips, which undersells its creative potential.</p><h2>What is it?</h2><p>When OpenAI first teased its original Sora model back in February 2024, <a href="https://www.whytryai.com/i/141610093/openais-sora-ushers-in-a-new-era-of-text-to-video">people went nuts</a>. But Sora wouldn&#8217;t launch to the public until December, and by that time, it was well behind many <a href="https://www.whytryai.com/p/free-ai-image-to-video-tools-tested">top-tier video models</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>With Sora 2, OpenAI is convincingly back in the game:</p><div id="youtube2-gzneGhpXwjU" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;gzneGhpXwjU&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/gzneGhpXwjU?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>There&#8217;s <a href="https://openai.com/index/sora-2/">a lot to like about Sora 2</a>.</p><p>It can handle complex physics, including intricate scenes like gymnastics and fight sequences. It can create videos in many different styles. And, like Veo 3, Sora 2 natively generates its own audio effects and dialogue.</p><p>While there are no third-party benchmarks or leaderboard rankings yet, Sora 2 feels roughly on par with Veo 3 (depending on the use case).</p><p>Here are three quick comparisons of my own:</p><blockquote><p><strong>Prompt #1</strong><em>: Over-excited blonde influencer is holding a smartphone with the Substack feed on it. She near-screams: &#8220;You guys have to check out Why Try AI. It&#8217;s so, so good. But what do I know? I don&#8217;t even exist!&#8221;</em></p></blockquote><p><strong>Veo 3:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;b57adf0a-2900-41f2-8169-0bc090958ce5&quot;,&quot;duration&quot;:null}"></div><p><strong>Sora 2:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;576c01ed-cdc9-42c8-87d0-efcc2daba459&quot;,&quot;duration&quot;:null}"></div><p>I don&#8217;t know why both models decided to throw in a semi-psychotic giggle at the end, entirely unprompted, but here we are.</p><blockquote><p><strong>Prompt #2</strong><em>: A drunken 1800s pirate tries to use a modern laptop but can&#8217;t figure it out. In frustration, he bangs the keys and says &#8220;Blast this shiny chest o&#8217; letters&#8212;won&#8217;t open no matter how I pound it!&#8221;</em></p></blockquote><p><strong>Veo 3:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;54697c45-a2f1-47f8-9cbb-279496f772e8&quot;,&quot;duration&quot;:null}"></div><p><strong>Sora 2:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;8efd4de4-adff-4b18-af0f-bfe52e908998&quot;,&quot;duration&quot;:null}"></div><p>I love the bonus details, from Veo 3&#8217;s exploding laptop to Sora 2&#8217;s off-script method acting improv.</p><blockquote><p><strong>Prompt #3</strong><em>: A horse wearing a top hat tap dances to music on its hind legs</em></p></blockquote><p><strong>Veo 3:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e152e1ab-8538-4a42-854c-ce07a51a544e&quot;,&quot;duration&quot;:null}"></div><p><strong>Sora 2:</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;b5661328-7827-44ee-bd1c-84d752262cf4&quot;,&quot;duration&quot;:null}"></div><p>Veo 3 looks more realistic, but I like how Sora 2 came up with an entire jingle. No true dancing on hind legs from either model, however.</p><p>The bottom line is that both are impressive and not too far apart.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><div class="paywall-jump" data-component-name="PaywallToDOM"></div><p>First, a whole bunch of caveats:</p><ul><li><p>The Sora app is only available for iOS for now. We Android users can go fish.</p></li><li><p>It&#8217;s only out in the US and Canada. (<a href="https://www.whytryai.com/i/136465476/nordvpn-my-personal-go-to-vpn">But a VPN works</a>.)</p></li><li><p>Access is <em>invite-only</em>, so you&#8217;ll have to wait or actively hunt for codes.</p></li></ul><p>Here&#8217;s the general process:</p><ol><li><p>Download <a href="https://apps.apple.com/us/app/sora-by-openai/id6744034028">the iOS app</a> or visit <a href="https://sora.com/">sora.com</a>.</p></li><li><p>Sign in with your ChatGPT account.</p></li><li><p>If you don&#8217;t have an invite code, see below.</p></li><li><p>Once you have an invite code, enter it to gain access:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3ggP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3ggP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 424w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 848w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 1272w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3ggP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png" width="439" height="307.9284253578732" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:343,&quot;width&quot;:489,&quot;resizeWidth&quot;:439,&quot;bytes&quot;:152223,&quot;alt&quot;:&quot;Meet the new Sora invite code screen&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/175009850?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Meet the new Sora invite code screen" title="Meet the new Sora invite code screen" srcset="https://substackcdn.com/image/fetch/$s_!3ggP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 424w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 848w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 1272w, https://substackcdn.com/image/fetch/$s_!3ggP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F849e5c4b-5aaa-43f1-9419-1938f344da1f_489x343.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Now you get the familiar prompt box to generate videos. All clips default to 9 seconds. The only thing you can tweak is the orientation (portrait or landscape):</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lRvW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lRvW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 424w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 848w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 1272w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lRvW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png" width="783" height="117" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:117,&quot;width&quot;:783,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:31085,&quot;alt&quot;:&quot;Prompt: \&quot;Crying man complains about having to sell his soul for a Sora 2 invite code\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/175009850?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Prompt: &quot;Crying man complains about having to sell his soul for a Sora 2 invite code&quot;" title="Prompt: &quot;Crying man complains about having to sell his soul for a Sora 2 invite code&quot;" srcset="https://substackcdn.com/image/fetch/$s_!lRvW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 424w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 848w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 1272w, https://substackcdn.com/image/fetch/$s_!lRvW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b007f48-c2a4-4dcc-bcc0-43362a334e84_783x117.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="6"><li><p>Enjoy your Sora videos:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;c580d5e4-55dc-4602-a677-0947f6c97457&quot;,&quot;duration&quot;:null}"></div></li></ol><p><strong>So, how do you get an invite code?</strong></p><p>You&#8217;ve got a few options:</p><ol><li><p>Wait patiently until OpenAI rolls out Sora 2 more broadly.</p></li><li><p>Hunt for new codes on forums like <a href="https://www.reddit.com/r/OpenAI/comments/1nukmm2/open_ai_sora_2_invite_codes_megathread/">this Reddit mega thread</a>.</p></li><li><p>Try obsessively refreshing <a href="https://sora-invite.vercel.app">sora-invite.vercel.app</a> (that&#8217;s how I got my code).</p></li><li><p>Ask a friend who has invite codes to share.</p></li></ol><p>On that note, I&#8217;ve been given 4 invite codes when I signed up. Send me a message if you want one. My paid subscribers get priority, but if none of them want a code, I&#8217;ll happily give it to any of my readers.</p><div class="directMessage button" data-attrs="{&quot;userId&quot;:103658370,&quot;userName&quot;:&quot;Daniel Nest&quot;,&quot;canDm&quot;:null,&quot;dmUpgradeOptions&quot;:null,&quot;isEditorNode&quot;:true}" data-component-name="DirectMessageToDOM"></div><h2>Why should you care?</h2><p>As much as I like Sora 2 itself, two things bug me about this launch.</p><p>First, the invite-only rollout.</p><p>This predictably annoyed existing paying customers, especially those on a $200 / month Pro account:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vUP_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vUP_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 424w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 848w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 1272w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vUP_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png" width="742" height="170" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:170,&quot;width&quot;:742,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:25350,&quot;alt&quot;:&quot;WHAT IS THE POINT of $200 / month if we don't get access to models like Sora 2???? Discussion Not sure why I am paying $200 / month for pro when I need to beg for access codes to new models.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/175009850?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="WHAT IS THE POINT of $200 / month if we don't get access to models like Sora 2???? Discussion Not sure why I am paying $200 / month for pro when I need to beg for access codes to new models." title="WHAT IS THE POINT of $200 / month if we don't get access to models like Sora 2???? Discussion Not sure why I am paying $200 / month for pro when I need to beg for access codes to new models." srcset="https://substackcdn.com/image/fetch/$s_!vUP_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 424w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 848w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 1272w, https://substackcdn.com/image/fetch/$s_!vUP_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e06a42-ed02-4c40-9894-82fb76917a41_742x170.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Here&#8217;s the <strong><a href="https://www.reddit.com/r/OpenAI/comments/1nuqdmj/what_is_the_point_of_200_month_if_we_dont_get/">Reddit thread</a></strong>.</figcaption></figure></div><p>But beyond that, it triggered a proper frenzy with Reddit code-sharing megathreads and shady &#8220;codes for money&#8221; schemes.</p><p>Not exactly a feel-good start.</p><p>Secondly, and more importantly, OpenAI has chosen to lean <em>heavily</em> into the &#8220;social media app&#8221; aspect with this rollout.</p><p>On the <a href="https://openai.com/index/sora-2/">Sora 2 announcement page</a>, OpenAI has some lofty words about general-purpose world models and how life-changing they will be:</p><blockquote><p><em>Since then, the Sora team has been focused on training models with more advanced world simulation capabilities. We believe such systems will be critical for training AI models that deeply understand the physical world.</em></p><p><em>[&#8230;]</em></p><p><em>General-purpose world simulators and robotic agents will fundamentally reshape society and accelerate the arc of human progress. Sora 2 represents significant progress towards that goal. In keeping with OpenAI&#8217;s mission, it is important that humanity benefits from these models as they are developed.</em></p></blockquote><p>But here&#8217;s how they position the mobile app:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ouh_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ouh_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 424w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 848w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 1272w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ouh_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png" width="995" height="559" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/321e16fe-6887-452a-a608-5c404c048508_995x559.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:559,&quot;width&quot;:995,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:491402,&quot;alt&quot;:&quot;iPhone Screenshots for Sora 2 iOS app&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/175009850?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="iPhone Screenshots for Sora 2 iOS app" title="iPhone Screenshots for Sora 2 iOS app" srcset="https://substackcdn.com/image/fetch/$s_!Ouh_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 424w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 848w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 1272w, https://substackcdn.com/image/fetch/$s_!Ouh_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F321e16fe-6887-452a-a608-5c404c048508_995x559.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://apps.apple.com/us/app/sora-by-openai/id6744034028">Sora 2 iOS app page</a></strong>.</figcaption></figure></div><p>&#8220;Accelerate the arc of human progress,&#8221; indeed.</p><p>The team spends <a href="https://www.youtube.com/watch?v=gzneGhpXwjU">most of the Sora 2 livestream</a> talking about the social app and its many gimmicks, from video remixing to &#8220;Cameos&#8221; that let you insert yourself and friends into short video clips.</p><p>Look, I like silly shenanigans as much as anyone.</p><p>My Sora 2 tests are drunk 1800s pirates and tap-dancing horses.</p><p>I&#8217;m also bullish on AI's potential to unlock our inner creators and help us tell stories we otherwise couldn&#8217;t. Here&#8217;s my first-ever post on this very Substack:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d671b28f-bff1-4a17-9cee-0673cbb1f203&quot;,&quot;caption&quot;:&quot;Let&#8217;s get one thing out of the way: I suck at drawing.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Turn Your Doodles Into Art Using AI (With Examples)&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:103658370,&quot;name&quot;:&quot;Daniel Nest&quot;,&quot;bio&quot;:&quot;I write about generative AI for the average person. I love experimenting with all GenAI, including AI images, video, music, chatbots, and more.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3cf75e3-f197-48b0-999b-d73cbb1a8ad5_1321x1321.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2022-09-13T12:16:41.134Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/h_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0cd748ef-fbe2-4885-a904-b9fd95f09ffe_768x512.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.whytryai.com/p/turn-your-doodles-into-art-with-ai&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:72382058,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:2,&quot;comment_count&quot;:0,&quot;publication_id&quot;:1077462,&quot;publication_name&quot;:&quot;Why Try AI&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!raEn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c4d0362-24d4-4046-9ccd-cb331c34edc4_1024x1024.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>But I&#8217;m not sure we need yet another social media app with an endless feed of hyper-short video clips.</p><p>It&#8217;s the <a href="https://www.whytryai.com/i/160256477/sunday-bonus-use-cases-for-gpt-o-image-generation-swipe-file">&#8220;Studio Ghibli version of my face&#8221; moment</a> all over again, except this time OpenAI is actively encouraging it.</p><p>We have so many short-form dopamine machines already, do we need one built exclusively around AI slop?</p><p>Is &#8220;Remix this viral meme but with me in it&#8221; the pinnacle of our creative potential and the best proof-of-concept use case for a video model as powerful as Sora 2?</p><p>To their credit, the OpenAI team appears to have given serious thought to what a responsible social app rollout might look like, at least on paper.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> They talk about steps to prevent doomscrolling, encourage creation vs. consumption, and so on.</p><p>Then again, I&#8217;m sure the Jurassic Park staff also had seemingly robust safety procedures in place. Also, here&#8217;s something that applies equally to both:</p><div id="youtube2-g3j9muCo4o0" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;g3j9muCo4o0&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/g3j9muCo4o0?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Google has always positioned Veo 3 as a tool for serious filmmakers with its <a href="https://labs.google/flow/about">Flow platform</a>.</p><p>OpenAI, instead, is betting on mainstream appeal and shareability with a meme <s>slop</s> slot machine.</p><p>As far as winning the attention game goes, OpenAI&#8217;s play is clearly the right move.</p><p>But as someone who tries to see AI as a force for good, I can&#8217;t help but feel this is a step in the wrong direction.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>What&#8217;s your take on OpenAI&#8217;s social media play? Does it discredit Sora&#8217;s promise by turning it into a viral gimmick, or is it just a fun way to reach more users? Do we need more endlessly scrolling social apps in our lives? Is the &#8220;Cameo&#8221; feature awesome or dystopian? Am I just an old man yelling at clouds?</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rHdD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rHdD!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 424w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 848w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 1272w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rHdD!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif" width="320" height="243.20000000000002" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:152,&quot;width&quot;:200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Old Man Yells At Clouds GIFs - Find &amp; Share on GIPHY&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Old Man Yells At Clouds GIFs - Find &amp; Share on GIPHY" title="Old Man Yells At Clouds GIFs - Find &amp; Share on GIPHY" srcset="https://substackcdn.com/image/fetch/$s_!rHdD!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 424w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 848w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 1272w, https://substackcdn.com/image/fetch/$s_!rHdD!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c6f07a-c1e5-4962-bed4-d68828579af7_200x152.gif 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/sora-2/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/sora-2/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>You&#8217;ll have to scroll to #16 to spot Sora <a href="https://artificialanalysis.ai/text-to-video/arena?tab=leaderboard-text">on this leaderboard</a> (at the time of writing)</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Further reading on the topic:</p><ul><li><p><a href="https://openai.com/index/sora-2/#:~:text=of%20Sora%202.-,Launching%20responsibly,-Concerns%20about%20doomscrolling">&#8220;Launching responsibly&#8221; section </a>of the Sora 2 launch page.</p></li><li><p>&#8220;<a href="https://openai.com/index/sora-feed-philosophy/">The Sora feed philosophy</a>.&#8221;</p></li><li><p><a href="https://blog.samaltman.com/sora-2">Sam Altman&#8217;s blog posts </a>with his thoughts on the social media app.</p></li></ul></div></div>]]></content:encoded></item><item><title><![CDATA[Seedream 4.0: Nano Banana Killer?]]></title><description><![CDATA[AI image wars continue, with bonus rumors of an upcoming OpenAI image update.]]></description><link>https://www.whytryai.com/p/seedream-4</link><guid isPermaLink="false">https://www.whytryai.com/p/seedream-4</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 11 Sep 2025 08:43:31 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1dd9361b-aed6-4feb-8e2a-b3fbecd5b083_1248x832.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>ByteDance&#8217;s Seedream 4.0 gives Nano Banana a run for its money with top-tier image generation and editing.</p><h2>What is it?</h2><p>Just when you thought we were done, folks!</p><p>It&#8217;s only been two weeks since Google&#8217;s <a href="https://www.whytryai.com/p/nano-banana">Nano Banana (Gemini 2.5 Flash Image)</a> &#8220;killed&#8221; Photoshop, but we already have a shiny new kid on the block: <a href="https://seed.bytedance.com/en/seedream4_0">Seedream 4.0</a>.</p><p>The latest model from ByteDance goes toe-to-toe with Nano Banana on image editing tasks, while also being on par with <a href="https://www.whytryai.com/p/openai-4o-native-image-generation">GPT image generation</a> at making <em>new</em> images.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_eEL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_eEL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 424w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 848w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 1272w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_eEL!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png" width="1200" height="599.1758241758242" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:727,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:386029,&quot;alt&quot;:&quot;MagicBench: Multi-Dimensional Evaluation In comparison with other models, Seedream 4.0 performed well across core dimensions including prompt adherence, alignment, and aesthetics. Text-to-Image Radar Chart  Achieved high scores in text-to-image tasks for prompt following, aesthetics, and text-rendering. Single-Image Editing Radar Chart  Achieved a good balance between prompt following and alignment with the source image in single-image editing tasks. Also reached the first place in the internal Elo evaluation.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="MagicBench: Multi-Dimensional Evaluation In comparison with other models, Seedream 4.0 performed well across core dimensions including prompt adherence, alignment, and aesthetics. Text-to-Image Radar Chart  Achieved high scores in text-to-image tasks for prompt following, aesthetics, and text-rendering. Single-Image Editing Radar Chart  Achieved a good balance between prompt following and alignment with the source image in single-image editing tasks. Also reached the first place in the internal Elo evaluation." title="MagicBench: Multi-Dimensional Evaluation In comparison with other models, Seedream 4.0 performed well across core dimensions including prompt adherence, alignment, and aesthetics. Text-to-Image Radar Chart  Achieved high scores in text-to-image tasks for prompt following, aesthetics, and text-rendering. Single-Image Editing Radar Chart  Achieved a good balance between prompt following and alignment with the source image in single-image editing tasks. Also reached the first place in the internal Elo evaluation." srcset="https://substackcdn.com/image/fetch/$s_!_eEL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 424w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 848w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 1272w, https://substackcdn.com/image/fetch/$s_!_eEL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6483cad1-f4c3-4fd8-aece-a2ac1e84a2c8_1601x799.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://seed.bytedance.com/en/seedream4_0#:~:text=MagicBench%3A%20Multi%2DDimensional%20Evaluation">ByteDance</a></strong></figcaption></figure></div><p>This video has plenty of helpful side-by-side image editing tests:</p><div id="youtube2-EdEn3aWHpO8" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;EdEn3aWHpO8&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/EdEn3aWHpO8?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>As you can see, Seedream 4.0 handles many niche challenges better than either GPT image generation or Nano Banana (often both).</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><p>Right now, there&#8217;s no official <em>free</em> way to try Seedream 4.0.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><blockquote><p><strong>Update 15-09-2025</strong>: You can now use Seedream 4.0 for free on <a href="https://lmarena.ai/">LMArena</a>:</p><p>Simply follow <a href="https://www.whytryai.com/i/172070970/option-lm-arena">this guide I wrote for Nano Banana</a> but select seedream-4 or seedream-4-high-res from the dropdown:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D9xS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D9xS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 424w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 848w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 1272w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D9xS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png" width="581" height="240" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:240,&quot;width&quot;:581,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:23599,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!D9xS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 424w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 848w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 1272w, https://substackcdn.com/image/fetch/$s_!D9xS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f05f5d1-a99b-42ec-826e-9ec8bc110bd9_581x240.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div></blockquote><p>Several shady third-party sites claim to offer free Seedream 4.0 generations, but the ones I tested all use outdated models under the hood. ByteDance&#8217;s China-facing platform <a href="https://jimeng.jianying.com/ai-tool/home">jimeng.ai</a> does offer free daily Seedream 4.0 credits, but I couldn&#8217;t make it work without a local telephone number.</p><p>I expect ByteDance might soon add Seedream 4.0 to its <a href="https://dreamina.capcut.com/ai-tool/home">Dreamina site</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>For now, if you&#8217;re outside China, your best option is a paid subscription to one of the following:</p><ul><li><p><a href="https://www.krea.ai/">krea.ia</a></p></li><li><p><a href="https://www.freepik.com/">freepik.com</a></p></li></ul><p>If you don&#8217;t want to commit to a subscription, <a href="https://fal.ai/">fal.ai </a>lets you try Seedream 4.0 on a pay-per-use basis (3 cents per image):</p><ol><li><p>Navigate to <a href="https://fal.ai/">fal.ai</a></p></li><li><p>Sign up (or sign in with e.g. your Google or GitHub account)</p></li><li><p>Purchase credits over at <a href="https://fal.ai/dashboard/usage-billing/credits">https://fal.ai/dashboard/usage-billing/credits</a></p></li><li><p>Pick the Seedream 4.0 model you want to use:</p><ol><li><p>For image generation:<br><a href="https://fal.ai/models/fal-ai/bytedance/seedream/v4/text-to-image">https://fal.ai/models/fal-ai/bytedance/seedream/v4/text-to-image</a></p></li><li><p>For image editing:<br><a href="https://fal.ai/models/fal-ai/bytedance/seedream/v4/edit">https://fal.ai/models/fal-ai/bytedance/seedream/v4/edit</a></p></li></ol></li><li><p>Type in your prompt and click &#8220;<strong>Run</strong>&#8221; to generate or edit the image</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gDk5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gDk5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 424w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 848w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 1272w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gDk5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png" width="743" height="404" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:404,&quot;width&quot;:743,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:48551,&quot;alt&quot;:&quot;Fal.ai Seedream 4.0 prompt screen&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Fal.ai Seedream 4.0 prompt screen" title="Fal.ai Seedream 4.0 prompt screen" srcset="https://substackcdn.com/image/fetch/$s_!gDk5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 424w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 848w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 1272w, https://substackcdn.com/image/fetch/$s_!gDk5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa0298b9-23b9-4a10-aabf-69ba4b2bdf07_743x404.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><p>I used the long test prompt from <a href="https://www.whytryai.com/p/text-to-image-comparison-gpt-4o-vs-ideogram-3-vs-reve-1">my GPT-4o showdown article</a>:</p><blockquote><p><em>Candid photo of a purple steampunk platypus with wings on stage in a comedy club. To the left of the platypus is a cyberpunk goose playing a saxophone. Behind them is a show banner with the words &#8220;Top Billed.&#8221; In front of the stage, in the audience, is a dieselpunk duck. The duck is holding a handwritten show program that says &#8220;Welcome to Top Billed! No beaks were harmed in the making of this lineup. Prepare for laughs, feathers, and unpaid sax solos.&#8221;</em></p></blockquote><p>Here&#8217;s the best of four attempts by Seedream 4.0:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ouwb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ouwb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 424w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 848w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 1272w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ouwb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png" width="401" height="400.4581081081081" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:740,&quot;resizeWidth&quot;:401,&quot;bytes&quot;:984928,&quot;alt&quot;:&quot;Candid photo of a purple steampunk platypus with wings on stage in a comedy club. To the left of the platypus is a cyberpunk goose playing a saxophone. Behind them is a show banner with the words &#8220;Top Billed.&#8221; In front of the stage, in the audience, is a dieselpunk duck. The duck is holding a handwritten show program that says &#8220;Welcome to Top Billed! No beaks were harmed in the making of this lineup. Prepare for laughs, feathers, and unpaid sax solos.&#8221; - by Seedream 4.0&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Candid photo of a purple steampunk platypus with wings on stage in a comedy club. To the left of the platypus is a cyberpunk goose playing a saxophone. Behind them is a show banner with the words &#8220;Top Billed.&#8221; In front of the stage, in the audience, is a dieselpunk duck. The duck is holding a handwritten show program that says &#8220;Welcome to Top Billed! No beaks were harmed in the making of this lineup. Prepare for laughs, feathers, and unpaid sax solos.&#8221; - by Seedream 4.0" title="Candid photo of a purple steampunk platypus with wings on stage in a comedy club. To the left of the platypus is a cyberpunk goose playing a saxophone. Behind them is a show banner with the words &#8220;Top Billed.&#8221; In front of the stage, in the audience, is a dieselpunk duck. The duck is holding a handwritten show program that says &#8220;Welcome to Top Billed! No beaks were harmed in the making of this lineup. Prepare for laughs, feathers, and unpaid sax solos.&#8221; - by Seedream 4.0" srcset="https://substackcdn.com/image/fetch/$s_!ouwb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 424w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 848w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 1272w, https://substackcdn.com/image/fetch/$s_!ouwb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F819513f5-8afa-4fe5-93b9-51c16e414043_740x739.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>It sticks to the prompt and almost nails the text (apart from the last word). Here&#8217;s a side-by-side view of Seedream 4.0, Nano Banana, and GPT-4o&#8212;click any image to open it in full resolution:</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7169d4f-b154-4777-8ef1-ca4ed9bddad3_2048x2048.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5fd3e2c2-4df4-4afd-8e67-68823cd69e41_1755x1755.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/87ce067f-fbc3-4631-8e91-b4c1c86a3451_1024x1024.png&quot;}],&quot;caption&quot;:&quot;Left to right: Seedream 4.0, Nano Banana, GPT-4o Image&quot;,&quot;alt&quot;:&quot;Steampunk platypus by Gemini, Seedream 4.0, and GPT-4o image&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/805b8311-8e09-4f2a-a953-7ece487f972a_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Seedream 4.0 is also the only model trying to honor the &#8220;candid photo&#8221; look.</p><p>Take Seedream 4.0 for a spin with your own prompts, especially those you felt were too tricky for other image models. See if it&#8217;s an improvement.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>Why should you care?</h2><p>ByteDance has just joined the ranks of <a href="https://www.whytryai.com/p/openai-4o-native-image-generation">OpenAI</a> and <a href="https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation">Google</a> with a context-aware image model that has built-in reasoning abilities. This means Seedream 4.0 doesn&#8217;t just blindly follow instructions but truly understands the request and has the world knowledge to act on it.</p><p>So I can ask it for something as broadly defined as this:</p><blockquote><p><em>Four-panel cartoon that explains how coffee makes it to the shop</em></p></blockquote><p>And it might return something like this:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FtV6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FtV6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 424w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 848w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 1272w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FtV6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png" width="913" height="667" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:667,&quot;width&quot;:913,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1085176,&quot;alt&quot;:&quot;Four-step cartoon explaining how coffee is grown, harvested, dried, shipped, and roasted&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Four-step cartoon explaining how coffee is grown, harvested, dried, shipped, and roasted" title="Four-step cartoon explaining how coffee is grown, harvested, dried, shipped, and roasted" srcset="https://substackcdn.com/image/fetch/$s_!FtV6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 424w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 848w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 1272w, https://substackcdn.com/image/fetch/$s_!FtV6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f4ec0a8-36ab-4e51-85f9-c16e788d4f0b_913x667.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>With Seedream 4.0, China now has a top-tier contender in yet another GenAI field, which reminds me of when Kling and Hailuo AI came seemingly out of nowhere to <a href="https://www.whytryai.com/p/free-ai-image-to-video-tools-tested">dominate AI video</a> at the end of 2024. (Several months later, reasoning models had their <a href="https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative">DeepSeek R1 moment</a>.)</p><p>Every time I feel we&#8217;re near the peak of what&#8217;s possible with AI image generation, a new model comes out to raise the bar.</p><p>And it looks like we&#8217;re not quite done for this year, because OpenAI is allegedly gearing up to update its own image model, fueling wild speculation:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/Angaisb_/status/1965104688187232556" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V0xg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 424w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 848w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 1272w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V0xg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png" width="593" height="545" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:545,&quot;width&quot;:593,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196231,&quot;alt&quot;:&quot;GPT-Image-0721-mini-alpha spotted  What do you expect from it? I think this will be OpenAI's response to nano-banana: perfect edits, fast and cheap, but probably much smarter  Can we please call it mini-strawberry or nano-strawberry?&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/Angaisb_/status/1965104688187232556&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/173256017?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="GPT-Image-0721-mini-alpha spotted  What do you expect from it? I think this will be OpenAI's response to nano-banana: perfect edits, fast and cheap, but probably much smarter  Can we please call it mini-strawberry or nano-strawberry?" title="GPT-Image-0721-mini-alpha spotted  What do you expect from it? I think this will be OpenAI's response to nano-banana: perfect edits, fast and cheap, but probably much smarter  Can we please call it mini-strawberry or nano-strawberry?" srcset="https://substackcdn.com/image/fetch/$s_!V0xg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 424w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 848w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 1272w, https://substackcdn.com/image/fetch/$s_!V0xg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6bf4025-0a16-4d09-9fd2-f3a175553938_593x545.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://x.com/Angaisb_/status/1965104688187232556">X</a></strong></figcaption></figure></div><p>As someone who originally started <strong>Why Try AI</strong> <a href="https://www.whytryai.com/p/why-try-ai-one-year-anniversary">because of AI image models</a>, I&#8217;m excited to see where it&#8217;s all heading.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you tried Seedream 4.0 yet? Are you following the AI image field closely? What is your favorite image model or platform?</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/seedream-4/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/seedream-4/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Let me know if you manage to make it work for you.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Dreamina lets you use Seedream 3.1, so it&#8217;s likely that the new version will also be added.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Nano Banana Is Fantastic, But Don't Bury Photoshop Just Yet]]></title><description><![CDATA[As awesome as Nano Banana is at editing images, it won't replace professionals.]]></description><link>https://www.whytryai.com/p/nano-banana</link><guid isPermaLink="false">https://www.whytryai.com/p/nano-banana</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 28 Aug 2025 09:26:15 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/240b32dd-45f4-4986-b859-92eb13331135_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p><em>It&#8217;s time for another Thursday &#8220;<a href="https://www.whytryai.com/s/hot-takes">Hot Take</a>.&#8221;</em></p></blockquote><h2>TL;DR</h2><p>Gemini&#8239;2.5 Flash Image (aka &#8220;Nano Banana&#8221;) is now the <a href="https://lmarena.ai/leaderboard/text-to-image">world&#8217;s best image model</a> that also <a href="https://lmarena.ai/leaderboard/image-edit">excels at detail-preserving edits</a>, but let&#8217;s not equate &#8220;good enough for the average Joe&#8221; with &#8220;Photoshop extinction event.&#8221;</p><h2>What is it?</h2><p>Gemini 2.5 Flash Image is Google's latest image model, which made <a href="https://www.whytryai.com/p/sunday-rundown-108-china-strikes-back#:~:text=the%20requested%20changes.-,%E2%80%9CNano%20Banana%E2%80%9D,-is%20a%20new">quite a splash last week</a> under its pre-release nickname, "Nano Banana."<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Two days ago, Google <a href="https://blog.google/intl/en-mena/product-updates/explore-get-answers/nano-banana-image-editing-in-gemini-just-got-a-major-upgrade/">officially claimed ownership of the model</a>.</p><p>While Nano Banana is topping leaderboards for text-to-image generation, what truly makes it special is its ability to make precise edits to existing images while keeping characters consistent and preserving details.</p><p>Here's a quick taste:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;888b6c9e-28cd-4bd6-94eb-8f3cd15f97b7&quot;,&quot;duration&quot;:null}"></div><p>It can blend separate characters and insert them into novel settings:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;85bd61ca-27b0-40b4-b285-e48461462553&quot;,&quot;duration&quot;:null}"></div><p>And it can handle multi-turn edits without losing key details of the original image:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;52525cdd-8d36-42de-b753-df8f2e4b03ae&quot;,&quot;duration&quot;:null}"></div><p>In short, Nano Banana is pretty damn neat.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><p>If you want to try Nano Banana for yourself, you've got three options...and all of them are free!</p><h3>Option 1: Gemini app</h3><ol><li><p>Go to <a href="https://gemini.google.com/">gemini.google.com</a> or open the Gemini app on your phone</p></li><li><p>Start a new chat.</p></li><li><p>Upload the image(s) you want to edit.</p></li><li><p>Request your changes by simply describing them.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q-cl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q-cl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 424w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 848w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 1272w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q-cl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png" width="809" height="289" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:289,&quot;width&quot;:809,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:36829,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q-cl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 424w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 848w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 1272w, https://substackcdn.com/image/fetch/$s_!q-cl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49b45eb8-a2fc-4df8-9987-f51552f411d4_809x289.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That's it!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Q51q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Q51q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 424w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 848w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 1272w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Q51q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png" width="451" height="448.04842931937173" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:759,&quot;width&quot;:764,&quot;resizeWidth&quot;:451,&quot;bytes&quot;:918915,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Q51q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 424w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 848w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 1272w, https://substackcdn.com/image/fetch/$s_!Q51q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d35d783-1980-43f5-84ab-17fb1411eac7_764x759.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Ladies&#8230;</figcaption></figure></div><p>If you just want to create new images, select "Create images" in the <strong>Tools</strong> menu, then describe the image you want:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!S8E4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!S8E4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 424w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 848w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 1272w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!S8E4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png" width="810" height="403" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f743894b-231e-4979-ab45-17d2cbddd15e_810x403.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:403,&quot;width&quot;:810,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:37041,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!S8E4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 424w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 848w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 1272w, https://substackcdn.com/image/fetch/$s_!S8E4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff743894b-231e-4979-ab45-17d2cbddd15e_810x403.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Like so:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XqgK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XqgK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 424w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 848w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 1272w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XqgK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png" width="452" height="450.81984334203656" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:764,&quot;width&quot;:766,&quot;resizeWidth&quot;:452,&quot;bytes&quot;:1062186,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XqgK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 424w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 848w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 1272w, https://substackcdn.com/image/fetch/$s_!XqgK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5357109-7f03-49a4-a403-cf2c647cc66f_766x764.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">You heard him!</figcaption></figure></div><h3>Option 2: Google AI Studio</h3><ol><li><p>Go to <a href="https://aistudio.google.com/">aistudio.google.com</a> and sign in with your Google account</p></li><li><p>Select <strong>Chat</strong> in the left-hand column</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c3NF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c3NF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 424w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 848w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 1272w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c3NF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png" width="1100" height="402" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:402,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:46858,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9871c3ab-fb40-443c-a732-82865051f833_1100x402.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c3NF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 424w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 848w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 1272w, https://substackcdn.com/image/fetch/$s_!c3NF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b33b83b-2ce4-4838-af39-ed17dc6f7b8e_1100x402.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Select "<strong>Gemini 2.5 Flash Image Preview</strong>" from the right-hand model picker</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Dy2b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Dy2b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 424w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 848w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 1272w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Dy2b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png" width="1159" height="475" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:475,&quot;width&quot;:1159,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112208,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00bfefd3-dee8-4df2-8b7c-100a1b621210_1159x475.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Dy2b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 424w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 848w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 1272w, https://substackcdn.com/image/fetch/$s_!Dy2b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc13b91e4-c0e0-472f-b583-fc358c86fa37_1159x475.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Now you can upload images to edit or request new ones as above</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wChs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wChs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 424w, https://substackcdn.com/image/fetch/$s_!wChs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 848w, https://substackcdn.com/image/fetch/$s_!wChs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 1272w, https://substackcdn.com/image/fetch/$s_!wChs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wChs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png" width="813" height="143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:143,&quot;width&quot;:813,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:19569,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wChs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 424w, https://substackcdn.com/image/fetch/$s_!wChs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 848w, https://substackcdn.com/image/fetch/$s_!wChs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 1272w, https://substackcdn.com/image/fetch/$s_!wChs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9773777f-7f4b-463f-a7fb-a0e8e9ecd07a_813x143.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div></li></ol><p>Like so:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gcXX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gcXX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 424w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 848w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 1272w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gcXX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png" width="450" height="451.9251336898396" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:939,&quot;width&quot;:935,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:1258547,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gcXX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 424w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 848w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 1272w, https://substackcdn.com/image/fetch/$s_!gcXX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d9888d3-c82c-49e6-9673-e20f7ae53ba0_935x939.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Option 3: LM Arena</h3><ol><li><p>Head to <a href="https://lmarena.ai/">lmarena.ai</a></p></li><li><p>Select "<strong>Direct Chat</strong>" in the top dropdown</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4tZa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4tZa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 424w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 848w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 1272w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4tZa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png" width="263" height="226" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:226,&quot;width&quot;:263,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:22793,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F286edadd-bdbf-40dc-bd7a-d709fc929d25_263x226.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4tZa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 424w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 848w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 1272w, https://substackcdn.com/image/fetch/$s_!4tZa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcef4770-5c0f-4b23-8642-9d12ce519a8a_263x226.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div></li><li><p>Click on "<strong>Generate Images</strong>" under the prompt box</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PP3u!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PP3u!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 424w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 848w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 1272w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PP3u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png" width="812" height="339" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:812,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:53329,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257fe821-af0a-4737-91f9-7498c65fe94d_812x339.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PP3u!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 424w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 848w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 1272w, https://substackcdn.com/image/fetch/$s_!PP3u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c3ee147-1255-4f09-b3d8-636fe021659c_812x339.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Select "<strong>gemini-2.5-flash-image-preview"</strong> in the top dropdown</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Vo5N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Vo5N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 424w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 848w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 1272w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Vo5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png" width="579" height="266" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:266,&quot;width&quot;:579,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:30744,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Vo5N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 424w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 848w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 1272w, https://substackcdn.com/image/fetch/$s_!Vo5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F976e53a0-dcc5-46d4-9608-ba3d496c315c_579x266.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Now you can upload images to edit or request new ones as above.</p></li></ol><blockquote><p><strong>Important note:</strong> <em>Unlike Gemini and Google AI Studio, LM Arena reserves the right to make your prompts public for research purposes.</em></p></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>Why should you care?</h2><p>Character consistency has long been the holy grail of AI image and video generation. It allows for long-form storytelling, precise image remixing, different shots of the same setting, and more.</p><p>Right now, no other AI model is as good at keeping characters and image details consistent as Nano Banana.</p><p>The previous best image model, <a href="https://www.whytryai.com/p/openai-4o-native-image-generation">GPT-4o image generation</a>, struggled with this. In fact, it was so bad at preserving details and characters over multiple turns that it spawned a short-lived-but-fun trend: People would <a href="https://www.reddit.com/r/ChatGPT/comments/1kbwo6a/i_did_the_create_a_replica_of_this_image_dont/">upload an image and repeatedly ask ChatGPT to</a> "Create a replica of this image. Don&#8217;t change anything." The results were disturbing:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;66cc462e-3cc7-493d-b4a0-78acb606a3e0&quot;,&quot;duration&quot;:null}"></div><p>So yes, Nano Banana is the closest the average user can get to making nuanced image edits in natural language.</p><p>But in its typical fashion, the Internet jumped from that to "Photoshop is dead" in a split second.</p><p>Seriously, go type in &#8220;nano banana photoshop&#8221; into your search bar. </p><p>I&#8217;ll wait.</p><p>No? Fine, I&#8217;ll do it for you:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9Wqn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9Wqn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 424w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 848w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 1272w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9Wqn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png" width="915" height="652" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:652,&quot;width&quot;:915,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:177969,&quot;alt&quot;:&quot;Google search results for \&quot;nano banana photoshop\&quot; - spelling the death of Photoshop&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Google search results for &quot;nano banana photoshop&quot; - spelling the death of Photoshop" title="Google search results for &quot;nano banana photoshop&quot; - spelling the death of Photoshop" srcset="https://substackcdn.com/image/fetch/$s_!9Wqn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 424w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 848w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 1272w, https://substackcdn.com/image/fetch/$s_!9Wqn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8d99bb7-2bc8-44dc-9ad7-682dc9920a76_915x652.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Why must we constantly pair the arrival of one awesome tool with the death of something else? Is it the drama?</p><p>We already killed Google back in early 2023:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;396ca884-5693-43e6-a38f-5f6769be4b98&quot;,&quot;caption&quot;:&quot;Have y&#8217;all heard? Google&#8217;s done!&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Just How \&quot;Dead\&quot; Is Google, Anyway?&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:103658370,&quot;name&quot;:&quot;Daniel Nest&quot;,&quot;bio&quot;:&quot;I write about generative AI for the average person. I love experimenting with all GenAI, including AI images, video, music, chatbots, and more.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3cf75e3-f197-48b0-999b-d73cbb1a8ad5_1321x1321.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-05-11T19:12:42.247Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac225780-91c7-4bbb-925b-3a002163ac38_1344x896.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.whytryai.com/p/how-dead-is-google&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:120455120,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:8,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Why Try AI&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!raEn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c4d0362-24d4-4046-9ccd-cb331c34edc4_1024x1024.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>Now, Google is back from the grave to murder Adobe?</p><p>Will this cycle of violence ever end?!</p><p>Jokes aside, there's quite a gap between "This AI tool lets people make quick and precise image edits" and "Professional Photoshop workflows are now obsolete.&#8221;</p><p>For one, Nano Banana still has many limitations. <a href="https://deepmind.google/models/gemini/image/">Google itself highlights</a> that the model struggles &#8220;with small faces, accurate spelling, and fine details in images.&#8221;</p><p>When I threw my &#8220;long text&#8221; prompt <a href="https://www.whytryai.com/i/160325050/level-long-text">from this image model showdown</a> at Nano Banana, it couldn&#8217;t nail the show program text after multiple tries&#8212;something that GPT-4o image generation did quite consistently:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hIAX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hIAX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 424w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 848w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 1272w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hIAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png" width="450" height="446.64179104477614" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:931,&quot;width&quot;:938,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:1377913,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/172070970?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hIAX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 424w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 848w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 1272w, https://substackcdn.com/image/fetch/$s_!hIAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fe1ca30-d2d3-42f7-95ca-a9ca8e9a1780_938x931.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This <em>AI Search</em> video, while very positive about Nano Banana, has a whole section dedicated to tests where other models do better:</p><div id="youtube2-2qYjhHtKxB8" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;2qYjhHtKxB8&quot;,&quot;startTime&quot;:&quot;956&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/2qYjhHtKxB8?start=956&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>But also&#8212;and I know this is a very controversial statement&#8212;there&#8217;s probably a tiny difference between making quick AI-powered edits and doing professional-grade visual design work.</p><p>How about we enjoy playing with this new, super cool tool without instantly proclaiming the death of all professional design?</p><p>Have fun out there!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you tried editing images with Nano Banana? What&#8217;s been your experience? Have you run into other obvious limitations? I&#8217;d love to hear your thoughts!</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/nano-banana/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/nano-banana/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I&#8217;ll mostly stick to the reader-friendly &#8220;Nano Banana&#8221; name in this post.</p></div></div>]]></content:encoded></item><item><title><![CDATA[HeyGen Avatar IV: Deepfakes Are Point-and-Click Now.]]></title><description><![CDATA[All you need is just one image. Is this a problem?]]></description><link>https://www.whytryai.com/p/heygen-avatar-iv-deepfakes</link><guid isPermaLink="false">https://www.whytryai.com/p/heygen-avatar-iv-deepfakes</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 08 May 2025 16:21:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/56ff9f43-d0c7-4b3e-9cde-37f90e7c92eb_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>HeyGen&#8217;s Avatar IV creates lifelike, expressive talking avatars from a single image that can lip-sync to any audio or script&#8212;are deepfakes too easy now?</p><h2>What is it?</h2><p>Avatar IV is a new AI avatar model from HeyGen:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/joshua_xu_/status/1919765489775231401" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YGxy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 424w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 848w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 1272w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YGxy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png" width="520" height="453.6551724137931" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:506,&quot;width&quot;:580,&quot;resizeWidth&quot;:520,&quot;bytes&quot;:45967,&quot;alt&quot;:&quot; NEW: HeyGen Avatar IV is here.  Our most advanced AI avatar model yet.  &#128248; One photo. &#128221; One script. &#127911; Just your voice.  Most avatars sync to your words. Avatar IV interprets them.  Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion &#8212; then synthesizes photoreal facial motion with temporal realism.  &#127917; Head tilts. Pauses. Cadences. Micro-expressions.  &#10145;&#65039; A single image &#8594; a video that feels real, not rendered.  Rolling out to all users now.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/joshua_xu_/status/1919765489775231401&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/155321899?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt=" NEW: HeyGen Avatar IV is here.  Our most advanced AI avatar model yet.  &#128248; One photo. &#128221; One script. &#127911; Just your voice.  Most avatars sync to your words. Avatar IV interprets them.  Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion &#8212; then synthesizes photoreal facial motion with temporal realism.  &#127917; Head tilts. Pauses. Cadences. Micro-expressions.  &#10145;&#65039; A single image &#8594; a video that feels real, not rendered.  Rolling out to all users now." title=" NEW: HeyGen Avatar IV is here.  Our most advanced AI avatar model yet.  &#128248; One photo. &#128221; One script. &#127911; Just your voice.  Most avatars sync to your words. Avatar IV interprets them.  Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion &#8212; then synthesizes photoreal facial motion with temporal realism.  &#127917; Head tilts. Pauses. Cadences. Micro-expressions.  &#10145;&#65039; A single image &#8594; a video that feels real, not rendered.  Rolling out to all users now." srcset="https://substackcdn.com/image/fetch/$s_!YGxy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 424w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 848w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 1272w, https://substackcdn.com/image/fetch/$s_!YGxy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63a374bd-c909-41c4-8f2f-f174f38ea78a_580x506.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://x.com/joshua_xu_/status/1919765489775231401">X</a></strong></figcaption></figure></div><p>Not so long ago, creating a custom avatar with HeyGen or Synthesia required you to record a training video and submit it for professional processing.</p><p>Now, it takes one image, one script (written or recorded), and a few minutes. That&#8217;s it:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;00e922bf-3cc3-4efb-af00-1a49dd6b6057&quot;,&quot;duration&quot;:null}"></div><p>As you can see, the avatars don&#8217;t just sync their lips to the voice track&#8212;their movements and microexpressions accurately match the tone, etc.</p><p>It&#8217;s quite uncanny.</p><h2>How do you use it?</h2><p>The process is disarmingly simple.</p><p>On the main dashboard, click the <strong>Photo to Video with Avatar IV</strong> option:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fDfn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fDfn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 424w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 848w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 1272w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fDfn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png" width="1919" height="823" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:823,&quot;width&quot;:1919,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:566773,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/155321899?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c09715-d1ad-4324-b9f3-e3270d27b6b4_1919x823.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fDfn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 424w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 848w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 1272w, https://substackcdn.com/image/fetch/$s_!fDfn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21c15b04-2b86-4d96-8711-3bfb7fc90ae9_1919x823.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This brings up a pop-up where you can complete the entire procedure in one go:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uNoN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uNoN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 424w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 848w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 1272w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uNoN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png" width="1384" height="679" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:679,&quot;width&quot;:1384,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:211595,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/155321899?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uNoN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 424w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 848w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 1272w, https://substackcdn.com/image/fetch/$s_!uNoN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa62276f1-5f90-4cc7-a6e4-396fb3e203d5_1384x679.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>From here, you just:</p><ol><li><p>Upload a single image.</p></li><li><p>Type out your script (or upload a pre-recorded voice clip).</p></li><li><p>Select a template voice (if not using the pre-recorded audio above).</p></li></ol><p>Here&#8217;s how that might look:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c42_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c42_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 424w, https://substackcdn.com/image/fetch/$s_!c42_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 848w, https://substackcdn.com/image/fetch/$s_!c42_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 1272w, https://substackcdn.com/image/fetch/$s_!c42_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c42_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png" width="1115" height="550" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9812de90-a99a-499e-bdca-6914679370c4_1115x550.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:550,&quot;width&quot;:1115,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;This is Daniel, and I did NOT say ANY of this!  This isn't even my voice, dude.  Come on!  Stop making me say stuff, you creep.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="This is Daniel, and I did NOT say ANY of this!  This isn't even my voice, dude.  Come on!  Stop making me say stuff, you creep." title="This is Daniel, and I did NOT say ANY of this!  This isn't even my voice, dude.  Come on!  Stop making me say stuff, you creep." srcset="https://substackcdn.com/image/fetch/$s_!c42_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 424w, https://substackcdn.com/image/fetch/$s_!c42_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 848w, https://substackcdn.com/image/fetch/$s_!c42_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 1272w, https://substackcdn.com/image/fetch/$s_!c42_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9812de90-a99a-499e-bdca-6914679370c4_1115x550.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now you just click &#8220;<strong>Generate video</strong>&#8221; and wait a minute or so. Then you get your video:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;ddd7674a-4766-49d9-9ec1-19d954962b39&quot;,&quot;duration&quot;:null}"></div><p>Ignore the frozen background and the voice mismatch for a second. </p><p>This is pretty solid for close to zero effort on my part.</p><p>Anyone can create up to <a href="https://help.heygen.com/en/articles/11269603-new-feature-alert-heygen-avatar-iv-is-here#:~:text=Usage%20Limits%20by%20Subscription%20Type">three 10-second clips a month</a> for free, while paid accounts can make 30-second videos.</p><h2>Why should you care?</h2><p>Because deepfakes are quickly heading into &#8220;So easy, your grandma&#8217;s goldfish can make one&#8221; territory.</p><p>To be sure, talking avatars are far from a brand-new concept.</p><p>D-ID has let people <a href="https://www.d-id.com/personal-avatars">create custom talking avatars for years</a>. So has <a href="https://www.synthesia.io/features/custom-avatar">Synthesia</a>.</p><p>Even the single-image-to-talking-avatar tech isn&#8217;t <em>that</em> new.</p><p>Last year alone, I wrote about three such models:</p><ul><li><p>March 3, 2024: <a href="https://www.whytryai.com/i/141965418/alibabas-emote-portrait-alive-is-uncannily-good">Alibaba&#8217;s Emote Portrait Alive (EMO)</a></p></li><li><p>March 17, 2024: <a href="https://www.whytryai.com/i/142513248/vlogger-image-to-animated-avatar-research">Google&#8217;s VLOGGER</a></p></li><li><p>April 21, 2024: <a href="https://www.whytryai.com/i/143602808/vasa-can-animate-images-using-audio-clips">Microsoft&#8217;s VASA-1</a></p></li></ul><p>But even then, the feature struck me as borderline creepy.</p><p>I could see why those models were research papers rather than consumer products.</p><p>To wit, here&#8217;s exactly what I said about VASA-1 (emphasis added):</p><blockquote><p><em>&#8220;VASA-1 is outright scary. Given just a single image of a person paired with an audio clip, VASA-1 makes a realistic talking head of that person. Optional controls let you adjust the speaker&#8217;s emotion, camera view, and more. <strong>Understandably, Microsoft is not planning to make the model available at this time</strong>.&#8221;</em></p></blockquote><p>Just one year later, thanks to Avatar IV, this technology is now mainstream.</p><p>HeyGen&#8217;s version stands out in several ways:</p><ul><li><p>It&#8217;s highly accessible through a clean point-and-click interface.</p></li><li><p>It can feel quite realistic thanks to lifelike microexpressions.</p></li><li><p>It exists inside a popular product with <a href="https://www.forbes.com/sites/charliefink/2024/06/24/heygen-ai-video-scores-60-million-plus-more-cinematic-ai-shorts/">at least 3 million monthly active users</a> (and the potential to <a href="https://lu.ma/ihp9l8wv">reach 130 million more</a> as a native app inside Canva).</p></li></ul><p>As such, Avatar IV reaches the average user in a way above research demos and previews never could.</p><p>But why take my word for it?</p><p>Everything I say above is the literal sales pitch, straight from the horse&#8217;s mouth:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o1hf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o1hf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 424w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 848w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 1272w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o1hf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png" width="661" height="373" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:373,&quot;width&quot;:661,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:79158,&quot;alt&quot;:&quot;   What Makes Avatar IV Different?  Human-Centric Video Generation: Unlike traditional avatars that simply sync to your words, this new-to-the-world Avatar model actually interprets them. Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion to generate photorealistic facial movements with true-to-life timing. Think head tilts, natural pauses, subtle cadences, and micro-expressions&#8212;all from a single image. The result? A video that feels real, not rendered. &#8203; &#8203;Speed + Simplicity: No complex script writing, scene setup, or editing in the Studio. Just open Avatar IV, fill out the required fields, and your video is ready in seconds. It&#8217;s designed for instant, on-the-fly communication.     No Learning Curve: Anyone can use it! No training, no creative brief, no editing timeline.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/155321899?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="   What Makes Avatar IV Different?  Human-Centric Video Generation: Unlike traditional avatars that simply sync to your words, this new-to-the-world Avatar model actually interprets them. Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion to generate photorealistic facial movements with true-to-life timing. Think head tilts, natural pauses, subtle cadences, and micro-expressions&#8212;all from a single image. The result? A video that feels real, not rendered. &#8203; &#8203;Speed + Simplicity: No complex script writing, scene setup, or editing in the Studio. Just open Avatar IV, fill out the required fields, and your video is ready in seconds. It&#8217;s designed for instant, on-the-fly communication.     No Learning Curve: Anyone can use it! No training, no creative brief, no editing timeline." title="   What Makes Avatar IV Different?  Human-Centric Video Generation: Unlike traditional avatars that simply sync to your words, this new-to-the-world Avatar model actually interprets them. Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion to generate photorealistic facial movements with true-to-life timing. Think head tilts, natural pauses, subtle cadences, and micro-expressions&#8212;all from a single image. The result? A video that feels real, not rendered. &#8203; &#8203;Speed + Simplicity: No complex script writing, scene setup, or editing in the Studio. Just open Avatar IV, fill out the required fields, and your video is ready in seconds. It&#8217;s designed for instant, on-the-fly communication.     No Learning Curve: Anyone can use it! No training, no creative brief, no editing timeline." srcset="https://substackcdn.com/image/fetch/$s_!o1hf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 424w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 848w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 1272w, https://substackcdn.com/image/fetch/$s_!o1hf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ab1c9a-fe94-4216-94ec-34d561156263_661x373.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://help.heygen.com/en/articles/11269603-new-feature-alert-heygen-avatar-iv-is-here">HeyGen</a></strong></figcaption></figure></div><p>&#8220;A video that feels <em>real</em>, not rendered.&#8221;</p><p>Indeed.</p><p>Look, I&#8217;m a big-time AI enthusiast.</p><p>I even have a newsletter about AI; you might&#8217;ve heard of it.</p><p>And while I tend to be cautiously optimistic about tech, sometimes it feels like we&#8217;re making increasingly high-impact tools available too broadly, too quickly.</p><p>Sure, my test video above isn&#8217;t going to fool anyone. I picked an image with a busy background that doesn&#8217;t get properly animated and a random, generic AI voice.</p><p>But what happens when you pair HeyGen&#8217;s Avatar IV with a better starting image and recent voice-cloning tech? Here are just a few that I covered:</p><ul><li><p><a href="https://elevenlabs.io/voice-cloning">ElevenLabs Voice Cloning</a></p></li><li><p><a href="https://www.whytryai.com/p/sunday-rundown-96-llms#:~:text=Play%20AI%20now%20has%20a%20Voice%20Changer%20that%20can%20clone%20a%20voice%20in%2010%20seconds%20while%20preserving%20tone%20and%20emotion.">PlayHT Voice Changer</a></p></li><li><p><a href="https://www.whytryai.com/p/sunday-rundown-88-grok-3#:~:text=Zyphra%20open%2Dsourced%20Zonos%2Dv0.1%2C%20a%20text%2Dto%2Dspeech%20model%20capable%20of%20high%2Dfidelity%20real%2Dtime%20voice%20cloning.%20(Try%20it%20for%20free.)">Zyphra Zonos-v0.1</a></p></li></ul><p>Many of them can clone a voice from only 10 seconds of input audio.</p><p>It doesn&#8217;t take a huge mental leap to imagine all sorts of unpleasant shenanigans:</p><ul><li><p>Fake UGC influencers</p></li><li><p>Political or celebrity deepfakes</p></li><li><p>Scams featuring friends and family</p></li></ul><p>Last February, a Zoom deepfake <a href="https://www.cnn.com/2024/02/04/asia/deepfake-cfo-scam-hong-kong-intl-hnk">convinced a finance worker to pay $25 million</a> to scammers. This year, a would-be scammer needs just one image.</p><p>Even if you don&#8217;t think HeyGen&#8217;s avatars are <em>that</em> convincing, how long will it take for them to evolve? At <a href="https://www.whytryai.com/p/ai-progress-revisited">the current pace of AI developments</a>, next Tuesday is a safe bet.</p><p>So while I appreciate all the positive use cases like guided tutorials or personalized messages, our guardrails better soon catch up with this new reality.</p><p>What do you think?</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you tried Avatar IV yet? If so, what did you think of the output?</p><p>What&#8217;s your general take on this tech? Are you worried about it enabling deepfakes and scams? Or do you trust that we&#8217;ll figure it out? Share your hopes and your fears!</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/heygen-avatar-iv-deepfakes/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/heygen-avatar-iv-deepfakes/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover my newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Don’t Sleep On Genspark’s Super Agent]]></title><description><![CDATA[Genspark is better than the much-hyped Manus, yet it has sailed under the radar.]]></description><link>https://www.whytryai.com/p/genspark-super-agent</link><guid isPermaLink="false">https://www.whytryai.com/p/genspark-super-agent</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 10 Apr 2025 11:41:24 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/303b0c70-da98-470b-8936-a43034393faf_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p><em>Yet another Thursday post in the &#8220;<a href="https://www.whytryai.com/s/hot-takes">Hot Takes</a>&#8221; format.</em></p></blockquote><h2>TL;DR</h2><p>Genspark&#8217;s Super Agent is a genuinely competent general agent that reasons and performs complex tasks, but it&#8217;s been overlooked in the avalanche of agent hype.</p><h2>What is it?</h2><p>Last week, <a href="https://mainfunc.ai/blog/genspark_super_agent">Genspark announced</a> its &#8220;fast and reliable&#8221; Super Agent:</p><div id="youtube2-mXJkGF37rAE" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;mXJkGF37rAE&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/mXJkGF37rAE?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>That video currently sits at a criminally low 32K views <a href="https://www.youtube.com/watch?v=K27diMbCsuw">compared to Manus AI&#8217;s 700K+</a>, certainly not helped by the monotonous presentation accompanied by a generic 1990s TV infomercial soundtrack.</p><p>But under the hood, Super Agent is surprisingly effective at orchestrating dozens of moving parts to complete complex tasks.</p><p>Super Agent appears to be powered by a reasoning model&#8212;perhaps o1 or o3-mini&#8212;which processes your request, then calls on Genspark&#8217;s specialized agents, LLMs, image models, video models, and other tools based on the task.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CfX5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CfX5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 424w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 848w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 1272w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CfX5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Genspark Super Agent under the hood&quot;,&quot;title&quot;:&quot;Genspark Super Agent under the hood&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Genspark Super Agent under the hood" title="Genspark Super Agent under the hood" srcset="https://substackcdn.com/image/fetch/$s_!CfX5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 424w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 848w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 1272w, https://substackcdn.com/image/fetch/$s_!CfX5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60e2c5d6-e68b-4ddd-b49d-9dc445871185_2880x1620.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://mainfunc.ai/blog/genspark_super_agent">MainFunc</a></strong></figcaption></figure></div><p>Among other things, Super Agent can use:</p><ul><li><p>Image models to make visuals</p></li><li><p>Video models to create short clips</p></li><li><p>Text-to-speech models to generate voices</p></li><li><p>Coding skills to write functioning code</p></li><li><p>Deep research agents for, uh, deep research</p></li><li><p>&#8230;and lots of other stuff.</p></li></ul><p>It also has a &#8220;Call for me&#8221; option that uses a voice agent to call up and talk to real people and businesses on your behalf, but I haven&#8217;t tested this one myself.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>Super Agent beats both Manus AI and OpenAI Deep Research on the <a href="https://arxiv.org/abs/2311.12983">GAIA Benchmark</a> for general assistants:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!byL1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!byL1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 424w, https://substackcdn.com/image/fetch/$s_!byL1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 848w, https://substackcdn.com/image/fetch/$s_!byL1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 1272w, https://substackcdn.com/image/fetch/$s_!byL1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!byL1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Genspark Super Agent GAIA benchmark&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Genspark Super Agent GAIA benchmark" title="Genspark Super Agent GAIA benchmark" srcset="https://substackcdn.com/image/fetch/$s_!byL1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 424w, https://substackcdn.com/image/fetch/$s_!byL1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 848w, https://substackcdn.com/image/fetch/$s_!byL1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 1272w, https://substackcdn.com/image/fetch/$s_!byL1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10ff393a-95d4-4b3a-82de-636707d05eb8_2880x1620.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://mainfunc.ai/blog/genspark_super_agent">MainFunc</a></strong></figcaption></figure></div><p>Let&#8217;s be honest, though: We&#8217;ve been inundated with such benchmarks lately. </p><p>These days, my eyes usually glaze over when I see another comparison chart, which helps explain why the announcement got <a href="https://www.whytryai.com/p/sunday-rundown-94-agents-video#:~:text=Genspark%20now%20has%20a%20%E2%80%9CSuper%20Agent%E2%80%9D">only a bland mention</a> in my last roundup.</p><p>But having now used Super Agent for several real-world tasks, I can say that this is the first time I&#8217;ve been impressed by a general AI agent.</p><p>In the past week, Genspark Super Agent:</p><ul><li><p>Helped me research, consolidate, and code <a href="https://www.whytryai.com/i/160256477/sunday-bonus-use-cases-for-gpt-o-image-generation-swipe-file">the GPT-4o swipe file</a> for my paid subscribers.</p></li><li><p>Found a fast charger for my phone in a Danish online store. It nailed this task, arriving at the same recommendation I did after longer manual research. (<a href="https://www.genspark.ai/agents?id=fd8f86b3-42ad-4c33-a92a-ba41a6555597">Read the chat</a>.)</p></li><li><p>One-shot coded <a href="https://page.genspark.site/page/toolu_01PAmmaS1hFyjrieGbstNXQo/whytryai_landing_page.html">a landing page</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> based on a simple request, done as a top-level test of its capabilities. (<a href="https://www.genspark.ai/agents?id=8f28745c-16f6-4672-90bf-c1358cc19b43">Read the chat</a>.)</p></li><li><p>&#8230;and did equally well in several other minor test tasks.</p></li></ul><p>In short, it just works!</p><p>Genspark lists <a href="https://mainfunc.ai/blog/genspark_super_agent#:~:text=your%20everyday%20tasks.-,Examples%3A,-Plan%20Travel%20to">11 real-world examples</a>, and I&#8217;d say they&#8217;re representative of Super Agent&#8217;s actual capabilities.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><p>It&#8217;s stupid simple:</p><ol><li><p>Go to <a href="https://www.genspark.ai/">genspark.ai</a> (sign up for a free account if needed).</p></li><li><p>Type your task or request into the big box on the front page:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YWV5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YWV5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 424w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 848w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 1272w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YWV5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png" width="706" height="377" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/758cd726-1552-4554-b127-31af9b000973_706x377.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:377,&quot;width&quot;:706,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:36994,&quot;alt&quot;:&quot;I'm writing about the capabilities of Genspark Super Agent (that's you).  Please use every tool at your disposal (image generation, video generation, coding, text to speech, etc.) in a creative way that demos what it can do quickly and efficiently.  Then, code a nice, visually appealing page that brings all of these together in an engaging way that users can explore and experience.  Make me proud!&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/160946883?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="I'm writing about the capabilities of Genspark Super Agent (that's you).  Please use every tool at your disposal (image generation, video generation, coding, text to speech, etc.) in a creative way that demos what it can do quickly and efficiently.  Then, code a nice, visually appealing page that brings all of these together in an engaging way that users can explore and experience.  Make me proud!" title="I'm writing about the capabilities of Genspark Super Agent (that's you).  Please use every tool at your disposal (image generation, video generation, coding, text to speech, etc.) in a creative way that demos what it can do quickly and efficiently.  Then, code a nice, visually appealing page that brings all of these together in an engaging way that users can explore and experience.  Make me proud!" srcset="https://substackcdn.com/image/fetch/$s_!YWV5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 424w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 848w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 1272w, https://substackcdn.com/image/fetch/$s_!YWV5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F758cd726-1552-4554-b127-31af9b000973_706x377.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Watch Super Agent break down its thinking as it works through your task.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!idLp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!idLp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 424w, https://substackcdn.com/image/fetch/$s_!idLp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 848w, https://substackcdn.com/image/fetch/$s_!idLp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 1272w, https://substackcdn.com/image/fetch/$s_!idLp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!idLp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png" width="703" height="544" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb745bed-705b-4475-aeae-e4c254071b9e_703x544.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:544,&quot;width&quot;:703,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:119414,&quot;alt&quot;:&quot;I'll showcase my capabilities as Genspark Super Agent using the most efficient models while still demonstrating what I can do. Let's begin!  1. First, let's gather some information about Genspark capabilities Using Tool | Search Genspark AI assistant View Using Tool | Search Genspark Super Agent View Let's explore the specific capabilities I can showcase:  2. Image Generation Demonstration Using Tool | Image Generation A futuristic digital assistant robot with glowing interfaces, showing multiple capabilities li&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/160946883?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="I'll showcase my capabilities as Genspark Super Agent using the most efficient models while still demonstrating what I can do. Let's begin!  1. First, let's gather some information about Genspark capabilities Using Tool | Search Genspark AI assistant View Using Tool | Search Genspark Super Agent View Let's explore the specific capabilities I can showcase:  2. Image Generation Demonstration Using Tool | Image Generation A futuristic digital assistant robot with glowing interfaces, showing multiple capabilities li" title="I'll showcase my capabilities as Genspark Super Agent using the most efficient models while still demonstrating what I can do. Let's begin!  1. First, let's gather some information about Genspark capabilities Using Tool | Search Genspark AI assistant View Using Tool | Search Genspark Super Agent View Let's explore the specific capabilities I can showcase:  2. Image Generation Demonstration Using Tool | Image Generation A futuristic digital assistant robot with glowing interfaces, showing multiple capabilities li" srcset="https://substackcdn.com/image/fetch/$s_!idLp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 424w, https://substackcdn.com/image/fetch/$s_!idLp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 848w, https://substackcdn.com/image/fetch/$s_!idLp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 1272w, https://substackcdn.com/image/fetch/$s_!idLp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb745bed-705b-4475-aeae-e4c254071b9e_703x544.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><p>At each step, you can click the &#8220;View&#8221; button to expand the details and see the agent&#8217;s thinking.</p><p>Once the initial task is finished, you can request changes via chat.</p><p>Here&#8217;s Super Agent&#8217;s one-take output for the above request:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://genspark.genspark.site/slides?project_id=d63887b8-6458-4487-8fb9-afa5b68bd5d2&amp;slide_id=toolu_016ewZLTsYHG74w4nZeoRLVg&quot;,&quot;text&quot;:&quot;Genspark's page about Genspark&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://genspark.genspark.site/slides?project_id=d63887b8-6458-4487-8fb9-afa5b68bd5d2&amp;slide_id=toolu_016ewZLTsYHG74w4nZeoRLVg"><span>Genspark's page about Genspark</span></a></p><p>Genspark gives you 200 daily credits for free, which is decent for testing text-based and coding tasks but probably won&#8217;t be enough for stuff that requires more advanced third-party models.</p><h2>Why should you care?</h2><p>AI agents have been hyped relentlessly since 2023 when <a href="https://babyagi.org/">BabyAGI</a> and <a href="https://github.com/Significant-Gravitas/AutoGPT">AutoGPT</a> were making a splash.</p><p>But the gap between flashy hypothetical demos and agents that actually work has been hard to close.</p><p>In October 2024, we started getting first glimpses of functioning general agents<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> when Anthropic launched &#8220;<a href="https://www.anthropic.com/news/3-5-models-and-computer-use">Computer Use</a>.&#8221; </p><p>Soon, we had OpenAI&#8217;s &#8220;<a href="https://openai.com/index/introducing-operator/">Operator</a>&#8221; in January 2025, Convergence AI&#8217;s &#8220;<a href="https://x.com/convergence_ai_/status/1892129466610073931">Proxy</a>&#8221; in February, and finally &#8220;<a href="https://manus.im/">Manus</a>&#8221; in March.</p><p>But they all came with caveats like waitlists (Manus), invite-only or expensive research previews (OpenAI), the need to install and run clunky local scripts (Anthropic), and so on. Proxy is excellent, but it&#8217;s primarily confined to web browsing tasks.</p><p>To me, Genspark&#8217;s Super Agent is the first general agent that runs entirely inside a virtual environment, doesn&#8217;t require access to your computer, can reason and take many diverse actions, and is readily accessible by everyone.</p><p>It&#8217;s one of those rare moments in AI where reality <em>exceeds</em> the marketing hype.</p><p>To be sure, Super Agent is not immune to the usual AI blind posts like hallucinations, <a href="https://www.whytryai.com/p/are-we-even-ready-for-ai-search">imperfect web browsing</a>, and so on. At the same time, I find that it can usually fix things based on simple feedback without messing up something else in the process.</p><p>The best part? You don&#8217;t have to take my word for any of this. </p><p>Just sign up for a free account and take Genspark&#8217;s Super Agent for a quick spin.</p><p>You might be as positively surprised as I was.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you tried Genspark Super Agent? What were your impressions? Have you had the chance to compare it to Manus or other AI agents?<br><br>I&#8217;d love to hear your thoughts about it.</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/genspark-super-agent/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/genspark-super-agent/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128260;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128483;&#65039;<strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For now, &#8220;Call for me&#8221; is only available in the US and Japan.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>You can nitpick over the crappy brand image made by Flux or minor link issues, but all of this is easily fixable with a single round of feedback.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>We also had more specialized &#8220;<a href="https://www.whytryai.com/p/openai-deep-research">Deep Research</a>&#8221; agents from several competitors.</p></div></div>]]></content:encoded></item><item><title><![CDATA[OpenAI Launches 4o Image Generation, Ruins Reve’s Rad Reveal.]]></title><description><![CDATA[Reve AI picked the absolute worst day to announce its best-in-class image model.]]></description><link>https://www.whytryai.com/p/openai-4o-native-image-generation</link><guid isPermaLink="false">https://www.whytryai.com/p/openai-4o-native-image-generation</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Wed, 26 Mar 2025 12:27:15 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d2c65244-9075-4887-b98d-3471a1f1654f_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>For inspiration:</strong> <em>Grab <a href="https://www.whytryai.com/i/160256477/sunday-bonus-use-cases-for-gpt-o-image-generation-swipe-file">my swipe file</a> with 90+ use cases for GPT-4o image generation.</em></p><div><hr></div><h2>TL;DR</h2><p>ChatGPT can now see, reason about, create, and edit images right inside your chat&#8212;stealing the thunder from traditional image models.</p><h2>What is it?</h2><p>It&#8217;s OpenAI&#8217;s answer to Gemini 2.0 Flash with native image generation, which I covered just two weeks ago:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9f5ec7a9-5246-4921-8e3b-bb5ee23c8230&quot;,&quot;caption&quot;:&quot;Today&#8217;s post is also a developing story, so the &#8220;Hot Take&#8221; format fits nicely.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Gemini 2.0 Flash Makes Mediocre Images...But That's Not The Point!&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:103658370,&quot;name&quot;:&quot;Daniel Nest&quot;,&quot;bio&quot;:&quot;I write about generative AI for the average person. I love experimenting with all GenAI, including AI images, video, music, chatbots, and more.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3cf75e3-f197-48b0-999b-d73cbb1a8ad5_1321x1321.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-03-13T13:13:09.649Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d89c5141-8340-4377-acb6-77c31a6aec00_1408x768.jpeg&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation&quot;,&quot;section_name&quot;:&quot;Hot Takes&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:158977694,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:14,&quot;comment_count&quot;:7,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Why Try AI&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F280198ae-022f-470f-80e0-e029815a33ca_850x850.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>In that post, I dropped this pretty spot-on prediction:</p><blockquote><p>If I were a betting man, I&#8217;d say we&#8217;re about to see OpenAI follow suit. We already know that GPT-4o can do the same stuff:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RY67!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RY67!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 424w, https://substackcdn.com/image/fetch/$s_!RY67!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 848w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1272w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png" width="571" height="761" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:761,&quot;width&quot;:571,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:245119,&quot;alt&quot;:&quot;Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?&quot;,&quot;title&quot;:&quot;Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?" title="Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?" srcset="https://substackcdn.com/image/fetch/$s_!RY67!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 424w, https://substackcdn.com/image/fetch/$s_!RY67!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 848w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1272w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1456w" sizes="100vw" loading="lazy" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong>GPT-4o <a href="https://openai.com/index/hello-gpt-4o/#:~:text=Explorations%20of%20capabilities">announcement post</a></strong>.</figcaption></figure></div><p>After all, the &#8220;o&#8221; in GPT-4o stands for &#8220;omni&#8221; or &#8220;omnimodal.&#8221;</p><p>It&#8217;s just that most of us weren&#8217;t given access to all of the modalities yet.</p><p>In a <a href="https://www.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9udu9/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">recent Reddit AMA</a>, OpenAI&#8217;s Chief Product Officer Kevin Weil confirmed that multimodal image generation was coming:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OLWM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OLWM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 424w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 848w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1272w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png" width="512" height="384" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/324ee040-7c63-4324-8582-c75822f02068_512x384.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:384,&quot;width&quot;:512,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73834,&quot;alt&quot;:&quot;Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait.&quot;,&quot;title&quot;:&quot;Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait.&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait." title="Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait." srcset="https://substackcdn.com/image/fetch/$s_!OLWM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 424w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 848w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1272w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1456w" sizes="100vw" loading="lazy" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9udu9/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">Reddit</a></strong>.</figcaption></figure></div><p>Now that Google&#8217;s version is out, the pressure is on OpenAI to catch up.</p><p>The landscape is changing fast.</p><p>We may soon wave goodbye to the era of separate features stitched into unholy amalgams. Instead, we&#8217;ll have truly omnimodal models handling everything on their own.</p></blockquote><p>Yesterday, my prediction came true with <a href="https://openai.com/index/introducing-4o-image-generation/">this announcement post</a> and a live stream:</p><div id="youtube2-2f3K43FHRKo" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;2f3K43FHRKo&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/2f3K43FHRKo?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>But unlike Gemini 2.0, images created by 4o are of really high quality with an even more precise level of detail and text accuracy.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>How do you use it?</h2><p>There are two ways.</p><h3>1. Directly in ChatGPT</h3><p>This is the most intuitive option for most ChatGPT users. Native image generation is currently rolling out to all accounts.</p><p>Simply go to ChatGPT and ask the default GPT-4o model to create an image.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hNF8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hNF8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 424w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 848w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 1272w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hNF8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png" width="451" height="423.350136239782" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe07cba2-552d-4314-8378-3091f533998d_734x689.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:689,&quot;width&quot;:734,&quot;resizeWidth&quot;:451,&quot;bytes&quot;:367124,&quot;alt&quot;:&quot;Create an image of a dog - ChatGPT prompt&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Create an image of a dog - ChatGPT prompt" title="Create an image of a dog - ChatGPT prompt" srcset="https://substackcdn.com/image/fetch/$s_!hNF8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 424w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 848w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 1272w, https://substackcdn.com/image/fetch/$s_!hNF8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe07cba2-552d-4314-8378-3091f533998d_734x689.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here&#8217;s how you&#8217;ll know whether native image generation has kicked in for you:</p><ol><li><p>You&#8217;ll see the image appearing line by line, instead of DALL-E 3 working on it behind the scenes as before:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z5e0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z5e0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 424w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 848w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 1272w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z5e0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif" width="502" height="502" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:240,&quot;width&quot;:240,&quot;resizeWidth&quot;:502,&quot;bytes&quot;:2188318,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z5e0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 424w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 848w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 1272w, https://substackcdn.com/image/fetch/$s_!z5e0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe37efe6c-abcb-4618-81ba-9afc38ed0060_240x240.gif 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="2"><li><p>You&#8217;ll see a different mix of buttons when clicking on the finished image. In the past, you were able to see the prompt that ChatGPT fed to DALL-E 3:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n-3B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n-3B!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 424w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 848w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 1272w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n-3B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png" width="1182" height="299" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:299,&quot;width&quot;:1182,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:293738,&quot;alt&quot;:&quot;Prompt box in DALL-E 3 based version of ChatGPT&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Prompt box in DALL-E 3 based version of ChatGPT" title="Prompt box in DALL-E 3 based version of ChatGPT" srcset="https://substackcdn.com/image/fetch/$s_!n-3B!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 424w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 848w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 1272w, https://substackcdn.com/image/fetch/$s_!n-3B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4835fe-9574-4954-ae16-3724d4cf0de0_1182x299.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now, you&#8217;ll see the rating and download options and the &#8220;Select&#8221; tool that lets you pick specific areas of an image to edit. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!P_ND!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!P_ND!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 424w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 848w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 1272w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!P_ND!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png" width="740" height="245" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2fe869f8-9513-4312-845d-f8200908cabc_740x245.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:245,&quot;width&quot;:740,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:128587,&quot;alt&quot;:&quot;New options in ChatGPT for image editing&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="New options in ChatGPT for image editing" title="New options in ChatGPT for image editing" srcset="https://substackcdn.com/image/fetch/$s_!P_ND!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 424w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 848w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 1272w, https://substackcdn.com/image/fetch/$s_!P_ND!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fe869f8-9513-4312-845d-f8200908cabc_740x245.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There is no DALL-E 3 prompt: GPT-4o creates the image all by itself based on your request.</p></li><li><p>You&#8217;ll be able to request complex, precise changes in natural language and GPT-4o will handle these like a pro:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qarD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qarD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 424w, https://substackcdn.com/image/fetch/$s_!qarD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 848w, https://substackcdn.com/image/fetch/$s_!qarD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 1272w, https://substackcdn.com/image/fetch/$s_!qarD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qarD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png" width="680" height="681" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:681,&quot;width&quot;:680,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:361681,&quot;alt&quot;:&quot;Turn this into a cartoon, make the dog blue, give it a bright green baseball cap. Keep the scene composition exactly the same.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Turn this into a cartoon, make the dog blue, give it a bright green baseball cap. Keep the scene composition exactly the same." title="Turn this into a cartoon, make the dog blue, give it a bright green baseball cap. Keep the scene composition exactly the same." srcset="https://substackcdn.com/image/fetch/$s_!qarD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 424w, https://substackcdn.com/image/fetch/$s_!qarD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 848w, https://substackcdn.com/image/fetch/$s_!qarD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 1272w, https://substackcdn.com/image/fetch/$s_!qarD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6aa37644-6082-49a8-b8b8-67b50d62711a_680x681.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;.and then&#8230;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PzUI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PzUI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 424w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 848w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 1272w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PzUI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png" width="691" height="704" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:704,&quot;width&quot;:691,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:404913,&quot;alt&quot;:&quot;Now make it a cat and add text that says \&quot;These aren't the pets you're looking for!\&quot; at the bottom on a yellow background with fancy cursive font in red letters.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Now make it a cat and add text that says &quot;These aren't the pets you're looking for!&quot; at the bottom on a yellow background with fancy cursive font in red letters." title="Now make it a cat and add text that says &quot;These aren't the pets you're looking for!&quot; at the bottom on a yellow background with fancy cursive font in red letters." srcset="https://substackcdn.com/image/fetch/$s_!PzUI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 424w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 848w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 1272w, https://substackcdn.com/image/fetch/$s_!PzUI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484ca71c-c1df-4c74-b577-efef8cb0a457_691x704.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><p>Go ahead and give it a try!</p><h3>2. On Sora.com</h3><p>If you pay for ChatGPT, you can also use <a href="https://sora.com/">sora.com</a>.</p><p>Simply navigate to the site, then switch the format from &#8220;Video&#8221; to &#8220;Image&#8221;:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vs5j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vs5j!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 424w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 848w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 1272w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vs5j!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif" width="560" height="240" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:240,&quot;width&quot;:560,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:5710501,&quot;alt&quot;:&quot;Switching from \&quot;Video\&quot; to \&quot;Image\&quot; in Sora&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Switching from &quot;Video&quot; to &quot;Image&quot; in Sora" title="Switching from &quot;Video&quot; to &quot;Image&quot; in Sora" srcset="https://substackcdn.com/image/fetch/$s_!vs5j!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 424w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 848w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 1272w, https://substackcdn.com/image/fetch/$s_!vs5j!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe743ed0e-aef9-4c3e-98ec-28c1b8e18fa2_560x240.gif 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>You can even work with a reference image. I uploaded this lil&#8217; guy made in Midjourney:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Mluw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Mluw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 424w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 848w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 1272w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Mluw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png" width="204" height="306" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1344,&quot;width&quot;:896,&quot;resizeWidth&quot;:204,&quot;bytes&quot;:1552215,&quot;alt&quot;:&quot;Cute blue robot made in Midjourney&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Cute blue robot made in Midjourney" title="Cute blue robot made in Midjourney" srcset="https://substackcdn.com/image/fetch/$s_!Mluw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 424w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 848w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 1272w, https://substackcdn.com/image/fetch/$s_!Mluw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48eccf6a-3a64-4708-ba29-6bd995d49ea9_896x1344.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;asked Sora to throw him into an epic sci-fi battle&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FdkR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FdkR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 424w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 848w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 1272w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FdkR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png" width="778" height="265" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:265,&quot;width&quot;:778,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:142095,&quot;alt&quot;:&quot;\&quot;Place this guy into an epic sci-fi battle\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="&quot;Place this guy into an epic sci-fi battle&quot;" title="&quot;Place this guy into an epic sci-fi battle&quot;" srcset="https://substackcdn.com/image/fetch/$s_!FdkR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 424w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 848w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 1272w, https://substackcdn.com/image/fetch/$s_!FdkR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed63c3d-4bd9-4258-8503-54517dc012f0_778x265.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;and got this:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n3oX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n3oX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n3oX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Epic sci-fi battle featuring a reference character, made in Sora.com using new GPT-4o image generation&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Epic sci-fi battle featuring a reference character, made in Sora.com using new GPT-4o image generation" title="Epic sci-fi battle featuring a reference character, made in Sora.com using new GPT-4o image generation" srcset="https://substackcdn.com/image/fetch/$s_!n3oX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!n3oX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e800cb5-2c92-49af-b65f-654d0da83c63_1536x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Not too shabby at all.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>Why should you care?</h2><p>First off, native image generation is an entirely new paradigm and a huge deal in its own right, as I explained in <a href="https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation">the Gemini 2.0 Flash piece</a>.</p><p>But the implications become even greater when you pair it with output quality that can suddenly compete with many <a href="https://www.whytryai.com/p/text-to-image-ai-models">text-to-image diffusion models</a>.</p><p>Would you rather work with an old-school image model, which:</p><ul><li><p>Doesn&#8217;t follow directions as precisely</p></li><li><p>Struggles to <a href="https://www.whytryai.com/p/ai-image-model-spelling-text">spell stuff accurately</a></p></li><li><p>Can&#8217;t actually &#8220;see&#8221; what it generates and make seamless tweaks</p></li><li><p>Has limited understanding of the context of your request and can&#8217;t discuss it back and forth</p></li></ul><p>Or would you prefer a language model to do all of the above, better, via natural instructions within a familiar chat interface?</p><p>It&#8217;s a rhetorical question, but you&#8217;re welcome to shout the answer out loud at your screen!</p><p>I don&#8217;t see how regular image tools can compete with just how easy it is to request pictures and make changes via chat.</p><p>Many people already treat classic diffusion models like unapproachable black boxes.</p><p>Hell, I wrote about the uselessness of &#8220;splatterprompting&#8221; <em>over two years ago</em>:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d8b4bfe2-5355-4841-928f-3e5bf0bf537a&quot;,&quot;caption&quot;:&quot;Eons ago, in the distant era of September 2022, Stable Diffusion was just entering the AI art scene.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Bye, Splatterprompting. We Hardly Knew You.&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:103658370,&quot;name&quot;:&quot;Daniel Nest&quot;,&quot;bio&quot;:&quot;I write about generative AI for the average person. I love experimenting with all GenAI, including AI images, video, music, chatbots, and more.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3cf75e3-f197-48b0-999b-d73cbb1a8ad5_1321x1321.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2023-01-20T10:15:29.120Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cfb55736-7f8a-46a3-a298-022edbd5b086_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.whytryai.com/p/splatterprompting&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:97474603,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:13,&quot;comment_count&quot;:11,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Why Try AI&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F280198ae-022f-470f-80e0-e029815a33ca_850x850.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>I debunked Midjourney photography prompts last December:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5e417d68-e964-40e8-9ec4-cbc7f119ea38&quot;,&quot;caption&quot;:&quot;If you know one thing about me, it&#8217;s that I haunt your inbox every Thursday and Sunday.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Ditch These Pointless Midjourney Photography Terms&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:103658370,&quot;name&quot;:&quot;Daniel Nest&quot;,&quot;bio&quot;:&quot;I write about generative AI for the average person. I love experimenting with all GenAI, including AI images, video, music, chatbots, and more.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3cf75e3-f197-48b0-999b-d73cbb1a8ad5_1321x1321.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-12-19T19:31:36.435Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fd638be1-e151-4e4c-90d8-9f76acb5b6c9_1344x896.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.whytryai.com/p/midjourney-photography-terms&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:153198033,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:13,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Why Try AI&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F280198ae-022f-470f-80e0-e029815a33ca_850x850.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>&#8230;yet I still keep seeing people use both of these constantly, simply copy-pasting random-descriptor-filled prompts at scale without reflection.</p><p>When you give users a model that intuitively knows what they want and can accurately create it on demand, they&#8217;ll flock to this model at the expense of alternatives.</p><p>Perhaps nothing illustrates this better than the launch of a new, state-of-the-art image model, which by cruel fate happened just before the OpenAI announcement.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>I&#8217;m talking about <a href="https://reveai.org/">Reve AI</a>.</p><p>Only a day earlier, Reve AI made a splash by releasing Reve Image: the &#8220;best image model in the world&#8221;:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/reveimage/status/1904211082870456824" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0gv5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 424w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 848w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 1272w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0gv5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png" width="594" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45d043ad-c221-48d4-848d-69ce19541063_594x480.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:480,&quot;width&quot;:594,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:199427,&quot;alt&quot;:&quot;Reve announcement on Twitter&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/reveimage/status/1904211082870456824&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/159895653?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Reve announcement on Twitter" title="Reve announcement on Twitter" srcset="https://substackcdn.com/image/fetch/$s_!0gv5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 424w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 848w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 1272w, https://substackcdn.com/image/fetch/$s_!0gv5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45d043ad-c221-48d4-848d-69ce19541063_594x480.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://x.com/reveimage/status/1904211082870456824">Twitter / X</a></strong></figcaption></figure></div><p>Reve Image currently tops text-to-image leaderboards from both <a href="https://huggingface.co/spaces/ArtificialAnalysis/Text-to-Image-Leaderboard">Artificial Analysis</a> and <a href="https://imgsys.org/rankings">imgsys</a>. </p><p>By all accounts, it&#8217;s an outstanding model.</p><p>I ran a few tests and am impressed by Reve&#8217;s quality, instruction following, and text rendering.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>Reve is also free to use, so go ahead and try it over at <strong><a href="https://preview.reve.art/">preview.reve.art</a></strong>.</p><p>But here&#8217;s the thing: While <a href="https://www.whytryai.com/i/149916535/ai-images">text-to-image nerds</a> like myself will happily try new sites and geek out about marginal improvements in diffusion models, most regular users will want something that &#8220;just works&#8221; inside a tool they&#8217;re already using.</p><p>And that&#8217;s exactly what the new 4o image creation in ChatGPT does.</p><p>My guess?</p><p>ChatGPT&#8217;s newfound drawing skills will open the floodgates for chat-based image generation by mainstream audiences.</p><p>I doubt it will wipe out existing text-to-image prompting methods overnight, but it&#8217;ll certainly shift the conversation toward a more intuitive way of doing things.</p><p>In fact, I won&#8217;t be surprised if we eventually look back at text-to-image prompt input boxes as relics of a bygone era.</p><p>Am I wrong?</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Am I too quick to dismiss current image models and their interfaces? Can chat-based image creation and text-to-image prompt boxes coexist? Do they serve different purposes and appeal to different groups of people? </p><p>Let&#8217;s talk!</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/openai-4o-native-image-generation/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/openai-4o-native-image-generation/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128279;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128489; <strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you want to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><blockquote><p><em><strong>Hot Takes</strong> are occasional timely posts that focus on fast-moving news and releases, in addition to my regular Thursday and Sunday columns.</em></p><p><em>If <strong>Hot Takes </strong>aren&#8217;t your cup of tea, simply go to your account at <strong><a href="https://www.whytryai.com/account">www.whytryai.com/account</a> </strong>and toggle the &#8220;Notifications&#8221; settings accordingly:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rr-K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" width="745" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20141,&quot;alt&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;title&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Managing Notification settings in Substack - Why Try AI section toggles" title="Managing Notification settings in Substack - Why Try AI section toggles" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></blockquote><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Although I fully expect Google to integrate the impressive quality of its Imagen 3 image model with the native image generation feature.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>OpenAI is freakishly good at stealing the show from other major launches. Remember how <a href="https://www.whytryai.com/p/10x-ai-39-open-ai-sora-google-gemini-1-5">Sora pulled the rug out</a> from under Gemini 1.5?</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>But I still prefer <a href="https://www.whytryai.com/t/midjourney">Midjourney&#8217;s aesthetic</a> for many of my test images.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Gemini 2.0 Flash Makes Mediocre Images...But That's Not The Point!]]></title><description><![CDATA[Image quality is a red herring. We're finally witnessing true multimodality.]]></description><link>https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation</link><guid isPermaLink="false">https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Thu, 13 Mar 2025 13:13:09 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d89c5141-8340-4377-acb6-77c31a6aec00_1408x768.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Today&#8217;s post is also a developing story, so the &#8220;<strong><a href="https://www.whytryai.com/s/hot-takes">Hot Take</a></strong>&#8221; format fits nicely.</em></p><h2>TL;DR</h2><p>Gemini 2.0 Flash Experimental can create and edit images <em>natively</em>.</p><h2>What is it?</h2><p>Yesterday, Google&#8217;s Logan Kilpatrick announced the release of Gemini 2.0 Flash with native image generation:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/OfficialLoganK/status/1899853465922175427" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HSl5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 424w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 848w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 1272w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HSl5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png" width="583" height="654" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:654,&quot;width&quot;:583,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:79967,&quot;alt&quot;:&quot; Logan Kilpatrick @OfficialLoganK Native image generation with Gemini 2.0 Flash is now available to all developers via an experimental release in the Gemini API and Google AI Studio!!  The chat based image editing and creation is so much fun to play with &quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/OfficialLoganK/status/1899853465922175427&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt=" Logan Kilpatrick @OfficialLoganK Native image generation with Gemini 2.0 Flash is now available to all developers via an experimental release in the Gemini API and Google AI Studio!!  The chat based image editing and creation is so much fun to play with " title=" Logan Kilpatrick @OfficialLoganK Native image generation with Gemini 2.0 Flash is now available to all developers via an experimental release in the Gemini API and Google AI Studio!!  The chat based image editing and creation is so much fun to play with " srcset="https://substackcdn.com/image/fetch/$s_!HSl5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 424w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 848w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 1272w, https://substackcdn.com/image/fetch/$s_!HSl5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90b781c5-f4ac-4cc7-8dc2-ac0b37adbe10_583x654.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://x.com/OfficialLoganK/status/1899853465922175427">X</a></strong></figcaption></figure></div><p>Gemini can now create multi-step illustrated stories from a single prompt, edit existing images directly, rework uploaded images, and more.</p><p>The best part?</p><p>It&#8217;s 100% free to try.</p><h2>How do you use it?</h2><p>The easiest way to try the new model is via <a href="https://aistudio.google.com/">Google AI Studio</a>.</p><p>Here&#8217;s the step-by-step process:</p><ol><li><p>Go to <a href="https://aistudio.google.com/">aistudio.google.com</a> and log in with your Google account.</p></li><li><p>Select &#8220;Gemini 2.0 Flash Experimental&#8221; from the model picker. Note: You want the <strong>gemini-2.0-flash-exp </strong>model, not the default Gemini 2.0 Flash. (I know, I know.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FvoU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FvoU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 424w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 848w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 1272w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FvoU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png" width="771" height="526" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:526,&quot;width&quot;:771,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:109546,&quot;alt&quot;:&quot;Selecting Gemini 2.0 Flash Experimental in Google AI Studio model picker&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0517e73-cbfe-429b-ae0d-8d5a34358104_771x526.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Selecting Gemini 2.0 Flash Experimental in Google AI Studio model picker" title="Selecting Gemini 2.0 Flash Experimental in Google AI Studio model picker" srcset="https://substackcdn.com/image/fetch/$s_!FvoU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 424w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 848w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 1272w, https://substackcdn.com/image/fetch/$s_!FvoU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe81a0620-8ea7-4b86-888c-800a616b10d0_771x526.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Type your request into the prompt field at the bottom.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ir9A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ir9A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 424w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 848w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 1272w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ir9A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png" width="555" height="55" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:55,&quot;width&quot;:555,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8790,&quot;alt&quot;:&quot;Gemini prompt: \&quot;Draw a cinematic movie still of a monkey on rollerskates\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Gemini prompt: &quot;Draw a cinematic movie still of a monkey on rollerskates&quot;" title="Gemini prompt: &quot;Draw a cinematic movie still of a monkey on rollerskates&quot;" srcset="https://substackcdn.com/image/fetch/$s_!ir9A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 424w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 848w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 1272w, https://substackcdn.com/image/fetch/$s_!ir9A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ee7faad-c39b-4ba4-953a-aacd4abcf3b6_555x55.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div></li><li><p>Enjoy your results.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!73eL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!73eL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 424w, https://substackcdn.com/image/fetch/$s_!73eL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 848w, https://substackcdn.com/image/fetch/$s_!73eL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 1272w, https://substackcdn.com/image/fetch/$s_!73eL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!73eL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png" width="1297" height="413" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:413,&quot;width&quot;:1297,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:265926,&quot;alt&quot;:&quot;Gemini image result for \&quot;monkey on rollerskates\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Gemini image result for &quot;monkey on rollerskates&quot;" title="Gemini image result for &quot;monkey on rollerskates&quot;" srcset="https://substackcdn.com/image/fetch/$s_!73eL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 424w, https://substackcdn.com/image/fetch/$s_!73eL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 848w, https://substackcdn.com/image/fetch/$s_!73eL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 1272w, https://substackcdn.com/image/fetch/$s_!73eL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F622621ed-ddf2-4b0c-be8c-4cb265835f75_1297x413.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><p>Now, if you look closely at the image, you&#8217;ll notice that the output quality is underwhelming, to say the least.</p><p>You&#8217;re not alone:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!paDy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!paDy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 424w, https://substackcdn.com/image/fetch/$s_!paDy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 848w, https://substackcdn.com/image/fetch/$s_!paDy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 1272w, https://substackcdn.com/image/fetch/$s_!paDy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!paDy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png" width="392" height="106" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:106,&quot;width&quot;:392,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:10296,&quot;alt&quot;:&quot;PollinosisQc &#8226; 13h ago It's really bad lol&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="PollinosisQc &#8226; 13h ago It's really bad lol" title="PollinosisQc &#8226; 13h ago It's really bad lol" srcset="https://substackcdn.com/image/fetch/$s_!paDy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 424w, https://substackcdn.com/image/fetch/$s_!paDy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 848w, https://substackcdn.com/image/fetch/$s_!paDy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 1272w, https://substackcdn.com/image/fetch/$s_!paDy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13410a56-71c0-426a-92be-fda2c1ea2924_392x106.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.reddit.com/r/singularity/comments/1j9npxd/comment/mhh8no4/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">Reddit</a></strong></figcaption></figure></div><p>Far from it:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!L3kH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!L3kH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 424w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 848w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 1272w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!L3kH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png" width="707" height="159" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:159,&quot;width&quot;:707,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20890,&quot;alt&quot;:&quot;UltraBabyVegeta &#8226; 21h ago It&#8217;s ok cool but it seems to be an extremely tiny model as it&#8217;s extremely fast and nowhere near as good at quality as Imagen is.  I kind of thought it would just be Imagen with the ability to edit its own output&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="UltraBabyVegeta &#8226; 21h ago It&#8217;s ok cool but it seems to be an extremely tiny model as it&#8217;s extremely fast and nowhere near as good at quality as Imagen is.  I kind of thought it would just be Imagen with the ability to edit its own output" title="UltraBabyVegeta &#8226; 21h ago It&#8217;s ok cool but it seems to be an extremely tiny model as it&#8217;s extremely fast and nowhere near as good at quality as Imagen is.  I kind of thought it would just be Imagen with the ability to edit its own output" srcset="https://substackcdn.com/image/fetch/$s_!L3kH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 424w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 848w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 1272w, https://substackcdn.com/image/fetch/$s_!L3kH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef46cd2-f7f1-48ca-9c02-f5ff509bb0d4_707x159.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.reddit.com/r/Bard/comments/1j9mihd/comment/mhejt62/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">Reddit</a></strong></figcaption></figure></div><p>In a world of <a href="https://www.whytryai.com/p/ai-image-model-spelling-text">so many impressive image models</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>, Gemini 2.0 Flash image quality is <em>way</em> behind the curve.</p><p>But focusing on that understates the real game-changer: It&#8217;s <em>the same model</em> handling everything under the hood: text, image understanding, and image generation.</p><p>Let&#8217;s unpack why that&#8217;s a big deal.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>Why should you care?</h2><p>Because you can now <em>finally hide the elephant!</em></p><p>Bear with me, it&#8217;ll all make sense in a moment.</p><p>You see, for years now, publicly available AI models have been mostly siloed.</p><p>You&#8217;d have one model for text generation, a separate model to create images, and a third one for converting speech into text and back again.</p><p>When you ask for an image in, say, ChatGPT, here&#8217;s what happens behind the scenes<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>:</p><ol><li><p>The language model (e.g. GPT-4o) turns your request into a text-to-image prompt.</p></li><li><p>GPT-4o sends this prompt to OpenAI&#8217;s image model: <a href="https://www.whytryai.com/p/dall-e-3-better-captions-research-paper-summary">DALL-E 3</a>.</p></li><li><p>DALL-E 3 generates the image based on the prompt from GPT-4o.</p></li><li><p>GPT-4o replies to your request in the chat and attaches the DALL-E 3 image.</p></li></ol><p>This disconnect is the real reason behind the hilarious <a href="https://garymarcus.substack.com/p/wheres-waldo-the-elephant-in-the">&#8220;hide the elephant&#8221; exchange</a> mocked by <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Gary Marcus&quot;,&quot;id&quot;:14807526,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F8fb2e48c-be2a-4db7-b68c-90300f00fd1e_1668x1456.jpeg&quot;,&quot;uuid&quot;:&quot;c475a866-02bd-4e48-8dc9-11ebc1d84bfd&quot;}" data-component-name="MentionToDOM"></span>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jZfo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jZfo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jZfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg" width="500" height="645.4234388366125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1509,&quot;width&quot;:1169,&quot;resizeWidth&quot;:500,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Colin Fraser on Twitter: \&quot;Generate an image of a scene at a beach.\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Colin Fraser on Twitter: &quot;Generate an image of a scene at a beach.&quot;" title="Colin Fraser on Twitter: &quot;Generate an image of a scene at a beach.&quot;" srcset="https://substackcdn.com/image/fetch/$s_!jZfo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jZfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d2519d1-d806-41b7-a9df-7e67f3753d99_1169x1509.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;and then&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VUa3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VUa3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VUa3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg" width="500" height="454.3039319872476" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:855,&quot;width&quot;:941,&quot;resizeWidth&quot;:500,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;\&quot;Can you make the elephant even more hidden\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="&quot;Can you make the elephant even more hidden&quot;" title="&quot;Can you make the elephant even more hidden&quot;" srcset="https://substackcdn.com/image/fetch/$s_!VUa3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VUa3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74b2de2c-e498-4a08-99f2-78b5e34578c5_941x855.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The problem here isn&#8217;t that GPT-4o doesn&#8217;t know what the user wants. </p><p>It&#8217;s that&#8212;when GPT-4o explicitly tells DALL-E 3 to hide the elephant&#8212;DALL-E 3 hears &#8220;elephant&#8221; and <em>adds it to the image instead</em>. Image models don&#8217;t do well with negative instructions, which is why <a href="https://www.whytryai.com/p/midjourney-negative-prompt">special &#8220;negative prompt&#8221; fields</a> exist in the first place.</p><p>Now watch this:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lZZN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lZZN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 424w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 848w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 1272w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lZZN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png" width="358" height="807" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:807,&quot;width&quot;:358,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:469731,&quot;alt&quot;:&quot;\&quot;Create a child's drawing of a zoo with a lion, elephant, and a giraffe\&quot; \&quot;Now remove the elephant\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="&quot;Create a child's drawing of a zoo with a lion, elephant, and a giraffe&quot; &quot;Now remove the elephant&quot;" title="&quot;Create a child's drawing of a zoo with a lion, elephant, and a giraffe&quot; &quot;Now remove the elephant&quot;" srcset="https://substackcdn.com/image/fetch/$s_!lZZN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 424w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 848w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 1272w, https://substackcdn.com/image/fetch/$s_!lZZN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F027e19ad-6c5a-40f7-b73b-62d2d681a895_358x807.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">POOF!</figcaption></figure></div><p>Gemini 2.0 Flash handles the task like a champ&#8212;precisely because it combines text understanding, image understanding, and image generation under one umbrella.</p><p>Thanks to this, Gemini is also able to keep the rest of the image intact <em>exactly as is</em>!</p><p>For comparison, requesting even minor changes in ChatGPT will generate a new, somewhat similar image<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K-6_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K-6_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 424w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 848w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 1272w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K-6_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png" width="496" height="725" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:725,&quot;width&quot;:496,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:288491,&quot;alt&quot;:&quot;Draw a cute cat ChatGPT said:  Here's a cute, fluffy kitten for you! Let me know if you&#8217;d like any modifications. &#128049;&#128149;  You said: Keep the image the same but give the kitten bright blue eyes  ChatGPT said:  Here&#8217;s your adorable kitten with bright blue eyes! Let me know if you want any further tweaks. &#128049;&#128153;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Draw a cute cat ChatGPT said:  Here's a cute, fluffy kitten for you! Let me know if you&#8217;d like any modifications. &#128049;&#128149;  You said: Keep the image the same but give the kitten bright blue eyes  ChatGPT said:  Here&#8217;s your adorable kitten with bright blue eyes! Let me know if you want any further tweaks. &#128049;&#128153;" title="Draw a cute cat ChatGPT said:  Here's a cute, fluffy kitten for you! Let me know if you&#8217;d like any modifications. &#128049;&#128149;  You said: Keep the image the same but give the kitten bright blue eyes  ChatGPT said:  Here&#8217;s your adorable kitten with bright blue eyes! Let me know if you want any further tweaks. &#128049;&#128153;" srcset="https://substackcdn.com/image/fetch/$s_!K-6_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 424w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 848w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 1272w, https://substackcdn.com/image/fetch/$s_!K-6_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f888f4a-6b22-4137-b719-7da6fac0cd3a_496x725.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Liar! You thought you could just swap the kitten without me noticing?!</figcaption></figure></div><p>This true multimodality opens up a whole range of possibilities, such as combining objects across images&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zSgl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zSgl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 424w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 848w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 1272w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zSgl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png" width="588" height="838" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:838,&quot;width&quot;:588,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:554454,&quot;alt&quot;:&quot;Add a cartoon version of this monkey to the other image between the lion and the giraffe. Keep the other image the same.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Add a cartoon version of this monkey to the other image between the lion and the giraffe. Keep the other image the same." title="Add a cartoon version of this monkey to the other image between the lion and the giraffe. Keep the other image the same." srcset="https://substackcdn.com/image/fetch/$s_!zSgl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 424w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 848w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 1272w, https://substackcdn.com/image/fetch/$s_!zSgl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20b3225-3b4c-4258-94c5-89ae67eed167_588x838.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;adding custom text into precisely defined locations&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8nsE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8nsE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 424w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 848w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 1272w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8nsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png" width="1259" height="412" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:412,&quot;width&quot;:1259,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:269636,&quot;alt&quot;:&quot;Add an awkwardly scribbled purple text in the bottom-right that says \&quot;For Mommy!\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Add an awkwardly scribbled purple text in the bottom-right that says &quot;For Mommy!&quot;" title="Add an awkwardly scribbled purple text in the bottom-right that says &quot;For Mommy!&quot;" srcset="https://substackcdn.com/image/fetch/$s_!8nsE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 424w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 848w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 1272w, https://substackcdn.com/image/fetch/$s_!8nsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52eac3dd-05ae-4562-a18e-04349aae7a13_1259x412.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;manipulating characters in an image&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TUjn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TUjn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 424w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 848w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 1272w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TUjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png" width="1260" height="386" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:386,&quot;width&quot;:1260,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:261011,&quot;alt&quot;:&quot;Turn the giraffe and make it look into the camera. The monkey should lift its arms up into the air.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Turn the giraffe and make it look into the camera. The monkey should lift its arms up into the air." title="Turn the giraffe and make it look into the camera. The monkey should lift its arms up into the air." srcset="https://substackcdn.com/image/fetch/$s_!TUjn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 424w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 848w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 1272w, https://substackcdn.com/image/fetch/$s_!TUjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49ca7d45-e80a-401c-be1e-19f527cbf4ca_1260x386.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;and more.</p><p>Go ahead: Take Gemini 2.0 Flash for a spin and explore what it&#8217;s capable of!</p><h2>Are we entering a new multimodal era?</h2><p>Want to hear the crazy part?</p><p>On paper, the Gemini family has been natively multimodal since it was first announced <em>one-and-a-half years ago</em>.</p><p>Here&#8217;s a quote from <a href="https://www.whytryai.com/i/139385812/google-gemini-is-finally-herein-spirit">my December 2023 round-up</a>:</p><blockquote><p><em>&#8230;Gemini is natively multimodal. This means that unlike GPT-4, which is trained purely on text and gets its multimodality from add-on modules, Gemini is trained on different modalities from the start. This should make it far more capable of switching effortlessly between many types of input and output.</em></p></blockquote><p>As such, Gemini was likely capable of these feats all along.</p><p>However, AI labs have initially been hesitant to unlock full multimodality for general audiences.</p><p>Things started to change last year when OpenAI <a href="https://www.whytryai.com/p/sunday-rundown-62-realtime-voice?open=false#%C2%A7ai-releases:~:text=The%20long%2Dawaited%20Advanced%20Voice%20Mode%20is%20rolling%20out%20to%20select%20ChatGPT%20Plus%20users.%20(Check%20out%20these%20hands%2Don%20tests%20by%20Ethan%20Mollick.)">rolled out the &#8220;Advanced Voice Mode&#8221;</a> to ChatGPT users. This mode doesn't use text-to-speech / speech-to-text conversion to enable voice conversations. It natively understands what you&#8217;re saying and can respond in kind.</p><p>Now, Google is giving us multimodal image generation, too.</p><p>If I were a betting man, I&#8217;d say we&#8217;re about to see OpenAI follow suit. We already know that GPT-4o can do the same stuff:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RY67!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RY67!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 424w, https://substackcdn.com/image/fetch/$s_!RY67!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 848w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1272w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png" width="571" height="761" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:761,&quot;width&quot;:571,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:245119,&quot;alt&quot;:&quot;Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?" title="Input A first person view of a robot typewriting the following journal entries:  1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  the text is large, legible and clear. the robot's hands type on the typewriter.  2 Output Robot on typewriter 3 Input The robot wrote the second entry. The page is now taller. The page has moved up. There are two entries on the sheet:  yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?  sound update just dropped, and it's wild. everything's got a vibe now, every sound's like a new secret. makes you think, what else am i missing?" srcset="https://substackcdn.com/image/fetch/$s_!RY67!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 424w, https://substackcdn.com/image/fetch/$s_!RY67!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 848w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1272w, https://substackcdn.com/image/fetch/$s_!RY67!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d649573-82d2-4d21-9b29-792d714f52e1_571x761.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong>GPT-4o <a href="https://openai.com/index/hello-gpt-4o/#:~:text=Explorations%20of%20capabilities">announcement post</a></strong>.</figcaption></figure></div><p>After all, the &#8220;o&#8221; in GPT-4o stands for &#8220;omni&#8221; or &#8220;omnimodal.&#8221;</p><p>It&#8217;s just that most of us weren&#8217;t given access to all of the modalities yet.</p><p>In a <a href="https://www.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9udu9/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">recent Reddit AMA</a>, OpenAI&#8217;s Chief Product Officer Kevin Weil confirmed that multimodal image generation was coming:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OLWM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OLWM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 424w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 848w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1272w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png" width="512" height="384" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/324ee040-7c63-4324-8582-c75822f02068_512x384.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:384,&quot;width&quot;:512,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73834,&quot;alt&quot;:&quot;Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/158977694?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait." title="Are you still planning to roll out the 4o image generator?  Comment Image   Upvote 383  Downvote  Award  Share Share  u/kevinweil avatar kevinweil CO-HOST &#8226; 1mo ago OpenAI CPO Kevin Weil  emoji:OpenAIWhite:  | Verified  emoji:checkmark: Yes! We're working on it. And I think it's going to be worth the wait." srcset="https://substackcdn.com/image/fetch/$s_!OLWM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 424w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 848w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1272w, https://substackcdn.com/image/fetch/$s_!OLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324ee040-7c63-4324-8582-c75822f02068_512x384.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9udu9/?utm_source=share&amp;utm_medium=web3x&amp;utm_name=web3xcss&amp;utm_term=1&amp;utm_content=share_button">Reddit</a></strong>.</figcaption></figure></div><p>Now that Google&#8217;s version is out, the pressure is on OpenAI to catch up.</p><p>The landscape is changing fast.</p><p>We may soon wave goodbye to the era of separate features stitched into unholy amalgams. Instead, we&#8217;ll have truly omnimodal models handling everything on their own.</p><p>So yes: You can choose to focus on how Gemini&#8217;s current image quality is nothing to write home about.</p><p>But if you do, you&#8217;ll miss the much bigger shift unfolding right under our noses.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you already tried Gemini 2.0 Flash for image generation? Did you discover any awesome use cases that I haven&#8217;t covered above? I&#8217;d love to hear what you think!</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/gemini-2-0-flash-native-image-generation/comments"><span>Leave a comment</span></a></p><div><hr></div><h2>Thanks for reading!</h2><p>If you enjoy my writing, here&#8217;s how you can help:</p><ul><li><p>&#10084;&#65039;<strong>Like</strong> this post if it resonates with you.</p></li><li><p>&#128279;<strong>Share</strong> it to help others discover this newsletter.</p></li><li><p>&#128489; <strong>Comment</strong> below&#8212;I love hearing your opinions.</p></li></ul><p><strong>Why Try AI</strong> is a passion project, and I&#8217;m grateful to those who help keep it going. If you&#8217;d like to support my work and <strong><a href="https://www.whytryai.com/p/paid-subscriber-bonuses">unlock cool perks</a></strong>, consider a paid subscription:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Including Google&#8217;s own, <a href="https://www.whytryai.com/p/my-go-to-ai-tools#:~:text=Imagen%203%20(via%20Google%20Labs)">excellent Imagen 3</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I explored this in more detail in the <a href="https://www.whytryai.com/p/text-in-ai-images-workshop">&#8220;Text In AI Images&#8221; workshop</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Although I&#8217;ve shown how you can <a href="https://www.whytryai.com/p/10x-ai-27-openai-devday-zapier-ai-actions?open=false#%C2%A7make-adjustments-to-the-same-image-in-dall-e">work around this</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Claude 3.7 Sonnet: Fantastic Model Held Back by Lack of Native Internet Access]]></title><description><![CDATA[The new hybrid model from Anthropic could really benefit from web browsing.]]></description><link>https://www.whytryai.com/p/claude-3-7-sonnet-internet-access</link><guid isPermaLink="false">https://www.whytryai.com/p/claude-3-7-sonnet-internet-access</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Tue, 25 Feb 2025 09:48:36 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ce8d34cb-65c5-49cb-bd98-d98f90be25ba_1280x896.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>Anthropic just launched the impressive Claude 3.7 Sonnet, but it&#8217;s still locked inside the old interface without built-in access to the web.</p><h2>What is it?</h2><p>Claude 3.7 Sonnet is a <a href="https://www.anthropic.com/news/claude-3-7-sonnet">new state-of-the-art model</a> that incorporates a traditional LLM and a reasoning model in one.</p><p>It goes toe-to-toe with or outperforms frontier models like Grok 3, DeepSeek-R1, and OpenAI&#8217;s o-family on most benchmarks:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D2Cl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D2Cl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 424w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 848w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 1272w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D2Cl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp" width="1456" height="1322" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1322,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Benchmark table comparing frontier reasoning models against Claude 3.7 Sonnet&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Benchmark table comparing frontier reasoning models against Claude 3.7 Sonnet" title="Benchmark table comparing frontier reasoning models against Claude 3.7 Sonnet" srcset="https://substackcdn.com/image/fetch/$s_!D2Cl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 424w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 848w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 1272w, https://substackcdn.com/image/fetch/$s_!D2Cl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ea5703a-01fc-4eeb-98f1-5d0dbe2fe937_2600x2360.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.anthropic.com/news/claude-3-7-sonnet">Anthropic</a></strong></figcaption></figure></div><h2>How do you use it?</h2><p>Simple: Just go to the usual <a href="https://claude.ai/">claude.ai</a> website.</p><p>If you&#8217;ve never used Claude before, you&#8217;ll need to create an account.</p><p>If you have, Claude 3.7 Sonnet will now be the default model:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c7F6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c7F6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 424w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 848w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 1272w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c7F6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png" width="711" height="419" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b09c847-ca91-492c-9058-86727a87fabf_711x419.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:419,&quot;width&quot;:711,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:54251,&quot;alt&quot;:&quot;Claude 3.7 Sonnet default new chat screen&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/157868822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Claude 3.7 Sonnet default new chat screen" title="Claude 3.7 Sonnet default new chat screen" srcset="https://substackcdn.com/image/fetch/$s_!c7F6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 424w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 848w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 1272w, https://substackcdn.com/image/fetch/$s_!c7F6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b09c847-ca91-492c-9058-86727a87fabf_711x419.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now you&#8217;ll know just what to do: You&#8217;ve seen plenty of these chat interfaces before.</p><p>Anthropic calls Claude 3.7 Sonnet &#8220;the first hybrid reasoning model on the market.&#8221;</p><p>If I&#8217;m honest, I&#8217;m somewhat puzzled by the way this is currently handled.</p><p>To me, the word &#8220;hybrid&#8221; entails a single model that <em>independently</em> switches between standard and reasoning modes without the user&#8217;s involvement.</p><p>But Anthropic <em>still</em> presents us with a manual thinking mode selection in its interface:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hfVj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hfVj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 424w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 848w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 1272w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hfVj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png" width="688" height="417" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:417,&quot;width&quot;:688,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:61473,&quot;alt&quot;:&quot;Thinkingm modes - Normal and Extended - in Claude 3.7 Sonnet&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/157868822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ea996db-373e-4a7c-b37f-bc64f79c574d_688x417.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Thinkingm modes - Normal and Extended - in Claude 3.7 Sonnet" title="Thinkingm modes - Normal and Extended - in Claude 3.7 Sonnet" srcset="https://substackcdn.com/image/fetch/$s_!hfVj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 424w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 848w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 1272w, https://substackcdn.com/image/fetch/$s_!hfVj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4fbb983-6a74-43ab-a7bd-4e2428173fcc_688x417.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Grok has a &#8220;Think&#8221; option that does the same:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XLnb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XLnb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 424w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 848w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 1272w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XLnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png" width="766" height="210" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:210,&quot;width&quot;:766,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:12631,&quot;alt&quot;:&quot;Grok \&quot;Think\&quot; option&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/157868822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Grok &quot;Think&quot; option" title="Grok &quot;Think&quot; option" srcset="https://substackcdn.com/image/fetch/$s_!XLnb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 424w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 848w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 1272w, https://substackcdn.com/image/fetch/$s_!XLnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95e7151-3e0a-4af3-97da-11cdeb548304_766x210.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>And here&#8217;s OpenAI&#8217;s version that currently lets free users <a href="https://www.whytryai.com/p/sunday-rundown-86-deep-thinkers?open=false#%C2%A7sunday-bonus-stupid-simple-way-to-make-and-share-apps-with-o-mini">tap into o3-mini</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y_QG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y_QG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 424w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 848w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 1272w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y_QG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png" width="773" height="122" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:122,&quot;width&quot;:773,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;The \&quot;Reason\&quot; lightbubl in ChatGPT for free users&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="The &quot;Reason&quot; lightbubl in ChatGPT for free users" title="The &quot;Reason&quot; lightbubl in ChatGPT for free users" srcset="https://substackcdn.com/image/fetch/$s_!Y_QG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 424w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 848w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 1272w, https://substackcdn.com/image/fetch/$s_!Y_QG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3953ada9-a717-4a99-84a8-d3d30c724690_773x122.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The way I interpret this is that Claude 3.7 Sonnet has a &#8220;basic&#8220; thinking mode for free users and a &#8220;super&#8221; thinking mode reserved for PRO accounts.</p><p>So this dropdown decision is driven by marketing considerations.</p><p>Nevertheless, I find it unnecessarily confusing to have dropdown selections for an ostensibly hybrid model.</p><p>Anthropic did kind of beat OpenAI to the punch here, as <a href="https://x.com/sama/status/1889755723078443244">Sam Altman recently announced</a> that they&#8217;re moving in a similar direction of a unified system without model pickers.</p><p>By early accounts, Claude 3.7 Sonnet <a href="https://www.oneusefulthing.org/p/a-new-generation-of-ais-claude-37">is fantastic</a>, especially <a href="https://x.com/search?q=claude%203.7%20sonnet&amp;src=typed_query">when it comes to coding</a>.</p><p>Yet I can&#8217;t help but feel that Anthropic is shooting itself in the foot by not giving it Internet access.</p><h2>Why should you care?</h2><p>Having a cutting-edge model that can&#8217;t browse the web is a bit like taking a genius scientist to a world-class research lab and then shackling them to a corner table next to a bunch of dusty encyclopedias.</p><p>Claude can&#8217;t look up information about new coding tools and features or visit troubleshooting forums. It can&#8217;t apply its reasoning skills to real-time developments or read newly released scientific papers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-8J_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-8J_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-8J_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg" width="1280" height="896" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:896,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1566225,&quot;alt&quot;:&quot;Cute robot sitting on an arm chair smoking a pipe, holding a newspaper, asking \&quot;What year is this?!\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.whytryai.com/i/157868822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Cute robot sitting on an arm chair smoking a pipe, holding a newspaper, asking &quot;What year is this?!&quot;" title="Cute robot sitting on an arm chair smoking a pipe, holding a newspaper, asking &quot;What year is this?!&quot;" srcset="https://substackcdn.com/image/fetch/$s_!-8J_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-8J_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f120285-65a8-42d1-8f4d-ebd327111606_1280x896.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>All of this might make Claude 3.7 Sonnet less competitive against otherwise weaker models that have direct Internet access.</p><p>Of course, this is mostly an issue for regular users, since developers can always use the API to incorporate Claude into existing, web-enabled applications. There are also ways to give Claude indirect web access via <a href="https://news.ycombinator.com/item?id=39946592">tool use</a> or <a href="https://www.linkup.so/">third-party solutions</a>.</p><p>Still, Anthropic could turn Claude 3.7 Sonnet into an even more appealing package by granting it out-of-the-box Internet access. Hell, I might even be persuaded to switch from my ChatGPT Plus account if that happens.</p><p>To be sure, I&#8217;ve been quite <a href="https://www.whytryai.com/p/are-we-even-ready-for-ai-search">skeptical about AI search before</a>.</p><p>But we&#8217;re starting to see impressive products like <a href="https://www.whytryai.com/p/openai-deep-research">OpenAI&#8217;s &#8220;Deep Research,&#8221;</a> which make up for the inherent issues of AI web browsing with their ability to reason through problems and identify quality sources.</p><p>Now, I&#8217;m not saying that we need yet another Deep Research product on the market.</p><p>But I&#8217;m also not <em>not</em> saying that.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you had the chance to try Claude 3.7 Sonnet for yourself? What have been your early impressions? I&#8217;d love to hear if there are specific use cases in your life that Claude 3.7 Sonnet has been especially well-suited for.</p><p>Also: Do you consider the lack of web access as much of a drawback as I do? Or is it pretty negligible, as far as you&#8217;re concerned?</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/claude-3-7-sonnet-internet-access/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/claude-3-7-sonnet-internet-access/comments"><span>Leave a comment</span></a></p><div><hr></div><blockquote><p><em><strong>Hot Takes</strong> are occasional timely posts that focus on fast-moving news and releases, in addition to my regular Thursday and Sunday columns.</em></p><p><em>If <strong>Hot Takes </strong>aren&#8217;t your cup of tea, simply go to your account at <strong><a href="https://www.whytryai.com/account">www.whytryai.com/account</a> </strong>and toggle the &#8220;Notifications&#8221; settings accordingly:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rr-K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" width="745" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20141,&quot;alt&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;title&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Managing Notification settings in Substack - Why Try AI section toggles" title="Managing Notification settings in Substack - Why Try AI section toggles" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[OpenAI Joins the “Deep Research” Trio]]></title><description><![CDATA[A quick look at three somewhat similar agents from Google, Genspark, and OpenAI.]]></description><link>https://www.whytryai.com/p/openai-deep-research</link><guid isPermaLink="false">https://www.whytryai.com/p/openai-deep-research</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Mon, 03 Feb 2025 20:26:56 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c340f6d5-6ea5-48e0-86d5-a8e4530952a7_1280x896.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR</h2><p>OpenAI just released a <a href="https://openai.com/index/introducing-deep-research/">&#8220;Deep Research&#8221; agent</a> that can autonomously research and reason about complex topics. It&#8217;s the third &#8220;Deep Research&#8221; tool in less than two months, but it&#8217;s arguably the most capable.</p><h2>What is it?</h2><p>OpenAI describes it as &#8220;an agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks.&#8221;</p><p>Basically, you ask Deep Research a question or give it a research task, and it sets off on its own to create a plan, read through relevant literature, explore different research avenues based on its findings, and seek additional information to produce the final report.</p><p>By early accounts it&#8217;s quite impressive, offering a &#8220;<a href="https://www.oneusefulthing.org/p/the-end-of-search-the-beginning-of">near PhD-level analysis</a>&#8221; according to Ethan Mollick.</p><p>It&#8217;s also, confusingly, the <em>third</em> such agent with <em>exactly</em> the same name. </p><p>Here&#8217;s the timeline:</p><ul><li><p>December 11, 2024: Google released a &#8220;<a href="https://blog.google/products/gemini/google-gemini-deep-research/">Deep Research</a>&#8221; assistant for paying Gemini Advanced users.</p></li><li><p>January 27, 2025: Genspark launched a &#8220;<a href="https://mainfunc.ai/blog/genspark_autopilot_agent_deep_research">Deep Research</a>&#8221; agent with similar capabilities.</p></li><li><p>February 2, 2025: OpenAI announced its own &#8220;<a href="https://openai.com/index/introducing-deep-research/">Deep Research</a>&#8221; agent, because naming stuff is hard, you guys!</p></li></ul><p>But while the three tools are conceptually similar, they aren&#8217;t equally capable or equally accessible.</p><p>Let&#8217;s take a quick look at each of them and your options.</p><h2>How do you use it?</h2><p>In order from most to least expensive&#8230;</p><h3>1. OpenAI&#8217;s deep research</h3><p>Right now, OpenAI&#8217;s &#8220;deep research&#8221; is available exclusively to people paying <strong>$200 per month</strong> for a ChatGPT Pro account. Plus and Teams accounts might have <a href="https://openai.com/index/introducing-deep-research/#:~:text=usage%20and%20time.-,Access,-Deep%20research%20in">to wait a month</a> before trying it.</p><p>If you&#8217;re one of the fancy people with a Pro account, you&#8217;ll want to toggle the &#8220;Deep research&#8221; button when entering your query:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FtWy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FtWy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 424w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 848w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 1272w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FtWy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png" width="646" height="88" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:88,&quot;width&quot;:646,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:14791,&quot;alt&quot;:&quot;OpenAI \&quot;Deep Research\&quot; button&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="OpenAI &quot;Deep Research&quot; button" title="OpenAI &quot;Deep Research&quot; button" srcset="https://substackcdn.com/image/fetch/$s_!FtWy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 424w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 848w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 1272w, https://substackcdn.com/image/fetch/$s_!FtWy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F590d9df7-b141-4707-aa5e-b00e020aff05_646x88.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Ethan Mollick says that OpenAI&#8217;s deep research agent truly appears to process and reason about the information, pursue supporting studies, and dig deeper when necessary on its own. It uses a version of OpenAI&#8217;s upcoming, most powerful o3 reasoning model to do this.</p><p>If you can afford it, it&#8217;s the best independent AI research agent currently available.</p><h3>2. Google&#8217;s deep research</h3><p>Google&#8217;s version has been out the longest and costs <strong>$19.99 per month</strong>, but you can try it for free with a<a href="https://gemini.google/advanced/"> one-month trial</a> of Gemini Advanced.</p><p>To use it, select &#8220;1.5 Pro with Deep Research&#8221; from the model dropdown:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-W9r!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-W9r!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 424w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 848w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 1272w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-W9r!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png" width="726" height="409" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:409,&quot;width&quot;:726,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:32637,&quot;alt&quot;:&quot;Google's Deep Research with Gemini 1.5 Pro&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Google's Deep Research with Gemini 1.5 Pro" title="Google's Deep Research with Gemini 1.5 Pro" srcset="https://substackcdn.com/image/fetch/$s_!-W9r!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 424w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 848w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 1272w, https://substackcdn.com/image/fetch/$s_!-W9r!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde3a35bb-96b1-4e3c-b0ab-accfe1590aba_726x409.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://blog.google/products/gemini/google-gemini-deep-research/">Google</a></strong></figcaption></figure></div><p>Google&#8217;s &#8220;Deep Research&#8221; is great at quickly scanning lots of sites but appears to have less innate intelligence.</p><p>Here&#8217;s the summary from <a href="https://www.oneusefulthing.org/p/the-end-of-search-the-beginning-of">Ethan Mollick&#8217;s article</a>:</p><blockquote><p><em>Google surfaces far more citations, but they are often a mix of websites of varying quality (the lack of access to paywalled information and books hurts all of these agents). It appears to gather documents all at once, as opposed to the curiosity-driven discovery of OpenAI&#8217;s researcher agent. And, because (as of now) this is powered by the non-reasoning, older Gemini 1.5 model, the overall summary is much more surface-level, though still solid and apparently error-free. It is like a very good undergraduate product.</em></p></blockquote><p>And then there&#8217;s the free version for the rest of us&#8230;</p><h3>3. Genspark&#8217;s deep research</h3><p>Two weeks ago, I looked at Genspark&#8217;s <a href="https://www.whytryai.com/i/154748642/genspark-lets-you-test-image-prompts-at-scale">image generation agent</a>.</p><p>You find the new &#8220;Deep Research&#8221; agent on the same <a href="https://www.genspark.ai/agents">Genspark Agents page</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gcRL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gcRL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 424w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 848w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 1272w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gcRL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png" width="1266" height="607" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:607,&quot;width&quot;:1266,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:76117,&quot;alt&quot;:&quot;Deep Research agent from Genspark&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Deep Research agent from Genspark" title="Deep Research agent from Genspark" srcset="https://substackcdn.com/image/fetch/$s_!gcRL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 424w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 848w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 1272w, https://substackcdn.com/image/fetch/$s_!gcRL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8afd2e9-c9a6-4cff-b065-50db49d9a8b5_1266x607.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>After you give &#8220;Deep Research&#8221; a task, it engages GPT-4o, Claude 3.5 Sonnet, and Gemini 1.5 Pro to create a research plan:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Kv3k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kv3k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 424w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 848w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 1272w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kv3k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png" width="775" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:775,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:77758,&quot;alt&quot;:&quot;I want to compare all currently available AI agents called \&quot;Deep Research.\&quot; Research Plan Mixture-of-Agents  GPT-4o  Claude 3.5 Sonnet  Gemini 1.5 Pro To compare all currently available AI agents called 'Deep Research', we need to approach this systematically. First, we'll need to understand what 'Deep Research' is and which companies or organizations have developed such AI agents. Then, we'll gather detailed information about each of these agents, including their capabilities, limitations, and unique features. We'll also look for any comparative studies or reviews that have already been done. This process will involve searching for official announcements, tech news articles, research papers, and user reviews. Let's break this down into steps and see how we can get a comprehensive comparison.  Search for existing comparisons or reviews of 'Deep Research' AI agents. This will give us a starting point and might provide a list of agents to compare. Identify the main providers of 'Deep Research' AI agents. From our initial search, it seems OpenAI has launched one, but we need to find if there are others. Gather detailed information about OpenAI's Deep Research agent, as it seems to be a prominent one in the market. Research other Deep Research AI agents we identified in step 2, gathering similar&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="I want to compare all currently available AI agents called &quot;Deep Research.&quot; Research Plan Mixture-of-Agents  GPT-4o  Claude 3.5 Sonnet  Gemini 1.5 Pro To compare all currently available AI agents called 'Deep Research', we need to approach this systematically. First, we'll need to understand what 'Deep Research' is and which companies or organizations have developed such AI agents. Then, we'll gather detailed information about each of these agents, including their capabilities, limitations, and unique features. We'll also look for any comparative studies or reviews that have already been done. This process will involve searching for official announcements, tech news articles, research papers, and user reviews. Let's break this down into steps and see how we can get a comprehensive comparison.  Search for existing comparisons or reviews of 'Deep Research' AI agents. This will give us a starting point and might provide a list of agents to compare. Identify the main providers of 'Deep Research' AI agents. From our initial search, it seems OpenAI has launched one, but we need to find if there are others. Gather detailed information about OpenAI's Deep Research agent, as it seems to be a prominent one in the market. Research other Deep Research AI agents we identified in step 2, gathering similar" title="I want to compare all currently available AI agents called &quot;Deep Research.&quot; Research Plan Mixture-of-Agents  GPT-4o  Claude 3.5 Sonnet  Gemini 1.5 Pro To compare all currently available AI agents called 'Deep Research', we need to approach this systematically. First, we'll need to understand what 'Deep Research' is and which companies or organizations have developed such AI agents. Then, we'll gather detailed information about each of these agents, including their capabilities, limitations, and unique features. We'll also look for any comparative studies or reviews that have already been done. This process will involve searching for official announcements, tech news articles, research papers, and user reviews. Let's break this down into steps and see how we can get a comprehensive comparison.  Search for existing comparisons or reviews of 'Deep Research' AI agents. This will give us a starting point and might provide a list of agents to compare. Identify the main providers of 'Deep Research' AI agents. From our initial search, it seems OpenAI has launched one, but we need to find if there are others. Gather detailed information about OpenAI's Deep Research agent, as it seems to be a prominent one in the market. Research other Deep Research AI agents we identified in step 2, gathering similar" srcset="https://substackcdn.com/image/fetch/$s_!Kv3k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 424w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 848w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 1272w, https://substackcdn.com/image/fetch/$s_!Kv3k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087f4030-d19e-494d-9d70-82a5d0bbf58d_775x794.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">How meta!</figcaption></figure></div><p>You can then edit or approve the plan, at which point the agent independently crawls and summarizes hundreds of sources and sends a report to your inbox. This takes around 20 minutes.</p><p>Genspark reports contain mind maps and comparison tables for an easy overview:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g_0-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g_0-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 424w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 848w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 1272w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g_0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png" width="726" height="393" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:393,&quot;width&quot;:726,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:50627,&quot;alt&quot;:&quot;Mind map by Genspark's deep research agent&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Mind map by Genspark's deep research agent" title="Mind map by Genspark's deep research agent" srcset="https://substackcdn.com/image/fetch/$s_!g_0-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 424w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 848w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 1272w, https://substackcdn.com/image/fetch/$s_!g_0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F809691c9-75c6-47f3-b13d-69be2b7854c5_726x393.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If I&#8217;m honest, the research quality here is a mixed bag. </p><p>In my example, Genspark started strong with the OpenAI and Google examples but then cast a wider net to include DeepSeek, Claude, and other entries that didn&#8217;t have &#8220;Deep Research&#8221; in their name. You can <a href="https://www.genspark.ai/spark?id=44916286-7832-4e98-bbc1-e3bcdb8f9217">view the final report </a>for yourself.</p><p>But as it stands, Genspark&#8217;s deep research agent is the only <strong>free</strong> version on the market. Everyone gets a few uses for free, and you can also use my invite link for <a href="https://www.genspark.ai/invite?invite_code=MGM2NTJkYWVMMWNkOUxiMWI0TGQzNGJMZThlMTM0ODgzYzM0">a free month of Genspark Plus</a>.</p><h2>Why should you care?</h2><p>We appear to be entering an era of narrow yet actually useful agents.</p><p>Unlike the hypothetical &#8220;do anything&#8221; browser-using AI agents, which remain clunky, error-prone, and often require the user&#8217;s active involvement,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> deep research agents are capable of completing the task on their own. </p><p>Crawling and summarizing information on the Internet lends itself quite well to this kind of fire-and-forget agentic approach.</p><p>Note that &#8220;Deep Research&#8221; agents are <a href="https://www.whytryai.com/p/are-we-even-ready-for-ai-search">not immune to hallucinations</a>. You must still exercise the &#8220;trust but verify&#8221; approach to any information they gather and conclusions they reach. But even in their current state, they can meaningfully condense hours and days of research into a much shorter timeframe.</p><p>As more powerful reasoning models become mainstream, deep research agents are only going to get better.</p><p>So I strongly suggest you test drive whichever existing version you can afford.</p><p>This way, you&#8217;ll be better prepared for what&#8217;s soon coming.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>Have you already used any of these Deep Research agents? What&#8217;s been your impression? I&#8217;d love to hear from all of you Mr. and Mrs. Money Bags who have tried the expensive OpenAI version.</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/openai-deep-research/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/openai-deep-research/comments"><span>Leave a comment</span></a></p><div><hr></div><blockquote><p><em><strong>Hot Takes</strong> like the above are occasional timely posts that focus on fast-moving news and releases, in addition to my regular Thursday and Sunday columns.</em></p><p><em>If <strong>Hot Takes </strong>aren&#8217;t your cup of tea, simply go to your account at <strong><a href="https://www.whytryai.com/account">www.whytryai.com/account</a> </strong>and toggle the &#8220;Notifications&#8221; settings accordingly:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rr-K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" width="745" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20141,&quot;alt&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;title&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Managing Notification settings in Substack - Why Try AI section toggles" title="Managing Notification settings in Substack - Why Try AI section toggles" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>See Anthropic&#8217;s &#8220;<a href="https://www.anthropic.com/news/3-5-models-and-computer-use">Computer Use</a>&#8221; and OpenAI&#8217;s &#8220;<a href="https://openai.com/index/introducing-operator/">Operator</a>.&#8221;</p></div></div>]]></content:encoded></item><item><title><![CDATA[No, DeepSeek’s Janus-Pro-7B Doesn’t Make Better Images Than DALL-E 3.]]></title><description><![CDATA[DeepSeek is an incredible AI lab, but let's not elevate it to godlike status just yet.]]></description><link>https://www.whytryai.com/p/deepseek-janus-pro-7b-is-not-better-than-dalle-e3</link><guid isPermaLink="false">https://www.whytryai.com/p/deepseek-janus-pro-7b-is-not-better-than-dalle-e3</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Tue, 28 Jan 2025 11:26:36 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/fd5fdeba-491d-41ca-84a5-1b9db5ef52d3_1278x406.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p><em><strong>Hot Takes</strong> are occasional timely posts that focus on fast-moving news and releases, in addition to my regular Thursday and Sunday columns.</em></p><p><em>If <strong>Hot Takes </strong>aren&#8217;t your cup of tea, simply go to your account at <strong><a href="https://www.whytryai.com/account">www.whytryai.com/account</a> </strong>and toggle the &#8220;Notifications&#8221; settings accordingly:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rr-K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" width="745" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20141,&quot;alt&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;title&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Managing Notification settings in Substack - Why Try AI section toggles" title="Managing Notification settings in Substack - Why Try AI section toggles" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw" loading="lazy" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><h2>TL;DR</h2><p>DeepSeek released an image model called Janus-Pro-7B, which some claim makes better images than DALL-E 3 and Stable Diffusion, but it&#8217;s nowhere near in my tests.</p><h2>What is it?</h2><p>DeepSeek <a href="https://huggingface.co/deepseek-ai/Janus-Pro-7B">describes Janus-Pro-7B</a> as &#8220;a novel autoregressive framework that unifies multimodal understanding and generation.&#8221;</p><p>In short, this means you can use the same model to process image inputs <em>and</em> generate new images. This makes Janus-Pro-7B quite flexible, combining the capabilities of task-specific models in a single one.</p><p>That&#8212;and the fact that it&#8217;s yet another open-source model&#8212;is worthy of praise.</p><p>But DeepSeek also shared a few benchmarks that show Janus-Pro-7B outperforming OpenAI&#8217;s DALL-E 3 and Stable Diffusion 3 Medium:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6deH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6deH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 424w, https://substackcdn.com/image/fetch/$s_!6deH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 848w, https://substackcdn.com/image/fetch/$s_!6deH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 1272w, https://substackcdn.com/image/fetch/$s_!6deH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6deH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png" width="450" height="392.4418604651163" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:525,&quot;width&quot;:602,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:106987,&quot;alt&quot;:&quot;Janus-Pro-7B GenEval and DPG-Bench scores&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Janus-Pro-7B GenEval and DPG-Bench scores" title="Janus-Pro-7B GenEval and DPG-Bench scores" srcset="https://substackcdn.com/image/fetch/$s_!6deH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 424w, https://substackcdn.com/image/fetch/$s_!6deH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 848w, https://substackcdn.com/image/fetch/$s_!6deH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 1272w, https://substackcdn.com/image/fetch/$s_!6deH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19a9a6ba-b44e-46c1-9fb7-608b6d01ce8c_602x525.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://huggingface.co/deepseek-ai/Janus-Pro-7B">DeepSeek</a></strong> on Hugging Face</figcaption></figure></div><p>Here&#8217;s the thing: <a href="https://github.com/djghosh13/geneval">GenEval </a>and <a href="https://github.com/TencentQQGYLab/ELLA?tab=readme-ov-file#-dpg-bench">DPG-Bench</a> are benchmarks that measure <em>prompt adherence</em>&#8212;how well a model follows directions in the text prompt.</p><p>They say absolutely nothing about the <em>aesthetic quality</em> of the model.</p><p>Still, this didn&#8217;t stop the Internet from instantly proclaiming Janus-Pro-7B the new king of images:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!scVI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!scVI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 424w, https://substackcdn.com/image/fetch/$s_!scVI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 848w, https://substackcdn.com/image/fetch/$s_!scVI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 1272w, https://substackcdn.com/image/fetch/$s_!scVI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!scVI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png" width="749" height="116" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:116,&quot;width&quot;:749,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:28466,&quot;alt&quot;:&quot;Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces" title="Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces" srcset="https://substackcdn.com/image/fetch/$s_!scVI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 424w, https://substackcdn.com/image/fetch/$s_!scVI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 848w, https://substackcdn.com/image/fetch/$s_!scVI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 1272w, https://substackcdn.com/image/fetch/$s_!scVI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba96d7a6-a95f-4613-a9a7-15898cfc3586_749x116.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.businessinsider.com/deepseek-janus-pro-7b-ai-model-openai-dall-e3-2025-1">Business Insider</a></strong></figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jFnb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jFnb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 424w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 848w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 1272w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jFnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png" width="745" height="165" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:165,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:28165,&quot;alt&quot;:&quot;DeepSeek-AI Releases Janus-Pro 7B: An Open-Source multimodal AI that Beats DALL-E 3 and Stable Diffusion&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="DeepSeek-AI Releases Janus-Pro 7B: An Open-Source multimodal AI that Beats DALL-E 3 and Stable Diffusion" title="DeepSeek-AI Releases Janus-Pro 7B: An Open-Source multimodal AI that Beats DALL-E 3 and Stable Diffusion" srcset="https://substackcdn.com/image/fetch/$s_!jFnb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 424w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 848w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 1272w, https://substackcdn.com/image/fetch/$s_!jFnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab2a7ba-4697-4201-a0a5-8752486885d0_745x165.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://www.marktechpost.com/2025/01/27/deepseek-ai-releases-janus-pro-7b-an-open-source-multimodal-ai-that-beats-dall-e-3-and-stable-diffusion/">Marktechpost</a></strong></figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/minchoi/status/1883967833270636662" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XzY5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 424w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 848w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 1272w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XzY5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png" width="583" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:583,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:365446,&quot;alt&quot;:&quot;Wow.  DeepSeek just dropped Janus-Pro-7B, an open-source multimodal AI that beats DALL-E 3 and Stable Diffusion.  The &#128011; is on fire. &#128064;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/minchoi/status/1883967833270636662&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Wow.  DeepSeek just dropped Janus-Pro-7B, an open-source multimodal AI that beats DALL-E 3 and Stable Diffusion.  The &#128011; is on fire. &#128064;" title="Wow.  DeepSeek just dropped Janus-Pro-7B, an open-source multimodal AI that beats DALL-E 3 and Stable Diffusion.  The &#128011; is on fire. &#128064;" srcset="https://substackcdn.com/image/fetch/$s_!XzY5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 424w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 848w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 1272w, https://substackcdn.com/image/fetch/$s_!XzY5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96a8fa7-f6fb-4afd-9f09-c70ecd10182c_583x721.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://x.com/minchoi/status/1883967833270636662">Min Choi on X</a></strong></figcaption></figure></div><p>And while <a href="https://www.whytryai.com/p/dall-e-3-better-captions-research-paper-summary">better prompt adherence</a> is nothing to sneeze at, you also want your image model to create something that looks good.</p><p>Unfortunately for Janus, most of what it generates is hot garbage.</p><p>Let me show you.</p><h2>How do you use it?</h2><p>To test Janus-Pro-7B for yourself, head on over to its <a href="https://huggingface.co/deepseek-ai/Janus-Pro-7B">Hugging Face</a> page and select any of the &#8220;Spaces&#8221; using the model on the right-hand side:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cngX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cngX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 424w, https://substackcdn.com/image/fetch/$s_!cngX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 848w, https://substackcdn.com/image/fetch/$s_!cngX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 1272w, https://substackcdn.com/image/fetch/$s_!cngX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cngX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png" width="586" height="223" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:223,&quot;width&quot;:586,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:37515,&quot;alt&quot;:&quot;Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces" title="Spaces using deepseek-ai/Janus-Pro-7B 19 &#127757; deepseek-ai/Janus-Pro-7B &#127757; AP123/Janus-Pro-7b &#129408; blanchon/JanusPro &#127757; NeuroSenko/Janus-Pro-7b &#127757; mkozak/Janus-Pro-7b &#128640; Bils/DeepseekJanusPro-Image &#128640; LLMhacker/DeepseekJanusPro-Image &#128187; shakuur/meme &#127757; unography/Janus-Pro-7b &#127757; zx2323/xxxkk &#127757; LLMhacker/Multimodal_Understanding &#127757; omninexus/deepseek-vision + 7 Spaces" srcset="https://substackcdn.com/image/fetch/$s_!cngX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 424w, https://substackcdn.com/image/fetch/$s_!cngX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 848w, https://substackcdn.com/image/fetch/$s_!cngX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 1272w, https://substackcdn.com/image/fetch/$s_!cngX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08871633-c36f-4265-8c83-b1cad01dabb5_586x223.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>I&#8217;ll go with the official <a href="https://huggingface.co/spaces/deepseek-ai/Janus-Pro-7B">deepseek-ai/Janus-Pro-7B</a>. Here&#8217;s my 2-minute walkthrough:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f8a4b104-54ed-4d52-aa8d-5fcf60bf92e0&quot;,&quot;duration&quot;:null}"></div><p>If you saw the video, you&#8217;ll know that Janus-Pro-7B:</p><ol><li><p>Hallucinates nonexistent details in uploaded images.</p></li><li><p>Creates new images that look like Lovecraftian horrors.</p></li></ol><p>Here are three sample prompts and side-by-side comparisons against DALL-E 3 and Stable Diffusion 3, the models DeepSeek chose to benchmark against:</p><blockquote><p><strong>Prompt:</strong> <em>&#8220;photo of a samurai in a traditional outfit holding a sci-fi blaster, futuristic skyscrapers with neon signs in the background&#8221;</em></p></blockquote><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c2706819-2c83-4039-a50f-e86a50338842_768x768.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2a857c9-6111-46f8-a1fc-030e79a1a370_1024x1024.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8c59927-921d-4a83-8eb3-2570450d0443_1024x1024.webp&quot;}],&quot;caption&quot;:&quot;Left to right: Janus, SD3 Medium, DALL-E 3 (Click an image to view the full-size version.)&quot;,&quot;alt&quot;:&quot;Samurai photos by Janus, SD3 Medium, DALL-E 3&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e183514c-a276-4db9-9860-81402adaed47_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><blockquote><p><strong>Prompt:</strong> <em>&#8220;cartoon illustration of a blue cat and a green dog wearing party hats, sitting on a park bench and looking up at Saturn&#8221;</em></p></blockquote><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58e282ab-c82a-4568-84ff-d1026bd698f7_768x768.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ec64d7c9-8518-43da-8b60-48dfc83cd78f_1024x1024.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ccbacca-57da-4d4a-904c-4e200fb0c82d_1024x1024.webp&quot;}],&quot;caption&quot;:&quot;Left to right: Janus, SD3 Medium, DALL-E 3 (Click an image to view the full-size version.)&quot;,&quot;alt&quot;:&quot;Cartoon cat and dog by Janus, SD3 Medium, DALL-E 3&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/37adf57f-8f59-4735-b0e1-85f355e3fe12_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><blockquote><p><strong>Prompt:</strong> <em>&#8220;a cute purple robot holding up a cardboard sign that reads &#8220;I can spell better than you!&#8221;</em></p></blockquote><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a69ad5f3-f129-4c1b-a4d2-3123380698f3_768x768.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eddf42c9-50d6-4f00-aceb-0f9314f4d025_1024x1024.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6951f425-0ad7-4027-9943-2f6fa7823a1a_1024x1024.webp&quot;}],&quot;caption&quot;:&quot;Left to right: Janus, SD3 Medium, DALL-E 3 (Click an image to view the full-size version.)&quot;,&quot;alt&quot;:&quot;Purple robots with signs by DALL-E 3, Janus, and SD 3 Medium&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7060044d-108b-4444-8b5a-60e8568f6e17_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Janus&#8212;how do I put this mildly&#8212;sucks ass.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>If I&#8217;m being generous, I&#8217;ll put Janus <a href="https://www.whytryai.com/i/107628584/how-does-v-output-compare-to-the-previous-versions">on par with Midjourney V2</a>. (Reminder: We currently have <a href="https://www.whytryai.com/p/midjourney-version-6">Midjourney V6.1</a> with Midjourney 7 set to come out soon.)</p><p>Curiously, DeepSeek chose to benchmark against the outdated DALL-E 3 and Stable Diffusion 3 Medium instead of the <a href="https://www.whytryai.com/i/147092340/sunday-bonus-flux-vs-every-other-text-to-image-model">newer, better image models</a>, many of which are also way better at prompt adherence and spelling than DALL-E 3 (<a href="https://www.whytryai.com/p/ai-image-model-spelling-text">see my recent test</a>).</p><h2>Why should you care?</h2><p>DeepSeek released the <a href="https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative">truly impressive DeepSeek-R1</a> reasoning model just a week ago. Since then, this Chinese AI lab has been the talk of the town, getting much-deserved praise for successfully competing with large, high-profile US labs on a &#8220;shoestring budget.&#8221;</p><p>Not only that but DeepSeek is also seen as more open and transparent:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4GK1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4GK1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 424w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 848w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 1272w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4GK1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png" width="590" height="197" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:197,&quot;width&quot;:590,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:19084,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4GK1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 424w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 848w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 1272w, https://substackcdn.com/image/fetch/$s_!4GK1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F26fbd608-811f-4921-b0f1-daaa8c0f2c05_590x197.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: <a href="https://x.com/armankhon/status/1883981149434954169">arman on X</a>.</figcaption></figure></div><p>DeepSeek got so much attention that it skyrocketed to the <a href="https://apps.apple.com/us/charts/iphone/top-free-apps/36">#1 free app spot on the App Store</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4D5n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4D5n!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 424w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 848w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 1272w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4D5n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png" width="1017" height="447" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:447,&quot;width&quot;:1017,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:116989,&quot;alt&quot;:&quot;Top Charts iPhone iPad Top Free Apps DeepSeek - AI Assistant 1  DeepSeek - AI Assistant &#26477;&#24030;&#28145;&#24230;&#27714;&#32034;&#20154;&#24037;&#26234;&#33021;&#22522;&#30784;&#25216;&#26415;&#30740;&#31350;&#26377;&#38480;&#20844;&#21496; ChatGPT 2  ChatGPT OpenAI Threads 3  Threads Instagram, Inc. Google Gemini 4  Google Gemini Google TurboTax: File Your Tax Return 5  TurboTax: File Your Tax Return Intuit Inc. Google 6  Google Google&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Top Charts iPhone iPad Top Free Apps DeepSeek - AI Assistant 1  DeepSeek - AI Assistant &#26477;&#24030;&#28145;&#24230;&#27714;&#32034;&#20154;&#24037;&#26234;&#33021;&#22522;&#30784;&#25216;&#26415;&#30740;&#31350;&#26377;&#38480;&#20844;&#21496; ChatGPT 2  ChatGPT OpenAI Threads 3  Threads Instagram, Inc. Google Gemini 4  Google Gemini Google TurboTax: File Your Tax Return 5  TurboTax: File Your Tax Return Intuit Inc. Google 6  Google Google" title="Top Charts iPhone iPad Top Free Apps DeepSeek - AI Assistant 1  DeepSeek - AI Assistant &#26477;&#24030;&#28145;&#24230;&#27714;&#32034;&#20154;&#24037;&#26234;&#33021;&#22522;&#30784;&#25216;&#26415;&#30740;&#31350;&#26377;&#38480;&#20844;&#21496; ChatGPT 2  ChatGPT OpenAI Threads 3  Threads Instagram, Inc. Google Gemini 4  Google Gemini Google TurboTax: File Your Tax Return 5  TurboTax: File Your Tax Return Intuit Inc. Google 6  Google Google" srcset="https://substackcdn.com/image/fetch/$s_!4D5n!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 424w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 848w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 1272w, https://substackcdn.com/image/fetch/$s_!4D5n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc72e46ec-74a7-4b9e-986c-93814cd03933_1017x447.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I just tested this. It&#8217;s still #1 at the time of writing.</figcaption></figure></div><p>Everybody loves DeepSeek.</p><p>So much so that we&#8217;re now doing that classic thing where we put a company on a pedestal and automatically give it a free pass.</p><p>Let&#8217;s not do that.</p><p>With DeepSeek feeding so much discourse, we&#8217;ll likely see even more hype around it. Some well-earned, some less so.</p><p>As always, take it all with a grain of salt and test things yourself where possible.</p><p>I certainly can&#8217;t rule out that DeepSeek will eventually release competitive image and even video models. If anything, last week has shown it to be an astoundingly capable AI lab.</p><p>But let&#8217;s not rush to find a new AI darling we can blindly idolize.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption"><strong>Why Try AI</strong> is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>What&#8217;s your take? Am I being unreasonably nitpicky? Have you tested Janus and found its multimodality especially helpful for certain use cases?</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/deepseek-janus-pro-7b-is-not-better-than-dalle-e3/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/deepseek-janus-pro-7b-is-not-better-than-dalle-e3/comments"><span>Leave a comment</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This <a href="https://www.reddit.com/r/singularity/comments/1ibhd6v/januspro7b_first_tests/">Reddit thread </a>agrees with me.</p></div></div>]]></content:encoded></item><item><title><![CDATA[DeepSeek-R1: The Free o1 Alternative]]></title><description><![CDATA[How to use the new DeepSeek-R1 model and how it compares to o1.]]></description><link>https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative</link><guid isPermaLink="false">https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative</guid><dc:creator><![CDATA[Daniel Nest]]></dc:creator><pubDate>Tue, 21 Jan 2025 12:03:50 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/75496041-a132-4ef2-af47-2f3de147e409_1344x896.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p><em><strong>Hot Takes</strong> are occasional timely posts that focus on fast-moving news and releases, in addition to my regular Thursday and Sunday columns.</em></p><p><em>If <strong>Hot Takes </strong>aren&#8217;t your cup of tea, simply go to your account at <strong><a href="https://www.whytryai.com/account">www.whytryai.com/account</a> </strong>and toggle the &#8220;Notifications&#8221; settings accordingly:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rr-K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png" width="745" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:745,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20141,&quot;alt&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;title&quot;:&quot;Managing Notification settings in Substack - Why Try AI section toggles&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Managing Notification settings in Substack - Why Try AI section toggles" title="Managing Notification settings in Substack - Why Try AI section toggles" srcset="https://substackcdn.com/image/fetch/$s_!rr-K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 424w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 848w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1272w, https://substackcdn.com/image/fetch/$s_!rr-K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44f82283-80c1-4310-bb47-4792fa43f9d6_745x268.png 1456w" sizes="100vw" loading="lazy" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>TL;DR</h2><p>Chinese AI lab DeepSeek just <a href="https://api-docs.deepseek.com/news/news250120">released its newest reasoning model</a>: <strong>DeepSeek-R1.</strong></p><h2>What is it?</h2><p>DeepSeek-R1 is a free, fully open-sourced reasoning model that performs on par with OpenAI&#8217;s o1 across many benchmarks:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y6PN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y6PN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 424w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 848w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 1272w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y6PN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png" width="1080" height="789" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:789,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;DeepSeek R1 vs. OpenAI's o1 on benchmarks like AIME 2024, MATH-500, and more&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="DeepSeek R1 vs. OpenAI's o1 on benchmarks like AIME 2024, MATH-500, and more" title="DeepSeek R1 vs. OpenAI's o1 on benchmarks like AIME 2024, MATH-500, and more" srcset="https://substackcdn.com/image/fetch/$s_!Y6PN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 424w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 848w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 1272w, https://substackcdn.com/image/fetch/$s_!Y6PN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0aca0da-03fc-4d78-96ab-2730565ecdf5_1080x789.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <strong><a href="https://api-docs.deepseek.com/news/news250120">DeepSeek</a></strong></figcaption></figure></div><h2>How do you use it?</h2><p>If you know what you&#8217;re doing and want to use DeepSeek-R1 for building apps, fine-tuning, etc., you can access it via <a href="https://platform.deepseek.com/">DeepSeek API</a> or <a href="https://huggingface.co/deepseek-ai/DeepSeek-R1">grab the model on Hugging Face</a>.</p><p>If you&#8217;re a regular user like me, you can chat with DeepSeek-R1 for free at <strong><a href="https://chat.deepseek.com/">chat.deepseek.com</a></strong>.</p><p>Here&#8217;s my quick video walkthrough and a showcase of its capabilities:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e8a5117b-9a1a-4dac-b168-87eda099c7e9&quot;,&quot;duration&quot;:null}"></div><h2>Why should you care?</h2><p>As I see it, the launch of DeepSeek-R1 is a big deal for the average user but also has implications for the industry at large.</p><h3>User-level implications</h3><p>With DeepSeek-R1, everyone now has access to a reasoning model that:</p><ol><li><p><strong>Performs on par with o1 (almost)</strong>: DeepSeek-R1 has similar scores on many benchmarks, although observers like <em>AI Explained</em> suggest <a href="https://youtu.be/59Etzj5gvsE?si=hiR0TEwi3uDtLjCB&amp;t=603">it has its blind spots</a>.</p></li><li><p><strong>Is free to use and cheap to build with</strong>: DeepSeek-R1 is free for regular chats, and its API pricing per 1M output tokens is <em>almost 30 times cheaper</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a><em> </em>than o1.</p></li><li><p><strong>Provides better insight into its thinking process</strong>: DeepSeek-R1 offers a more detailed and transparent picture of its inner monologue than the distilled summary from o1 (see above video). This gives users a better chance to trace the reasoning behind the answer, identify where it goes off the rails, and perhaps steer the model better in subsequent requests.</p></li><li><p><strong>Can render the resulting code</strong>: As I&#8217;ve shown, DeepSeek-R1 lets you test the apps it creates directly in the chat interface (a la <a href="https://www.whytryai.com/p/claude-makes-useful-apps">Claude Artifacts</a>). This is helpful for quick back-and-forth iterations.</p></li><li><p><strong>Is fully open-sourced</strong>: Developers can fine-tune the model, distill it, and otherwise access the underlying code.</p></li></ol><h3>Broader implications</h3><p>DeepSeek has shown that it&#8217;s possible to quickly develop and open-source an o1-level reasoning model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> while making it less expensive and more transparent than OpenAI&#8217;s proprietary ones.</p><p>We&#8217;re only three weeks into January and already have a capable open-source competitor to OpenAI&#8217;s o1. I&#8217;m starting to think <a href="https://www.whytryai.com/i/154384487/reasoning-models-converge-in-performance">this two-week-old prediction of mine</a> might&#8217;ve been too conservative:</p><blockquote><p><em>By the end of 2025, reasoning models from at least three other players will perform on par with or better than OpenAI&#8217;s o3.</em></p></blockquote><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>&#129781; Over to you&#8230;</h2><p>What&#8217;s your take? Are we about to see an avalanche of reasoning models from other firms? Have I overlooked some important implications? </p><p>If you&#8217;ve had the chance to test DeepSeek-R1 and compare it to o1 for your tasks, I&#8217;d love to hear what you think!</p><p>Leave a comment or drop me a line at <a href="mailto:whytryai@substack.com">whytryai@substack.com</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.whytryai.com/p/deepseek-r1-free-openai-o1-alternative/comments"><span>Leave a comment</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>DeepSeek-R1 is $2.19 per million output tokens (<a href="https://api-docs.deepseek.com/quick_start/pricing">here</a>). OpenAI o1 is $60 per million output tokens (<a href="https://openai.com/api/pricing/#:~:text=a%20new%20window)-,OpenAI%20o1,-o1%20is%20our">here</a>).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>On the other hand, I have yet to see any serious discussion of what these condensed timelines mean for safety testing, alignment, etc.</p></div></div>]]></content:encoded></item></channel></rss>