Sunday Rundown #97: Open Source & Meta Torrance
Sunday Bonus #57: Recording of my live Q&A about AI agents.
Happy Sunday, friends!
Welcome back to the weekly look at generative AI that covers the following:
Sunday Rundown (free): this week’s AI news + a fun AI fail.
Sunday Bonus (paid): an exclusive segment for my paid subscribers.
Let’s get to it.
🗞️ AI news
Here are this week’s AI developments.
👩💻 AI releases
New stuff you can try right now:
Alibaba open-sourced Qwen3, a family of hybrid thinking models competitive with other top reasoning models on the market.
Anthropic launched Integrations that let Claude connect to third-party tools via remote MCP servers (in beta for Max, Team, and Enterprise).
Freepik open-sourced F-Lite, a 10B parameter image model trained on fully licensed data and ready for commercial use. (Try it here.)
Google news:
AI Mode now comes with visual cards and saved threads. It’s moving out of the Labs environment and into early access for a subset of regular US users.
NotebookLM can now generate its popular “Audio Overview” podcasts in over 50 languages.
Gemini native image generation is rolling out to gemini.google.com, letting you upload and edit images using chat commands.
Little Language Lessons is a new Google Labs experiment that can create situation-specific language lessons on the fly at your request. (Try it here.)
Higgsfield AI launched Iconic Scenes that let you insert yourself into legendary movie moments.
Ideogram upgraded 3.0 with better realism, improved prompt following, and editing tools like Magic Fill and Extend.
Kling AI introduced Instant Film Effect, which can turn your photos into animated memories.
KREA AI news:
The upgraded Enhancer tool can sharpen and upscale images up to 22K resolution (in partnership with Topaz Labs).
The new GPT Paint feature lets you visually prompt GPT-4o to make edits.
Meta launched a standalone Meta AI app powered by Llama 4. (Get it here.)
Microsoft released Phi-4 family of small reasoning models that do well against competitors from DeepSeek and OpenAI and can run locally on Windows.
Midjourney news:
The company upgraded the V7 model, introduced a lightbox editor, and added a new --exp parameter to adjust “details and creativity.”
Omni-Reference is a new way to drop characters or objects into your images in combination with text prompts.
NVIDIA released a 3D Guided Generative AI Blueprint that turns Blender scenes into AI images powered by FLUX.1-dev model from Black Forest Labs.
OpenAI upgraded ChatGPT Search with a better shopping experience, live queries in WhatsApp, and improved citations.
Runway rolled out Gen-4 References that let you use reference images to create consistent characters and settings. (Only for paid accounts.)
Suno dropped v4.5 for Pro & Premier users with richer vocals, better prompt adherence, and many other upgrades.
Vercept launched Vy, an AI model that can navigate your screen and run apps on your computer, beating Google, OpenAI, and Anthropic on UI benchmarks.
🔬 AI research
Cool stuff you might get to try one day:
Apple is said to be working on an AI coding platform in partnership with Anthropic.
xAI is expected to roll out Grok 3.5 next week, which can reportedly answer complex questions by reasoning from first principles.
📖 AI resources
Helpful AI tools and stuff that teaches you about AI:
“Anthropic Economic Index: AI’s Impact on Software Development” [STUDY] - research by Anthropic based on 500K Claude interactions.
🔀 AI random
Other notable AI stories of the week:
OpenAI rolled back a GPT-4o update that made ChatGPT over-the-top sycophantic.
📝 Suddenly, a surprise survey spawns…
Please help make Why Try AI better. Let me know what works and what doesn’t:
🤦♂️ AI fail of the week
All count and no play make Llama go cuckoo.
💰 Sunday Bonus #57: Proxy & Genspark Super Agent—live Q&A
In this recording (+transcript) of Thursday’s live session, I demo two web agents: Proxy and Genspark Super Agent—walking through real-world use cases and testing their capabilities.
What’s inside:
🔍 Key similarities and differences between Proxy and Genspark
🧩 What each of them is best-suited for (with examples)
⚠️ Their practical limitations—and a few workarounds
🙋♀️ Live audience Q&A
If you're curious about what these agents can and can’t do, this one’s for you.