Why Try AI

Why Try AI

Share this post

Why Try AI
Why Try AI
Sunday Rundown #100: Conference Craze & Tralalero Tralala
Copy link
Facebook
Email
Notes
More
Sunday Rundown

Sunday Rundown #100: Conference Craze & Tralalero Tralala

Sunday Bonus #60: Swipe file with 40+ Claude 4 use cases.

Daniel Nest's avatar
Daniel Nest
Jun 01, 2025
∙ Paid
7

Share this post

Why Try AI
Why Try AI
Sunday Rundown #100: Conference Craze & Tralalero Tralala
Copy link
Facebook
Email
Notes
More
2
1
Share

Happy Sunday, friends!

Welcome back to the weekly look at generative AI that covers the following:

  • Sunday Rundown (free): this week’s AI news + a fun AI fail.

  • Sunday Bonus (paid): an exclusive segment for my paid subscribers.

Every Sunday Bonus in one place

Let’s get to it.

🗞️ AI news

Here are the biggest developments of the past two weeks.

We had three major conferences last week:

  • Google I/O 2025

  • Microsoft Build 2025

  • Computex 2025

So we’ve got a lot of stuff to catch up on!

👩‍💻 AI releases

New stuff you can try right now:

  1. Amazon launched AI-powered search in Amazon Music to improve artist and song discovery. (In beta for Amazon Music Unlimited subscribers in the US.)

  2. Anthropic news:

    1. The new Claude 4 family comes with long-term memory, tool use, and better coding abilities. (Try Claude 4 Sonnet for free.)

    2. Voice mode is out in beta, letting you finally talk to Claude in real time.

  3. Black Forest Labs launched FLUX.1 Kontext, a multimodal model capable of targeted, in-context image editing based on text or image input.

  4. DeepSeek released an updated R1-0528 model with better reasoning, fewer hallucinations, JSON function calling, and more. (Try for free.)

  5. ElevenLabs launched Multimodal Conversational AI that processes speech and text inputs at the same time, allowing for more flexible and efficient interactions.

  6. Genspark now gives unlimited access to AI Chat for Plus and Pro plans, so users can chat with nine top-tier models on one platform.

  7. GitHub launched a new agent for GitHub Copilot that can autonomously fix bugs, add features, refactor code, and more.

  8. Google announced major updates during the Google I/O 2025 Conference:

    1. Flow is a new film-making platform powered by Google’s state-of-the-art Veo 3 video model (see below).

    2. Gemini 2.5 Flash got better, and Gemini 2.5 Pro got a “Deep Think” mode for complex reasoning. Both models now also come with native audio output.

    3. Gemma 3n is a mobile-first multimodal language model that can run locally on just 2–3GB RAM.

    4. Imagen 4 is a top-tier text-to-image model with high-quality visuals, sharp details, and better-spelled text. (Try it for free on gemini.google.com)

    5. Jules is an autonomous coding agent that fixes bugs, writes tests, and builds features directly in your GitHub workflow.

    6. Lyria 2 (music model) is now available in more places to more creators, including as a Lyria RealTime version for live jamming in Google AI Studio.

    7. NotebookLM is now available as a mobile app that brings many of its best features directly to people’s phones.

    8. Photos now features a redesigned editor that uses AI to suggest and make tweaks to your pictures.

    9. Search is getting AI-powered improvements across Google products, including deeper answers, live search, smart shopping, and more.

    10. Veo 3 is a paradigm-shifting AI video model that natively incorporates sound, speech, and music into video clips from simple text prompts.

    11. Workspace is also getting AI-powered upgrades like smart email replies, automatic speech translation in Google Meet, and more.

  9. Hume AI rolled out EVI 3, a voice model that outperforms GPT-4o in empathy, expressiveness, response speed, and other parameters in blind testing.

  10. Kling launched an upgraded 2.1 family of video models with 1080p output and better prompt adherence.

  11. Manus now has a Slides tool that builds entire presentations from a simple prompt and lets you edit them directly on the fly.

  12. Microsoft also had many announcements during Microsoft Build 2025:

    1. Copilot Wave 2 brings smarter search, specialized agents, a new Copilot Create experience, and much more.

    2. Notepad, Paint, and Snipping Tool are getting AI-powered enhancements like custom stickers, smart screenshots, and more.

    3. Windows is getting dozens of developer-focused AI improvements, including tools for local inference, model fine-tuning, agent integration, and more.

  13. NVIDIA released Llama Nemotron Nano 4B, an open reasoning model for edge devices. See also: CEO Jensen Huang’s Keynote at Computex 2025:

  14. Perplexity introduced a new Pro feature called Labs that helps you build reports, dashboards, and mini apps in a single workspace.

  15. Salesforce launched Agentforce, bringing agentic AI teammates to applications like Slack to handle support, onboarding, CRM updates, and more.

  16. Stability AI upgraded Stable Video 4D to version 2.0, which can generate sharp multi-angle 4D video from a single input video.

  17. Tencent open-sourced HunyuanVideo-Avatar, which can animate photos from speech or audio input. (Much like HeyGen Avatar IV)

  18. xAI gave Grok the ability to create charts from live data on the fly.


🔬 AI research

Cool stuff you might get to try one day:

  1. Google also has a lot of stuff in the pipeline:

    1. Gemini Diffusion is an experimental model that generates text from noise significantly faster than the fastest conventional models. (Waitlist here.)

    2. Google Beam is an AI-powered platform that transforms regular 2D video streams into lifelike 3D visual experiences.

    3. LightLab is a diffusion-based tool that lets users retroactively adjust lighting sources and conditions in existing images with realistic results.

    4. NotebookLM will be getting a Video Overviews feature that transforms information into visual slide decks with voiceover:

    5. SignGemma is a multilingual model that can translate live sign language into English text.

  2. Opera teased Neon, an AI-powered browser built for the "agentic web" that can independently browse and take action on your behalf. (Sign up for the waitlist.)


🔀 AI random

Other notable AI stories of the week:

  1. Google launched a SynthID Detector portal that can effectively identify content made with Google’s generative AI across text, audio, images, and video.

🤦‍♂️ AI fail of the week

The two separate character references I gave to Google Whisk...

Two characters in Google Whisk - t-rex and a blue fish

…the Italian brainrot Whisk cobbled together:

Blue fish fused with a T-rex wearing cowboy boots in black and white cow patterns running on the ocean floor

Send me your AI fail for a chance to be featured in an upcoming Sunday Rundown.

Why Try AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.


💰 Sunday Bonus #60: 40+ use cases for Claude 4 (swipe file)

My two previous swipe files were quite popular:

  • 90+ use cases for GPT-4o image generation

  • 45+ use cases for OpenAI o3

Now that the new Claude 4 is topping the WebDev arena leaderboard and other coding benchmarks, I figured it’d help to make a swipe file of use cases for its best-in-class coding abilities. (Remember, the smaller Claude 4 Sonnet is free for everyone.)

I’m not a pro coder myself, so I partnered with OpenAI o3 to identify the use cases, categorize them, create examples, and put the swipe file together.

We ended up with over 42 use cases in total:

Claude 4 Swipe File

You can filter by category, search by keyword, and one-click copy starter prompts to test them with Claude 4.

I hope this gives you a bit of inspiration for your own needs!

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Daniel Nest
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More