Why Try AI

Why Try AI

Hot Takes

HeyGen Avatar IV: Deepfakes Are Point-and-Click Now.

All you need is one image. Is this a problem?

Daniel Nest's avatar
Daniel Nest
May 08, 2025
∙ Paid

TL;DR

HeyGen’s Avatar IV creates lifelike, expressive talking avatars from a single image that can lip-sync to any audio or script—are deepfakes too easy now?

What is it?

Avatar IV is a new AI avatar model from HeyGen:

 NEW: HeyGen Avatar IV is here.  Our most advanced AI avatar model yet.  📸 One photo. 📝 One script. 🎧 Just your voice.  Most avatars sync to your words. Avatar IV interprets them.  Built on a diffusion-inspired audio-to-expression engine, it analyzes your vocal tone, rhythm, and emotion — then synthesizes photoreal facial motion with temporal realism.  🎭 Head tilts. Pauses. Cadences. Micro-expressions.  ➡️ A single image → a video that feels real, not rendered.  Rolling out to all users now.
Source: X

Not so long ago, creating a custom avatar with HeyGen or Synthesia required you to record a training video and submit it for professional processing.

Now, it takes one image, one script (written or recorded), and a few minutes. That’s it:

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Daniel Nest
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture