Feb 11, 2024

PLUS: Stable Video Diffusion 1.1, Hugging Face assistants, MetaVoice-1B, Smaug-72B, BUD-E voice assistant, and background removal by Bria AI.

17 Comments

Andrew Smith

Feb 11, 2024

I just now upgraded to Gemini Advanced! The "dumb" version was already pretty useful. Really looking forward to this.

Where did you guys travel?

Reply (1)

Share

Daniel Nest

Feb 11, 2024Edited

Would love to hear your thoughts!

We're in the Czech Republic. Nathan (my oldest) has a hockey tournament here, and then we're sticking around for the winter holidays with my wife's family.

I think I'd really enjoy visiting Prague. I'll let you know when I'm on my way! Thanks for the invite.

Don't mention it!

Can't believe you're going to bring me that autographed Eminem CD!

100%. I never did tell you who autographed it, did I?

I counted on that joke!

It was Vanilla Ice.

Bud-E is definitely interesting - I have seen many attempts at this, so this will happen due to the massive interest.

I think Bud-E looks as one of the more "legit" ones. Excellent find Daniel.

I'd be interested to see a side-by-side comparison of Microsoft's Copilot and Midjourney or Dalle - take your squirrel example and do it side by side. While the results will be interesting, I'd like to know which one works faster/easier with your concept of Minimal Viable Prompt (great concept btw)

I wonder what people think of Google's Gemini to use it long term - unfortunately hearing that they "killed" Bard and now they are doing Gemini, this looks on par with Google's business practices where they will pull the rug from under large infrastructure type projects.

You're in luck, I've done a side-by-side deep dive into image models last December, here:

https://www.whytryai.com/p/text-to-image-ai-models

(Microsoft Copilot uses DALL-E 3, so it's the same as ChatGPT Plus)

Of course, Imagen 2 and Midjourney V6 came out after that.

The short answer is that DALL-E 3 and Midjourney V6 are the best for the "minimum viable prompt" approach as their prompt understanding and adherence is especially great. But most models are solid these days.

Also, the change from Bard to Gemini is purely a branding exercise. They simply renamed the search chatbot from Bard to Gemini. Bard already used Gemini Pro under the hood since late December, and I guess it just made sense for them to keep the "Gemini" umbrella as their guiding star for the future.

Microsoft TV ads used to always be comically terrible, especially compared to Apple's.

That Super Bowl ad is pretty good.

Yeah it has a sufficiently epic feel to it!

Good examples of what it can help with as well

Indeed. Lots of info in under a minute.

Comment deleted

Comment deleted

Yeah digital avatars have been around for a while (Synthesia, HeyGen, etc.) - but like you, I'm not sure how well they handle realtime interactions with low latency.

For full immersion, I'd like to see a voice with more inflection and maybe even some filler words like uhm, etc. The video demo voice sounds a bit too monotonous.

Reply

Share

Why Try AI

10X AI (Issue #38): Gemini Ultra, Redesigned…