Discussion about this post

User's avatar
Andrew Smith's avatar

Great summary. I remember how bad voice-to-text was around 2010 and how useless I felt the tech was back then. Fast forward to today and I will frequently use Jippity Voice as a sounding board for thinking out loud, or for taking notes, or for creating an ultra-fast outline for something I want to write. It's silly useful for all those use cases, and prior to 2024 or so, I had a much harder time getting what was in my brain out and into the wider world. Voice helps so much.

Lately, Jippity will allow you to see the text and images on the screen while using Voice from your phone. I have gotten so used to just using it while I walk or wash dishes or whatever, that I haven't really taken advantage of this new form of computing yet.

Andrew Sniderman 🕷️'s avatar

My wife is peak pragmatist - super efficient - you won’t catch her playing around with tech for kicks; she don’t care. So when she commented that Siri was all of a sudden sounding like a real person and now she’s talking to her phone I thought ahha, progress.

The glue for a lot of this - and the breakthrough that still amazes me - is Natural Language Processing which has pretty much been subsumed by LLMs now. There’s so much nuance to language and even more so spoken language that taking the basics like speech-to-text and actually *understanding* the intent is magic along the same lines as neural nets.

Awhile back I wrote a bit about the evolution of voice recognition and just dipped my toe into the NLP waters, perhaps interesting if you want a little glimpse of how we got here https://newsletter.wirepine.com/p/talk-to-the-wizard

7 more comments...

No posts

Ready for more?