648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip
Super Data Science: ML & AI Podcast with Jon Krohn

648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

2023-01-27

Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice.

Additional materials: www.superdatascience.com/648


Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Comments (3)

More Episodes

All Episodes>>

Get this podcast on your phone, Free

Creat Yourt Podcast In Minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Distribute to Apple Podcasts, Spotify, and more
  • Make money with your podcast
Get Started
It is Free