Meta's Voicebox AI is a Dall-E for text-to-speech
Today, we are one step closer to the immortal celebrity future we have long been promised . Meta has unveiled Voicebox, its generative text-to-speech model that promises to do for the spoken word what ChatGPT and Dall-E, respectfully, did for text and image generation.
Essentially, its a text-to-output generator just like GPT or Dall-E — just instead of creating prose or pretty pictures, it spits out audio clips. Meta defines the system as “a non-autoregressive flow-matching model trained to infill speech, given audio context and text.” It’s been trained on more than 50,000 hours of unfiltered audio.
That diverse data set allows the system to generate more conversational sounding speech, regardless of the languages spoken by each party, according to the researchers. “Our results show that speech recognition models trained on Voicebox-generated synthetic speech perform almost as well as models trained on real speech.” What’s more the computer generated speech performed with just a 1 percent error rate degradation, compared to the 45 to 70 percent drop-off seen with existing TTS models.
The system was first taught to predict speech segments based on the segments around them as well as the passage’s transcript. “Having learned to infill speech from context, the model can then apply this across speech generation tasks, including generating portions in the middle of an audio recording without having to recreate the entire input,” the Meta researchers explained.Great deals on consumer electronics delivered straight to your inbox, curated by Engadget’s editorial team.
México Últimas Noticias, México Titulares
Similar News:También puedes leer noticias similares a ésta que hemos recopilado de otras fuentes de noticias.
Meta is expanding its bonus program that pays creators for Facebook posts | EngadgetMeta is trying to lure more creators to Facebook with new monetization features..
Leer más »
Engadget Podcast: Reddit’s revolt, MacBook Air 15 and Mac Studio reviews | EngadgetThis week, Cherlynn and Devindra discuss the recent subreddit revolts, following the company’s decision to dramatically increase the cost of its API for third parties..
Leer más »
Amazon's Freevee is adding free MGM and Warner Bros. Discovery channels | EngadgetAmazon's Freevee will soon include free cable-style channels dedicated to streaming the likes of 'The Pink Panther,' 'Stargate,' 'The Outer Limits' and 'Cake Boss.'.
Leer más »
Zwift launches dedicated game controllers for its bike-based fitness platform | EngadgetThe Zwift virtual cycling platform is getting a dedicated first-party game controller to simplify and improve the user experience.
Leer más »
Sonos lays off 7 percent of its workforce | EngadgetSonos is cutting about 130 jobs as it deals with a tough competitive landscape..
Leer más »
UPS tentatively agrees to add air conditioning to its trucks | EngadgetUPS tentatively agreed to equip its delivery trucks with air conditioning for the first time following union negotiations between the company and the Teamsters.
Leer más »