Technology
AI speech generator 'reaches human parity' — but it's too dangerous to release, scientists say
Microsoft has developed a new artificial intelligence (AI) speech generator that is apparently so convincing it cannot be released to the public.
VALL-E 2 is a text-to-speech (TTS) generator that can reproduce the voice of a human speaker using just a few seconds of audio.
Microsoft researchers said VALL-E 2 was capable of generating "accurate, natural speech in the exact voice of the original speaker, comparable to human performance," in a paper that appeared June 17 on the pre-print server arXiv. In other words, the new AI voice generator is convincing enough to be mistaken for a real person — at least, according to its creators.
"VALL-E 2 is the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time," the researchers wrote in the paper. "Moreover, VALL-E 2 consistently synthesizes high-quality speech, even for sentences that are traditionally challenging due to their complexity or repetitive phrases."
Related: New AI algorithm flags deepfakes with 98% accuracy — better than any other tool out there right now
Human parity in this context means that speech generated by VALL-E 2 matched or exceeded the quality of human speech in benchmarks used by Microsoft.
The AI engine is capable of this given the inclusion of two key features: "Repetition Aware Sampling" and "Grouped Code Modeling."
-
Technology20m ago
Here’s What AT&T Customers Impacted By the Major Data Security Breach Should Do Now
-
Technology2h ago
Meta to roll out ‘favorites’ feature for seamless WhatsApp filtering
-
Technology6h ago
AI mass surveillance at Paris Olympics – a legal scholar on the security boon and privacy nightmare
-
Technology6h ago
The Large Hadron Collider gets reset and refreshed each year – a CERN physicist explains how the team uses subatomic splashes to restart the experiments
-
Technology8h ago
SpaceX rocket failure leads to 20 Starlink satellites falling from sky
-
Technology8h ago
Trump supports TikTok as US ban looms over ByteDance divestment
-
Technology11h ago
What We Know About the New U.K. Government’s Approach to AI
-
Technology12h ago
3Investigating the Impact of Hollywood’s Most Successful Actors on the Music Industry