Microsoft can simulate your voice

@ 2023/01/11
All it needs is a three-second audio sample

Software King of the World Microsoft has announced a new text-to-speech AI model called VALL-E that can closely simulate a person’s voice when given a three-second audio sample.

Once it has heard you speak for three seconds, VALL-E can synthesise audio of you saying anything and mimic your emotional tone.

Volish Boffins claim that VALL-E could be used for high-quality text-to-speech applications, speech editing where a recording of a person could be edited and changed from a text transcript (making them say something they never did), and audio content creation when combined with other generative AI models like GPT-3.

While the tech behind the idea is interesting, it does not seem that anyone has thought – this is an idiotic idea which could be used to do no good.

No comments available.