- Back to homepage

Microsoft can simulate your voice

@ 2023/01/11

All it needs is a three-second audio sample

Software King of the World Microsoft has announced a new text-to-speech AI model called VALL-E that can closely simulate a person’s voice when given a three-second audio sample.

Once it has heard you speak for three seconds, VALL-E can synthesise audio of you saying anything and mimic your emotional tone.

Volish Boffins claim that VALL-E could be used for high-quality text-to-speech applications, speech editing where a recording of a person could be edited and changed from a text transcript (making them say something they never did), and audio content creation when combined with other generative AI models like GPT-3.

While the tech behind the idea is interesting, it does not seem that anyone has thought – this is an idiotic idea which could be used to do no good.

No comments available.

All information and graphics contained in Madshrimps are sole property of the Madshrimps crew and may not be reproduced or copied in any manner without written permission from us.