Site icon TechRound

OpenAI’s Voice Engine Will Be Able To Mimic Human Voices

OpenAI’s Voice Engine is the AI company’s venture in synthetic voice technology, capable of using just a 15-second audio sample to create voices that sound almost indistinguishably like the original speaker.

“It is notable that a small model with a single 15-second sample can create emotive and realistic voices,” OpenAI said on a blog post, speaking to the model’s abilities to produce different natural-sounding and emotionally resonant voices.

The development of OpenAI’s text-to-speech applications, including ChatGPT Voice and Read Aloud, are taken to the next level with this tool. Its offering users a whole amount of voices that can cater to diverse needs and preferences.

 

Voice Engine’s Key Features

 

“We hope to start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities,” OpenAI expressed.

 

 

Who Is Using Voice Engine?

 

Voice Engine’s ability to work across different industry is shown through its various uses, from educational aids to healthcare support.

 

Education

 

Age of Learning uses Voice Engine to generate natural-sounding, emotive voices, which really enhances the academic success of children and non-readers.

The technology facilitates the creation of pre-scripted voice-over content and enables real-time, personalised interactions with students, which in turn diversifies the scope of educational content and makes learning more accessible.

 

Healthcare

 

In healthcare, applications like those by Dimagi use Voice Engine to assist community health workers by providing essential services in the local languages of remote areas.

Similarly, Livox uses this tech to aid non-verbal individuals, offering them distinctive and non-robotic voices across multiple languages, which improves their ability to communicate.

 

How Is Voice Engine Kept Safe?

 

There are some dangers that could come with unsafe use of synthetic voice tech, so OpenAI has come up with a set of safety measures. These include usage policies that make sure there is no impersonation and the requirement for informed consent from the individuals whose voices are being replicated.

Also, this tech actually incorporates watermarking and proactive monitoring strategies to make sure there is ethical use as well as to be able to trace the origins of AI-generated audio content.

 

OpenAI’s Safety Protocols For Voice Engine

 

Exit mobile version