OpenAI’s Voice Engine is the AI company’s venture in synthetic voice technology, capable of using just a 15-second audio sample to create voices that sound almost indistinguishably like the original speaker.
“It is notable that a small model with a single 15-second sample can create emotive and realistic voices,” OpenAI said on a blog post, speaking to the model’s abilities to produce different natural-sounding and emotionally resonant voices.
The development of OpenAI’s text-to-speech applications, including ChatGPT Voice and Read Aloud, are taken to the next level with this tool. Its offering users a whole amount of voices that can cater to diverse needs and preferences.
Voice Engine’s Key Features
- The Voice Engine generates natural-sounding speech from just a 15-second audio sample, showcasing the model’s efficiency.
- It is employed in OpenAI’s text-to-speech API, enhancing ChatGPT Voice and Read Aloud with a variety of preset voices, thus expanding the accessibility and usability of these applications.
- OpenAI’s caution towards the full release of Voice Engine is the company’s approach to ethical AI development.
“We hope to start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities,” OpenAI expressed.
More from News
- Trump Lifts Sanctions in Syria: What Does This Mean For Syrian Businesses?
- Retail Cyber Attacks: Cartier And North Face Are The Next Retailers Affected
- A Look At The Different Technologies Volvo Is Bringing To Its Cars
- Klarna Launches Debit Card To Diversify Away From BNPL
- T-Mobile Now Has Fibre Internet Plans Available For Homes
- Bitdefender Finds 84% of Attacks Use Built In Windows Tools, Here’s How
- Japan Starts Clinical Trials For Artificial Blood Which Is Compatible With All Blood Types
- UK Unicorn Monzo Breaks £1 Billion in Revenue
Who Is Using Voice Engine?
Voice Engine’s ability to work across different industry is shown through its various uses, from educational aids to healthcare support.
Education
Age of Learning uses Voice Engine to generate natural-sounding, emotive voices, which really enhances the academic success of children and non-readers.
The technology facilitates the creation of pre-scripted voice-over content and enables real-time, personalised interactions with students, which in turn diversifies the scope of educational content and makes learning more accessible.
Healthcare
In healthcare, applications like those by Dimagi use Voice Engine to assist community health workers by providing essential services in the local languages of remote areas.
Similarly, Livox uses this tech to aid non-verbal individuals, offering them distinctive and non-robotic voices across multiple languages, which improves their ability to communicate.
How Is Voice Engine Kept Safe?
There are some dangers that could come with unsafe use of synthetic voice tech, so OpenAI has come up with a set of safety measures. These include usage policies that make sure there is no impersonation and the requirement for informed consent from the individuals whose voices are being replicated.
Also, this tech actually incorporates watermarking and proactive monitoring strategies to make sure there is ethical use as well as to be able to trace the origins of AI-generated audio content.
OpenAI’s Safety Protocols For Voice Engine
- The prohibition of impersonating another individual or organisation without explicit consent, ensuring ethical use of the technology.
- A requirement for explicit and informed consent from the original speaker before their voice is replicated, safeguarding personal rights and autonomy.
- The implementation of watermarking and proactive monitoring to trace and control the usage of synthetic voices, enhancing the security and integrity of audio content generated by Voice Engine.
- OpenAI’s engagement with a broad spectrum of stakeholders, from governmental agencies to civil societies could mean that the company is taking ethical AI development seriously. This is so that there is feedback and guidance for a safe and socially acceptable tool.