Thanks to Google and artificial intelligence (AI) research company, DeepMind, your phone will no longer sound like a robot when reading out or dictating requested information. Google Assistant is using an improved version of DeepMind’s WaveNet, a deep neural network that can synthesize realistic human speech.
WaveNet uses an improved system of speech synthesis or text-to-speech (TTS). TTS uses two techniques, concatenative and parametric TTS. In order to closely mimic human speech, concatenative TTS juxtaposes different parts of a voice actor’s recordings to construct the desired sentence. Upgrading concatenative TTS is cumbersome as it involves replacing the audio libraries. Parametric TTS generates computer-generated speech that tends to sound robotic and artificial.
How Does WaveNet Work?
Unlike these two TTS systems, WaveNet uses a system developed from a convolutional neural network to produce waveforms from scratch. Speech samples are used to train the platform to synthesize voices. The system determines which waveforms sound like people and which do not. This provides the speech synthesizer with the ability to mimic human intonations such as lip smacks. The system is even capable of coming up with its own accent based on the given samples.
In earlier years, amount of computing power needed to generate the audio was a severe limitation for WaveNet. It used to take at least one second to produce .02 seconds of audio. DeepMind’s engineers fixed the problem, and the system was able to produce a one-second-long waveform in 50 milliseconds. The sample’s resolution has doubled from 8 to 16 bits. This directly translates into audio that score much higher in human listening tests.
The improvements enable system integration into Google Assistant and other consumer products. As of today, Google Assistant can produce Japanese and U.S. English voices. Eventually, Google can use WaveNet to synthesize speeches for other dialects and languages. Eventually, computer-generated speech will sound more like humans, getting it correct right down to the peculiar regional accent.
Achieve IoT Success
We understand the challenge of transforming an organization to embrace the Internet of Things. Let us help you increase your probability of success.
Contact Amyx+ for a free initial consultation.
About Amyx+ IoT Business Transformation | Strategy | Innovation | Product | Data Analytics
- Voted Top IoT Influencer by Skyhook
- Voted Top IoT Rockstar by HP Enterprise
- Voted Top IoT Influencer by Inc. Magazine
- Voted Top in the Business of IoT by Relayr
- Voted Top Global IoT Expert by Postscapes
- Voted Top IoT Authority by the Internet of Things Institute
- Featured as a Top Internet of Things Company by Postscapes
- Voted Most Influential in Smart Cities and IIoT by Right Relevance
- Winner of the Cloud & DevOps World Award for Most Innovative Vendor
Amyx+ is an award-winning IoT business transformation firm specializing in IoT strategy, innovation & product development. As a thought leader in the Internet of Things, Amyx+ has the creative horsepower and the development prowess to execute even the most complex client engagements. Amyx+ is working with international and multinational enterprises to help 1) understand the impact of IoT disruptions, 2) formulate and sharpen their IoT strategy, 3) quantify the business case, 4) experiment, learn, validate, 5) develop game changing technologies, and 6) launch innovative IoT products and services worldwide. We employ a flexible methodology and approach to fit the client and needs & objectives while adapting to changing IoT environments. We have presence in San Francisco, NYC, and throughout Europe.