03. January 2018

speech to text language translation software free download for windows 7 ultimate edition 32

Software Translates Your Voice into Another Language

Research software from Microsoft synthesizes speech in a foreign language, but in a voice that sounds like yours.

by Tom Simonite
March 9, 2012

Researchers at Microsoft have made software that can learn the sound of your voice, and then use it to speak a language that you donвЂ™t. The system could be used to make language tutoring software more personal, or to make tools for travelers.

In a demonstration at MicrosoftвЂ™s Redmond, Washington, campus on Tuesday, Microsoft research scientist Frank Soong showed how his software could read out text in Spanish using the voice of his boss, Rick Rashid, who leads MicrosoftвЂ™s research efforts. In a second demonstration, Soong used his software to grant Craig Mundie, MicrosoftвЂ™s chief research and strategy officer, the ability to speak Mandarin.

Hear Rick RashidвЂ™s voice in his native language and then translated into several other languages:

In English, a synthetic version of MundieвЂ™s voice welcomed the audience to an open day held by Microsoft Research, concluding, вЂњWith the help of this system, now I can speak Mandarin.вЂќ The phrase was repeated in Mandarin Chinese, in what was still recognizably MundieвЂ™s voice.

вЂњWe will be able to do quite a few scenario applications,вЂќ said Soong, who created the system with colleagues at Microsoft Research Asia, the companyвЂ™s second-largest research lab, in Beijing, China.

вЂњFor a monolingual speaker traveling in a foreign country, weвЂ™ll do speech recognition followed by translation, followed by the final text to speech output [in] a different language, but still in his own voice,вЂќ said Soong.

The new technique could also be used to help students learn a language, said Soong. Providing sample foreign phrases in a personвЂ™s own voice could be encouraging, or easier to imitate. Soong also showed how his new system could improve a navigational directions phone app, allowing a stock synthetic English voice to seamlessly read out text written on Chinese road signs as it relayed instructions for a route in Beijing.

The system needs around an hour of training to develop a model able to read out any text in a personвЂ™s own voice. That model is converted into one able to read out text in another language by comparing it with a stock text-to-speech model for the target language. Individual sounds used by the first model to build up words using a personвЂ™s voice in his or her own language are carefully tweaked to give the new text-to-speech model a full ability to sound out phrases in the second language.

Soong says that this approach can convert between any pair of 26 languages, including Mandarin Chinese, Spanish, and Italian.

Preserving a personвЂ™s voice when synthesizing speech for them in another language would likely be reassuring to a user, and could make interactions reliant on translation software more meaningful, says Shrikanth Narayanan, a professor at the University of Southern California, in Los Angeles, leads a research group working on systems to translate speech in situations such as doctor-patient consultations.

вЂњThe word is just one part of what a person is saying,вЂќ he says, and to truly convey all the information in a personвЂ™s speech, translation systems will need to be able to preserve voices and much more. вЂњPreserving voice, preserving intonation, those things matter, and this project clearly knows that,вЂќ says Narayanan. вЂњOur systems need to capture the expression a person is trying to convey, who they are, and how theyвЂ™re saying it.вЂќ

His research group is investigating how features such as emphasis, intonation, and the way people use pauses or hesitation affects the effectiveness and perceived quality of a word-for-word translation. вЂњWeвЂ™re asking if you can build systems that can mediate between people as well as just replacing the words,вЂќ he says. вЂњI view this [Microsoft research] as a part of how you make this happen.вЂќ

Become an MIT Technology Review Insider for in-depth analysis and unparalleled perspective.

Tagged

Tom Simonite San Francisco Bureau Chief

IвЂ™m MIT Technology ReviewвЂ™s San Francisco bureau chief and enjoy a diverse diet of algorithms, Internet, and human-computer interaction with chips on the side. I lead our coverage of new ideas from Silicon Valley, whether they spring from tech … More giants, new startups, or academic labs.

speech to text language translation software free download for windows 7 ultimate edition 32

Software Translates Your Voice into Another Language

Research software from Microsoft synthesizes speech in a foreign language, but in a voice that sounds like yours.

Tagged

Related Video

Love at First Bite Bakery