Advertisement
Promo

Mobile devices Toolkit

IBM working on a hands-free translator

Lisa M Bowman CNET News

Published: 25 Apr 2003 08:58 BST

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

Imagine you're in a foreign country where you don't speak the language, and you need to decipher a confusing train schedule in a hurry. Wouldn't it be handy to be able to talk into a device, asking questions about departures and ticket prices, and have your queries translated into spoken word in the native language of train officials?

IBM is working on software that would bridge the spoken language gap for weary travellers and others who might need a personal translator in their pocket.

Researchers at IBM are developing and testing translation software that would enable two people speaking different languages to communicate without either having to type.

"When you go to a new country and you want to deal with all of the issues, this will be very handy," said David Nahamoo, department group manager for human language technologies at IBM Research.

Although several companies, including IBM, produce software that provides text-to-speech translation, so-called speech-to-speech translation products remain on the horizon.

The prototype of the IBM software, dubbed Multilingual Automatic Speech-to-Speech Technology, or "MASTOR," actually does have a text component. Two people speak into microphones connected to a computing gadget. The first person might, for instance, say, "Hi, my name is David" in English. The gadget then converts the speech to text, displays it in English, translates it, displays the translation alongside the original English text, and then speaks the translated version.

If, for example, "David" were trying to chat with a native of Mexico City, the computer would display the text versions and say, "Hola, me llamo David." The other person could then reply in Spanish and have her response translated into English.

The research builds on products IBM has already introduced to the market. Last September, the company unveiled ViaVoice Translator, software that lets people type in a phrase in one language and hear it in another.

Speech-to-speech technology is a particularly tricky endeavour for technologists and linguists, partly because it incorporates so many complex functions. For example, MASTOR includes speech-recognition software to capture the original spoken phrase, translation software to transform it into Spanish, and text-to-speech software so that the computer can say the words aloud.

"The technology needs work in all of these areas," Nahamoo said.

Nahamoo thinks speech-to-speech research projects could drive improvement in the areas of speech recognition and translation software because a glitch in any component of a speech-to-speech system would make other parts virtually impossible to use. What's more, even the best translation and speech software is susceptible to amusing and embarrassing errors as it tries to account for the different speakers' accents, slang and cadence.

Another feature of MASTOR will be its use of the notion of "meaning" in order to translate. Under the theory, different phrases that mean generally the same thing would be translated the same way. For example, a person could say "I'm injured and I need a doctor" or "Can you find me a doctor?" and both would be translated into an identical spoken phrase that would convey the need for medical help.

Using meaning in the translation process is less database intensive than translating more precisely, researchers said. That means the technology could be used for handhelds and other small portable devices that don't have as much memory as a desktop computer.

The researchers said the most immediate application of the software would be for personal or business travel or in health care settings like emergency rooms, where people who don't speak the local language might need to communicate information about their injury or medical history. However, the scientists declined to speculate about when the technology would appear in actual products.

Researchers also envision the speech technology being used to translate newscasts, enabling a media outlet to provide, for example, nearly real-time reports in multiple languages from an event such as the US Open. Researchers also say companies could use the technology for conference calls or meetings, although it still needs a lot of refinement before it's ready for such uses.


See the Software News Section for the latest headlines on everything from peer to peer clients to Office software and beyond.

Let the editors know what you think in the Mailroom.

  • Email
  • Trackback
  • Clip Link
  • Print friendlyPrint with EPSON

Did you find this article useful?
29 out of 91 people found this useful


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:












Video icon

Video

Enterprise Smartphones Special Report Special Report

Nokia E63

Nokia E63

Review Although it's missing some features (chiefly HSDPA and GPS), Nokia's E63 is a well-thought-out, ergonomic and affordable smartphone.

More Special Reports

On The Road Blog

On the Saving Edge: New Tech in Disast...

By Matthew Cordell A new report commissioned by the UN Foundation and Vodafone Foundation has found the intersection between two incredible trends -- the significant uptick in disasters... More

Post a comment

Tinsel on the TARDIS

There were shepherds on the hill, and the Doctor popped his head out of the TARDIS and said "you might want to see this" and they were astounded. WHY do we pay for a TV licence?... More

Post a comment

Linux is shipped on a third of all net...

A third of netbooks shipped in 2009 came with GNU/Linux rather than Windows preinstalled, according to analysis from ABI Research. The firm's figures strongly contradict Microsoft's... More

Post a comment

Discussions

sjh777 sjh777

Copper tax?

Thursday 10 December 2009, 1:16 PM

1 comment
lucadematteis lucadematteis

3 reasons I won’t give up my iPhone

Thursday 10 December 2009, 12:03 PM

5 comments
1000088037 1000088037

Another 'THE SKY IS FALLING!'...

Thursday 10 December 2009, 11:56 AM

1 comment
dres dres

o_O

Thursday 10 December 2009, 11:35 AM

1 comment

Skip Sub Navigation Links to CNET Brand Links

Help

Become part of the ZDNet community.

Newsletters