Advertisement
Promo

Online business Toolkit

W3C examines the next generation of speech technology

Dawn Kawamoto CNET News

Published: 07 Dec 2005 16:05 GMT

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

The W3C on Tuesday said the next generation of VoiceXML will include specifications for speaker verification.

The W3C said it will draft specifications for VoiceXML 3.0, a technology enabling voice identification verification for users transacting business by phone or using voice on computers.

VoiceXML technologies are usually used to enable commands to be issued by voice, rather than with keystrokes. A number of businesses rely on the technology to improve profits by automating processes and reducing employees.

But users and businesses are becoming increasingly concerned about the security of those transactions, given the influx of cases where security has been breached.

"Speaker verification and identification is not only the best biometric for securing telephone transactions and communications, it can work seamlessly with speech recognition and speech synthesis in VoiceXML deployments," Ken Rehor, newly elected chairman of the VoiceXML Forum, said in a statement.

The W3C has now completed its desired requirements for VoiceXML 3.0 and expects to have a working draft of the specifications by the end of the first quarter, said James Larson, co-chair of the W3C Voice Browser Working Group.

In addition to the speaker identification requirements for VoiceXML 3.0, the W3C addressed the issue of extending its Speech Synthesis Markup Language (SSML) functionality to certain languages including Mandarin, Japanese and Korean.

SSML is designed to allow developers to control various aspects of speech from pitch to volume to pronunciation.

"The Chinese tags will include the right tone so the right meaning is conveyed, and boundaries so users know where words start and end," Larson said.

In Mandarin, for example, the word "mai" would mean either "cat" or "pretty", depending on the tone the speaker used.

Tags in SSML already help speech synthesizers to elicit correct word pronunciations, Larson said. For example, aluminium is pronounced one way in the United States, but the same word has a different pronunciation in Canada and the UK.

Although SSML is already a standard, the W3C group working on the language extensions hopes to hold its first formal meeting in March to develop a document on extension requirements, Larson noted.

  • Email
  • Trackback
  • Clip Link
  • Print friendlyPrint with EPSON

Did you find this article useful?
70 out of 122 people found this useful


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:





Win a BlackBerry with Vlingo voice recognition

Win a BlackBerry with Vlingo voice recognition

What is ZDNet UK's usual tagline?

Competition closes - 14 Jan 2010

Video icon

Video

Google Chrome

Roundup: Full coverage of Google Chrome

The search giant has launched a beta of its own open-source browser, sending a clear challenge to Microsoft in the way it lets users work with applications More

Blog: Google Chrome has Microsoft's code inside, says MS manager

And furthermore, he says, that's a good thing... More

Blog: Google Chrome — nine things we've found since launch

Google must be very happy with the coverage Chrome has gathered. But it's not all good news... More


Skip Sub Navigation Links to CNET Brand Links

Help

Become part of the ZDNet community.

Newsletters