Advertisement
Promo

Office applications Toolkit

Microsoft seeks to build more useful thesaurus

Ina Fried CNET News

Published: 24 Feb 2009 11:25 GMT

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

The Next Generation Writing Assistance project within Microsoft's research unit is aiming to build a more useful thesaurus, by tapping techniques used to translate languages.

Although thesauri are good at finding synonyms, they require the user to pick the right one, since they can't understand context. That's where machine language-translation techniques come in.

"We've taken the actual translation tables... and said: 'If a word in Chinese maps to two different English words, maybe those two words are synonyms, with some probability'," said Chris Brockett, a computational linguist and one of the Microsoft researchers leading the project.

This approach offers two key benefits over using a standard thesaurus: it can handle phrases, as opposed to single words, and it can draw on the context in which a phrase is used.

Brockett plans to show off a prototype of the tool next week at TechFest, Microsoft's annual internal science fair. It's one of dozens of projects that will be shown as part of an effort to expose Microsoft's business units to the work being done in Microsoft's research labs.

As is the case with most of the projects that will be displayed at TechFest, the thesaurus effort is still in its infancy.

"We're still working on the algorithms and how much work we give to the language pairs," Brockett said. "We have to get the quality up. There are usability issues that have to be looked into."

Over time, Brockett hopes the technique could be used to effectively rewrite whole sentences. However, would-be plagiarists should beware. Although the technology could one day translate a whole Wikipedia article for you, it would probably translate the article the same way for everyone else as well. Plagiarism-detection software is also evolving.

The thesaurus technology would be naturally suited to inclusion in Word, which already has a built-in traditional thesaurus.

The technology could also help Microsoft in another key area: search. While search engines are good at finding names, for example, that have just one form, they have more difficulty finding expressions that can be phrased in multiple ways.

Credit: Microsoft aims to build a better thesaurus from CNET News

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

Did you find this article useful?


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:





Video icon

Video

Discussions

Fat Pop Do Wop Fat Pop Do Wop

Windows? Security? Oxymoron?

Tuesday 9 February 2010, 11:32 PM

1 comment
Fat Pop Do Wop Fat Pop Do Wop

Why does everybody hate me?

Tuesday 9 February 2010, 11:17 PM

4 comments
blackholesun blackholesun

The economics of computing

Tuesday 9 February 2010, 9:18 PM

3 comments
knakworst knakworst

Linux Mint 8 KDE Community Edition

Tuesday 9 February 2010, 9:12 PM

2 comments

Small Business Blog

Google Buzz sparks Gmail social networ...

Google has announced on Tuesday that it is immediately rolling out Google Buzz, a location-aware social networking tool highly integrated with its Gmail client, Google Maps and a new... More

Post a comment

Google's People Hopper 'morphs' Orkut...

Just came across a rather strange little project in Google Labs called People Hopper. Launched in late January, it's a gadget that runs in Orkut, Google's massively successful (if... More

Post a comment

Intel wishes virtualisation didn't exi...

Here's an interesting tidbit about vPro vendors Intel: they wish the technology they now enthusiastically tout had never been invented. At a cloud conference I attended today, Intel's... More

Post a comment


Skip Sub Navigation Links to CNET Brand Links

Help

Become part of the ZDNet community.

Newsletters