Advertisement
Promo

Office applications Toolkit

Microsoft seeks to build more useful thesaurus

Ina Fried CNET News

Published: 24 Feb 2009 11:25 GMT

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

The Next Generation Writing Assistance project within Microsoft's research unit is aiming to build a more useful thesaurus, by tapping techniques used to translate languages.

Although thesauri are good at finding synonyms, they require the user to pick the right one, since they can't understand context. That's where machine language-translation techniques come in.

"We've taken the actual translation tables... and said: 'If a word in Chinese maps to two different English words, maybe those two words are synonyms, with some probability'," said Chris Brockett, a computational linguist and one of the Microsoft researchers leading the project.

This approach offers two key benefits over using a standard thesaurus: it can handle phrases, as opposed to single words, and it can draw on the context in which a phrase is used.

Brockett plans to show off a prototype of the tool next week at TechFest, Microsoft's annual internal science fair. It's one of dozens of projects that will be shown as part of an effort to expose Microsoft's business units to the work being done in Microsoft's research labs.

As is the case with most of the projects that will be displayed at TechFest, the thesaurus effort is still in its infancy.

"We're still working on the algorithms and how much work we give to the language pairs," Brockett said. "We have to get the quality up. There are usability issues that have to be looked into."

Over time, Brockett hopes the technique could be used to effectively rewrite whole sentences. However, would-be plagiarists should beware. Although the technology could one day translate a whole Wikipedia article for you, it would probably translate the article the same way for everyone else as well. Plagiarism-detection software is also evolving.

The thesaurus technology would be naturally suited to inclusion in Word, which already has a built-in traditional thesaurus.

The technology could also help Microsoft in another key area: search. While search engines are good at finding names, for example, that have just one form, they have more difficulty finding expressions that can be phrased in multiple ways.

Credit: Microsoft aims to build a better thesaurus from CNET News

  • Email
  • Trackback
  • Clip Link
  • Print friendlyPrint with EPSON

Did you find this article useful?


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:





Video icon

Video

Discussions

CA CA

How to estabish your company on the wo...

Wednesday 25 November 2009, 11:39 PM

1 comment
DCoker DCoker

Does 10x faster development dumb down...

Wednesday 25 November 2009, 11:26 PM

8 comments
CA CA

Does 10x faster development dumb down...

Wednesday 25 November 2009, 11:02 PM

8 comments
Tezzer Tezzer

Not good enough

Wednesday 25 November 2009, 10:08 PM

1 comment

Vista Upgrade Blog

This Crap Site

How utterly stupid - I am ranked #40 in the top 100 - as a member of this site..... I mean HOW utterly stupid.... I have done sweet FA, I have only rejoined this site after a 3 or... More

2 comments

Microsoft Security Update: November Pa...

Apologies for this late update to our core Patch Tuesday update. Here is a summary of the update .... The November Patch Tuesday update from Microsoft follows the largest patch and... More

Post a comment

Windows 7 pricing all over the shop..a...

I really think Microsoft have made a mess of Windows 7 pricing. They got the product right, yet there initial pricing of at around £44.95 for the full version of Windows 7 Home Premium... More

7 comments


Skip Sub Navigation Links to CNET Brand Links

Help

Become part of the ZDNet community.

Newsletters