ZDNet UK


Skip to Main Content

ZDNet.co.uk - Winner of Best Business Website 2007
  1. Home
  2. News
  3. Blogs
  4. Reviews
  5. Prices
  6. Resources
  7. Community
  8. My ZDNet

 

ZDNet UK RSS Feeds


IT Jobs

Online business Toolkit

Search engines make some noise

Stefanie Olsen CNET News.com

Published: 28 May 2004 11:35 BST

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

StreamSage has flown under the radar during its last four years of operation while it has invested heavily in research and development. Its chief scientist, Tim Sibley, is known for his work in computational linguistics. StreamSage has received funding from research grants, including the National Institute for Standards and Technology's Advanced Technology Program. Harvard University uses StreamSage's technology to allow medical school students to search past lectures on related subjects. AOL is using the technology to provide closed captions for streaming video and audio on AOL Broadband.

NPR is using StreamSage to transcribe its audio programmes as they're broadcast, thereby helping them to get listed faster. NPR does commission transcripts for many of its programs, but the traditionally manual process of transcription would be too slow for a search related to timely news. Using speech recognition technology, StreamSage can create text from audio much more quickly, and then feed those transcripts to Google and Yahoo.

NPR's Thomas said her outfit eventually replaces the transcripts from StreamSage with those from humans because the human-rendered records much more accurately reflect the audio and video content. StreamSage's results can be garbled.

NPR also licenses technology from Singingfish to meticulously label its audio files with relevant information, or metadata.

In its own first step toward offering multimedia search, Google registers NPR audio files on Google News, its specialty news aggregation service. A search for a headline topic that is discussed on audio-only NPR programs, such as Talk of the Nation, will uncover a link to the audio programme and the specific segment covered, for example.

A Google representative confirmed a relationship with NPR but declined to comment further on the technology. Up until now, Google has not listed multimedia files because the company has sought to avoid the legal uncertainties of indexing and linking to copyrighted works that owners may want protected, company executives have said in the past. Beyond those reasons, audio and video file searching can be a much more difficult technical task to solve than cataloguing the Web.

StreamSage's Murray said he's not worried about potential copyright issues because his company is not housing the information. Rather Streamsage points people to audio and video around the Web, just like Google or Yahoo does.

Exactly how far search engines can go in linking to multimedia files has yet to be worked out definitively in the courts. The recording industry last year quietly settled a long running dispute with MP3Board.com over alleged illegal links to music files without any money changing hands, said MP3Board's attorney, Ira Rothken.

Yahoo also announced a relationship with NPR, in February, when it outlined its "content acquisition" program, a systematic effort to include more hard-to-get information in its searchable database.

An NPR affiliate in Boston, WBUR.org, is using similar technology from Hewlett-Packard, called Speechbot. Robin Lubbock, director of new media for WBUR, said the broadcaster is using Speechbot to translate audio into text so employees and visitors can search for content on its own Web site.

Virage, which is now owned by Autonomy, has technology that analyses in-stream audio and video and lets people zip to the part of the stream they want. Yet it can be an expensive enterprise solution.

Jay Webster, chief technology officer of interactive agency Fathom Online, said that for most audio or video broadcasters to get ranked in search engine results, they would have to employ some manual indexing of their own first.

"Where it gets cool is if you could search on any keyword and find it within audio and that audio would come up in search results," Webster said. "But I don't think we're there yet."

Next

Previous

1 2 3 4


  • Email
  • Trackback
  • Clip Link
  • Print friendly Print with Dell

Did you find this article useful?
180 out of 396 people found this useful


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:








Related Jobs

SAP Functional Solution Architect

Good client facing skills at manager level, senior manager level and be able to engage these individuals in meaning debate on the functional ...

SAP Retail Solutions Project Manager / Integration Architect

All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, ...

Major Investment Bank: Quantitative Analyst with commercial experience sought

The standard of candidate sought is extremely high. Your role will consist of thought provoking, challenging projects such as: - Interacting with ...

Sentry Posts Blog

Skype - The Roach Motel

Here is an interesting article from The National Business Review, pointing out once again that you can never delete a Skype account. Never. Period. This is something I am familiar... More

Post a comment

The vPhone: Why Visa Should Go Mobile

The vPhone: Why Visa Should Go Mobile Author: Eric Everson, Founder MyMobiSafe.com With all of the success of Apple’s iPhone, there is a growing case to support a company like Visa... More

Post a comment

The Google Apple Merger: Fantasy or Fu...

The Google Apple Merger: Fantasy or Future? Author: Eric Everson, Founder MyMobiSafe.com Market research suggests that Microsoft controls upwards of 90% of the respective computer-based... More

2 comments

Featured Talkback

I wonder, who needs .asia domain? I cannot imagine, what would be useful for Microsoft.asia? Toyota.asia? Then let's register .europe (if .eu is too short). Or perhaps Microsoft.southamerica, Dell.australiaandnewzealand, Coca-Cola.africa... Sound funny? Then why not just use the global and country domains? Or perhaps it is time to drop the domains at all?

By: LadyRoot

Read full story:
Businesses advised to register .asia domains