ZDNet UK


Skip to Main Content

ZDNet.co.uk - Winner of Best Business Website 2007
  1. Home
  2. News
  3. Blogs
  4. Reviews
  5. Prices
  6. Resources
  7. Community
  8. My ZDNet

 

ZDNet UK RSS Feeds


IT Jobs

Application development Toolkit

Google could be five times faster

Winston Chai GameSpot Europe

Published: 27 May 2003 08:48 BST

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

Users of the Google search engine like it because it's fast, but a team at Stanford University has come up with ways to make it up to five times faster.

With the extra speed, Google could tailored for each user, according to the team. For example, a sports-loving Google user looking for "tiger" will see pages only on golfer Tiger Woods, not large felines from Asia.

At present, Google's ranking system relies on a method called PageRank, an invention of co-founder Larry Page which calculates the popularity and relevance of Web sites based on how many other sites link to it.

"Computing PageRank for a billion Web pages can take several days. Google currently ranks and searches three billion Web pages and each personalised or topic-sensitive ranking would also require a separate multi-day computation," the university said in a statement.

To speed up PageRank, Stanford researchers have developed a trio of techniques based on a branch of mathematics called numerical linear algebra. These methods are described in three papers.

The first method from the Stanford team, BlockRank, offers the most significant gain, speeding up PageRank by three times, they claim.

The researchers make use of their discovery that on most sites, up to 80 percent of links point to other pages on the same site -- each site looks like a thick block of links.

PageRank processes each link individually, but with their more efficient BlockRank method, these same-site links are processed as a unit, before moving on to links outside the site.

The second method involves the use of extrapolation. Before scanning the Web, certain assumptions about a site's importance are drawn up.

As the scanning continues, these assumptions are either proven or disproved, with the accuracy increasing as more links are processed. A site's rank is extrapolated -- guessed at -- when a reasonable amount of evidence is acquired. Compared with PageRank, which only knows a site's rank after exhaustively trawling the Web, extrapolation works 50 percent faster, say the researchers.

The third method, called Adaptive PageRank, relies on the fact that lower-ranking sites tend to be computed faster than higher-ranking ones. By dropping further processing of such quickly-computed sites, a speed boost of up to 50 percent can be won, they said.

While these methods have their individual merits, the Stanford team believes they can offer even greater returns when combined.

"Further speed-ups are possible when we use all these methods," said Sepandar Kamvar, one of the members of this project. "Our preliminary experiments show that combining the methods will make the computation of PageRank up to a factor of five faster.

"However, there are still several issues to be solved. We're closer to a topic-based PageRank than to a personalised ranking," he added.

The Stanford team's theories will remain theories for now -- they don't appear to have any official ties to Google itself.

"Google appreciates any contributions that further the study of hyperlink analysis on the web," was a spokesman's reply to CNETAsia when asked whether Google will consider using the team's methods, or if the privately-held company was involved in the university team's efforts.

The Stanford team presented its paper on these Google enhancements at the Twelfth Annual Word Wide Web Conference in Budapest, Hungary, last week.


For everything Internet-related, from the latest legal and policy-related news, to domain name updates, see ZDNet UK's Internet News Section.

Let the editors know what you think in the Mailroom.

  • Email
  • Trackback
  • Clip Link
  • Print friendly Print with HP

Did you find this article useful?
32 out of 74 people found this useful


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:











Related Jobs

Senior Metals Researchers - London

Global metals research boutique is looking for senior metals researchers/ analysts to join their rapidly growing London office.

Junior HTML CSS Photoshop - Web Designer role - Berkshire

The position that is available is one that is heavily involved within the E-Commerce domain and will involve the Web Designer to design and build web ...

Exciting marketing company seek web designer for (x)html, CSS role

Your day to day responsibilities will include designing HTML e-mails and landing pages for the clients who include British Airways. The role in terms ...

Discussions

davidparry davidparry

Rugged or Heavy Duty?

Sunday 11 May 2008, 9:50 AM

1 comment
dotancohen dotancohen

Just install Ubuntu!

Saturday 10 May 2008, 6:57 PM

1 comment

Featured Talkback

The fact is: Software developers today are really designers and not coders. The reason that business anlaysts exist today to model solutions is because they understand the value of designing software before writing it. All too often developers create code that has little value because they do not understand that business classes interact with other classes within the confines of a working model or pattern.

By: 1000165269

Read full story:
Making sense of agile modelling