ZDNet UK


Skip to Main Content

ZDNet.co.uk - Winner of Best Business Website 2007
  1. Home
  2. News
  3. Blogs
  4. Reviews
  5. Prices
  6. Resources
  7. Community
  8. My ZDNet

 

ZDNet UK RSS Feeds


IT Jobs

Online business Toolkit

British Library plans to archive whole UK Web

Ingrid Marson ZDNet.co.uk

Published: 24 Jun 2004 15:00 BST

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

A trial project to archive 6,000 UK Web sites was announced on Tuesday by the UK Web Archiving Consortium. The consortium, led by the British Library, includes the Wellcome Trust, the National Archives and the Scottish and Welsh national libraries.

Each member of the consortium will choose content relevant to its subject. All types of Web content will be included, from government documents to blogs.

Richard Boulderstone, director of e-strategy at the British Library, said that all types of material will be collected including "informal material" such as discussion forums. "Letters and other informal works tell us how society is actually operating," he said.

The British Library will not censor the material because it does not want to restrict what people can find out about in the future.

"We would like to take a snapshot of every year, as a sample of what the Web looked like", said Boulderstone, suggesting that in the future people could look back to 2004 and see the swear words that Web users were using.

Only a limited number of Web sites will be archived initially but "ultimately, we would like to archive the whole UK Web," said Boulderstone.

One of the problems faced by the consortium is that, due to UK copyright law, permission is needed before a site can be archived. The British Library is working with the government to extend the law to allow them blanket access to all Web sites because "there are 4 million sites that we would like to capture -- we cannot ask everyone for permission," said Boulderstone.

The UK Web Archiving Consortium is not the first to archive the Web. The Wayback Machine, run by US-based Internet Archive, is a service that allows people to visit archived versions of Web sites.

According to Boulderstone, the British Library's approach differs from that of the Internet Archive because his organisation seeks permission from Web sites. In the future, the British Library hopes to improve on Wayback by archiving more frequently and with more depth, and through providing metadata so that information can be found more easily.

  • Email
  • Trackback
  • Clip Link
  • Print friendly Print with Dell

Did you find this article useful?
58 out of 115 people found this useful


Company/Topic Alerts

Create a new alert from the list below:





Related Jobs

QTP Tester Law Sector London 40K 45K

A leading company providing software and information services to the legal sector require an QTP specialist to work on testing their web based ...

Quality Lead - Unilever - Level C-00055185

The Quality and Process Improvement programme (QPI), Sarbanes Oxley (SOX) Compliance and Security are highly visible subject matter on this ...

SAP Project Manager - British National

For Security reasons you MUST be a British National. Our exclusive client has an urgent requirement for a SAP Project Manager for an initial 6 month ...

Sentry Posts Blog

Mobile Security Expert: Your Camera Ph...

Mobile Security Expert: Your Camera Phone Got Hacked Author: Eric Everson, Founder MyMobiSafe.com Have you ever heard someone say “I’d like to be a fly on the wall in that room.”?... More

Post a comment

Skype - The Roach Motel

Here is an interesting article from The National Business Review, pointing out once again that you can never delete a Skype account. Never. Period. This is something I am familiar... More

Post a comment

The vPhone: Why Visa Should Go Mobile

The vPhone: Why Visa Should Go Mobile Author: Eric Everson, Founder MyMobiSafe.com With all of the success of Apple’s iPhone, there is a growing case to support a company like Visa... More

Post a comment

Featured Talkback

I wonder, who needs .asia domain? I cannot imagine, what would be useful for Microsoft.asia? Toyota.asia? Then let's register .europe (if .eu is too short). Or perhaps Microsoft.southamerica, Dell.australiaandnewzealand, Coca-Cola.africa... Sound funny? Then why not just use the global and country domains? Or perhaps it is time to drop the domains at all?

By: LadyRoot

Read full story:
Businesses advised to register .asia domains