Advertisement
Promo

Industry watch Toolkit

HP, MIT delve deep with digital library

Michael Kanellos, CNET News.com CNET News

Published: 05 Nov 2002 09:28 GMT

  • Email
  • Trackback
  • Clip Link
  • Print friendly
  • Post Comment

The Massachusetts Institute of Technology and Hewlett-Packard on Monday unveiled a system for electronically archiving books, lecture notes and scientific data that potentially will serve as a model for academic libraries in the future.

Called DSpace, the new system is essentially a centralised, electronic repository for the massive amounts of intellectual property created by research institutions, said Mackenzie Smith, associate director of MIT Libraries and the DSpace project director.

Preserving data in an accessible manner is increasingly becoming a problem for a number of universities and government agencies. MIT itself produces an estimated 10,000 pieces of digital content a year, a figure that includes conference papers and technical reports.

Some of the data is also quite large and difficult to access. One faculty member has generated ocean floor maps that take up 30 terabytes of data.

"We began this to get some kind of territorial control over all of this research," Smith said. "If you're lucky, you can get some of it on Google, but most of the stuff we are talking about is not indexed in any way you can get it."

Potentially, DSpace will lead to the creation of a virtual library that meshes the collections of several research universities. MIT is already discussing using the system to link to the libraries of Cambridge and Cornell, she said. Corporations and government agencies have also been in contact with MIT.

The heart of the DSpace system is an open-source storage and retrieval system. Each academic department has been assigned a customised portal for submitting materials, Smith said. Professors and researchers can then deposit information directly into the system through a portal, or after a peer review, depending on the departmental regulations.

To retrieve documents, researchers can consult an index. Author and text searches will come in later versions, she said.

"Part of the reason for doing this is that the faculty says, 'My stuff is too hard to find,'" she said.

Using open-source software also cut costs, Smith added. The MIT system, which currently can hold two terabytes (2 trillion bytes) of data, can be replicated for $100,000 to $500,000 (£64,000 to £320,000), with most of the expense deriving from hardware. The software will be licensed freely under the Berkeley licence.

The system can also be expanded. Eventually, MIT's system will wield more than a petabyte, or a quadrillion bytes of data.

Over time, academics and librarians will then have to go through the arduous process of determining what to keep and what to eliminate. The system will ideally let universities cut costs associated with housing documents and research findings, but electronic storage isn't free so culling is inevitable, she said.

The project started about 18 months ago and was jointly developed by MIT and HP. The company and the university have collaborated on a number of projects. Recently, the two digitised all of the output of MIT Press, including out-of-print textbooks, and put it into a searchable database.


Everybody needs storage. And almost every week some company manages to squeeze more storage into less space for a lower price. For the latest news, reviews and price checks on everything from USB flash cards and PCCard hard disks to storage area networks, see ZDNet UK's Storage News Section.

Have your say instantly, and see what others have said. Go to the ZDNet news forum.

Let the editors know what you think in the Mailroom.

  • Email
  • Trackback
  • Clip Link
  • Print friendlyPrint with EPSON

Did you find this article useful?
60 out of 125 people found this useful


Full Talkback thread

0 comments

Company/Topic Alerts

Create a new alert from the list below:







Discussions

juicecultus juicecultus

The link provided is not working

Sunday 6 December 2009, 5:13 PM

1 comment
lezlow lezlow

when it comes with power supply you,ll...

Saturday 5 December 2009, 9:42 PM

3 comments
lezlow lezlow

yer

Saturday 5 December 2009, 9:40 PM

1 comment
lezlow lezlow

HP workers set dates for strikes

Saturday 5 December 2009, 9:39 PM

2 comments
Video icon

Video

Featured Talkback

In association with Network Liberation Movement
When all is said, if Microsoft produce the best product people will buy it and thats a good thing. If people have to buy their product because no one else can produce an alternative, only because interoperability protocols are kept secret, then thats a bad thing.

By: pround

Read full story:
EU court crushes Microsoft's antitrust appeal


Skip Sub Navigation Links to CNET Brand Links

Help

Become part of the ZDNet community.

Newsletters