Arikah Map

Digital library

A digital library is a library in which a significant proportion of the resources are available in machine-readable format (as opposed to print or microform), accessible by means of computers. The digital content may be locally held or accessed remotely via computer networks. In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing. Some of the largest digital libraries are purely digital having few if any physical holdings.


Contents

Types of digital libraries

The term digital library is diffuse enough to be applicable to a wide range of digital entities. Divisions can be made between libraries that have some physical presence, where patrons are able to access physical holdings as well as digital holdings and libraries where collections are almost completely digital. Project Gutenberg, ibiblio, and the Internet Archive can serve as examples of this later case.

Advantages

Traditional libraries are limited by storage space; digital libraries have the potential to store much more information, simply because digital information requires very little physical space to contain it. As such, the cost of maintaining a digital library is much lower than that of a traditional library. A traditional library must spend large sums of money paying for staff, book maintenance, rent, and additional books. Digital libraries do away with these fees.

Digital libraries can immediately adopt innovations in technology providing users with improvements in electronic and audio book technology as well as presenting new forms of communication such as wikis and blogs.

Problems

Some people have criticized that digital libraries are hampered by copyright law, because works cannot be shared over different periods of time in the manner of a traditional library. The content is, in many cases, public domain or self-generated content only. Some digital libraries, such as Project Gutenberg, work to digitize out-of-copyright works and make them freely available to the public. An estimate of the number of distinct books still existent in library catalogues from 2000B.C. to 1960, has been made [1].

Other digital libraries (more specifically, digitial collections, which may be acquired by libraries) accommodate copyright concerns by licensing content and distributing it on a commercial basis, which allows for better management of the content's reproduction and the payment (if required) of royalties.

Digital libraries cannot reproduce the environment of a traditional library. Many people also find reading printed material to be easier than reading material on a computer screen although this depends heavily on presentation as well as personal preferences[2]. Also, due to technological developments, a digital library can see some of its content become out-of-date and its data may become unaccessible.

Access to digital libraries and their collections is dependent upon a stable information technology infrastructure (power, computers, communications links etc.). Hence, despite the egalitarian potential of the digitial library, many of those who could most benefit from its global reach (for instance in the Third World) are not able to do so.

Academic repositories

Many academic libraries are actively involved in building institutional repositories of the institution's books, papers, theses, and other works which can be digitized. Many of these repositories are made available to the academic community or the general public. Insitutional repositories are often referred to as digital libraries.

Digital archives

Archives differ from libraries in several ways. Traditionally, archives were defined as:

  1. Containing primary sources of information (typically letters and papers directly produced by an individual or organization) rather than the secondary sources found in a library (books, etc);
  2. Having their contents organized in groups rather than individual items. Whereas books in a library are cataloged individually, items in an archive are typically grouped by provenance (the individual or organization who created them) and original order (the order in which the materials were kept by the creator);
  3. Having unique contents. Whereas a book may be found at many different libraries, depending on its rarity, the records in an archive are usually one-of-a-kind, and cannot be found or consulted at any other location except at the archive that holds them.

The technology used to create digital libraries has been even more revolutionary for archives since it breaks down the second and third of these general rules. The use of search engines, Optical Character Recognition and metadata allow digital copies of individual items (i.e. letters) to be cataloged, and the ability to remotely access digital copies has removed the necessity of physically going to a particular archive to find a particular set of records.

Cornell University and the Wisconsin Historical Society are considered leaders in the field of digital archive creation and management.

Searching

Most digital libraries provide a search interface which allows resources to be found. These resources are typically deep web (or invisible web) resources since they frequently cannot be located by search engine crawlers. Some digital libraries create special pages or sitemaps to allow search engines to find all their resources. Digital libraries frequently use the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to expose their metadata to other digital libraries, and search engines like Google can also use OAI-PMH to find these deep web resources.

There are two general strategies for searching a federation of digital libraries:

  1. distributed searching, and
  2. searching previously harvested metadata.

Distributed searching typically involves a client sending multiple search requests in parallel to a number of servers in the federation. The results are gathered, duplicates or eliminated or clustered, and the remaining items are sorted and presented back to the client. Scalability and performance issues tend to plague distributed searching for large federations of digital libraries. Protocols like Z39.50 are frequently used in distributed searching.

Searching over previously harvested metadata requires the pooling of metadata collected from every digital library in the federation. This solution scales better than distributed search, but it introduces the problem of data freshness; digital libraries need to be re-harvested on a periodic basis to discover new and updated resources. OAI-PMH is frequently used by digital libraries for harvesting metadata.

Framework

The Greenstone Digital Library Software is free and open source digital library building software that is sometimes employed for building digital libraries.

The future

Large scale digitization projects are underway at Google, the Million Book Project, MSN, and Yahoo!. With continued improvements in book handling and presentation technologies such as optical character recognition and ebooks, and development of alternative depositories and business models, digital libraries are rapidly growing in popularity as demonstrated by Google, Yahoo!, and MSN's efforts. Just as libraries have ventured into audio and video collections, so have digital libraries such as the Internet Archive.

See also

Categories


Digital libraries | Futurology | Library and information science | Types of library

Find

Find

Find