Metadata Encoding and Transmission Standard

Last updated
Metadata Encoding and Transmission Standard
Version1.12.1 Revised 2019

The Metadata Encoding and Transmission Standard (METS) is a metadata standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language of the World Wide Web Consortium (W3C). The standard is maintained as part of the MARC standards of the Library of Congress, and is being developed as an initiative of the Digital Library Federation (DLF).

Contents

Overview

METS is an XML Schema designed for the purpose of:

Depending on its use, a METS document could be used in the role of Submission Information Package (SIP), Archival Information Package (AIP), or Dissemination Information Package (DIP) within the Open Archival Information System (OAIS) Reference Model. [1]

Digital libraries vs traditional libraries

Maintaining a library of digital objects requires maintaining metadata about those objects. The metadata necessary for successful management and use of digital objects is both more extensive than and different from the metadata used for managing collections of printed works and other physical materials. METS is intended to promote the preservation of, and interoperability between digital libraries. [2]

Flexibility and interoperability

The open flexibility of METS means that there is not a prescribed vocabulary which allows many different types of institutions, with many different document types, to utilize METS. The customization of METS makes it highly functional internally, but creates limitations for interoperability. Interoperability becomes difficult when the exporting and importing institutions have used vocabularies. As a workaround for this problem the creation of institutional profiles has become popular. These profiles document the implementation of METS specific to that institution helping to map content in order for exchanged METS documents to be more usable across institutions. [3] [4]

History

As early as 1996 the University of California, Berkeley began working toward the development of a system that combined encoding for an outline of a digital object's structure with metadata for that object. In 1998 this work was expanded upon by the Making of America II project (MoAII). An important objective of this project was the creation of a standard for digital objects that would include defined metadata for the descriptive, administrative, and structural aspects of a digital object. A type of structural and metadata encoding system using an XML Document Type Definition (DTD) was the result of these efforts. The MoAII DTD was limited in that it did not provide flexibility in which metadata terms could be used for the elements in the descriptive, administrative, and structural metadata portions of the object. [5] In 2001, a new version of the DTD was developed that used namespaces separate from the system rather than the vocabulary of the previous DTD. This revision was the foundation for the current METS schema, officially named in April of that year. [6]

The 7 sections of a METS document

Sections of a METS document PARTSangles.jpg
Sections of a METS document

See also

Related Research Articles

<span class="mw-page-title-main">Dublin Core</span> Standardized set of metadata elements

The Dublin Core, also known as the Dublin Core Metadata Element Set (DCMES), is a set of fifteen main metadata items for describing digital or physical resources. The Dublin Core Metadata Initiative (DCMI) is responsible for formulating the Dublin Core; DCMI is a project of the Association for Information Science and Technology (ASIS&T), a non-profit organization.

<span class="mw-page-title-main">XML</span> Markup language by the W3C for encoding of data

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

<span class="mw-page-title-main">Geography Markup Language</span> XML grammar for geographical features

The Geography Markup Language (GML) is the XML grammar defined by the Open Geospatial Consortium (OGC) to express geographical features. GML serves as a modeling language for geographic systems as well as an open interchange format for geographic transactions on the Internet. Key to GML's utility is its ability to integrate all forms of geographic information, including not only conventional "vector" or discrete objects, but coverages and sensor data.

MARC is a standard set of digital formats for the machine-readable description of items catalogued by libraries, such as books, DVDs, and digital resources. Computerized library catalogs and library management software need to structure their catalog records as per an industry-wide standard, which is MARC, so that bibliographic information can be shared freely between computers. The structure of bibliographic records almost universally follows the MARC standard. Other standards work in conjunction with MARC, for example, Anglo-American Cataloguing Rules (AACR)/Resource Description and Access (RDA) provide guidelines on formulating bibliographic data into the MARC record structure, while the International Standard Bibliographic Description (ISBD) provides guidelines for displaying MARC records in a standard, human-readable form.

The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.

Encoded Archival Description (EAD) is a standard for encoding descriptive information regarding archival records.

RDFa or Resource Description Framework in Attributes is a W3C Recommendation that adds a set of attribute-level extensions to HTML, XHTML and various XML-based document types for embedding rich metadata within Web documents. The Resource Description Framework (RDF) data-model mapping enables its use for embedding RDF subject-predicate-object expressions within XHTML documents. It also enables the extraction of RDF model triples by compliant user agents.

The AgMES initiative was developed by the Food and Agriculture Organization (FAO) of the United Nations and aims to encompass issues of semantic standards in the domain of agriculture with respect to description, resource discovery, interoperability, and data exchange for different types of information resources.

Geospatial metadata is a type of metadata applicable to geographic data and information. Such objects may be stored in a geographic information system (GIS) or may simply be documents, data-sets, images or other objects, services, or related items that exist in some other native environment but whose features may be appropriate to describe in a (geographic) metadata catalog.

PREservation Metadata: Implementation Strategies (PREMIS) is the de facto digital preservation metadata standard.

<span class="mw-page-title-main">Metadata</span> Data about data

Metadata is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:

A metadata standard is a requirement which is intended to establish a common understanding of the meaning or semantics of the data, to ensure correct and proper use and interpretation of the data by its owners and users. To achieve this common understanding, a number of characteristics, or attributes of the data have to be defined, also known as metadata.

The Publishing Requirements for Industry Standard Metadata (PRISM) for the Internet, computing, and computer science, is a specification that defines a set of XML metadata vocabularies for syndicating, aggregating, post-processing and multi-purposing content.

Analyzed Layout and Text Object (ALTO) is an open XML Schema developed by the EU-funded project called METAe.

In computing, a data definition specification (DDS) is a guideline to ensure comprehensive and consistent data definition. It represents the attributes required to quantify data definition. A comprehensive data definition specification encompasses enterprise data, the hierarchy of data management, prescribed guidance enforcement and criteria to determine compliance.

<span class="mw-page-title-main">Journal Article Tag Suite</span>

The Journal Article Tag Suite (JATS) is an XML format used to describe scientific literature published online. It is a technical standard developed by the National Information Standards Organization (NISO) and approved by the American National Standards Institute with the code Z39.96-2012.

Lightweight Information Describing Objects (LIDO) is an XML schema for describing museum or collection objects. Memory institutions use LIDO for “exposing, sharing and connecting data on the web”. It can be applied to all kind of disciplines in cultural heritage, e.g. art, natural history, technology, etc. LIDO is a specific application of CIDOC CRM.

<span class="mw-page-title-main">Audiovisual archive</span>

In archives, the term "audiovisual" is frequently used generically to denote materials other than written documents. Films, videos, audio recordings, pictures, and other audio and visual media are collected in audiovisual archives. A vast amount of knowledge is included in audiovisual records, which are considered cultural treasures and must be preserved for future use. Print materials would not have the same reach across various audiences as audiovisual resources.

References

  1. Consultative Committee for Space Data Systems (2012). Space Data and Information Transfer System -- Open Archival Information System -- Reference Model (ISO 14721:2012). Geneva: International Organization for Standardization.
  2. McDonough, Jerome (April 2006). "METS: Standardized Encoding for Digital Library Objects". International Journal on Digital Libraries. 6 (2): 148–158. doi:10.1007/s00799-005-0132-1. hdl: 2142/177 . S2CID   2107048. Available as a PDF file Archived 2014-07-15 at the Wayback Machine via ideals.illinois.edu. PDF p. 1, 20-21. Retrieved 17 July 2018.
  3. Pearce, J.; Pearson, D.; Williams, M.; Yeadon, S. (2008). "The Australian METS Profile – A journey about metadata". D-Lib Magazine. 14 (3/4). doi: 10.1045/march2008-pearce . hdl: 1885/46635 . Retrieved 12 May 2014.
  4. Wagner, Lindsey (2011). "METS: A Survey of Recent Literature and Applications". Library Philosophy and Practice. ISSN 1522-0222. 2011. Archived from the original on 4 March 2016. Retrieved 12 May 2014.
  5. McDonough, Jerome (April 2006). "METS: Standardized Encoding for Digital Library Objects". International Journal on Digital Libraries. 6 (2): 148–158. doi:10.1007/s00799-005-0132-1. hdl: 2142/177 . S2CID   2107048. Available as a PDF file Archived 2014-07-15 at the Wayback Machine via ideals.illinois.edu. PDF p. 2. Retrieved 17 July 2018.
  6. Cundiff, M. V. (2004). An introduction to the metadata encoding and transmission standard (METS). Library Hi Tech, 22 (1), 52-62. doi: 10.1108/07378830410524495