DCMI Kernel Metadata Task Group J. Kunze
  A. Turner
  California Digital Library
  November 15, 2007 - DRAFT V1.0

Kernel Metadata/ERC Application Profile for Dublin Core

 

Introduction

Kernel metadata ([KSPEC]) is a small prescriptive vocabulary designed to support highly uniform but minimal object descriptions for the purpose of orderly collection management. The Kernel vocabulary, based on a subset of the Dublin Core (DC) metadata element set, aims to describe objects of any form or category, but its reach is limited to a small number of fundamental questions such as who, what, when, and where. The Electronic Resource Citation (ERC) is a Kernel metadata-based application profile, specifying requirements for representing Kernel metadata in any given object description.

This document comprises guidelines for representing Kernel metadata and ERC requirements, using Dublin Core Abstract Model [DCAM]-based application profiles.

Functional Requirements and Domain Model

Kernel metadata is designed to assist orderly collection management by supporting the creation of brief but highly uniform object descriptions that can be listed, surveyed, and searched efficiently during normal collection maintenance and trouble-shooting activities. These descriptions serve as object surrogates that are convenient for automated sorting and filtering operations and are also eye-readable without specialized display software. The goal of Kernel metadata is to balance the needs for expressive power, very simple machine processing, and direct human manipulation of metadata records.

Kernel metadata is based on the Dublin Core (DC) metadata element set [RFC5013] maintained by the Dublin Core Metadata Initiative [DCMI]. Kernel elements are descriptors that identify various object properties. In principle they apply to any object in the universe, whether digital, physical, or abstract, following in the tradition of [RFC3986]. This extreme diversity of objects is approached with the hypothesis that highly variable and rich object descriptions can be directly comparable at the level of about four fundamental elements — who, what, when, and where — as applied to the expression of the object. This sequence is a recurring theme in the Kernel. In anticipation of future extensions to "how" and "why", we refer to the first four elements as "the four h's" (what they all have in common is an initial aspirated "h" sound, which is also shorter to say than "w").

Kernel-based descriptions make it possible to compare an extremely diverse set of objects. Comparison is possible even when many other elements co-exist with Kernel elements, or when a minor amount of information in other elements overlaps with Kernel element information. Regardless of whether an object is smoked, worn, navigated, or in any other way, interacted with, its Kernel based description ensures the presence of a few predictable points of commonality in the form of easily isolatable Kernel elements. Kernel elements provide a concise intersection of interoperable (or at least comparable) elements across a broad range of object descriptions.

Syntax and Usage Guidelines

Standard encoding methods such as [RDF] and [XML] may be used to format ERCs and Kernel metadata. It is also possible to encode modified forms of Kernel element values using other methods, such as [MARC] or [MODS], although some granularity of information may be lost in the process. For specific syntax and usage guidelines, refer to the Kernel metadata specification [KSPEC] and crosswalk to other encoding schemes [KMAP].

Description Set Profile

ERC requirements can be roughly mapped to Dublin Core Abstract Model [DCAM]-based application profiles -- such as the Simple Dublin Core application profile [SIMPLEDC] -- as outlined in the tables below. In principle, however, the ERC profile may be expressed through any formal Dublin Core application profile, with the following conditions:

 

Creator


DC Property http://purl.org/dc/terms/creator
Literal? No
DC Definition An entity primarily responsible for making the resource
Kernel Definition A responsible person or party
Kernel Mapping who (h1)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary
Comment If no value is provided for Creator, use Publisher, and if no Publisher, use Contributor

The Kernel assumes that most values, especially personal names given in "who", will be given in "sort-friendly" manner, for example, "lastname, firstname" for western names and natural word order for Chinese names.

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

2

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Title


DC Property http://purl.org/dc/terms/title
Literal? Yes
DC Definition A name given to the resource
Kernel Definition A name or other human-oriented identifier
Kernel Mapping what (h2)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Literal)

Syntax Encoding Scheme:

Occurrence

disallowed

Language Constraint:

Occurrence

mandatory

 

Date


DC Property http://purl.org/dc/terms/date
Literal? Yes
DC Definition A point or period of time associated with an event in the lifecycle of the resource
Kernel Definition A date important in the object's lifecycle
Kernel Mapping when (h3)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary
Comment The Kernel assumes [TEMPER] format for dates in order to express date ranges, lists, approximate dates, and BC dates (not possible, for example, with [W3CDTF].

Value (Literal)

Syntax Encoding Scheme:

Occurrence

disallowed

Language Constraint:

Occurrence

mandatory

 

Identifier


DC Property http://purl.org/dc/terms/identifier
Literal? Yes
DC Definition An unambiguous reference to the resource within a given context
Kernel Definition A location or system-oriented identifier
Kernel Mapping where (h4)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Literal)

Syntax Encoding Scheme:

Occurrence

disallowed

Language Constraint:

Occurrence

mandatory

 

Subject (Personage)


DC Property http://purl.org/dc/terms/subject
Literal? No
DC Definition The topic of the resource
Kernel Definition A person or party figuring in the information content
Kernel Mapping about-who (h11)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Subject


DC Property http://purl.org/dc/terms/subject
Literal? No
DC Definition The topic of the resource
Kernel Definition A subject or topic figuring in the information content
Kernel Mapping about-who (h12)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Coverage (Temporal)


DC Property http://purl.org/dc/terms/temporal
Literal? No
DC Definition Temporal characteristics of the intellectual content of the resource
Kernel Definition A time period covered by the information content
Kernel Mapping about-when (h13)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Coverage (Spatial)


DC Property http://purl.org/dc/terms/spatial
Literal? No
DC Definition Spatial characteristics of the intellectual content of the resource
Kernel Definition A location or region covered by the information content
Kernel Mapping about-where (h14)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Description


DC Property http://purl.org/dc/terms/description
Literal? Yes
DC Definition An account of the resource
Kernel Definition A description of the information content
Kernel Mapping about-how (h15)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Value (Literal)

Syntax Encoding Scheme:

Occurrence

disallowed

Language Constraint:

Occurrence

optional

 

Publisher


DC Property http://purl.org/dc/terms/publisher
Literal? No
DC Definition An entity responsible for making the resource available
Kernel Definition A responsible person or party
Kernel Mapping who (h1)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary
Comment If no value is provided for Creator, use Publisher, and if no Publisher, use Contributor.

The Kernel assumes that most values, especially personal names given in "who", will be given in "sort-friendly" manner, for example, "lastname, firstname" for western names and natural word order for Chinese names.

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

Contributor


DC Property http://purl.org/dc/terms/contributor
DC Literal? No

Definition

An entity primarily responsible for making contributions to the resource
Kernel Definition A responsible person or party
Kernel Mapping who (h1)
Kernel Property http://www.cdlib.org/inside/diglib/ark/ercspec.html#vocabulary

Comment

If no value is provided for Creator, use Publisher, and if no Publisher, use Contributor.

The Kernel assumes that most values, especially personal names given in "who", will be given in "sort-friendly" manner, for example, "lastname, firstname" for western names and natural word order for Chinese names.

Value (Non-Literal)

Value URI Constraint:

Occurrence

disallowed

Vocabulary Encoding Scheme Constraint

Occurrence:

disallowed

Value String Constraint:

Max occurrence

1

Syntax Encoding Syntax Constraint:

Occurrence:

disallowed

Language Constraint:

Occurrence

optional

 

References

[AACR2] American Library Association, “Anglo-American Cataloguing Rules,” 2007 (HTML).
[DCAM] Dublin Core Metadata Initiative, “DCMI Abstract Model” (HTML).
[DCMI] Dublin Core Metadata Initiative, “DCMI Metadata Terms” (HTML).
[KMAP] Kunze, J. and A. Turner, “Mapping and Crosswalk: Kernel Metadata to Dublin Core, MARC21, MODS, and VRA Core” (HTML).
[KSPEC] Kunze, J. and A. Turner, “Kernel Metadata and Electronic Resource Citations (ERCs)” (HTML).
[MARC] Library of Congress, “Machine Readable Cataloguing,” 2007 (HTML).
[MODS] Library of Congress, “Metadata Object Description Schema,” June 2006 (HTML).
[RDF] W3C, “Resource Description Framework” (HTML).
[SIMPLEDC] Enoksson, Fredrik , “Simple Dublin Core: an Application Profile,” September 2007 (HTML).
[TEMPER] Blair, C. and J. Kunze, “Temporal Enumerated Ranges,” August 2007 (PDF).
[W3CDTF] Date and Time Formats (W3C profile of ISO8601)” (HTML).
[XML] W3C, “Extensible Markup Language (XML) 1.0 (Fourth Edition),” August 2006 (HTML).
[RFC5013] Kunze, J. and T. Baker, “The Dublin Core Metadata Element Set,” RFC 5013, August 2007.
[RFC3986] Berners-Lee, T., Fielding, R., and L. Masinter, “Uniform Resource Identifier (URI): Generic Syntax,” STD 66, RFC 3986, January 2005 (TXT, HTML, XML).

Notes on Draft V1.0