Metadata

Data that provides information about other data [wikipedia].

Metadata is structured (and often standardised) information associated with a (data) resource, that provides information about the resource itself.

Metadata provides context and pragmatics (which may be general or domain-specific) for its resource.

Purpose

Metadata allows you to:

  • Make resources findable, by providing a high-level overview which can be inserted into a search index

  • Make resources reusable, by providing information about how they were generated in the first place.

Thinking about types of metadata allows you to:

  • Decide who should be produce metadata and when that should happen

Types of Metadata

A 'type' of metadata is a broad-brush classification of what a metadata element is for and (correspondingly) who might create it and when. The How To Fair project describes three different types of metadata, Administrative, Description and Structural. Wikipedia defines many more!

How To Fair's Structural metadata description incorporates two aspects, we've differentiated them into "provenance" and "form" to help highlight the different roles and times.

Administrative metadata is relevant for managing data, for example:

  • Project

  • Resource owner

  • Collaborators

  • Funder

  • Organisation

  • License

These can usually be assigned before you collect or create the data resource itself.

The Dublin Core Metadata Initiative

The Dublin Core Metadata Initiative is an organisation dedicated to metadata. They give a set of fifteen widely used metadata elements. If publishing a data resource on the web, these elements serve as a guide for metadata should be included in order to best facilitate information discovery.

See the Dublin Core elements

See the DCMI elements page for full descriptions.

  • Contributor

  • Coverage

  • Creator

  • Date

  • Description

  • Format

  • Identifier

  • Language

  • Publisher

  • Relation

  • Rights

  • Source

  • Subject

  • Title

  • Type

Whilst metadata need not be limited to this (particularly in cases where rich descriptive metadata can be used to query for datasets, as opposed to search based methods), adhering to the Dublin Core should ensure a good search ranking and help you conform to the FAIR principles.

Purpose

The Dublin Core allows you to:

  • Quickly define a set of metadata to make a published data resource findable

Tip

As discussed in Structural (form) metadata, a schema can be an important element of your metadata, for describing the content of the data resource.

It is also possible to use schema in a different way - not to describe the resource itself, but to describe the associated metadata!

This allows you to check that the provided metadata is correct. The Dublin Core Metadata Initiative publish schema for just this purpose.

Useful syntaxes for metadata

We think the following syntaxes will be most useful for definition (but this is by no means an exhaustive list):

Last updated