Minimum Metadata

The DataCite schema is used to determine minimum metadata in the SAEON ODP. The form for data submissions to the ODP can be accessed here.

Universal Basic Metadata with DataCite

The basic metadata schema required for data submission to the Open Data Portal is that of DataCite. The DataCite Metadata Schema is a list of core metadata properties, chosen for accurate and consistent identification of a resource for citation, discovery and re-usability purposes. The schema can be used to describe any digital object, meaning it can be used for any data family. DataCite has 6 mandatory (M) properties (that must all be completed regardless of data type or conditions), 6 recommended (R) properties (the completion of which depends on conditions of data submission), and 7 optional (O) properties. The DataCite properties are listed and described in Table 1.

Table 1: DataCite Properties

# (1) Property Description Obligation Purpose
1 Identifier Unique identifier in the provider's environment M Citation
2 Creator(s) Usually the author of the work. M Citation
3 Title The title of the work M Citation
4 Publisher The publisher of the work M Citation
5 Publication Year The year of publication M Citation
6 Subject One or more keywords, which may come from authoritative lists (vocabularies) R Discovery
8 Date The date coverage of the dataset or digital object - in most cases not the same as the year of publication R Discovery
10 ResourceType This assists with re-usability and discovery of resource types appropriate to the intended end use. M Re-usability
11 Alternate Identifier An alternative identifier, such as a DOI, handle, or ARK. If an identifier is not provided, SAEON will assign a DOI on behalf of the requestor. O Re-usability
12 Related Identifier Critical for version management - references previous versions of the same digital object, amongst other uses. R Re-usability
16 Rights This is an important determinant of re-usability. SAEON advocates the use of open licenses O Re-usability
17 Description The element can be used for both an abstract and a methodology or lineage description. R Re-usability
18 Geo-Location Spatial region or named place where the data was gathered or about which the data is focused. This may not be applicable to all datasets, but very useful for discovery in cases where it does apply. R Discovery
A Online Resource A link to the location of the digital object in the web. Discovery
B Original Metadata A link to the original metadata record in the web. Re-usability
C Supplementary Material A link to supplementary material in the web. This can be a formal; published methodology or protocol. Re-usability
D Research Ethics A link to an ethics protocol or guideline in the web. Re-usability
E Download Link A download link in cases where this is different from (A). Re-usability
F Grant Number If research is grant funded, this attribute is mandatory to prove deposit (this condition applies to NRF funded research). Re-usability
7 Contributor(s) Additional contributions to the work R Re-usability
9 Language In South Africa, the language is almost exclusively English, but the element should be used if it is not. O Re-usability
13 Size Size of the work - assists with download decisions. O Re-usability
14 Format Format of the work - assists with discovery and download decisions. O Re-usability
15 Version The version assigned by the owner of the work. This is not the preferred method of indicating versioning - see Related Identifier above. O Re-usability
19 Funding Reference Information about financial support (funding) for the resource being registered. It is a best practice to supply funding information when financial support has been received. O Re-usability

(1. Corresponds to the element ID in DataCite Schema: https://goo.gl/r5jDRo)