Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
saeon_preservation_policy [2022/07/11 09:55] – lindsay | saeon_preservation_policy [2022/09/16 10:44] (current) – lindsay | ||
---|---|---|---|
Line 14: | Line 14: | ||
%%This policy may be revised if the framework governing the SAEON ODP changes.%% | %%This policy may be revised if the framework governing the SAEON ODP changes.%% | ||
+ | |||
+ | ==== Definitions ==== | ||
+ | |||
+ | **AIP** - Archival Information Package: This is a package containing data and the metadata that describes it. It is created by a data curator from the Submission Information Package (SIP) supplied by the data provider, with the addition of any necessary format migrations or additional information added to the metadata. | ||
+ | |||
+ | **DOI** - Digital Object Identifier: A Digital Object Identifier (DOI) is a unique persistent identifier assigned to an object. This links to the metadata record for the object as well as to a digital location, where details about the object can be found. SAEON makes use of DataCite’s DOI system. | ||
+ | |||
+ | **Data curators**: The uLwazi team members responsible for the management of data throughout its lifecycle, from ingestion to dissemination and long-term archiving. | ||
+ | |||
+ | **Data providers**: | ||
+ | |||
+ | **DSI** - Department of Science and Innovation: A South African government department whose mission is to provide leadership, an enabling environment, | ||
+ | |||
+ | **FAIR** - Findable, Accessible, Interoperable, | ||
+ | |||
+ | **NRF** - National Research Foundation: The mandate of the National Research Foundation (NRF) is to promote and support research through funding, human resource development and the provision of the necessary research facilities in order to facilitate the creation of knowledge, innovation and development in all fields of science and technology, including indigenous knowledge, and thereby contribute to the improvement of the quality of life of all South Africans. | ||
+ | |||
+ | **OAIS** - Open Archival Information System Reference Model: SAEON makes use of the Reference Model for an Open Archival Information System (OAIS) , developed by The Consultative Committee for Space Data Systems (CCSDS), as a best-practice standard to work towards. | ||
+ | |||
+ | **QA** - Quality Assurance: In this context quality assurance is performed by data curators to ensure that the SIPs provided by the data providers contain data that falls within SAEON’s collection policy in an acceptable format and that sufficient metadata has been provided to describe the data. | ||
+ | |||
+ | **QC** - Quality Control: | ||
+ | |||
+ | **SAEON** - South African Environmental Observation Network: The South African Environmental Observation Network (SAEON) is a business unit of the NRF and serves as a national platform for detecting, translating and predicting environmental change through scientifically designed observation systems and research. SAEON also captures and makes long-term datasets freely accessible, and runs an education outreach programme. SAEON has six nodes dispersed geographically across the country. | ||
+ | |||
+ | **SAEON ODP** - SAEON Open Data Platform: The Open Data Platform is SAEON’s system of systems that includes a number of data and metadata infrastructures and community portals that are customised for particular stakeholder communities. | ||
+ | |||
+ | **SLA** - Service Level Agreement: These detail the agreements between SAEON and its stakeholders and define the roles and responsibilities of both parties, the service levels and issue resolution procedures and the duration of the agreements. | ||
+ | |||
+ | **SIP** - Submission Information Package: This is the package of data and metadata that the data provider sends to SAEON for archiving and publishing. | ||
+ | |||
+ | **TRAC** - Trustworthy Repositories Audit & Certification: | ||
+ | |||
+ | **uLwazi**: The uLwazi node is one of the seven nodes of the South African Environmental Observation Network (SAEON). uLwazi means ‘knowledge’ in Nguni languages. The SAEON uLwazi node is made up of four teams, Infrastructure Management, Systems Development, | ||
==== Background Information on the SAEON Open Data Platform (ODP) ==== | ==== Background Information on the SAEON Open Data Platform (ODP) ==== | ||
Line 44: | Line 78: | ||
* File System for managing text files, images, video, audio - for any other digital object or unstructured data. | * File System for managing text files, images, video, audio - for any other digital object or unstructured data. | ||
- | At this stage the SIP may be reassigned to a data curator with the relevant domain expertise. The purpose of this QA step is to check if any information is needed from the data provider prior to publishing the dataset. If no further data management actions are required, an Archival Information Package (AIP) is generated and passed on to anotherdata | + | At this stage the SIP may be reassigned to a data curator with the relevant domain expertise. The purpose of this QA step is to check if any information is needed from the data provider prior to publishing the dataset. If no further data management actions are required, an Archival Information Package (AIP) is generated and passed on to another data curator for Quality Control (QC) and publication. |
The Archival Information Package (AIP) generation is initiated by uploading the data in the SIP into the correct data store. If the data are not in the correct format for long-term preservation, | The Archival Information Package (AIP) generation is initiated by uploading the data in the SIP into the correct data store. If the data are not in the correct format for long-term preservation, | ||
Line 88: | Line 122: | ||
The version control of the data is conducted through metadata using the DataCite schema, which makes use of related identifiers to link versions that are derived from the AIP, or which provide major or minor versions of the original AIP. Changes in datasets will trigger a major version whereas additional details about the dataset will trigger a minor version. | The version control of the data is conducted through metadata using the DataCite schema, which makes use of related identifiers to link versions that are derived from the AIP, or which provide major or minor versions of the original AIP. Changes in datasets will trigger a major version whereas additional details about the dataset will trigger a minor version. | ||
- | === Data migration | + | === Data transformation |
- | All Data providers are required to comply with the SAEON Data Policy which grants SAEON staff the necessary rights to migrate | + | All Data providers are required to comply with the SAEON Data Policy which grants SAEON staff the necessary rights to convert |
=== Data retention === | === Data retention === | ||
Line 98: | Line 132: | ||
=== Data retention checklist === | === Data retention checklist === | ||
- | The checklist in Table 2, adapted from the Natural Environment Research Council (NERC) data value checklist, | + | The checklist in Table 2, adapted from the Natural Environment Research Council (NERC) data value checklist, is intended to guide decision-making on data accessioning and data retention. If any of the legal considerations are applicable then the data must be accessioned and retained, and if any of the criteria in the other sections are applicable then the data should probably be accessioned and retained. |
__Table 2: Data retention checklist adapted from NERC (1)__ | __Table 2: Data retention checklist adapted from NERC (1)__ | ||
Line 122: | Line 156: | ||
(1. NERC. (n.d.). Data value checklist. [online] Available at: [[https:// | (1. NERC. (n.d.). Data value checklist. [online] Available at: [[https:// | ||
- | === Data access | + | === Data access === |
SAEON is committed to the principles of free and open access and, in the interest of keeping the ODP as accessible as possible. However, if datasets are listed under restrictive licenses then registration will be required to access the data and the data user will need to confirm that they are aware of the restrictions on the use of the data. | SAEON is committed to the principles of free and open access and, in the interest of keeping the ODP as accessible as possible. However, if datasets are listed under restrictive licenses then registration will be required to access the data and the data user will need to confirm that they are aware of the restrictions on the use of the data. | ||
Line 132: | Line 166: | ||
SAEON currently receives a portfolio of funding for development and maintenance of its research data infrastructure. Should SAEON fail to receive long-term funding in the future, the National Research Foundation (NRF), its host organisation, | SAEON currently receives a portfolio of funding for development and maintenance of its research data infrastructure. Should SAEON fail to receive long-term funding in the future, the National Research Foundation (NRF), its host organisation, | ||
- | ==== Preservation Policy Review | + | ==== Preservation Policy Review ==== |
This policy is to be reviewed on an annual basis, or as needed, at the discretion of the SAEON Data Committee or Managing Director. | This policy is to be reviewed on an annual basis, or as needed, at the discretion of the SAEON Data Committee or Managing Director. | ||