Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Data Structure Profile

Resources:

This profile is focused on the definition of a physical or logical dataset structure in a way that can be packaged and reused for documenting different datasets that have the same structure, for instance periodically released statistics reported in the same format.

This profile adds DDI-CDI properties for describing a data structure in terms of DataStructureComponents and RepresentedVariables, primary and foreign keys, and mapping of components to their positions in a physical dataset. Value domains for variables are specified in the same way as in the Data Description profile. It has schema for Wide, Long and Dimensional datastructures. The implementation target is an rdf serialization, which is an open world logical model; users are thus free to add additional properties that they find useful for dataset documentation in their community, but these can be ignored by other users without penalty.

Requirements:

TBD a DataStructure class that defines the file format mappings for the data structure components in a physical implemenation; The only things that the InstanceVariables can modify in datasets using the resusable DataStructure are the labels for the variables and the physicalDataType.

See graphical presentation of Data Structure Profile

Implementation

The current recommended implementation uses the schema.org vocabulary, with a few entities and properties from other vocabularies to fill gaps; see Implementation of metadata content items. For background on JSON, JSON-LD and general implementation patters CDIF is using, see Schema.org implementation notes.