DCAT-US 3.0: Dataset

Information about a dataset, including identifiers, contacts, coverage, distributions, and related resources.

A collection of data published or curated by one provider

  • Type: object
  • Additional properties: Any type allowed

Example:

{
    "@type": "Dataset",
    "title": "Daily Climate Observations 2024",
    "description": "Daily temperature, precipitation, and wind measurements from monitoring stations across the United States.",
    "identifier": "https://example.gov/datasets/climate-observations-2024",
    "contactPoint": {
        "fn": "Climate Data Support",
        "hasEmail": "mailto:climate@example.gov"
    },
    "publisher": {
        "name": "National Climate Data Center"
    },
    "keyword": [
        "climate",
        "weather",
        "temperature",
        "precipitation"
    ],
    "issued": "2024-01-15",
    "modified": "2024-06-01",
    "accrualPeriodicity": "daily",
    "accessRights": "public",
    "landingPage": {
        "@id": "https://example.gov/climate-data",
        "@type": "Document",
        "title": "Climate Data Landing Page"
    },
    "describedBy": {
        "@id": "https://example.gov/climate-data/data-dictionary",
        "@type": "Distribution",
        "title": "Data Dictionary",
        "mediaType": "application/pdf"
    },
    "spatial": {
        "@type": "Location",
        "bbox": {
            "type": "Polygon",
            "coordinates": [
                [
                    [
                        -125.0,
                        24.0
                    ],
                    [
                        -66.0,
                        24.0
                    ],
                    [
                        -66.0,
                        50.0
                    ],
                    [
                        -125.0,
                        50.0
                    ],
                    [
                        -125.0,
                        24.0
                    ]
                ]
            ]
        }
    },
    "theme": [
        "Climate Science"
    ],
    "distribution": [
        {
            "title": "Climate Data CSV",
            "downloadURL": "https://example.gov/downloads/climate-2024.csv",
            "mediaType": "text/csv"
        }
    ],
    "rights": [
        "Data is provided as-is without warranty. Please cite the National Climate Data Center when using this data."
    ],
    "temporal": [
        {
            "@type": "PeriodOfTime",
            "startDate": "2024-01-01",
            "endDate": "2024-12-31"
        }
    ]
}
Property Type Requirement Level Title/Description
contactPoint More than one type Mandatory A contact point for questions about the Dataset (single contact or list). Include an email address that is continuously monitored
description string Mandatory Plain-language summary of the dataset
identifier null or Identifier Mandatory The unique identifier for the Dataset, e.g. the URI or other unique identifier in the context of the Catalog
title string Mandatory Human-readable title of the dataset
describedBy null or Distribution Recommended A distribution describing the Data Dictionary for this dataset
distribution null or array of Distribution classes Recommended List of available distributions for the dataset. This can be omitted when no distribution is available yet.
keyword null or array of string Recommended List of keywords or tags describing the dataset
landingPage null or Document Recommended A web page from the original data provider that gives access to the Dataset, its Distributions, and related information
modified null or object Recommended Most recent date when the dataset’s actual data changed, not just metadata
publisher null or Organization Recommended Organization responsible for publishing and making the dataset available
rights null or array of string Recommended Rights statements not already covered by license or accessRights, such as copyright or policy restrictions
spatial More than one type Recommended A geographic region or regions that are covered by the Dataset
temporal null or array of PeriodOfTime classes Recommended Time periods covered by the dataset
theme null or array of Concept classes Recommended List of themes or categories for the dataset
@id string Optional  
@type string Optional  
accessRights null or string Optional Information about whether the dataset is publicly accessible, restricted, or not public
accrualPeriodicity More than one type Optional The frequency at which the Dataset is updated
category null or array of Concept classes Optional List of high-level categories for the dataset
conformsTo null or array of Standard classes Optional List of standards, schemas, or profiles the dataset follows
contributor null or array of Agent classes Optional List of agents contributing to the Dataset
created null or object Optional The date on which the Dataset was first created
creator null or Agent Optional Person or organization responsible for creating the dataset
first null or Dataset Optional the first item of the sequence the dataset belongs to
hasCurrentVersion null or Dataset Optional reference to the current (latest) version of a dataset
hasPart null or array of Dataset classes Optional List of related datasets that are part of the described dataset
hasQualityMeasurement null or array of QualityMeasurement classes Optional List of quality measurements for the dataset (for example, completeness, accuracy, or timeliness) beyond spatial or temporal resolution
hasVersion null or array of Dataset classes Optional List of related Datasets that are a version, edition, or adaptation of the described Dataset
image null or string Optional Thumbnail image illustrating the dataset, especially useful for visual data such as maps, photos, or video
inSeries null or array of DatasetSeries classes Optional Dataset series this dataset belongs to
isReferencedBy null or array of string Optional List of links to related resources, such as publications, that reference, cite, or otherwise point to the Dataset
issued null or object Optional Date when the dataset was first published. If the exact publication date is unknown, use the date it was first referenced in the catalog.
language More than one type Optional ISO 639-1 language code values used in the dataset text or metadata, such as en or es, full list can be seen at https://id.loc.gov/vocabulary/iso639-1.html
liabilityStatement null or string Optional A liability statement about the dataset that may clarify limitations of responsibility, qualifications on the accuracy, reliability, and completeness of the data, or absence of endorsement by the data publisher or provider, among other considerations
metadataDistribution null or array of Distribution classes Optional Distribution of the original metadata document this dataset was derived from
otherIdentifier null or array of Identifier classes Optional Additional identifiers for the dataset besides the main identifier, such as a DOI or other persistent ID
page null or array of Document classes Optional List of pages or documents about this dataset
previousVersion null or Dataset Optional reference to the previous dataset version
provenance null or array of string Optional List of statements about the lineage of a Dataset, including any changes in its ownership or custody since its creation that may be significant for its authenticity, integrity, or interpretation
purpose null or string Optional The purpose of the dataset
qualifiedAttribution null or array of Attribution classes Optional List of agents with specific responsibilities for the dataset
qualifiedRelation null or array of Relationship classes Optional Detailed relationship between the dataset and another resource, including the role of that relationship
relation null or array of string Optional List of links to related resources when the relationship is not otherwise specified
replaces null or array of Dataset classes Optional List of Datasets replaced by this Dataset
rightsHolder null or array of Organization classes Optional List of agents (organizations) holding rights on the Dataset
sample null or array of Distribution classes Optional List of sample distributions for the dataset
scopeNote null or string Optional usage note for the dataset
source null or array of Dataset classes Optional List of related Datasets from which the described Dataset is derived
spatialResolutionInMeters null or string Optional Smallest spatial distance between data points, in meters, represented as a single value
status null or Concept Optional Lifecycle status of the dataset, such as completed, deprecated, under development, or withdrawn
subject null or array of Concept classes Optional List of primary subjects for the dataset, usually narrower than broad theme categories
supportedSchema null or Dataset Optional supported schema for this dataset
temporalResolution null or string Optional Smallest time interval between data points, using xsd:duration format (for example, P1D)
version null or string Optional The version indicator (name or identifier) of a resource
versionNotes null or string Optional Notes describing how this version differs from earlier versions of the dataset
wasAttributedTo null or array of Agent classes Optional List of agents attributed to this dataset
wasGeneratedBy null or array of Activity classes Optional List of activities that generated, or provide the business context for the creation of the dataset
wasUsedBy null or array of Activity classes Optional List of activities that used the Dataset

Dataset > contactPoint #

Requirement: Mandatory

A contact point for questions about the Dataset (single contact or list). Include an email address that is continuously monitored

  • Type: combining
  • Required: Yes
  • Additional properties: Any type allowed

Any of:

Dataset > contactPoint > anyOf > Kind #

inline description of Kind

  • Type: object
  • Additional properties: Any type allowed
  • Same definition as: Kind

Dataset > contactPoint > anyOf > List of contacts #

  • Type: array of Kind classes

Each item of this array must be:

  • Kind: Contact information for an individual or entity

Dataset > description #

Requirement: Mandatory

Plain-language summary of the dataset

  • Type: string
  • Required: Yes

Examples:

"Daily temperature, precipitation, and wind measurements from monitoring stations across the United States."
"Comprehensive daily climate observations collected from monitoring stations across the United States, including temperature, precipitation, humidity, and wind measurements."

Dataset > identifier #

Requirement: Mandatory

The unique identifier for the Dataset, e.g. the URI or other unique identifier in the context of the Catalog

Dataset > title #

Requirement: Mandatory

Human-readable title of the dataset

  • Type: string
  • Required: Yes

Examples:

"Daily Climate Observations 2024"
"National Climate Observations 2024"

Dataset > describedBy #

Title: data dictionary

Requirement: Recommended

A distribution describing the Data Dictionary for this dataset

Dataset > distribution #

Title: dataset distribution

Requirement: Recommended

List of available distributions for the dataset. This can be omitted when no distribution is available yet.

Each item of this array must be:

  • Distribution: A specific representation of a dataset, such as a file, feed, or API response

Dataset > keyword #

Title: keyword/tag

Requirement: Recommended

List of keywords or tags describing the dataset

  • Type: null or array of string

Examples:

[
    "climate",
    "weather",
    "temperature",
    "precipitation"
]
[
    "climate",
    "weather",
    "temperature",
    "precipitation",
    "humidity",
    "wind",
    "meteorology"
]

Each item of this array must be:

Non-empty string #

  • Type: string

Restrictions:

  • Min length: 1

Dataset > landingPage #

Requirement: Recommended

A web page from the original data provider that gives access to the Dataset, its Distributions, and related information

Dataset > modified #

Title: last modified

Requirement: Recommended

Most recent date when the dataset’s actual data changed, not just metadata

  • Type: null or object

Examples:

"2024-06-01"
"2024-01-15T10:30:00Z"
"2024"
"2024-01"

Dataset > publisher #

Requirement: Recommended

Organization responsible for publishing and making the dataset available

Dataset > rights #

Requirement: Recommended

Rights statements not already covered by license or accessRights, such as copyright or policy restrictions

  • Type: null or array of string

Examples:

[
    "Data is provided as-is without warranty. Please cite the National Climate Data Center when using this data."
]
[
    "This data is in the public domain and may be used without restriction."
]

Each item of this array must be:

Array Item #

Full text of a statement of rights

  • Type: string

Dataset > spatial #

Title: spatial/geographic coverage

Requirement: Recommended

A geographic region or regions that are covered by the Dataset

  • Type: combining
  • Additional properties: Any type allowed

Any of:

Dataset > spatial > anyOf > Null allowed when not required #

  • Type: null

Dataset > spatial > anyOf > Location #

inline description of Location

  • Type: object
  • Additional properties: Any type allowed
  • Same definition as: Location

Dataset > spatial > anyOf > List of geographic regions #

Each item of this array must be:

  • Location: A named place or geographic area

Dataset > temporal #

Title: temporal coverage

Requirement: Recommended

Time periods covered by the dataset

Each item of this array must be:

  • PeriodOfTime: Information about a specific time period with a start- and/or end-time

Dataset > theme #

Title: theme/category

Requirement: Recommended

List of themes or categories for the dataset

  • Type: null or array of Concept classes

Each item of this array must be:

  • Concept: A controlled term or label, optionally drawn from a concept scheme

Dataset > @id #

Requirement: Optional

  • Type: string
  • Format: iri

Example:

"https://example.gov/datasets/national-climate-observations-2024"

Dataset > @type #

Requirement: Optional

  • Type: string
  • Default: "Dataset"

Dataset > accessRights #

Requirement: Optional

Information about whether the dataset is publicly accessible, restricted, or not public

  • Type: null or string

Examples:

"public"
"Public access with no restrictions. Data is freely available for download and use."

Dataset > accrualPeriodicity #

Title: frequency

Requirement: Optional

The frequency at which the Dataset is updated

  • Type: combining
  • Additional properties: Any type allowed

Example:

"daily"

Any of:

Dataset > accrualPeriodicity > anyOf > Null allowed when not required #

  • Type: null

Dataset > accrualPeriodicity > anyOf > item 1 #

ISO 19115 Maintenance Frequency code

  • Type: enum (of string)

Must be one of:

  • “continual”
  • “daily”
  • “weekly”
  • “fortnightly”
  • “monthly”
  • “quarterly”
  • “biannually”
  • “annually”
  • “asNeeded”
  • “irregular”
  • “notPlanned”
  • “unknown”

Dataset > accrualPeriodicity > anyOf > item 2 #

ISO-8601 Maintenance Frequency code for recurring values, see https://www.iso.org/standard/70907.html

  • Type: string

Restrictions:

  • Must match regular expression: ^R/P.+$ Test

Dataset > accrualPeriodicity > anyOf > item 3 #

Dublin Core Collection Frequency Vocabulary, see https://www.dublincore.org/specifications/dublin-core/collection-description/frequency/#vocabulary-terms

  • Type: enum (of string)

Must be one of:

  • “continuous”
  • “daily”
  • “weekly”
  • “biweekly”
  • “monthly”
  • “quarterly”
  • “semiannual”
  • “annual”
  • “irregular”
  • “triennial”
  • “biennial”
  • “threeTimesAYear”
  • “bimonthly”
  • “semimonthly”
  • “threeTimesAMonth”
  • “semiweekly”
  • “threeTimesAWeek”

Dataset > category #

Requirement: Optional

List of high-level categories for the dataset

  • Type: null or array of Concept classes

Each item of this array must be:

  • Concept: A controlled term or label, optionally drawn from a concept scheme

Dataset > conformsTo #

Requirement: Optional

List of standards, schemas, or profiles the dataset follows

  • Type: null or array of Standard classes

Each item of this array must be:

  • Standard: A standard or specification that another resource conforms to

Dataset > contributor #

Requirement: Optional

List of agents contributing to the Dataset

  • Type: null or array of Agent classes

Each item of this array must be:

  • Agent: A person, organization, software agent, or other entity involved with a resource

Dataset > created #

Title: creation date

Requirement: Optional

The date on which the Dataset was first created

  • Type: null or object

Examples:

"2024-01-01"
"2024-01-15T10:30:00Z"
"2024"
"2024-01"

Dataset > creator #

Requirement: Optional

Person or organization responsible for creating the dataset

Dataset > first #

Requirement: Optional

the first item of the sequence the dataset belongs to

Dataset > hasCurrentVersion #

Title: current version

Requirement: Optional

reference to the current (latest) version of a dataset

Dataset > hasPart #

Requirement: Optional

List of related datasets that are part of the described dataset

  • Type: null or array of Dataset classes

Each item of this array must be:

  • Dataset: A collection of data published or curated by one provider

Dataset > hasQualityMeasurement #

Title: quality measurement

Requirement: Optional

List of quality measurements for the dataset (for example, completeness, accuracy, or timeliness) beyond spatial or temporal resolution

Each item of this array must be:

Dataset > hasVersion #

Requirement: Optional

List of related Datasets that are a version, edition, or adaptation of the described Dataset

  • Type: null or array of Dataset classes

Each item of this array must be:

  • Dataset: A collection of data published or curated by one provider

Dataset > image #

Requirement: Optional

Thumbnail image illustrating the dataset, especially useful for visual data such as maps, photos, or video

  • Type: null or string

Dataset > inSeries #

Requirement: Optional

Dataset series this dataset belongs to

Each item of this array must be:

  • DatasetSeries: A group of related datasets that are published separately

Dataset > isReferencedBy #

Requirement: Optional

List of links to related resources, such as publications, that reference, cite, or otherwise point to the Dataset

  • Type: null or array of string

Example:

[
    "https://example.gov/publications/climate-trends-2024"
]

Each item of this array must be:

  • Link: reference iri of Resource

reference iri of Resource

  • Type: string
  • Format: iri

Dataset > issued #

Title: release date

Requirement: Optional

Date when the dataset was first published. If the exact publication date is unknown, use the date it was first referenced in the catalog.

  • Type: null or object

Examples:

"2024-01-15"
"2024-01-15T10:30:00Z"
"2024"
"2024-01"

Dataset > language #

Requirement: Optional

ISO 639-1 language code values used in the dataset text or metadata, such as en or es, full list can be seen at https://id.loc.gov/vocabulary/iso639-1.html

  • Type: combining
  • Additional properties: Any type allowed

Example:

[
    "en"
]

Any of:

Dataset > language > anyOf > Null allowed when not required #

  • Type: null

Dataset > language > anyOf > Language code #

  • Type: string

Restrictions:

  • Max length: 2

Dataset > language > anyOf > List of languages #

  • Type: array of string

Each item of this array must be:

Language code #

  • Type: string

Restrictions:

  • Max length: 2

Dataset > liabilityStatement #

Requirement: Optional

A liability statement about the dataset that may clarify limitations of responsibility, qualifications on the accuracy, reliability, and completeness of the data, or absence of endorsement by the data publisher or provider, among other considerations

  • Type: null or string

Example:

"This dataset is provided as-is without warranty of any kind. Users are responsible for determining fitness for their intended use."

Dataset > metadataDistribution #

Requirement: Optional

Distribution of the original metadata document this dataset was derived from

Each item of this array must be:

  • Distribution: A specific representation of a dataset, such as a file, feed, or API response

Dataset > otherIdentifier #

Requirement: Optional

Additional identifiers for the dataset besides the main identifier, such as a DOI or other persistent ID

Each item of this array must be:

  • Identifier: A unique identifier and optionally it’s scheme and other relevant information

Dataset > page #

Title: documentation

Requirement: Optional

List of pages or documents about this dataset

  • Type: null or array of Document classes

Each item of this array must be:

  • Document: A publication or other document related to a resource

Dataset > previousVersion #

Requirement: Optional

reference to the previous dataset version

Dataset > provenance #

Requirement: Optional

List of statements about the lineage of a Dataset, including any changes in its ownership or custody since its creation that may be significant for its authenticity, integrity, or interpretation

  • Type: null or array of string

Example:

[
    "Data collected from automated weather stations deployed at 2,500 locations across the continental United States.",
    "Quality control procedures applied according to WMO guidelines."
]

Each item of this array must be:

Array Item #

Full text of the provenance statement

  • Type: string

Dataset > purpose #

Requirement: Optional

The purpose of the dataset

  • Type: null or string

Example:

"To provide comprehensive, high-quality climate observations for research, planning, and decision-making related to weather and climate."

Dataset > qualifiedAttribution #

Requirement: Optional

List of agents with specific responsibilities for the dataset

Each item of this array must be:

  • Attribution: A responsibility that an agent has for a resource

Dataset > qualifiedRelation #

Requirement: Optional

Detailed relationship between the dataset and another resource, including the role of that relationship

Each item of this array must be:

  • Relationship: Additional information about how one resource is related to another

Dataset > relation #

Title: related resource

Requirement: Optional

List of links to related resources when the relationship is not otherwise specified

  • Type: null or array of string

Example:

[
    "https://example.gov/datasets/historical-climate-averages"
]

Each item of this array must be:

  • Link: reference iri of Resource

reference iri of Resource

  • Type: string
  • Format: iri

Dataset > replaces #

Requirement: Optional

List of Datasets replaced by this Dataset

  • Type: null or array of Dataset classes

Each item of this array must be:

  • Dataset: A collection of data published or curated by one provider

Dataset > rightsHolder #

Requirement: Optional

List of agents (organizations) holding rights on the Dataset

Each item of this array must be:

  • Organization: An organization involved with a resource, including parent or child organizations

Dataset > sample #

Requirement: Optional

List of sample distributions for the dataset

Each item of this array must be:

  • Distribution: A specific representation of a dataset, such as a file, feed, or API response

Dataset > scopeNote #

Title: usage note

Requirement: Optional

usage note for the dataset

  • Type: null or string

Example:

"This dataset contains raw observational data. For derived products such as monthly averages or climate normals, see related datasets."

Dataset > source #

Title: data source

Requirement: Optional

List of related Datasets from which the described Dataset is derived

  • Type: null or array of Dataset classes

Each item of this array must be:

  • Dataset: A collection of data published or curated by one provider

Dataset > spatialResolutionInMeters #

Title: Spatial resolution (meters)

Requirement: Optional

Smallest spatial distance between data points, in meters, represented as a single value

  • Type: null or string

Example:

"1000"

Dataset > status #

Title: lifecycle status

Requirement: Optional

Lifecycle status of the dataset, such as completed, deprecated, under development, or withdrawn

Dataset > subject #

Requirement: Optional

List of primary subjects for the dataset, usually narrower than broad theme categories

  • Type: null or array of Concept classes

Each item of this array must be:

  • Concept: A controlled term or label, optionally drawn from a concept scheme

Dataset > supportedSchema #

Requirement: Optional

supported schema for this dataset

Dataset > temporalResolution #

Requirement: Optional

Smallest time interval between data points, using xsd:duration format (for example, P1D)

  • Type: null or string

Example:

"P1D"

Dataset > version #

Requirement: Optional

The version indicator (name or identifier) of a resource

  • Type: null or string

Example:

"2024.1"

Dataset > versionNotes #

Requirement: Optional

Notes describing how this version differs from earlier versions of the dataset

  • Type: null or string

Example:

"Initial release of 2024 climate observations data."

Dataset > wasAttributedTo #

Title: attribution

Requirement: Optional

List of agents attributed to this dataset

  • Type: null or array of Agent classes

Each item of this array must be:

  • Agent: A person, organization, software agent, or other entity involved with a resource

Dataset > wasGeneratedBy #

Requirement: Optional

List of activities that generated, or provide the business context for the creation of the dataset

  • Type: null or array of Activity classes

Each item of this array must be:

  • Activity: An activity related to creating, changing, or using a resource

Dataset > wasUsedBy #

Title: used by

Requirement: Optional

List of activities that used the Dataset

  • Type: null or array of Activity classes

Each item of this array must be:

  • Activity: An activity related to creating, changing, or using a resource

See Also: (related supporting classes)