DCAT-US Schema v3.0 — Dataset fields
Overview
Field-level reference for the DCAT-US v3.0 Dataset class. The Dataset is the primary unit of a federal data inventory.
Source
Category
Keywords
Details
See an error on this page or have other feedback? Email us at DataGovHelp@gsa.gov
Overview
The Dataset class describes an individual dataset or API. Most agency data inventory work happens at this level.
Required fields in v3.0: title, description, publisher, contactPoint, identifier
All other fields are Optional unless noted. Fields that were required in v1.1 but are no longer schema-required in v3.0 are noted below. Agencies should consult the current OMB policy guidance for any fields that remain required.
—
Transition from DCAT-US v1.1 to v3.0
If your agency is upgrading from v1.1, most of your existing dataset records will carry forward with minimal changes. The required fields are largely the same. The main things to be aware of are: modified no longer accepts repeating intervals; temporal and spatial now use structured objects instead of strings; accessLevel is replaced by accessRights; and license should move to the Distribution level. The table at the bottom of this page summarizes all field-level changes from v1.1.
—
Discovery and context
These fields are new in v3.0 and help users understand what a dataset is for, what it covers, and any limitations on its use. They are all optional but strongly encouraged for publicly visible datasets.
| Field | Required | Type | Description |
|---|---|---|---|
purpose |
Optional | string | The purpose of the dataset — why it was created and what problem it addresses. Example: "To provide comprehensive climate observations for research, planning, and decision-making." |
liabilityStatement |
Optional | string | A statement clarifying limitations of responsibility, qualifications on accuracy or completeness, or absence of endorsement by the publisher. Example: "This dataset is provided as-is without warranty. Users are responsible for determining fitness for their intended use." |
scopeNote |
Optional | string | A usage note clarifying what the dataset includes and excludes, or guidance on how it should be interpreted. Example: "This dataset contains raw observational data. For derived products such as monthly averages see related datasets." |
creator |
Optional | object | The person or organization that created the dataset, if different from the publisher. References the Agent class. Example: {"name": "National Climate Data Center"} |
provenance |
Optional | array of strings | Plain-language statements about the lineage of the dataset — how the data was collected, processed, and quality-controlled. Example: ["Data collected from automated weather stations at 2,500 locations.", "Quality control applied per WMO guidelines."] |
Core fields
| Field | Required | Type | Description |
|---|---|---|---|
title |
Mandatory | string | Human-readable name of the dataset. Should be in plain English with sufficient detail for search and discovery. Avoid acronyms. |
description |
Mandatory | string | Human-readable description with sufficient detail for a user to quickly understand whether the dataset is of interest. |
publisher |
Mandatory | object | The publishing entity and optionally its parent organization(s). References the Organization class. At a minimum, provide name. Optionally add prefLabel (preferred legal name), altLabel (acronym), and subOrganizationOf (array of parent organizations). |
contactPoint |
Mandatory | object or array | Contact person's name and email for the dataset. References the Kind class. Must include fn (full name) and hasEmail (formatted as mailto:name@agency.gov). Can be a single object or an array for multiple contacts. Optionally add tel and organization-name. |
identifier |
Mandatory | string | A unique identifier for the dataset. Must be unique across the agency's catalog and remain fixed over time. Use a persistent URI where possible. |
keyword |
Recommended | array of strings | Tags to help users discover the dataset. Include both technical and non-technical terms. (Required in v1.1) |
modified |
Recommended | string (ISO 8601) | Most recent date the dataset was changed or updated. (Required in v1.1). Note for agencies upgrading from v1.1: If you currently use a repeating interval like R/P1D or R/P1Y in this field, that format is no longer valid in v3.0. Set modified to the actual date the data last changed (e.g., 2024-06-01) and use accrualPeriodicity to express update frequency. |
issued |
Optional | string (ISO 8601) | Date of formal release. |
language |
Optional | array of strings | ISO 639-1 two-letter language codes (e.g., en, es, fr). Note: v1.1 used RFC 5646 tags like en-US — v3.0 simplifies to two-letter codes only. |
landingPage |
Recommended | object | A Document object with a title and accessURL pointing to a human-friendly hub or landing page for all resources tied to the dataset. Example: {"@type": "Document", "title": "Dataset Homepage", "accessURL": "https://agency.gov/dataset"} |
theme |
Recommended | array of Concept objects | Each Concept requires a prefLabel. Example: [{"prefLabel": "Climate Science"}]. Can reference a ConceptScheme for controlled vocabulary linkage. |
accrualPeriodicity |
Optional | string | Accepts plain-language codes (e.g., daily, weekly, monthly, quarterly, annually, irregular), ISO 8601 repeating duration format (e.g., R/P1Y), or Dublin Core frequency terms. Plain-language codes are preferred. |
conformsTo |
Optional | array of Standard objects | Each Standard should include a title and optionally an identifier and issued date. Example: [{"@type": "Standard", "title": "ISO 19115", "identifier": "https://www.iso.org/standard/53798.html"}]. A dataset can conform to multiple standards. |
describedBy |
Recommended | object | A Distribution object describing the data dictionary. Example: {"@type": "Distribution", "title": "Data Dictionary", "downloadURL": "https://agency.gov/dict.pdf", "mediaType": "application/pdf"}. The describedByType field from v1.1 is no longer needed — express the format as mediaType within this Distribution object. |
Distribution and access
| Field | Required | Type | Description |
|---|---|---|---|
distribution |
Conditional | array of objects | A container for one or more Distribution objects describing how the dataset can be accessed or downloaded. Required if the dataset has an accessURL or downloadURL. See Distribution fields. In v3.0, license should be included on each Distribution object rather than at the Dataset level. If all distributions share the same license, add it to each one. |
sample |
Optional | array of objects | Links to sample distributions of the dataset. |
Access and rights
| Field | Required | Type | Description |
|---|---|---|---|
accessRights |
Optional | string | Information about who can access the dataset and under what conditions. In v1.1 this was the accessLevel field with three fixed values (public, restricted public, non-public). In v3.0 accessRights is a free-text string. The value public remains valid. For restricted datasets, write a plain-language explanation of the restriction and how to request access. |
rights |
Recommended | array of strings | Rights statements not already covered by license or accessRights — for example, copyright statements, attribution requirements, or policy restrictions on use. Example: ["Data is in the public domain under 17 USC 105.", "Please cite the National Climate Data Center when using this data."]. In v1.1 this was a single string with a 255-character limit. In v3.0 it is an array with no character limit. |
rightsHolder |
Optional | object | An agent owning or managing rights over the dataset. References the Agent class. |
license |
Optional | string (URL) | The license or public domain dedication status of the dataset. Provide as a URL. See Open Licenses. In v3.0, license is more correctly expressed at the Distribution level per W3C DCAT. Including it here at the dataset level is supported, but agencies are encouraged to move license to each Distribution object. |
Spatial and temporal
| Field | Required | Type | Description |
|---|---|---|---|
spatial |
Recommended | array of Location objects | Spatial coverage of the dataset. References the Location class. (Was a string in v1.1; now uses the Location class.) Minimum upgrade: {"@type": "Location", "prefLabel": "United States"}. For geospatial precision, add a bbox as a WKT string: "POLYGON((-125 24, -66 24, -66 50, -125 50, -125 24))" or as a GeoJSON Polygon object. |
temporal |
Recommended | array of PeriodOfTime objects | Temporal coverage of the dataset. References the PeriodOfTime class. Note for agencies upgrading from v1.1: The v1.1 string format (e.g., 2000-01-15T00:00:00Z/2010-01-15T00:00:00Z) is no longer valid. Use a PeriodOfTime object: {"@type": "PeriodOfTime", "startDate": "2000-01-15", "endDate": "2010-01-15"}. Open-ended periods are valid — you can omit either startDate or endDate. |
spatialResolutionInMeters |
Optional | string | A numeric value expressed as a string (e.g., "1000" for 1 kilometer resolution). |
temporalResolution |
Optional | string (ISO 8601 duration) | Minimum time period resolvable in the dataset. New in v3.0. |
Versioning and series
| Field | Required | Type | Description |
|---|---|---|---|
version |
Optional | string | The version indicator for this dataset. |
versionNotes |
Optional | string | A description of changes between this version and the previous version. |
hasCurrentVersion |
Optional | string (IRI) | A link to the current version of the dataset. |
hasVersion |
Optional | array | Links to known versions of the dataset. |
previousVersion |
Optional | string (IRI) | A link to the previous version of the dataset. |
replaces |
Optional | array | Datasets that this dataset replaces. |
inSeries |
Optional | array of DatasetSeries objects | A DatasetSeries to which this dataset belongs. See DatasetSeries fields. In v1.1, isPartOf was used to group datasets into a collection. In v3.0, use inSeries for temporal series membership, or document collection membership at the parent dataset level using hasPart. |
When to use which versioning field
- Use
inSerieswhen your dataset is one edition in a recurring sequence — for example, the FY2024 edition of an annual survey. Create a DatasetSeries record for the series as a whole and reference it here. See DatasetSeries fields. - Useversionwhen you are publishing a corrected or updated version of the same dataset — for example, version 2.1 of a reference file. Pair it withversionNotesto describe what changed. - UsehasVersionto link a dataset record to all its known versions, andhasCurrentVersionto point users to the latest one. - Usereplaceswhen a dataset supersedes one or more previous datasets entirely — not just an update, but a replacement. These fields can be used together. A dataset can belong to a series viainSeriesand also carry aversionnumber andversionNotes. —Provenance and quality
| Field | Required | Type | Description |
|---|---|---|---|
provenance |
Optional | array of strings | Plain-language statements about the lineage of the dataset — how the data was collected, processed, and quality-controlled. |
source |
Optional | array | Datasets from which this dataset was derived. |
wasGeneratedBy |
Optional | array | Activities that generated this dataset. References the Activity class. |
hasQualityMeasurement |
Optional | array | Quality measurements associated with the dataset. References the QualityMeasurement class. |
Federal government fields
These fields were introduced in DCAT-US v1.1 for federal agency use. They are not defined in the v3.0 core schema but agencies should continue populating them under existing OMB policy (M-13-13 and the OPEN Government Data Act) until updated guidance is issued. The v3.0 schema will not reject records that include them.
| Field | Required | Type | Description |
|---|---|---|---|
bureauCode |
Policy-dependent | array of strings | Agency and bureau code from OMB Circular A-11, Appendix C. Format: 015:11. |
programCode |
Policy-dependent | array of strings | Primary program related to this dataset, from the Federal Program Inventory. Format: 015:001. |
accessLevel |
Policy-dependent | string | The degree to which the dataset could be made publicly available. One of: public, restricted public, non-public. (Required in v1.1) |
primaryITInvestmentUII |
Optional | string | IT Unique Investment Identifier linking the dataset to an IT investment in Exhibit 53. |
systemOfRecords |
Optional | string (URL) | URL to the System of Records Notice (SORN) if this dataset is a Privacy Act system of records. |
dataQuality |
Optional | boolean | Whether the dataset meets the agency's Information Quality Guidelines (true or false). Not in v3.0 schema — use hasQualityMeasurement for structured quality reporting going forward. |
Example
A minimal Dataset record meeting the v3.0 required fields:
{
"@type": "Dataset",
"title": "Agency Travel Data FY2024",
"description": "Records of official travel expenditures by agency employees during fiscal year 2024, including destination, purpose, and cost.",
"identifier": "https://www.agency.gov/data/travel-fy2024",
"publisher": {
"@type": "Organization",
"name": "Office of the Chief Financial Officer",
"subOrganizationOf": [
{
"@type": "Organization",
"name": "Example Federal Agency"
}
]
},
"contactPoint": {
"@type": "Kind",
"fn": "Jane Smith",
"hasEmail": "mailto:jane.smith@agency.gov"
}
}
A more complete record showing commonly used fields:
{
"@type": "Dataset",
"title": "Agency Travel Data FY2024",
"description": "Records of official travel expenditures by agency employees during fiscal year 2024, including destination, purpose, and cost.",
"identifier": "https://www.agency.gov/data/travel-fy2024",
"publisher": {
"@type": "Organization",
"name": "Office of the Chief Financial Officer",
"subOrganizationOf": [
{
"@type": "Organization",
"name": "Example Federal Agency"
}
]
},
"contactPoint": {
"@type": "Kind",
"fn": "Jane Smith",
"hasEmail": "mailto:jane.smith@agency.gov",
"tel": "+1-202-555-0100",
"organization-name": "Office of the Chief Financial Officer"
},
"keyword": ["travel", "expenditure", "fiscal year", "FY2024"],
"modified": "2024-10-15",
"accrualPeriodicity": "annually",
"accessRights": "public",
"purpose": "To provide transparency into federal employee travel expenditures.",
"temporal": [
{
"@type": "PeriodOfTime",
"startDate": "2023-10-01",
"endDate": "2024-09-30"
}
],
"spatial": [
{
"@type": "Location",
"prefLabel": "United States"
}
],
"distribution": [
{
"@type": "Distribution",
"title": "FY2024 Travel Data (CSV)",
"description": "Full travel records as a comma-separated values file.",
"downloadURL": "https://www.agency.gov/data/travel-fy2024.csv",
"mediaType": "text/csv",
"format": "CSV",
"license": "https://creativecommons.org/publicdomain/zero/1.0/"
},
{
"@type": "Distribution",
"title": "Travel Data REST API",
"description": "A fully queryable REST API returning JSON.",
"accessURL": "https://api.agency.gov/travel/",
"format": "API"
}
],
"bureauCode": ["015:11"],
"programCode": ["015:001"],
"accessLevel": "public"
}
Changes from v1.1
| v1.1 Field | Status in v3.0 | Action |
|---|---|---|
accessLevel |
Replaced by accessRights |
Use a free-text string. The value public remains valid. For restricted datasets write a plain-language explanation. |
modified with repeating intervals (e.g., R/P1D) |
Format no longer valid | Set to the actual date the data last changed (e.g., 2024-06-01). Use accrualPeriodicity for update frequency. |
temporal as an ISO 8601 string |
Format no longer valid | Use a PeriodOfTime object: {"@type": "PeriodOfTime", "startDate": "2000-01-15", "endDate": "2010-01-15"} |
spatial as a plain string or ad-hoc GeoJSON |
Format no longer valid | Use a Location object: {"@type": "Location", "prefLabel": "United States"} |
language with RFC 5646 tags (e.g., en-US) |
Format no longer valid | Use two-letter ISO 639-1 codes only (e.g., en, es). |
isPartOf |
Replaced | Use inSeries for temporal series membership or hasPart at the parent dataset level for generic collections. |
describedByType |
Absorbed into describedBy |
Express the format as mediaType within the describedBy Distribution object. No separate field needed. |
bureauCode |
Not in v3.0 core schema | Continue populating per existing OMB policy guidance until updated guidance is issued. |
programCode |
Not in v3.0 core schema | Continue populating per existing OMB policy guidance until updated guidance is issued. |
dataQuality |
Not in v3.0 core schema | Use hasQualityMeasurement for structured quality reporting, or note conformance in scopeNote. |
Source: jsonschema/definitions/Dataset.json · Generated reference: jsonschema/docs/Dataset.md