Skip to content

Latest commit

 

History

History
205 lines (157 loc) · 19.6 KB

metadata-resources.md

File metadata and controls

205 lines (157 loc) · 19.6 KB
layout title permalink filename
default
Metadata Resources
/metadata-resources/
metadata-resources.md

This section provides further background and resources to assist agencies in implementing the common core metadata schema.

Template Data Files

Sample Data Files

Use Cases

A) Starting from a spreadsheet

Some agencies may initially build or maintain data catalogs through large spreadsheets. In order to then generate a data.json file from this, simply import the spreadsheet (in the form of a CSV file) into the Catalog Generator, translate the metadata as needed, and then generate an appropriately formatted JSON file.

B) Translating a data file from another schema

If your agency has a JSON catalog with a different metadata schema and need to convert it to the prescribed schema needed for the data.json file, one can import the file into Catalog Generator, adjust the metadata as needed and export the results again as the appropriately formatted JSON file.

C) Utilizing CKAN

CKAN is a popular, open-source data catalog. Even if an agency hasn't begun a data catalog, adopting CKAN can provide a robust and feature-rich solution with which to begin maintaining an internal or external catalog. In addition to providing an effective means of organizing the agency's data catalog, CKAN provides several opportunities for generating the needed data.json file. CKAN provides a convenient CSV export that can then be imported into the Catalog Generator for conversion into an appropriately formatted JSON file. Alternatively, CKAN provides a JSON API of the data catalog which can then be altered to provide an automatic update for the appropriately formatted JSON file.

D) Manual translating a data file from another system

Agencies may also manage their data catalogs through various database solutions such as Access, an asset management sytem, document management system, geoportal, or assorted proprietary systems. Regardless of which system an agency employs, it should include a fundamental export option which allows the catalog to be exported as a CSV file, which can then be imported into the Catalog Generator for conversation to an appropriately formatted JSON file. Depending on the solutions' varied further functionality, some of them may also provide APIs which can then be configured to maintain the data.json file. If not, they may instead support XML and JSON exports, which can then be converted to the proper format.

E) Convert from CSV or JSON to XML or RDFa lite.

In order to generate appropriately formatted XML or RDFa Lite files, simply import an appropriately formatted CSV or JSON file into the Catalog Generator, ensure that the metadata fields line up, and re-export the file.

"Common Core" Required Fields Equivalents

Field Data.gov CKAN RDFa Lite 1.1 Schema.org
Title Title title dcterms:title sdo:name
Description Description notes dcterms:description sdo:description
Tags Keywords tags dcat:keyword sdo:keywords
Last Update Date updated revision_timestamp dcterms:modified sdo:dateModified
Publisher Agency Name owner_org dcat:publisher sdo:publisher
Contact Name Contact Name maintainer foaf:Person sdo:Person
Contact Email Contact Email Address maintainer_email foaf:mbox n/a
Unique Identifier User Generated ID id dcterms:identifier n/a
Public Access Level n/a n/a n/a n/a

"Common Core" Required if Applicable Fields

Field Data.gov CKAN RDFa Lite 1.1 Schema.org
Data Dictionary Data Dictionary data_dict dcat:dataDictionary n/a
Download URL Access Point res_url dcat:accessURL sdo:contentUrl
Endpoint Access Point res_url dcat:webService * n/a
Format Media Format res_format dcterms:format sdo:encodingFormat
License Dataset license agreement URL license_id dcterms:license n/a
Spatial Geographic scope spatial dcterms:spatial ds:spatialCoverage
Temporal Period of Coverage n/a dcterms:temporal ds:temporalCoverage

Expanded Fields

Field Data.gov CKAN RDFa Lite 1.1 Schema.org
Release Date Date Released n/a dcterms:issued sdo:datePublished
Frequency Frequency n/a dcterms:accrualPeriodicity n/a
Language n/a n/a dcat:language sdo:inLanguage
Granularity Geographic Granularity n/a dcat:granularity n/a
Data Quality Data Quality n/a xsd:boolean n/a
Category Subject Area groups dcat:theme sdo:about
Related Documents Reference for Technical Documentation n/a dcterms:references n/a
Size File Size n/a dcat:size sdo:contentSize
Homepage URL n/a url dcat:landingPage sdo:url
RSS Feed Access Point n/a dcat:feed * n/a
System of Records n/a n/a n/a n/a

*When combined with accessURL, format, and size.

Mapping DCAT to Other Metadata Specifications

Below is a set of mappings that will help users of other common metadata schemas map their content to the defined DCAT fields. The mapping is from the DCAT field to the XPATH(s) in the common metadata schema where the corresponding information may be found.

###FGDC CSDGM

Established by Office of Management and Budget Circular A-16, the Federal Geographic Data Committee (FGDC) promotes the coordinated development, use, sharing, and dissemination of geographic data. The Content Standard for Digital Geospatial Metadata aims to provide a common set of terminology and definitions for the documentation of digital geospatial data.

DCAT Field FGDC CSDGM
Title /metadata/idinfo/citation/citeinfo/title
Description /metadata/idinfo/descript/abstract
Tags /metadata/idinfo/keywords/theme/themekey
/metadata/idinfo/keywords/place/placekey
/metadata/idinfo/keywords/temporal/tempkey
Last Update /metadata/idinfo/citation/citeinfo/pubdate
Publisher /metadata/idinfo/ptcontac/cntinfo/cntorgp/cntorg
Contact Name /metadata/idinfo/ptcontac/cntinfo/cntorgp/cntper
Contact Email /metadata/idinfo/ptcontac/cntinfo/cntemail
Unique Identifier CSDGM does not include a unique identifier in the metadata itself. Common practice is to use a URL to the metadata document as unique identifier.
Public Access Level /metadata/idinfo/accconst
Data Dictionary /metadata/idinfo/keywords/theme/themekt
Download URL /metadata/idinfo/citation/citeinfo/onlink
Endpoint One of a number of places in FGDC may contain the endpoint to a web service. Catalogs typically parse these elements to find a web service endpoint:
/metadata/dataqual/lineage/srcinfo/srccite/citeinfo/onlink
/metadata/idinfo/crossref/citeinfo/onlink
/metadata/idinfo/citation/citeinfo/onlink
/metadata/metainfo/metextns/onlink
/metadata/distinfo/stdorder/digform/digtopt/onlinopt/computer/
networka/networkr
Format /metadata/spdoinfo/direct
License /metadata/distinfo/distliab
Spatial /metadata/idinfo/spdom/bounding
Temporal /metadata/idinfo/timeperd/timeinfo (might be one date or range. If range, separate start/end with comma)

###ISO 19115:2003

ISO 19115:2003 defines the schema required for describing geographic information and services. It provides information about the identification, the extent, the quality, the spatial and temporal schema, spatial reference, and distribution of digital geographic data.

DCAT Field ISO 19115:2003
Title /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString
Description /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:abstract/gco:CharacterString
Tags /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:topicCategory/gmd:MD_TopicCategoryCode
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords/gmd:MD_Keywords/gmd:keyword/gco:CharacterString
Last Update /gmd:MD_Metadata/gmd:dateStamp/gco:Date
/gmd:MD_Metadata/gmd:dateStamp/gco:DateTime
Publisher /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:organisationName/gco:CharacterString
Contact Name /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:individualName/gco:CharacterString
Contact Email /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:contactInfo/gmd:CI_Contact/gmd:address/gmd:CI_Address/gmd:electronicMailAddress/gco:CharacterString
Unique Identifier /gmd:MD_Metadata/gmd:fileIdentifier/gco:CharacterString
Public Access Level /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:resourceConstraints/gmd:MD_Constraints/gmd:useLimitation/gco:CharacterString
Data Dictionary
Download URL /gmd:MD_Metadata/gmd:distributionInfo/gmd:MD_Distribution/gmd:transferOptions/gmd:MD_DigitalTransferOptions/gmd:onLine/gmd:CI_OnlineResource/gmd:linkage/gmd:URL
Format /gmd:MD_Metadata/gmd:distributionInfo/gmd:MD_Distribution/gmd:distributionFormat/gmd:MD_Format/name/gco:CharacterString
License
Spatial /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:geographicElement/gmd:EX_GeographicBoundingBox
Temporal /gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:begin/gml:TimeInstant/gml:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:begin/gml32:TimeInstant/gml32:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:beginPosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:beginPosition

/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:end/gml:TimeInstant/gml:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:end/gml32:TimeInstant/gml32:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:endPosition
/gmd:MD_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:endPosition

ISO 19119:2005

ISO 19119:2005 identifies and defines the architecture patterns for service interfaces used for geographic information, defines its relationship to the Open Systems Environment model, presents a geographic services taxonomy and a list of example geographic services placed in the services taxonomy. It also prescribes how to create a platform-neutral service specification, how to derive conformant platform-specific service specifications, and provides guidelines for the selection and specification of geographic services from both platform-neutral and platform-specific perspectives.

DCAT Field ISO 19119:2005
Title /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString
Description /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:abstract/gco:CharacterString
Tags /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:descriptiveKeywords/gmd:MD_Keywords/gmd:keyword/gco:CharacterString
Last Update /gmd:MD_Metadata/gmd:dateStamp/gco:Date
/gmd:MD_Metadata/gmd:dateStamp/gco:DateTime
Publisher /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:organisationName/gco:CharacterString
Contact Name /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:individualName/gco:CharacterString
Contact Email /gmd:MD_Metadata/gmd:contact/gmd:CI_ResponsibleParty/gmd:contactInfo/gmd:CI_Contact/gmd:address/gmd:CI_Address/gmd:electronicMailAddress/gco:CharacterString
Unique Identifier /gmd:MD_Metadata/gmd:fileIdentifier/gco:CharacterString
Public Access Level /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:resourceConstraints/gmd:MD_Constraints/gmd:useLimitation/gco:CharacterString
Data Dictionary
Endpoint /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:containsOperations/srv:SV_OperationMetadata/srv:connectPoint/gmd:CI_OnlineResource/gmd:linkage/gmd:URL
Format /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:serviceType/gco:LocalName
License
Spatial /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:geographicElement/gmd:EX_GeographicBoundingBox
Temporal /gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:begin/gml:TimeInstant/gml:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:begin/gml32:TimeInstant/gml32:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:beginPosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:beginPosition

/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:end/gml:TimeInstant/gml:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:end/gml32:TimeInstant/gml32:timePosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml:TimePeriod/gml:endPosition
/gmd:MD_Metadata/gmd:identificationInfo/srv:SV_ServiceIdentification/srv:extent/gmd:EX_Extent/gmd:temporalElement/gmd:EX_TemporalExtent/gmd:extent/gml32:TimePeriod/gml32:endPosition"

Namespaces

The above XPATHs assume the following namespaces:

Prefix URI
gmd http://www.isotc211.org/2005/gmd
gco http://www.isotc211.org/2005/gco
gml http://www.opengis.net/gml
gml32 http://www.opengis.net/gml/3.2
srv http://www.isotc211.org/2005/srv