This dataset is included in the NDE Dataset Register
What data are there in this dataset?
The open data consists of the metadata of 4,450,903 historical documents from National Archives, containing 4,172,557 historical personal references. The pie chart shows the number of person entries per source type.
How can the dataset be downloaded?
Harvesting (A2A/XML)
The National Archives historical document metadata dataset is provided as an OAI-PMH metadata feed. This feed is an implementation of an OAI-PMH data provider, based on OAI-PMH 2.0. The metadata is structured according to the Archives 2 All (A2A) model.
The URLs to harvests this dataset (set = ghn):
Download (A2A/XML)
The dataset can be downloaded in A2A / XML format as a compressed file (.tar.gz):
- DownloadThis file with 4,157,825 records is created on December 18, 2024, the file size is 9,410 GB (uncompressed) / 927 GB (compressed).
Download (linked data in 'a2a vocabulaire')
The dataset can be downloaded in N-triples format as a compressed file (.ttl.gz.):
- DownloadThis file was created on January 17, 2024 and has a file size of 4 MB (compressed).
Download (CSV)
The dataset can be downloaded in parts in CSV format as a compressed file (.csv.gz):
- DownloadFile with data of source type json. This file with 5,613 lines was created on January 17, 2024, the file size is 3 MB (uncompressed) / 381 KB (compressed).
- DownloadFile with data of source type json. This file with 1,341 lines was created on January 17, 2024, the file size is 823 KB (uncompressed) / 86 KB (compressed).
License
This dataset is offered by Open Archives under a Creative Commons 0 license (Public Domain Dedication). This means that you can copy, modify, distribute and execute the work, even for commercial purposes, without asking permission. It is appreciated if you contact Open Archives if you use this dataset, maybe we can do even more for each other!
The dataset can contain URLs to images and images of third-party viewers, the CC0-license does not apply to those items! Check the website of the relevant archive which license is linked to that material.
For questions and comments about this dataset, please contact Open Archives.
Datasetbeschrijving in JSON-LD
{
"@context": "https://schema.org/",
"@type": "Dataset",
"@id": "https://www.openarchieven.nl/id/dataset_ghn",
"identifier": "https://www.openarchieven.nl/id/dataset_ghn",
"name": [
{
"@value": "Dataset genealogische metadata Nationaal Archief via Open Archieven",
"@language": "nl"
},
{
"@value": "Dataset genealogical metadata National Archives via Open Archives",
"@language": "en"
}
],
"mainEntityOfPage": "https://www.openarchieven.nl/datasets/ghn",
"thumbnailUrl": "https://www.openarchieven.nl/img/search/ghn-oa-en.png",
"inLanguage": "nl-NL",
"isAccessibleForFree": true,
"spatialCoverage": [
{
"@value": "Nederland",
"@language": "nl"
},
{
"@value": "Netherlands",
"@language": "en"
}
],
"distribution": [
{
"@type": "DataDownload",
"@id": "https://www.openarchieven.nl/.well-known/genid/distribution-ghn-oai-pmh",
"contentUrl": "https://api.openarchieven.nl/oai-pmh/?verb=ListRecords&metadataPrefix=oai_a2a&set=ghn",
"encodingFormat": "text/xml",
"genre": "OAI-PMH-endpoint",
"description": [
{
"@value": "OAI-PMH endpoint met XML gebaseerd op het A2A model",
"@language": "nl"
},
{
"@value": "OAI-PMH endpoint with XML based on the A2A model",
"@language": "en"
}
]
},
{
"@type": "DataDownload",
"@id": "https://www.openarchieven.nl/.well-known/genid/distribution-ghn-nt",
"contentUrl": "https://oa-export.s3.nl-ams.scw.cloud/nt/ghn.nt.gz",
"description": [
{
"@value": "Gecomprimeerd N-triples bestand",
"@language": "nl"
},
{
"@value": "Compressed N-triples file",
"@language": "en"
}
],
"contentSize": "4 MB",
"inLanguage": "nl-NL",
"dateModified": "2024-01-17",
"license": "http://creativecommons.org/publicdomain/zero/1.0/",
"encodingFormat": "application/n-triples+gzip"
}
],
"keywords": [
{
"@value": "Historische persoonsvermeldingen",
"@language": "nl"
},
{
"@value": "Genealogie",
"@language": "nl"
},
{
"@value": "DTB Dopen",
"@language": "nl"
},
{
"@value": "DTB Trouwen",
"@language": "nl"
},
{
"@value": "Historical personal data",
"@language": "en"
},
{
"@value": "Genealogy",
"@language": "en"
},
{
"@value": "Baptisms",
"@language": "en"
},
{
"@value": "Marriages (church)",
"@language": "en"
}
],
"description": [
{
"@value": "De open data bestaat uit de metadata van 4.450.903 akten van Nationaal Archief, met daarop 4.172.557 historische persoonsvermeldingen. De brontypes omvatten dtb dopen, dtb trouwen. Deze dataset kan doorzocht worden via https://www.openarchieven.nl/ghn",
"@language": "nl"
},
{
"@value": "The open data consists of metadata from 4,450,903 records of National Archives, with 4.172.557 historical person observations. The source types included baptisms, marriages (church). This dataset can be searched via https://www.openarchieven.nl/ghn",
"@language": "en"
}
],
"license": "http://creativecommons.org/publicdomain/zero/1.0/",
"publisher": {
"@id": "https://www.openarchieven.nl/",
"@type": "Organization",
"name": [
{
"@language": "nl",
"@value": "Open Archieven"
},
{
"@language": "en",
"@value": "Open Archives"
}
],
"logo": "https://static.openarchieven.nl/img/oa/logo200x200-nl.png",
"contactPoint": {
"@id": "https://www.openarchieven.nl/contact",
"@type": "ContactPoint",
"name": [
{
"@value": "Databeheerder Open Archieven",
"@language": "nl"
},
{
"@value": "Datamanager Open Archives",
"@language": "en"
}
],
"email": "data@openarch.nl"
}
},
"creator": {
"@id": "https://www.nationaalarchief.nl/",
"@type": "Organization",
"name": "National Archives",
"identifier": "NL-HaNA"
},
"includedInDataCatalog": {
"@id": "https://www.openarchieven.nl/datasets/",
"name": "Open Archieven datasets"
},
"isBasedOn": [
"https://www.nationaalarchief.nl/onderzoeken/open-data/open-data-indexen"
],
"dateCreated": "2024-01-17",
"dateModified": "2024-01-17"
}