HSN personal cards - SSD matching subset (hdl:10622/KAKABB)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Entire Codebook

Document Description

Citation

Title:

HSN personal cards - SSD matching subset

Identification Number:

hdl:10622/KAKABB

Distributor:

IISH Data Collection

Date of Distribution:

2024-03-13

Version:

5

Bibliographic Citation:

Mourits, Rick J; Mandemakers, Kees, 2024, "HSN personal cards - SSD matching subset", https://hdl.handle.net/10622/KAKABB, IISH Data Collection, V5, UNF:6:JDL0D+QiFKHvV0qTDShREA== [fileUNF]

Study Description

Citation

Title:

HSN personal cards - SSD matching subset

Identification Number:

hdl:10622/KAKABB

Authoring Entity:

Mourits, Rick J (IISG)

Mandemakers, Kees (IISG)

Distributor:

IISH Data Collection

Access Authority:

International Institute of Social History

Depositor:

Mourits, Rick

Date of Deposit:

2024-03-11

Holdings Information:

https://hdl.handle.net/10622/KAKABB

Study Scope

Keywords:

Arts and Humanities, Social Sciences

Abstract:

The Netherlands has two key databases to study of social inequalities across the life course and over generations: the Historical Sample of the Netherlands (HSN) for persons born in the Netherlands from 1812 until 1922, and the System of Social statistical Datasets (SSD) with register data on all current inhabitants. The HSNDB and Statistics Netherlands (CBS) aimed to establish a Proof of Concept linkage of the HSN and SSD, allowing for the historical life trajectories to be linked forward to contemporary outcomes. For the Proof of Concept we tested whether linkage could be established on the basis of the combination of three birth dates (ego, father, mother), date of marriage, and sex. We developed an initial linking strategy which we validated and refined based on non-unique links and deviating information between HSN and SSD. Additionally, we established a link between the written (HSN) and coded (SSD) places of birth, and used this information in the validation process. The revised linking strategy results in linkage of 77% of the linkable HSN records. The stringent validation criteria of the linking steps and evaluation of the linked result appear to indicate that we provide a successful Proof of Concept for the linkage of the HSN and SSD as conducted by Statistics Netherlands. <br> <br> version 1: Original matching sample <br> version 2: Enriched matching sample <br> version 3: Added Id_mother + Id_father to HSN_RP_child_SSD <br> version 4: Minor update in the standardisation of place names <br> version 5: Bugfixes: (a) recovered children lost in version 2, (b) improved retrieval of children with two HSN RP parents, (c) added IDNR_child from version 1

Methodology and Processing

Sources Statement

Data Access

Notes:

I, hereinafter called the researcher, accept the following. In respect of the LINKS dataset based on the WieWasWie index of names in registrations, made available to him/her by the International Institute of Social History (hereinafter called: IISG) by the department of Data & Collection Management (hereinafter called: DCM) and the Centre for Family History (hereinafter called: CBG), that the researcher: 1 will consult the data incorporated in the dataset solely for the purpose of scientific research or statistics, and that these data are required for specific research purposes or specific statistics; 2 will provide the IISG with the title and aim of the research (maximum 50 words) for each dataset provided by the IISG and each research subject that will be undertaken by the researcher; 3 agrees that he/she will be included in the license register that is kept by the IISG with each licensed dataset and each specific research and this information will be shared with CBG; 4 agrees that he/she will keep the IISG informed about each new intended subject of research with each licensed dataset, so that the IISG is able to upkeep the license register; 5 agrees that the IISG will inform all involved researchers in case the same subject is investigated by more than one researcher or research group; 6 guarantees that he or she will strictly observe all the statutory regulations concerning the data contained in the dataset, including, in particular, the provisions contained in or based on General Data Protection Regulation (GDPR; Regulation EU 2016/679) as implemented by the Uitvoeringswet Algemene Verordening Gegevensbescherming (UAVG; Staatsblad 2018, 144); 7 is acquainted with the privacy regulations of the DCM and in using the dataset will strictly observe these regulations, in particular by warranting the anonymity of the data and guaranteeing that the results for which the data are used cannot be traced back to individual natural persons; 8 will not use the data in the dataset in order to search for additional information about individual persons, regardless of whether these persons are included in the dataset; except with written consent by the IISG; 9 recognizes that all the intellectual or industrial property rights in the dataset, the software, the hardware or other (accompanying) materials remain with IISG and or CBG or its licensors, and certifies that he or she will not reproduce or copy (parts of) the dataset, the software or other (accompanying) materials for other than internal use; 10 indemnifies IISG and CBG against all claims made by third parties, arising from violation by him or her of the statutory regulations concerning the data, including violation of the on General Data Protection Regulation (GDPR; Regulation EU 2016/679); 11 undertakes to keep secret the data in the dataset, not to divulge them to, put them at the disposal of or make them available for use by third parties, and to utilize them solely for the purpose for which IISG/CBG has provided them to him or her; 12 will not alter, add to or remove data in the dataset without the written consent of the IISG; 13 will transfer all enrichment to the dataset like coding or linking results to the DCM as soon as possible; 14 will report to DCM all errors, inconsistencies or ambiguities found by him or her as soon as possible; 15 in case of publication of the results in any form (presentations, books, articles etc.) for which he or she has used the dataset in a direct or indirect way, he or she will mention WieWasWie as source and administrator of the data and credit the IISG according to the citation rules of the DCM which will be provided by the DCM in the documentation belonging to each specific dataset; 16 notifying the DCM of all kind of publications (presentations, books, articles etc.) for which he or she has used the dataset and send a copy of each publication to DCM; 17 will pay to the IISG the amount of € nihil to compensate the costs of making the dataset available.

Other Study Description Materials

Related Publications

Citation

Identification Number:

10.5281/zenodo.7875707

Bibliographic Citation:

Van Toor, L., Claij-Swart, J., Van Gaalen, R., Mourits, R.J., & Zijdeman, R.L. (2022). End report 'Historical Sample of the Netherlands (HSN, task 2.3)' within the ODISSEI Roadmap project. Zenodo.

File Description--f33640

File: Adresses.tab

  • Number of cases: 146845

  • No. of variables per record: 9

  • Type of File: text/tab-separated-values

Notes:

UNF:6:IrMFf0Atc2ew1OQC3lgSLg==

File Description--f33638

File: HSN_RP_child_SSD_Id_NAMED.tab

  • Number of cases: 52746

  • No. of variables per record: 49

  • Type of File: text/tab-separated-values

Notes:

UNF:6:vSk0s5y+izBPcKwYOxYsdA==

File Description--f33635

File: HSN_RP_child_SSD_Id_no_days.tab

  • Number of cases: 52746

  • No. of variables per record: 37

  • Type of File: text/tab-separated-values

Notes:

UNF:6:pDe+tJhhI1AtNX63Vj+j1A==

File Description--f33639

File: HSN_RP_child_SSD_Id.tab

  • Number of cases: 52746

  • No. of variables per record: 43

  • Type of File: text/tab-separated-values

Notes:

UNF:6:Amblb3ePil6kFMxsmQSHhQ==

File Description--f33637

File: HSN_RP_SSD_Id_NAMED.tab

  • Number of cases: 20877

  • No. of variables per record: 61

  • Type of File: text/tab-separated-values

Notes:

UNF:6:+l+iWR16WE3jvnTjJ/9otw==

File Description--f33642

File: HSN_RP_SSD_Id_no_days.tab

  • Number of cases: 20877

  • No. of variables per record: 39

  • Type of File: text/tab-separated-values

Notes:

UNF:6:oGfatA0OzU3QNIKKoc2xnw==

File Description--f33634

File: HSN_RP_SSD_Id.tab

  • Number of cases: 20877

  • No. of variables per record: 45

  • Type of File: text/tab-separated-values

Notes:

UNF:6:m8SugoxgyxRhcUlj/gD6dQ==

File Description--f33636

File: Occupations.tab

  • Number of cases: 41918

  • No. of variables per record: 14

  • Type of File: text/tab-separated-values

Notes:

UNF:6:QlQ6r/b3EUq8fReZ/bsxrA==

File Description--f33641

File: Religion.tab

  • Number of cases: 20816

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:XA0sqBJ4MwSDCzY89wXBqQ==