GLOBALISE Ground Truth for Handwritten Text and Layout Recognition (hdl:10622/QJZKZ2)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

GLOBALISE Ground Truth for Handwritten Text and Layout Recognition

Identification Number:

hdl:10622/QJZKZ2

Distributor:

IISH Data Collection

Date of Distribution:

2024-05-02

Version:

1

Bibliographic Citation:

Pepping, Kay; Hids, Maartje; Tosun, Merve; Brink, Femke; Swüste, Marja; GLOBALISE project, 2024, "GLOBALISE Ground Truth for Handwritten Text and Layout Recognition", https://hdl.handle.net/10622/QJZKZ2, IISH Data Collection, V1

Study Description

Citation

Title:

GLOBALISE Ground Truth for Handwritten Text and Layout Recognition

Identification Number:

hdl:10622/QJZKZ2

Authoring Entity:

Pepping, Kay (Huygens Institute)

Hids, Maartje (Huygens Institute)

Tosun, Merve (International Institute for Social History)

Brink, Femke (Huygens Institute)

Swüste, Marja (Huygens Institute)

GLOBALISE project (Huygens Institute)

Producer:

GLOBALISE project

Distributor:

IISH Data Collection

Access Authority:

Petram, Lodewijk

Access Authority:

IISG Data

Depositor:

Petram, Lodewijk

Date of Deposit:

2024-05-02

Holdings Information:

https://hdl.handle.net/10622/QJZKZ2

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, handwriting recognition, ground truth

Abstract:

This dataset contains Ground Truth PageXML files that were used to finetune the GLOBALISE Handwritten Text Recognition, baseline detection and region detection models (see Related Publications).

Notes:

This collection includes a <a href="https://datasets.iisg.amsterdam/file.xhtml?fileId=33328&version=1.0">datasheet</a> with comprehensive details about the motivation for creating this dataset, the files it comprises, and their potential uses. Additionally, it contains <a href="https://datasets.iisg.amsterdam/file.xhtml?fileId=33327&version=1.0">guidelines for creating text region Ground Truth</a>. The transcription Ground Truth files were created in accordance with the guidelines of the Dutch National Archives.

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Bibliographic Citation:

<a href="https://hdl.handle.net/10622/X2JZYY">GLOBALISE Loghi Handwritten Text Recognition Model – August 2023</a><br> <a href="https://hdl.handle.net/10622/VMSCBR">GLOBALISE Laypa Baseline Model – August 2023</a><br> <a href="https://hdl.handle.net/10622/DMAS2T">GLOBALISE Laypa Region Model - August 2023</a><br> <a href="https://hdl.handle.net/10622/LVXSBW">VOC transcriptions v2 - GLOBALISE</a>

Other Study-Related Materials

Label:

Datasheet.pdf

Notes:

application/pdf

Other Study-Related Materials

Label:

Guidelines_Text_Region_GT.pdf

Notes:

application/pdf

Other Study-Related Materials

Label:

Training_Baselines_1-1500_B6_V1_03-07-23.zip

Notes:

application/zip

Other Study-Related Materials

Label:

Training_General_Missives_B1_0_5_(17-03-2023).zip

Notes:

application/zip

Other Study-Related Materials

Label:

Training_Limited2_B2_v_1_1_(17-3-2023).zip

Notes:

application/zip

Other Study-Related Materials

Label:

Training_Regions_1001_B5_V1_26-6-23.zip

Notes:

application/zip

Other Study-Related Materials

Label:

Training_Regions_Standard_Layout_B4_V3.zip

Notes:

application/zip

Other Study-Related Materials

Label:

Validation_All_Random_B2_v1_1_(17-3-2023).zip

Notes:

application/zip

Other Study-Related Materials

Label:

Validation_General_Missives_B1_V_0_5_(17-03-2023).zip

Notes:

application/zip