Jedidiah Morse, The American Gazetteer (Boston 1797) [plain text, TEI/XML format] (hdl:10622/L0LJGD)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Jedidiah Morse, The American Gazetteer (Boston 1797) [plain text, TEI/XML format]

Identification Number:

hdl:10622/L0LJGD

Distributor:

IISH Data Collection

Date of Distribution:

2020-01-31

Version:

1

Bibliographic Citation:

Stapel, Rombert; Ashkpour, Ashkan; Reynaert, Martin, 2020, "Jedidiah Morse, The American Gazetteer (Boston 1797) [plain text, TEI/XML format]", https://hdl.handle.net/10622/L0LJGD, IISH Data Collection, V1

Study Description

Citation

Title:

Jedidiah Morse, The American Gazetteer (Boston 1797) [plain text, TEI/XML format]

Identification Number:

hdl:10622/L0LJGD

Identification Number:

urn:oclc:record:1039515558

Identification Number:

americangazettee00mors

Identification Number:

ark:/13960/t43r11p1n

Authoring Entity:

Stapel, Rombert (International Institute of Social History)

Ashkpour, Ashkan (International Institute of Social History)

Reynaert, Martin (Meertens Instituut)

Date of Production:

1797

Software used in Production:

Transkribus

Distributor:

IISH Data Collection

Access Authority:

Stapel, Rombert

Depositor:

Stapel, Rombert

Date of Deposit:

2020-01-31

Study Scope

Keywords:

Arts and Humanities, Gazetteer, Early Modern, North America, West Indies, South America, Latin America, Oceania

Abstract:

This dataset contains the digitalised text of Jedidiah Morse's <em>The American Gazetteer</em> (Boston 1797). The text was digitalised using scans provided by the John Adams Library at the Boston Public Library (<a href="https://archive.org/details/americangazettee00mors" target="_blank">Internet Archive</a>) and the HTR software <a href="http://www.transkribus.eu/" target="_blank">Transkribus</a>.<br><br> The text is presented in several formats (txt, TEI/XML, <a href="https://hdl.handle.net/10622/2T7EQH">ALTO line</a>, <a href="https://hdl.handle.net/10622/7MCPZB">ALTO word</a>, <a href="https://hdl.handle.net/10622/BTVJ4X">Transkribus PAGE/XML</a>), stored in separate folders. We have also included two extra plain text files, one file ('§') containing only the place name lemmas (and no introduction and appendices), the other one which was manually edited to correct a limited number of incorrect line breaks (e.g. 'New¬York' instead of 'New-York').<br><br> In the dataset <a href="https://hdl.handle.net/10622/UJLP04">'Ground Truth'</a> the transcriptions can be found that were manually created in order to train the Transkribus HTR model.

Time Period:

1700-01-01-1797-12-31

Country:

United States, Canada, Mexico, Argentina, Brazil, Solomon Islands, Colombia, Chile, Venezuela, Bolivarian Republic of, Suriname, Greenland, Falkland Islands (Malvinas), Guyana, Cuba, Bolivia, Plurinational State of, Haiti, Dominican Republic, Nicaragua, Peru, Paraguay

Geographic Bounding Box:

  • West Bounding Longitude: -135
  • East Bounding Longitude: 20
  • South Bounding Latitude: -90
  • North Bounding Latitude: 90

Methodology and Processing

Sources Statement

Data Access

Notes:

CC0 Waiver

Other Study Description Materials

Other Study-Related Materials

Label:

The_American_Gazetteer (cleaned line breaks).txt

Text:

Complete text, manually corrected line breaks

Notes:

text/plain

Other Study-Related Materials

Label:

The_American_Gazetteer_tei.xml

Text:

TEI/XML format

Notes:

text/xml

Other Study-Related Materials

Label:

The_American_Gazetteer (§).txt

Text:

Place name lemmas only (no introduction or appendices)

Notes:

text/plain

Other Study-Related Materials

Label:

The_American_Gazetteer.txt

Text:

Complete text

Notes:

text/plain