DigiHistCH24
  • Home
  • Book of Abstracts
  • Conference Program
  • Call for Contributions
  • About

20 godparents and 3 wives – studying migrant glassworkers in post-medieval Estonia

  • Home
  • Book of Abstracts
    • Data-Driven Approaches to Studying the History of Museums on the Web: Challenges and Opportunities for New Discoveries
    • On a solid ground. Building software for a 120-year-old research project applying modern engineering practices
    • Tables are tricky. Testing Text Encoding Initiative (TEI) Guidelines for FAIR upcycling of digitised historical statistics.
    • Training engineering students through a digital humanities project: Techn’hom Time Machine
    • From manual work to artificial intelligence: developments in data literacy using the example of the Repertorium Academicum Germanicum (2001-2024)
    • A handful of pixels of blood
    • Impresso 2: Connecting Historical Digitised Newspapers and Radio. A Challenge at the Crossroads of History, User Interfaces and Natural Language Processing.
    • Learning to Read Digital? Constellations of Correspondence Project and Humanist Perspectives on the Aggregated 19th-century Finnish Letter Metadata
    • Teaching the use of Automated Text Recognition online. Ad fontes goes ATR
    • Geovistory, a LOD Research Infrastructure for Historical Sciences
    • Using GIS to Analyze the Development of Public Urban Green Spaces in Hamburg and Marseille (1945 - 1973)
    • Belpop, a history-computer project to study the population of a town during early industrialization
    • Contributing to a Paradigm Shift in Historical Research by Teaching Digital Methods to Master’s Students
    • Revealing the Structure of Land Ownership through the Automatic Vectorisation of Swiss Cadastral Plans
    • Rockefeller fellows as heralds of globalization: the circulation of elites, knowledge, and practices of modernization (1920–1970s): global history, database connection, and teaching experience
    • Theory and Practice of Historical Data Versioning
    • Towards Computational Historiographical Modeling
    • Efficacy of Chat GPT Correlations vs. Co-occurrence Networks in Deciphering Chinese History
    • Data Literacy and the Role of Libraries
    • 20 godparents and 3 wives – studying migrant glassworkers in post-medieval Estonia
    • From record cards to the dynamics of real estate transactions: Working with automatically extracted information from Basel’s historical land register, 1400-1700
    • When the Data Becomes Meta: Quality Control for Digitized Ancient Heritage Collections
    • On the Historiographic Authority of Machine Learning Systems
    • Films as sources and as means of communication for knowledge gained from historical research
    • Develop Yourself! Development according to the Rockefeller Foundation (1913 – 2013)
    • AI-assisted Search for Digitized Publication Archives
    • Digital Film Collection Literacy – Critical Research Interfaces for the “Encyclopaedia Cinematographica”
    • From Source-Criticism to System-Criticism, Born Digital Objects, Forensic Methods, and Digital Literacy for All
    • Connecting floras and herbaria before 1850 – challenges and lessons learned in digital history of biodiversity
    • A Digital History of Internationalization. Operationalizing Concepts and Exploring Millions of Patent Documents
    • From words to numbers. Methodological perspectives on large scale Named Entity Linking
    • Go Digital, They Said. It Will Be Fun, They Said. Teaching DH Methods for Historical Research
    • Unveiling Historical Depth: Semantic annotation of the Panorama of the Battle of Murten
    • When Literacy Goes Digital: Rethinking the Ethics and Politics of Digitisation
  • Conference Program
    • Schedule
    • Keynote
    • Practical Information
    • Event Digital History Network
    • Event SSH ORD
  • Call for Contributions
    • Key Dates
    • Evaluation Criteria
    • Submission Guidelines
  • About
    • Code of Conduct
    • Terms and Conditions

On this page

  • Introduction
  • Data collection
  • National Archives of Estonia
  • National Library of Estonia
  • Publication
  • Visualisation
  • Results
  • References
  • Edit this page
  • Report an issue

Other Links

  • Presentation Slides (PDF)

20 godparents and 3 wives – studying migrant glassworkers in post-medieval Estonia

Session 3B
Author
Affiliation

Monika Reppo

University of Tartu

Published

September 12, 2024

Modified

November 15, 2024

Doi

10.5281/zenodo.14171320

Abstract

During my PhD research, ‘Glass and its makers in post-medieval Estonia’, genealogical data for 1,248 German-speaking migrant glassworkers and their family members from the 16th–19th century was collected. The data was compiled using church books and other relevant records indexed, digitised and/or transcribed by the National Archives of Estonia (NAE) and the Digital archive of Estonian newspapers (DEA) managed by the National Library of Estonia. NAE has recently begun employing and training Transkribus, an AI-powered platform in the text recognition and transcription of locally compiled historical documents. DEA uses Optical Character Recognition which enables users to search but also correct automatically created texts from newspapers. The data collected from these sources was tabulated and published on DataDOI, an Open Access repository managed by the University of Tartu library. This paper reflects on using these tools for large-scale data collection and the motivation behind publishing the raw data Open Access and the effects of it on my research. Using Gephi, an open-source visualisation program, the connections between individuals and locations in this dataset are presented to demonstrate how this data can be used in network analysis. The results offer a look into the lives of migrant glassworkers in the 17th–19th century in Estonia with a particular focus on godparents and godchildren and the connections within the glassworking community.

Keywords

post-medieval archaeology, documentary archaeology, digitised records

For this paper, slides are available on Zenodo (PDF).

Introduction

As part of the author’s PhD project, ‘Glass and its makers in Estonia, c. 1550–1950: an archaeological study,’ the genealogical data about 1,248 migrant glassworkers and their family members working in Estonia from the 16th–19th century were collected using archival records and newspapers. The goal was to use information about key life events to trace the life histories of the glassworkers and their families from childhood to old age to gain an understanding of the community and the industry through one of its most important aspects – the workforce. It was hoped that the data will also assist in identifying the locations and names of glassworks during the period under study. In this paper, the author reflects on the process of this documentary archaeology research. The data collection, storage, and visualisation process are described, followed by the results of the study which have been included in a doctoral dissertation (Reppo 2024) and a research article (Reppo 2023b).

Data collection

The aim of this part of the PhD project was to collate, visualise, and publish data on the key life events of migrant glassworkers in post-medieval Estonia. Information on 1,248 individuals was obtained who are mostly of German origin. This list is in no way complete but provides information workers and their family members connected with the glass industry from the 16th century until the 1840s–1860s when the reliance on foreign workers started to lessen due to the abolishment of serfdom in Estonia which allowed locals access to skilled professions previously inaccessible to them (Reppo 2024, 52).

The data were collected, tabulated, and made Open Access via DataDOI (“DataDOI” 2024) as a raw dataset (Reppo 2023a). The following life events were considered – birth, baptism, marriage(s), and death. Both the date and place were included where possible to identify migration routes to and within Estonia. With baptisms, the number of godparents as well as names of all the godparents in the order listed in the church records were included. In total, the dataset has 1,249 rows and 22 columns. But how to find, access, and organise data about more than 1,200 individuals at this scale?

In addition to previously published sources and some additional archival information, this study mainly used records kept and digitised by the National Archives of Estonia (NAE) and the National Library of Estonia (NLE). During the period under study, the area of modern day Estonia was under the rule of the Swedish Kingdom (1561–1710) and the Russian Czardom (1710–1918). Due to the political history of the area, official business, including church records were kept in German well into the 19th century but also both Swedish and Russian during the respective periods. The newspapers considered in this study were also published in German and Russian as a result but there are also sources compiled in Estonian that were used in this study. This means the raw data could be in any of these languages.

As the dissertation and most of the articles connected to this thesis was written in English, all collected data was translated into English. For many of the entries on the dataset, the place name in the original source was in German. The currently used name is given first with the German version in brackets, for example, ‘Latvia, Suntaži (Sunzel).’ For Estonian place names, the German version is mostly not given but can be found in the Dictionary of Estonian Place Names (KNR; “Dictionary of Estonian Place Names” (2017)). For the workers’ profession, the translated version is given first with the title from the original source, for example, ‘Hollow glass maker (Hohlgläser).’ For surnames, there is some change from German to Russian to Estonian and from church warden to another. The most common variations of a surname are given in brackets – for example, ‘Kilias (Kihlgas).’ This translation is not included for the glassworks as all used names and other details such as coordinates, operation dates, owners, and so on are given in another dataset (Reppo 2023b).

National Archives of Estonia

From the NAE, data were collected by identifying records using the Archival Information System (Rahvusarhiiv 2024a), the name register for the Lutheran congregations (“Luteri Koguduste Personaalraamatute Nimeregister” 2024), and Saaga (“Saaga” 2024). With AIS and Saaga, it was possible to find references to records only available as paper copies at the NAE reading rooms in Tartu and Tallinn but also access digitised records, most of which were church books. NAE has estimated that around 34 million images of their physical records have been made available online which is roughly 5% of their collection (“National Archives of Estonia” 2024). NAE adopted Transkribus, an AI-powered platform developed to transcribe and recognise historical handwritten documents and text in October 2022 (Rahvusarhiiv 2024b) but a limited number of records are searchable through this feature at present.

Unfortunately, none of the records related to the glassworkers life events under consideration in this study have been added yet. To test the employability of Transkribus as a non-expert user, a handful of 17th-century documents in Swedish were run through Transkribus (Transkribus 2024) by the author to identify the location of a glassworks in Pärnu, Estonia. These records did not yield results that were hoped for but using Transkribus did speed up the process, even if the transcribed text needed corrections.

Despite the current lack of records related to the key life events of the glassworkers via the Transkribus engine on the NAE homepage, the archive has used family name indexes compiled in the 1960s–1980s at the present-day Estonian Ministry of the Interior’s IT and Development Centre Department of Population Services based on church books which were kept until the 1940s. Although many congregations have preserved church records already from the 18th century and some even from the 17th century, the church law legislated keeping church books only from 1834 onwards so the coverage varies across Estonia (Puss 2024). For this study, the focus was on the Kärevere-Laeva region which housed the largest number of Estonian glassworks from the mid-18th until the 20th century (Reppo 2024, 35). This means studying the church books from this area – Kursi and Kolga-Jaani parishes – was predicted to be the most advantageous exercise.

The indexes mentioned above are based on these records and list the last name with the relevant church book page numbers. Their digitisation was started in 2005 by the Estonian Association of Genealogists, taking advantage of researchers’ strong interest in this material (Puss 2024). Members and other volunteers thus digitised these indexes but also added their own indexes to this collection. The NAE complemented these surname indexes with a search engine which allows searching by date, parish, and last name. Over the years, the system has been developed to allow users to add image numbers which direct researchers to the correct image (page) in the digitised church book. Maiden names have also been partially indexed. The archive has now upscaled the use of this external help, crowdsourcing the indexing for specific thematic projects occasionally.

Although the crowdsourced indexes allowed identifying the records which included the glassworkers, and most of these were indeed digitised, the use of records from NAE during this study was certainly affected by the need to use traditional research methods to retrieve the information. Thus, thousands of pages of church books were combed through to compile the raw dataset after identifying the parishes with the highest number of glassworks. With further help from transcription services, the process of collecting basic data about key life events of the glassworkers and their family members could be streamlined further. Whilst some 17th-century records were uploaded to Transkribus for transcription to speed up the process of collecting very straightforward data for the individuals – dates and locations of key life events – future studies would certainly be facilitated by the built-in Transkribus engine on NAE.

National Library of Estonia

Further information about the glassworkers and their family members was collected from the Digital archive of Estonian newspapers (“DIGAR Eesti Artiklid” 2024) which is managed by the NLE. As the newspapers available via this database were published from 1811 with some earlier exceptions. Unlike NAE, this collection employs Optical Character Recognition (OCR). The use of OCR for these records did significantly speed up the process of research. There were obviously errors, for example where OCR was unable to detect the layout of the text or where the print ink had bled. The database allows corrections from users. As the author of this study did correct the errors in recognised characters in the sources used for this study, future searches for other researchers should be less error-prone.

Publication

Publication of raw datasets in Estonian archaeology is a new phenomenon and has been particularly rare for material culture studies which this study was a part of (Reppo 2024, 38). In addition to adhering to FAIR principles, the publication of this dataset is tied to an unusual situation – the author is the only archaeologists in Estonia studying post-medieval glass. In fact, three large datasets were published as part of this dissertation – one on archaeological finds (Reppo 2023a), another on the workers (Reppo 2023a), and a third one the glassworks themselves (Reppo 2023b) to avoid research monopoly and encourage other researchers to study the post-medieval glass industry in Estonia.

The raw dataset was published Open Access under a CC-BY 4.0 licence via DataDOI, a free data repository which is managed by the University of Tartu library which provides the dataset with a persistent interoperable identifier. As mentioned above, the dataset of life events is tabulated and has 22 columns and 1,249 lines. It is accompanied by a metadata file which includes details on the project, the references, and other information relevant to the raw data.

Visualisation

One of the goals of this study were to visualise the data to provide easily legible images (charts, models, drawings) which encompass the entirety of the collected data. The data were visualised using Gephi, an open-source visualisation program by extracting the raw data using pivot tables in Microsoft Excel and wrangling the data to remove unnecessary details and columns. This proved that the data is mutable and suitable for network analysis. For Gephi, this data needed to be sorted into nodes and edges which allows visualising the connections between several points of data by means of lines. After cleaning the data, the format was transformed from a Microsoft Excel table (.XLSX) to a .CSV file to run the model. In the model, the node (point) size is representative of the number of connections to the place or family. Glassworks are differentiated from birth, marriage, and death locations by the ‘GW’ (glassworks) in the name.

In this model, marriages between families and the connections of those families to places are plotted based on their places of origin, birth, baptism, marriage, and death. With further data wrangling it would be possible to show the connections of the glassworkers and their family members within the larger community beyond marriages by analysing the connections of those individuals who appear as godparents.

Results

This study explored the network of connections between 1,248 migrant glassworkers and their family members working in Estonia from the 16th–19th century, using Transkribus, OCR, and Gephi as the main tools. A complete list of workers during this period was not the goal of this study. The raw dataset was published via DataDOI, an Open Access repository managed by the University of Tartu library in accordance to FAIR principles. The data shows that a key factor in building and maintaining the glass community was godparenting and marriages between the families. In addition to tracing migration to, within, and from Estonia, the data also allowed identifying the makers of some archaeological glass artefacts and locations and names of glassworks.

Note

The Project “Cooperation between universities to promote doctoral studies” (2021-2027.4.04.24-0003) is co-funded by the European Union.

References

“DataDOI.” 2024. https://datadoi.ee/.
“Dictionary of Estonian Place Names.” 2017. https://arhiiv.eki.ee/dict/knr/.
“DIGAR Eesti Artiklid.” 2024. https://dea.digar.ee/.
“Luteri Koguduste Personaalraamatute Nimeregister.” 2024. https://www.ra.ee/dgs/addon/nimreg/index.php.
“National Archives of Estonia.” 2024. https://www.ra.ee/en/national-archives/about-us/.
Puss, Fred. 2024. “Ülevaade Personaalraamatutest Ja Projektist.” https://www.ra.ee/dgs/addon/nimreg/about.php.
Rahvusarhiiv. 2024a. “Arhiivi Infosüsteem.” https://ais.ra.ee.
———. 2024b. “Otsi Otse Allikast.” https://rahvusarhiiv.transkribus.eu/.
Reppo, Monika. 2023a. “Dataset 1. Archaeological Glass Finds from Estonia.” https://doi.org/10.23673/re-450.
———. 2023a. “Dataset 2. 16th–19th-Century Glassworkers in Estonia.” https://doi.org/10.23673/re-448.
———. 2023b. “Dataset 3. 17th–20th-Century Glassworks in Estonia.” http://dx.doi.org/10.23673/re-449.
———. 2023b. “Moving Skills, Moving Ideas – Migrant Glassworkers in 17th–19thcentury Estonia.” Post-Medieval Achaeology, 2023b.
———. 2024. “Glass and Its Makers in Estonia, c. 1550–1950: An Archaeological Study.” PhD thesis, Tartu: University of Tartu.
“Saaga.” 2024. https://www.ra.ee/dgs/explorer.php.
Transkribus. 2024. “Unlock the Past with Transkribus.” https://www.transkribus.org/.
Back to top

Reuse

CC BY-SA 4.0

Citation

BibTeX citation:
@misc{reppo2024,
  author = {Reppo, Monika},
  editor = {Baudry, Jérôme and Burkart, Lucas and Joyeux-Prunel,
    Béatrice and Kurmann, Eliane and Mähr, Moritz and Natale, Enrico and
    Sibille, Christiane and Twente, Moritz},
  title = {20 Godparents and 3 Wives – Studying Migrant Glassworkers in
    Post-Medieval {Estonia}},
  date = {2024-09-12},
  url = {https://digihistch24.github.io/submissions/460/},
  doi = {10.5281/zenodo.14171320},
  langid = {en},
  abstract = {During my PhD research, “Glass and its makers in
    post-medieval Estonia”, genealogical data for 1,248 German-speaking
    migrant glassworkers and their family members from the 16th–19th
    century was collected. The data was compiled using church books and
    other relevant records indexed, digitised and/or transcribed by the
    National Archives of Estonia (NAE) and the Digital archive of
    Estonian newspapers (DEA) managed by the National Library of
    Estonia. NAE has recently begun employing and training Transkribus,
    an AI-powered platform in the text recognition and transcription of
    locally compiled historical documents. DEA uses Optical Character
    Recognition which enables users to search but also correct
    automatically created texts from newspapers. The data collected from
    these sources was tabulated and published on DataDOI, an Open Access
    repository managed by the University of Tartu library. This paper
    reflects on using these tools for large-scale data collection and
    the motivation behind publishing the raw data Open Access and the
    effects of it on my research. Using Gephi, an open-source
    visualisation program, the connections between individuals and
    locations in this dataset are presented to demonstrate how this data
    can be used in network analysis. The results offer a look into the
    lives of migrant glassworkers in the 17th–19th century in Estonia
    with a particular focus on godparents and godchildren and the
    connections within the glassworking community.}
}
For attribution, please cite this work as:
Reppo, Monika. 2024. “20 Godparents and 3 Wives – Studying Migrant Glassworkers in Post-Medieval Estonia.” Edited by Jérôme Baudry, Lucas Burkart, Béatrice Joyeux-Prunel, Eliane Kurmann, Moritz Mähr, Enrico Natale, Christiane Sibille, and Moritz Twente. Digital History Switzerland 2024: Book of Abstracts. https://doi.org/10.5281/zenodo.14171320.
Data Literacy and the Role of Libraries
From record cards to the dynamics of real estate transactions: Working with automatically extracted information from Basel’s historical land register, 1400-1700
Source Code
---
submission_id: 460
categories: 'Session 3B'
title: 20 godparents and 3 wives – studying migrant glassworkers in post-medieval Estonia
author:
  - name: Monika Reppo
    orcid: 0000-0002-1643-7229
    email: monika.reppo@tlu.ee
    affiliations:
      - University of Tartu
funding:
  - statement: >
      The Project "Cooperation between universities to promote doctoral studies" (2021-2027.4.04.24-0003) is co-funded by the European Union.
keywords:
  - post-medieval archaeology
  - documentary archaeology
  - digitised records
abstract: |
  During my PhD research, ‘Glass and its makers in post-medieval Estonia’, genealogical data for 1,248 German-speaking migrant glassworkers and their family members from the 16th–19th century was collected. The data was compiled using church books and other relevant records indexed, digitised and/or transcribed by the National Archives of Estonia (NAE) and the Digital archive of Estonian newspapers (DEA) managed by the National Library of Estonia. NAE has recently begun employing and training Transkribus, an AI-powered platform in the text recognition and transcription of locally compiled historical documents. DEA uses Optical Character Recognition which enables users to search but also correct automatically created texts from newspapers. The data collected from these sources was tabulated and published on DataDOI, an Open Access repository managed by the University of Tartu library. This paper reflects on using these tools for large-scale data collection and the motivation behind publishing the raw data Open Access and the effects of it on my research. Using Gephi, an open-source visualisation program, the connections between individuals and locations in this dataset are presented to demonstrate how this data can be used in network analysis. The results offer a look into the lives of migrant glassworkers in the 17th–19th century in Estonia with a particular focus on godparents and godchildren and the connections within the glassworking community.
key-points:
  - This study explored the network of connections between 1,248 migrant glassworkers and their family members working in Estonia from the 16th-19th century, using Transkribus, OCR, and Gephi as the main tools.
  - The raw dataset was published via DataDOI, an Open Access repository managed by the University of Tartu library in accordance to FAIR principles.
  - The data shows that a key factor in building and maintaining the glass community was godparenting and marriages between the families.
date: 09-12-2024
date-modified: 11-15-2024
doi: 10.5281/zenodo.14171320
other-links:
  - text: Presentation Slides (PDF)
    href: https://doi.org/10.5281/zenodo.14171320
bibliography: references.bib
---

::: {.callout-note appearance="simple" icon=false}

For this paper, slides are available [on Zenodo (PDF)](https://zenodo.org/records/14171320/files/460_DigiHistCH24_Glassworkers_Slides.pdf).

:::

## Introduction

As part of the author’s PhD project, ‘Glass and its makers in Estonia, c. 1550–1950: an archaeological study,’ the genealogical data about 1,248 migrant glassworkers and their family members working in Estonia from the 16th–19th century were collected using archival records and newspapers. The goal was to use information about key life events to trace the life histories of the glassworkers and their families from childhood to old age to gain an understanding of the community and the industry through one of its most important aspects – the workforce. It was hoped that the data will also assist in identifying the locations and names of glassworks during the period under study. In this paper, the author reflects on the process of this documentary archaeology research. The data collection, storage, and visualisation process are described, followed by the results of the study which have been included in a doctoral dissertation [@mythesis] and a research article [@reppo2023d].

## Data collection

The aim of this part of the PhD project was to collate, visualise, and publish data on the key life events of migrant glassworkers in post-medieval Estonia. Information on 1,248 individuals was obtained who are mostly of German origin. This list is in no way complete but provides information workers and their family members connected with the glass industry from the 16th century until the 1840s–1860s when the reliance on foreign workers started to lessen due to the abolishment of serfdom in Estonia which allowed locals access to skilled professions previously inaccessible to them [@mythesis, pp. 52].

The data were collected, tabulated, and made Open Access via DataDOI [@datadoi] as a raw dataset [@reppo2023b]. The following life events were considered – birth, baptism, marriage(s), and death. Both the date and place were included where possible to identify migration routes to and within Estonia. With baptisms, the number of godparents as well as names of all the godparents in the order listed in the church records were included. In total, the dataset has 1,249 rows and 22 columns. But how to find, access, and organise data about more than 1,200 individuals at this scale?

In addition to previously published sources and some additional archival information, this study mainly used records kept and digitised by the National Archives of Estonia (NAE) and the National Library of Estonia (NLE). During the period under study, the area of modern day Estonia was under the rule of the Swedish Kingdom (1561–1710) and the Russian Czardom (1710–1918). Due to the political history of the area, official business, including church records were kept in German well into the 19th century but also both Swedish and Russian during the respective periods. The newspapers considered in this study were also published in German and Russian as a result but there are also sources compiled in Estonian that were used in this study. This means the raw data could be in any of these languages.

As the dissertation and most of the articles connected to this thesis was written in English, all collected data was translated into English. For many of the entries on the dataset, the place name in the original source was in German. The currently used name is given first with the German version in brackets, for example, ‘Latvia, Suntaži (Sunzel).’ For Estonian place names, the German version is mostly not given but can be found in the Dictionary of Estonian Place Names (KNR; @knr). For the workers’ profession, the translated version is given first with the title from the original source, for example, ‘Hollow glass maker (Hohlgläser).’ For surnames, there is some change from German to Russian to Estonian and from church warden to another. The most common variations of a surname are given in brackets – for example, ‘Kilias (Kihlgas).’ This translation is not included for the glassworks as all used names and other details such as coordinates, operation dates, owners, and so on are given in another dataset [@reppo2023c].

## National Archives of Estonia

From the NAE, data were collected by identifying records using the Archival Information System [@ais], the name register for the Lutheran congregations [@luterikpn], and Saaga [@saaga]. With AIS and Saaga, it was possible to find references to records only available as paper copies at the NAE reading rooms in Tartu and Tallinn but also access digitised records, most of which were church books. NAE has estimated that around 34 million images of their physical records have been made available online which is roughly 5% of their collection [@rahvusarhiiv]. NAE adopted Transkribus, an AI-powered platform developed to transcribe and recognise historical handwritten documents and text in October 2022 [@ratranskribus] but a limited number of records are searchable through this feature at present.

Unfortunately, none of the records related to the glassworkers  life events under consideration in this study have been added yet. To test the employability of Transkribus as a non-expert user, a handful of 17th-century documents in Swedish were run through Transkribus [@transkribus] by the author to identify the location of a glassworks in Pärnu, Estonia. These records did not yield results that were hoped for but using Transkribus did speed up the process, even if the transcribed text needed corrections.

Despite the current lack of records related to the key life events of the glassworkers via the Transkribus engine on the NAE homepage, the archive has used family name indexes compiled in the 1960s–1980s at the present-day Estonian Ministry of the Interior’s IT and Development Centre Department of Population Services based on church books which were kept until the 1940s. Although many congregations have preserved church records already from the 18th century and some even from the 17th century, the church law legislated keeping church books only from 1834 onwards so the coverage varies across Estonia [@puss2024]. For this study, the focus was on the Kärevere-Laeva region which housed the largest number of Estonian glassworks from the mid-18th until the 20th century [@mythesis, pp. 35]. This means studying the church books from this area – Kursi and Kolga-Jaani parishes – was predicted to be the most advantageous exercise.

The indexes mentioned above are based on these records and list the last name with the relevant church book page numbers. Their digitisation was started in 2005 by the Estonian Association of Genealogists, taking advantage of researchers’ strong interest in this material [@puss2024]. Members and other volunteers thus digitised these indexes but also added their own indexes to this collection. The NAE complemented these surname indexes with a search engine which allows searching by date, parish, and last name. Over the years, the system has been developed to allow users to add image numbers which direct researchers to the correct image (page) in the digitised church book. Maiden names have also been partially indexed. The archive has now upscaled the use of this external help, crowdsourcing the indexing for specific thematic projects occasionally.

Although the crowdsourced indexes allowed identifying the records which included the glassworkers, and most of these were indeed digitised, the use of records from NAE during this study was certainly affected by the need to use traditional research methods to retrieve the information. Thus, thousands of pages of church books were combed through to compile the raw dataset after identifying the parishes with the highest number of glassworks. With further help from transcription services, the process of collecting basic data about key life events of the glassworkers and their family members could be streamlined further. Whilst some 17th-century records were uploaded to Transkribus for transcription to speed up the process of collecting very straightforward data for the individuals – dates and locations of key life events – future studies would certainly be facilitated by the built-in Transkribus engine on NAE.

## National Library of Estonia

Further information about the glassworkers and their family members was collected from the Digital archive of Estonian newspapers [@digar] which is managed by the NLE. As the newspapers available via this database were published from 1811 with some earlier exceptions. Unlike NAE, this collection employs Optical Character Recognition (OCR). The use of OCR for these records did significantly speed up the process of research. There were obviously errors, for example where OCR was unable to detect the layout of the text or where the print ink had bled. The database allows corrections from users. As the author of this study did correct the errors in recognised characters in the sources used for this study, future searches for other researchers should be less error-prone.

## Publication

Publication of raw datasets in Estonian archaeology is a new phenomenon and has been particularly rare for material culture studies which this study was a part of [@mythesis, pp. 38]. In addition to adhering to FAIR principles, the publication of this dataset is tied to an unusual situation – the author is the only archaeologists in Estonia studying post-medieval glass. In fact, three large datasets were published as part of this dissertation – one on archaeological finds [@reppo2023a], another on the workers [@reppo2023b], and a third one the glassworks themselves [@reppo2023c] to avoid research monopoly and encourage other researchers to study the post-medieval glass industry in Estonia.

The raw dataset was published Open Access under a CC-BY 4.0 licence via DataDOI, a free data repository which is managed by the University of Tartu library which provides the dataset with a persistent interoperable identifier. As mentioned above, the dataset of life events is tabulated and has 22 columns and 1,249 lines. It is accompanied by a metadata file which includes details on the project, the references, and other information relevant to the raw data.

## Visualisation

One of the goals of this study were to visualise the data to provide easily legible images (charts, models, drawings) which encompass the entirety of the collected data. The data were visualised using Gephi, an open-source visualisation program by extracting the raw data using pivot tables in Microsoft Excel and wrangling the data to remove unnecessary details and columns. This proved that the data is mutable and suitable for network analysis. For Gephi, this data needed to be sorted into nodes and edges which allows visualising the connections between several points of data by means of lines. After cleaning the data, the format was transformed from a Microsoft Excel table (.XLSX) to a .CSV file to run the model. In the model, the node (point) size is representative of the number of connections to the place or family. Glassworks are differentiated from birth, marriage, and death locations by the ‘GW’ (glassworks) in the name.

In this model, marriages between families and the connections of those families to places are plotted based on their places of origin, birth, baptism, marriage, and death. With further data wrangling it would be possible to show the connections of the glassworkers and their family members within the larger community beyond marriages by analysing the connections of those individuals who appear as godparents.

## Results

This study explored the network of connections between 1,248 migrant glassworkers and their family members working in Estonia from the 16th–19th century, using Transkribus, OCR, and Gephi as the main tools. A complete list of workers during this period was not the goal of this study. The raw dataset was published via DataDOI, an Open Access repository managed by the University of Tartu library in accordance to FAIR principles. The data shows that a key factor in building and maintaining the glass community was godparenting and marriages between the families. In addition to tracing migration to, within, and from Estonia, the data also allowed identifying the makers of some archaeological glass artefacts and locations and names of glassworks.

::: {.callout-note}
The Project "Cooperation between universities to promote doctoral studies" (2021-2027.4.04.24-0003) is co-funded by the European Union.
:::

## References

::: {#refs}
:::
  • Edit this page
  • Report an issue