A New Dataset with Phonological Reconstructions in Lexibank

Authors

  • Frederic Blum Department of Linguistic and Cultural Evolution at the Max Planck Institute for the Science of Human History
  • Carlos Barrientos Department of Linguistic and Cultural Evolution at the Max Planck Institute for the Science of Human History

DOI:

https://doi.org/10.15475/calcip.2023.1.6

Keywords:

dataset, Lexibank, Pano-Tacanan languages

Abstract

Data in historical linguistics is typically presented in non-machine-readable formats, such as text-based supplementary material or even handwritten manuscripts. Many annotations and important facts are given in prose or remain within linguists' heads. Those problems make it difficult for non-experts in the specific field to understand the data, and to reproduce and replicate the results, and also limits the exposure that linguists receive for their hard work. Similar to previous blog posts on retro-standardizing data, we present the digitization of a dataset that includes phonological reconstructions. By representing this kind of data in CLDF, we can apply a variety of computer-assisted methods to assess the quality of the reconstructions.

Downloads

Published

2023-06-21

How to Cite

Blum, F., & Barrientos, C. (2023). A New Dataset with Phonological Reconstructions in Lexibank. Computer-Assisted Language Comparison in Practice, 6(1), 43–51. https://doi.org/10.15475/calcip.2023.1.6