Implementing Fuzzy Spelling Search in Dictionaries of Under-Described Languages Lacking Standard Orthographies

Authors

  • Kellen Parker van Dam Chair of Multilingual Computational Linguistics at the University of Passau

DOI:

https://doi.org/10.15475/calcip.2024.1.5

Abstract

Non-standard orthographies are common in the world of under-described language documentation. Whether they are semi-conventionalised community spellings, orthographies partially adopted from missionary works, or hastily transcribed texts representing as-yet uncertain phonologies, there is a need to be able to work through lexical data in a way which can accommodate and respond to such non-standard transcriptions. Here, a few options are considered, with a solution for fuzzy string matching based on attested variations is presented.

Downloads

Published

2024-05-27

How to Cite

van Dam, K. P. (2024). Implementing Fuzzy Spelling Search in Dictionaries of Under-Described Languages Lacking Standard Orthographies. Computer-Assisted Language Comparison in Practice, 7(1), 35–46. https://doi.org/10.15475/calcip.2024.1.5