Computing Detailed Colexifications with Missing Data Information from the CLICS⁴ Collection

Authors

DOI:

https://doi.org/10.15475/calcip.2026.1.2

Keywords:

CLICS, colexification data, missing data, tutorial

Abstract

CLICS⁴ offers a refined structural representation of cross-linguistic colexification patterns but retains an implicit representation of missing data. This obscures whether the lack of a colexification in a language for paired concepts is due to its true absence in the language, or due to missing data on the concept or word form level. We introduce a straightforward workflow that can be applied to individual datasets from CLICS⁴ to identify cases of colexification via a three-way attestation scheme. Our approach captures the presence or absence of a colexification in CLICS⁴, but it also explicitly encodes the presence or absence of data at the level of the original questionnaire, or the individual language, elicited with the help of the questionnaire.

Downloads

Published

2026-02-23

How to Cite

Computing Detailed Colexifications with Missing Data Information from the CLICS⁴ Collection. (2026). Computer-Assisted Language Comparison in Practice: Tutorials on Computational Approaches to the History and Diversity of Languages, 9(1), 7-18. https://doi.org/10.15475/calcip.2026.1.2