Skip to content

lexibank/lairgyalrong

Repository files navigation

CLDF dataset derived from Lai and List's "Comparison of Rgyalrongic Languages" from 2023

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Lai, Yunfan and List, Johann-Mattis (2023): Lexical Data for the Historical Comparison of Rgyalrongic Languages [Dataset, Version 1.0]. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at http://github.com/lexibank/lairgyalrong

Conceptlists in Concepticon:

Notes

In order to convert the data to the TSV format compatible with EDICTOR (and easy to browse in Excel), you can use pyedictor:

$ pip install pyedictor
$ edictor wordlist --name=lairgyalrong --addon="partial_cognacy:cogids"

This will output the data to the file lairgyalrong.tsv. For convenience, we allow users to access the file via EDICTOR directly, using the following link:

https://digling.org/edictor/?file=lairgyalrong.tsv&preview=500&basics=DOCULECT|CONCEPT|TOKENS|COGIDS&publish=true

With this link, you can browse the data in the EDICTOR tool online (without being able to manipulate the data).

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 91% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 22 (linked to 17 different Glottocodes)
  • Concepts: 291 (linked to 291 different Concepticon concept sets)
  • Lexemes: 6,321
  • Sources: 16
  • Synonymy: 1.10
  • Invalid lexemes: 0
  • Tokens: 29,381
  • Segments: 414 (0 BIPA errors, 0 CLTS sound class errors, 407 CLTS modified)
  • Inventory size (avg): 72.32

Possible Improvements:

  • Entries missing sources: 567/6321 (8.97%)

Contributors

Name GitHub user Description Role
Yunfan Lai Author
Johann-Mattis List @lingulist maintainer Author, Editor

CLDF Datasets

The following CLDF datasets are available in cldf: