If you use these data please cite
- the original source
Lai, Yunfan and List, Johann-Mattis (2023): Lexical Data for the Historical Comparison of Rgyalrongic Languages [Dataset, Version 1.0]. Leipzig: Max Planck Institute for Evolutionary Anthropology.
- the derived dataset using the DOI of the particular released version you were using
This dataset is licensed under a CC-BY-4.0 license
Available online at http://github.com/lexibank/lairgyalrong
Conceptlists in Concepticon:
In order to convert the data to the TSV format compatible with EDICTOR (and easy to browse in Excel), you can use pyedictor
:
$ pip install pyedictor
$ edictor wordlist --name=lairgyalrong --addon="partial_cognacy:cogids"
This will output the data to the file lairgyalrong.tsv
. For convenience, we allow users to access the file via EDICTOR directly, using the following link:
https://digling.org/edictor/?file=lairgyalrong.tsv&preview=500&basics=DOCULECT|CONCEPT|TOKENS|COGIDS&publish=true
With this link, you can browse the data in the EDICTOR tool online (without being able to manipulate the data).
- Varieties: 22 (linked to 17 different Glottocodes)
- Concepts: 291 (linked to 291 different Concepticon concept sets)
- Lexemes: 6,321
- Sources: 16
- Synonymy: 1.10
- Invalid lexemes: 0
- Tokens: 29,381
- Segments: 414 (0 BIPA errors, 0 CLTS sound class errors, 407 CLTS modified)
- Inventory size (avg): 72.32
- Entries missing sources: 567/6321 (8.97%)
Name | GitHub user | Description | Role |
---|---|---|---|
Yunfan Lai | Author | ||
Johann-Mattis List | @lingulist | maintainer | Author, Editor |
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json