Skip to content

CLDF dataset derived from Dellert et al.'s "NorthEuraLex" from 2020

License

Notifications You must be signed in to change notification settings

lexibank/northeuralex

Repository files navigation

CLDF dataset derived from Dellert et al.'s "NorthEuraLex (Version 0.9)" from 2020

CLDF validation

How to cite

If you use these data please cite

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at http://www.northeuralex.org

Conceptlists in Concepticon:

Notes

This large database covers several languages of Northern Eurasia. For the conversion to CLDF, we considerably adjusted the IPA in the source.

Statistics

CLDF validation Glottolog: 100% Concepticon: 94% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 107 (linked to 107 different Glottocodes)
  • Concepts: 1,016 (linked to 954 different Concepticon concept sets)
  • Lexemes: 121,611
  • Sources: 1
  • Synonymy: 1.15
  • Invalid lexemes: 0
  • Tokens: 699,892
  • Segments: 678 (0 BIPA errors, 0 CLTS sound class errors, 676 CLTS modified)
  • Inventory size (avg): 52.43

Contributors

Name GitHub user Description Role
Tiago Tresoldi @tresoldi patron Other
Julius Steuer @justeuer orthographic profile Other
Johann-Mattis List @LinguList code, integration Editor
Robert Forkel @xrotwang code, integration Editor
Johannes Dellert editor DataCurator, DataManager, Author
Pavel Sofroniev @pavelsof original team cdlf curation DataCurator, DataManager

CLDF Datasets

The following CLDF datasets are available in cldf: