MA in Linguistics with a specialization in character-level language modeling.
Highlights
- Pro
Pinned Loading
-
corpus_toolkit
corpus_toolkit PublicPython toolkit for corpus analysis: tokenization, lexical diversity, vocabulary growth prediction, entropy measures, and Zipf/Heaps visualizations.
Python 5
-
-
shannon
shannon PublicThis project uses KenLM to analyze language entropy and redundancy in English and Linear B.
Python
-
suxotin
suxotin PublicPython script that distinguishes vowels from consonants using Suxotin's algorithm.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.