GitHub - mayurdb/marathiModels: Follow along of the Andrej's makemore NN series but for marathi language and at word level prediction instead of character level in the videos

These are implementations of ML models starting from bigrams all the way till transformers along with Andrej Karpathy's youtube series.

There are two major differences:

In videos word prediction is done, but here we try to predict the entire sentence
The training and predictions are done for marathi language instead of english

Notebooks and their essence:

bigram.ipynb => Simple bigram model but at word level. In the video next character is predicted with one character as input, here we predict next word with one word as input. Note that the vocabulary here is words, so much larger than 27 character vocabulary in videos, so produces terrible results as expected.
simple_bigram_network.ipynb => NN implementation of the bag_of_words.ipyng. Note that vocabulary here is still words instead of characters. Because of this the vocab size is very very large, 50k+, which is even more than gpt4 :p. Note that this also predicts the entire sentence, one word at a time. As expected, gives terrible results.
one_char_context_network.ipynb => Same as simple_bigram_network.ipynb, but remedies using the word as vocab instead problem. This uses character as a vocab. So vocabulary now is manageable ~400. As the theme of this repo, this also tries to predict the sentences.
k_char_context_network.ipynb => Same as one_char_context_network.ipynb, but main difference that it uses k character context as input instead of just last character

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
weights		weights
.gitignore		.gitignore
README.md		README.md
biGram.ipynb		biGram.ipynb
gpt.py		gpt.py
gpt_test.ipynb		gpt_test.ipynb
k_char_context_network.ipynb		k_char_context_network.ipynb
one_char_context_network.ipynb		one_char_context_network.ipynb
simple_bigram_network.ipynb		simple_bigram_network.ipynb
tokenizer_experiment.ipynb		tokenizer_experiment.ipynb

Provide feedback