Next Word Prediction

This repository contains machine learning models and code for predicting the next word in a sequence based on a variety of algorithms and methods, such as TF-IDF, Cosine Similarity, AdaBoost, and more. The project demonstrates how to use different techniques for text prediction tasks.

Features

Next Word Prediction using:
TF-IDF Multinominal Naive Bayes model / embedding with universal model encoder by Tensor Flow with LSTM model
Training and evaluating multiple models
Interactive Web Application using Flask
Support for different types of word embeddings and vectorizers

Requirements

Before running the code, ensure you have the following Python libraries installed:

Flask
pandas
numpy
scikit-learn
keras
tensorflow
nltk
gensim

You can install the dependencies using the following command:

pip install -r requirements.txt

additional step for universal model encoder from Tensor Flow :

in app.py uncomment this line

 embed = (hub.KerasLayer("https://tfhub.dev/google/universal-sentence-encoder/4"))

make sure to make a rep in your repository named universal_model_encoder_tf
navigate to here you find this rep and copy it in universal_model_encoder_tf

C:\\Users\\name\\AppData\\Local\\tfhub_modules

063d866c06683311b44b4992fd46fsfdsfdsf/
│
├── saved_model.pb                  
├── variables/                        
│   ├── variables.data-00000-of-00001             
│   └──  variables.index

they comment the line to avoide loading the model each time you run the app

Web interface by flask framework

Note that: the corpus used is directed to concepts of AI

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
models		models
static		static
templates		templates
NWP-USE.keras		NWP-USE.keras
Next_Word_Prediction_using_Universal_Sentence_Encoder.ipynb		Next_Word_Prediction_using_Universal_Sentence_Encoder.ipynb
README.md		README.md
ada_boost_model.pkl		ada_boost_model.pkl
app.py		app.py
corpus.txt		corpus.txt
corpus.txtw_fjxdtw.part		corpus.txtw_fjxdtw.part
corpus2.txt		corpus2.txt
nb_model.joblib		nb_model.joblib
requirements.txt		requirements.txt
tfidf_vectorizer.joblib		tfidf_vectorizer.joblib
vocabulary.npy		vocabulary.npy
vocabulary_fo_rwr2vc.npy		vocabulary_fo_rwr2vc.npy
word2vec_model		word2vec_model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next Word Prediction

Features

Requirements

additional step for universal model encoder from Tensor Flow :

Web interface by flask framework

About

Languages

a-alhaouil/next_word_prediction

Folders and files

Latest commit

History

Repository files navigation

Next Word Prediction

Features

Requirements

additional step for universal model encoder from Tensor Flow :

Web interface by flask framework

About

Topics

Resources

Stars

Watchers

Forks

Languages