Skip to content

nltk, regex, tf-idf, count vector, feature engineering, grid search, Random Forest and Gradient Boost models.

Notifications You must be signed in to change notification settings

evgenygrobov/nlp_spam_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nlp_spam_detection

Natural Languege Processing

TODO

nltk tf-idf countvector regex feature engineering

picture of body length

picture of body punct%

cross validation grid search machine learning algorithms RF and GB

Enference

model/score Fit time Predict time Precision Recall Accuracy
Random Forest 4.375 1.384 0.984 0.814 0.972
Gradient Boost 781.057 0.376 0.901 0.872 0.969

About

nltk, regex, tf-idf, count vector, feature engineering, grid search, Random Forest and Gradient Boost models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published