Skip to content

End to end income analysis and model deployment using streamlit

Notifications You must be signed in to change notification settings

Gopalkholade/End_to_end_income_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dataset Description

The dataset is credited to Ronny Kohavi and Barry Becker and was drawn from the 1994 United States Census Bureau data and involves using personal details such as education level to predict whether an individual will earn more or less than $50,000 per year.

The dataset provides 14 input variables that are a mixture of categorical, ordinal, and numerical data types. The complete list of variables is as follows:

  • age.
  • workclass.
  • final_weight.
  • education.
  • education_years.
  • marital_status.
  • occupation.
  • relationship.
  • race.
  • sex.
  • capital_gain.
  • capital_loss.
  • hours_per_week.
  • native_country.
  • category

After testing many models Adaboost is the only one with highest Accuracy Score.

At last we got an accuracy score of 86% on the model. which is quite good with this imbalance dataset.

About

End to end income analysis and model deployment using streamlit

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published