Requirements:
- Pandas
- Numpy
- re (python regular expressions module)
This is a full Data Analysis project from cleaning to visualization. Here we analyze New York City High School Data where we try to find the correlation between multiple factors and performance in the SAT or scholastic aptitude test. This test is designed as an entrance exam in the US to colleges and Universities.
We use a lot of data analysis techniques from data cleaning with regular expressions to exploratory data analysis to correlational coefficients and visualization.