Skip to content

Data Analysis and Visualisation on a Dataset containing U.S. births data for the years 1994 to 2014.

Notifications You must be signed in to change notification settings

sunroofgod/dataanalysis-births-1994-2014

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Visualisation & Analysis of Number of Births in US

Lian Kah Seng @sunroofgod

The Elpehant in the Room

What is this project about? Well, here's a question for you. Ever wondered how 'Popular' your birthday is? Well, in this project, I go in-depth into a Dataset containing U.S. births data for the years 1994 to 2014.

About

  • View the source code here
  • In this project, I wanted to see if I could find interesting trends and patterns regarding the Day of the Month, the Day of the Week as well as the Year that a person was born in.
  • This also allowed me to go through the basics of Pandas, Matplotlib and NumPy again (as a refresher).
  • This proejct also covers my thought processes of what one should do when trying to do basic Data Analysis on a given Dataset.
    • This includes: Data Cleaning, How to Structure your Data using Pandas, Indexing, Filtering, Graphing Techniques, Observation of Trends, Types of graphs and its Advantages.
  • The Dataset was obtained from FiveThirtyEight. (TYSM!)

Setup

git clone https://github.com/sunroofgod/data_visualisation.git

Make sure you have the following dependencies in requirements.txt

python3==3.9.7
pandas==1.4.1
numpy==1.20.3

You can run the Jupyter Notebook main.ipynb on your local machine or Google Colab.

About

Data Analysis and Visualisation on a Dataset containing U.S. births data for the years 1994 to 2014.

Topics

Resources

Stars

Watchers

Forks