A Data Analysis of a Video-Game Industry Dataset
The project has been done in the Jupyter enviroment coded in Python (3.8). All the required packages and libraries have been mentioned in requirement.txt for your referal.
The data file would contain 2 files and one of them will have the 'improved' tag, this dataset consists a modified version of the orignal dataset wherein I have searched for the missing publisher column values.
This project was a part of an evaluation assignment of Udacity's Data Scientist Nanodegree. The students were to select a dataset, come up with some questions and answer them through data analysis of the dataset, and post their findings through a blogpost. The dataset I chose is present on Kaggle, representing gaming industry information from 1984-2020 in reference to the video-game titles that have been released. The code answer's the following questions:
- Which Genre has become most popular?
- Which Platform has been the most popular to play games on?
- How the Sales Trend for games has been for the past 40 years?
- Publishers with most Global Sales along with which country offers the maximum of these sales?
The answers to the above questions and my findings can all be found in my blog post right here. Please do refer to it.
The orignal data set is by Gregory Smith who had done a web scrape of the VG Charts. Recently Rush Kiruni took the same dataset and added more columns to it.
If you do find this repository useful, why not give a star and even let me know about it!
Feel free to express issues and feedback as well, cheers!