Welcome to my data portfolio! Here, I document a summary of my projects in the data field.
Project Link | Completion Date | Tools | Project Description |
---|---|---|---|
π Uber Taxi | May 2023 | Python, GCP (Storage, Compute Engine, BigQuery), Mage, Looker Studio | Developed and implemented an end-to-end ETL pipeline for processinsg NYC Trip Record data. The pipeline encompassed extracting raw data, performing data transformation using Python, applying fact and dimensional data modelling techniques, orchestrating the pipeline on Mage, and ultimately creating a dashboard using Looker Studio. |
πΆ Dog Adoption | Mar 2023 | Python, PostgreSQL, Jupyter Notebook | Designed, created, and deployed a custom data model for a dog adoption data set using Python and PostgreSQL on Jupyter Notebook. |
Project Link | Area of Analysis | Project Description |
---|---|---|
π‘ 8-Week SQL Challenges | Data analysis, data cleaning, data transformation | This repo serves as the solution for the 8 case studies from the #8WeekSQLChallenge. It showcases my ability to tackle various SQL challenges and demonstrates my proficiency in SQL query writing and problem-solving skills. |
π©π»ββοΈ Health Analytics Case Study | Health analysis | I answer business questions related to patients data, such as average and median measurements per user, types of measurements for active users, and median blood pressure values for users. |
π¦ Covid-19 and the Impact on Malaysia Stock Market | Data cleaning, data analysis | A project close to π‘ home. Inspired by Alex Freberg's Data Exploration Project, I analysed global and local Covid-19 cases & the impact on Malaysia stock market from Jan 2020 to Jul 2021 using SQL and Tableau. |
Project Link | Area | Project Description | Libraries |
---|---|---|---|
π©π»βπ» CS50P - Ongoing | Programming | This repo contains the solution to the problem sets in Harvardx CS50P Introduction to Programming with Python. | - |
πΊ TMDb Movie Analysis | Data Wrangling & EDA | I analysed more than 10,000 TMDb movies and getting the answers to - Which actor(s) is associated with higher revenue and profit, Does a higher budget constitute to a higher profit, and Which director produced the highest grossing movie? | pandas, matplotlib |
β½οΈ Fuel Economy | Data Wrangling & EDA | Analysis on vehiclesβ fuel economy in 2008 and 2018 to understand usage of alternative sources of fuel, changes in greenhouse gas and smog ratings over the decade, and vehicle features associated with better fuel economy. | pandas, matplotlib |
π· Wine Quality | Data Wrangling & EDA | A study on red and white wine samples and understanding whether certain types of wine and their qualities (alcohol level, sugar content and acidity level) are associated with higher wine quality. | pandas, matplotlib |
π€ Explore Weather Trends | Time-series analysis | In this time-series analysis, I use moving average method to analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. | pandas, matplotlib |
π Super Store Analysis | EDA | Analysis of sales data to find out highest revenue and profit product categories and top customer segments. | pandas, matplotlib, seaborn |
ππ»ββοΈ Bellabeat Fitness Tracking Analysis | EDA | Discovered insights into whether users are using the FitBit app for tracking health habits, their frequency of usage across the week and whether there is correlation between the hours logged, number of steps taken and calories burnt. | pandas, matplotlib, seaborn |
Project Link | Project Description | Dashboard Link |
---|---|---|
π¦ Maven Unicorn Challenge | Cleansed and transformed data on privately-owned companies (start-ups) valued at over $1 billion using Python. Visualised key insights using Tableau, including the timeline of valuations, the top 10 countries and investors with the highest valuations, the most successful unicorns, and the average time it takes to reach a $1 billion valuation. | Dashboard |
π¦ Covid-19 and the Impact on Malaysia Stock Market | A project close to π‘ home. Inspired by Alex Freberg's Data Exploration Project, I analysed global and local Covid-19 cases in Malaysia and the impact on the KLSE stock market from Jan 2020 to Jul 2021 using SQL and Tableau. | Dashboard |
Looking to learn SQL for data analysis but don't know where to start?
Check out my Linkedin post and GitHub guide where I've compiled a comprehensive list of free SQL resources! From YouTube videos to interactive websites, courses, practice sites, and projects, this list has got you covered.
Are you keen on pursuing a career in data analytics, but feeling lost on how to take the first steps?
Explore my comprehensive repo here, which contains all the essential resources you require to develop the technical expertise in SQL, Python, and Tableau!
Are you new to GitHub and wondering how to showcase your coding skills to potential employers or clients? Look no further!
My step-by-step tutorial here will guide you through creating a professional portfolio on GitHub.
In my guide, you will learn:
- How to create your profile on GitHub and add relevant information
- How to customize Markdown files to create a visually appealing portfolio
- How to create a new repository for each project and add project details and code
- Follow these steps and you'll have an impressive portfolio to showcase your coding projects in no time!