Singapore Resale Flat Prices Predicting

Overview

This project allows users to explore Singapore flat resale prices. After retrieving and cleaning the data, insights are gained through exploratory data analysis (EDA). Various regression models are then used to predict resale prices, and the final model is deployed on a user-friendly Render platform.

Render App - (https://singapore-resale-price-500j.onrender.com)

Key Technologies and Skills

Python Scripting
Streamlit
Data Cleaning and analysis
Pandas-Dataframe
Numpy
EDA Analysis
Scikit-Learn
Pickle

Installation

To run this project, please install below python packages as prerequisites.

pip install streamlit
pip install pandas
pip install plotly
pip install numpy
pip install scipy
pip install category_encoders
pip install scikit-learn

Usage

To use this project, Please follow the below steps.

To clone this repository: git clone https://github.com/Gokulakkrizhna/singapore-resale-price.git
Install the required packages: pip install -r requirements.txt
To extract and clean data:python Datacoll_dataclean.py
To perform EDA analysis:python statistical_eda_analysis.py
Run the Streamlit app: streamlit run Dataextract_visual.py
Access the app in your browser at http://localhost:8501

Data Source : (https://beta.data.gov.sg/collections/189/view)

Features

Fetch Singapore Resale data from Excel file
Data Cleaning and pre-processing
Perform EDA analysis
Perform Machine Leanrning analysis
User-friendly interface powered by Streamlit and Render

Approach

Data Collection: Extract Singapore Resale data from your local directory.

Data Cleaning: Perform pre-processing methods like Data handling is applied to the collected data.

Statistical Analysis: Cleaned numerical data has been analysed to get a valuable insights.

EDA Analysis: The cleaned data has been analyzed using various exploratory data analysis (EDA) techniques, revealing insights into current trends.

Setup the Streamlit app: Streamlit is a user-friendly web development tool that simplifies the process of creating intuitive interfaces.

Data Analysis: Cleaned data has been analyzed and visualized in Streamlit through Pandas DataFrame and Plotly.

Machine Learning: Cleaned data has been applied in different machine learning algorithm to predict the resale price.

The provided code utilizes Python scripting along with various libraries to fetch data from the our local directory. Data manipulation techniques are then applied, followed by exploratory data analysis (EDA) on the cleaned dataset to extract meaningful insights. Furthermore, the implementation includes a Streamlit web application to enhance user friendly dashboard and deployed in Render Platform.

Here's a breakdown of what the code does:

Importing all the neccessary libraries includes Streamlit which creates UI to interact with user and display the analysed data, Pandas which helps to display the analysed data in Streamlit web,numpy which will help in mathematical conversion,Plotly is employed to visualize the data and gain insights from it,Scipy aids in conducting exploratory data analysis (EDA) and extracting valuable insights, category_encoders used to convert categorical values to numerical data,pickle is used to load the trained model.

import pandas as pd
import category_encoders as ce
import numpy as np
from scipy.stats import spearmanr
import scipy.stats as stats
import streamlit as st
import pickle
import plotly.express as px
from plotly.subplots import make_subplots
import plotly.graph_objects as go

Datacoll_dataclean file is responsible for fetching data from the local directory and performing necessary data cleaning operations. Note: Replace your file name in df1,df2,df3,df4,df5,df6

df1 = pd.read_csv('please provide the file name here')
df2 = pd.read_csv('please provide the file name here')
df3 = pd.read_csv('please provide the file name here')
df4 = pd.read_csv('please provide the file name here')
df5 = pd.read_csv('please provide the file name here')
df6 = pd.read_csv('please provide the file name here')

analysis file is responsible for exploratory data analysis (EDA) and extracting valuable insights.
Dataextract_visual file is the complete analysis which is used to display the predicted data from ML model and display statistical, EDA and Vissual insights over the streamlit dashboard.
Dataextract_visual_render file used to deploy the predicted ML alone in the Render platform.
Four separate tabs have been implemented in the Streamlit web application to facilitate user interaction and enhance data visualization for insightful analysis.

tab1, tab2, tab3, tab4= st.tabs(["Home", "Statistical Insights","EDA Insights","Visual Insights"])

In Tab1 of the Streamlit web application,Users can input various data to receive the corresponding predicted Resale price.
In Tab2, Cleaned data has undergone Statistical analysis procedure to get a valuable insights.
In Tab3, exploratory data analysis (EDA) was conducted on the dataset to extract valuable insights, which are then presented to the user for further analysis and interpretation.
In Tab4, Cleaned data has been analyzed, and visualized to extract valuable insights.

This Python script streamlines the process of fetching data from a local directory, implementing crucial data cleaning procedures, visualized and conducting exploratory data analysis (EDA) to extract valuable insights. Analysed data was applied on ML to predict Flat resale price.

Contact

📧 Email: gokulakkrizhna@gmail.com

🌐 LinkedIn: linkedin.com/in/gokulakkrizhna-s-241562159

For any further questions or inquiries, feel free to reach out. We are happy to assist you with any queries.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
2.csv		2.csv
3.csv		3.csv
4.csv		4.csv
5.csv		5.csv
Datacoll_dataclean.py		Datacoll_dataclean.py
Dataextract_visual.py		Dataextract_visual.py
Dataextract_visual_render.py		Dataextract_visual_render.py
README.md		README.md
ar.pkl		ar.pkl
large_file_part1.csv		large_file_part1.csv
large_file_part2.csv		large_file_part2.csv
requirements.txt		requirements.txt
statistical_eda_analysis.py		statistical_eda_analysis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Singapore Resale Flat Prices Predicting

Overview

Table of Contents

Key Technologies and Skills

Installation

Usage

Features

Approach

Contact

About

Releases

Packages

Languages

Gokulakkrizhna/singapore-resale-price

Folders and files

Latest commit

History

Repository files navigation

Singapore Resale Flat Prices Predicting

Overview

Table of Contents

Key Technologies and Skills

Installation

Usage

Features

Approach

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages