Skip to content

Latest commit

 

History

History
77 lines (55 loc) · 3.54 KB

README.md

File metadata and controls

77 lines (55 loc) · 3.54 KB
The project is currently under ACTIVE development phase with the most recent major update on Jun 4'2018.

About the Project

About Description
Overview This is a Machine Learning application powered by Python.
Application Nature This is a backend application. No frontend is attached to the application yet.
Objective This repository helps us understand analysing Data retrieved from online source by using Machine Learning algorithm based on given requirements. By practicing such application, developers should have a basic level of understanding on how to structure a real life machine learning application.
Target Audience This project targets the beginners of Machine Learning who have basic knowledge on Machine Learning algorithms, and would like to see how to implement such algorithm in real life example.

Primary Technologies/Concepts Used

Topics What is It Links
PhantomJS --- ---
Beautiful Soup --- ---
Sellenium Testing tool ---
Pandas, Numpy Python library for data science ---
Matplotlib Python package for data visualization ---
DOM Document Object Model ---

Know before you Go

  • Structure of a general machine learning application
  • What is User Agent
  • Use of Sellenium
  • Use of DOM

Technical Description

There are four/five steps in this process:

Step Function Description
#1 Sourcing airfare pricing data To get the primary data, we will use Google's Flight Tracker which is called Google Flight Explorer. Once we set an origin and destination, it gives us a series of pricing data within a given time frame which ultimately helps us understand the changes in airfares over a period of time. For example, the following link will give you the airfare prices from Washington DC to the major cities within US for a 3-5 days trip. This is our primary source of data for this project.
#2 Retrieving the fare data Used web scraping technique to retrieve the data from the data source. Python has a number of web scraping libraries such as beautifulsoup, requests, scrap. For this process, we need a browser to retrieve the data.
#3 Parsing the DOM to extract pricing data ---
#4 Sending real-time alerts using IFTTT ----

How to Run

Please follow the following steps to run the application on your local machine

Support & Disclaimer

Support

Found a bug?? Here are the options

  • Please file an issue with detailed description.
  • If you know a possible solution, please create a new brnach, update the code and then submit pull request.
  • If you would like to reach out to me directly with any question, email me at ariful.haque.uc@gmail.com

General Disclaimer

This is my personal project and not an official product of any company. If you would like to use this code, please keep it in your mind that, although I have tried to make it as error-free as possible, there's no warranty of a 100% bug free application.

References