Skip to content
This repository has been archived by the owner on Jun 18, 2024. It is now read-only.

josephedward/exoplanet_data_ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Description

Comparison of Models using NASA Kepler data

Target

'koi_disposition' - The disposition in the literature towards this exoplanet candidate. One of CANDIDATE, FALSE POSITIVE, NOT DISPOSITIONED or CONFIRMED (i.e., likelihood that a given exoplanet is a true planet)

Resources

Preprocessing

  • Preprocess the dataset prior to fitting the model.
  • Perform feature selection and remove unnecessary features.
  • Use MinMaxScaler to scale the numerical data.
  • Separates the data into training and testing data with TrainTestSplit.

Currently Supported Models

  • Decision Tree - uses GridSearch to tune model parameters. [K-Nearest Neighbors, Support Vector Machine, Recurrent Neural Network - Keras/Tensorflow]

Denouement

1 . Decision tree ensemble (random forest) points to solar mass centroid offset of exoplanet mass as strongest predictor of planetary viability.