Data Lifecycle Management for Aerospace Applications

This repository contains the code for the paper

Data Lifecycle Management in Evolving Input Distributions for Learning-based Aerospace Applications by Somrita Banerjee, Apoorva Sharma, Edward Schmerling, Max Spolaor, Michael Nemerouf, and Marco Pavone, presented at AI4Space Workshop, ECCV, 2022.

Download datasets

ExoRomper -- A subset of ExoRomper images and ground truth labels is available in this repository in datasets/exoromper. The results of the paper can be reproduced using this subset. For the full ExoRomper dataset, please contact the authors at somrita@stanford.edu.
SPEED -- The dataset can be downloaded from https://kelvins.esa.int/satellite-pose-estimation-challenge/data/.

Visualize datasets

visualize_images.ipynb: a Jupyter Notebook that plots images from the datasets both in their original form and modified with added degraded pixels.
visualize_pose.ipynb: a Jupyter Notebook for inspecting the datasets: it plots example images, pose label is visualized by projected axes.

Train a model

Run Jupyter notebook train.ipynb.

Training functions are in scripts/train_functions.py
Utility functions are in scripts/utils.py
Model checkpoint, best model, and losses are stored in checkpoint/, best_model/, and losses/ respectively.

Apply a SCOD wrapper

Run Jupyter notebook apply_scod.ipynb.

SCOD is the algorithm Sketching Curvature for Out-of-Distribution Detection developed in

Sharma, Apoorva, Navid Azizan, and Marco Pavone. "Sketching curvature for efficient out-of-distribution detection for deep neural networks." Uncertainty in Artificial Intelligence. PMLR, 2021.

SCOD code in scod/. Also available on Github.
Applies a SCOD wrapper to the last parameters of the trained model
Plots Mean Squared Error (MSE) of model predictions (predicted - true)
Plots SCOD uncertainty estimate
Shows that high SCOD uncertainty correlates with high MSE of predictions

Evaluation benchmark

Run Jupyter notebook evaluation_benchmark.ipynb.

Randomly selects a benchmark set of images from a configurable mix of categories
Loads trained model, applies SCOD wrapper
Compares data lifecycle performance of multiple flagging algorithms (including DS-SCOD)
Benchmark functions are in scripts/benchmark_functions.py
Interface details below. Output costs and accuracies are stored in saved_data/

Note

Interface details: eval_flaggers

Inputs:

flaggers_to_test: List of flagging algorithms to compare. Each algorithm should be defined as a lambda function that takes in a sequence x and returns a list of boolean flags with length equal to that of x. Examples:

Naive_false - flags none of the images, always returns false
lambda x: [False for i in range(len(x))]

Naive_true - flags all of the images, always returns true
lambda x: [True for i in range(len(x))]

Random with budget of 10, i.e. select 10 images from each batch randomly
lambda x: random_flagger(x, flag_limit=10, seed=49)

SCOD with budget of 10, i.e. select 10 images from each batch with greatest uncertainty
lambda x: scod_flagger(x, unc_model, flag_limit=10)

DS-SCOD with budget of 10, i.e. select up to 10 images from each batch that maximize information gain
lambda x: ds_scod_flagger(x, unc_model, flag_limit=10)

load_model_path: Path to saved model

test_seq: Test images (batched or sequential)

labels: Ground truth labels \

Outputs:

costs: Cumulative cost of labeling (1.0 for each image labeled)

accs: Rolling prediction accuracy (currently evaluated as negative mse loss), averaged over each batch

accs_std: Standard deviation of prediction accuracy

Visualize results of evaluation benchmark

Run Jupyter notebook visualize_benchmark_results.ipynb.

Compare different algorithms by plotting average model performance vs. cumulative labeling cost.
View time-series performance on breakouts such as only space images, only earth images, etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Lifecycle Management for Aerospace Applications

Download datasets

Visualize datasets

Train a model

Apply a SCOD wrapper

Evaluation benchmark

Interface details: `eval_flaggers`

Inputs:

Outputs:

Visualize results of evaluation benchmark

About

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
best_model		best_model
checkpoint		checkpoint
datasets/exoromper		datasets/exoromper
img		img
losses		losses
saved_data		saved_data
scod		scod
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
apply_scod.ipynb		apply_scod.ipynb
evaluation_benchmark.ipynb		evaluation_benchmark.ipynb
train.ipynb		train.ipynb
visualize_benchmark_results.ipynb		visualize_benchmark_results.ipynb
visualize_images.ipynb		visualize_images.ipynb
visualize_pose.ipynb		visualize_pose.ipynb

License

StanfordASL/data-lifecycle-management

Folders and files

Latest commit

History

Repository files navigation

Data Lifecycle Management for Aerospace Applications

Download datasets

Visualize datasets

Train a model

Apply a SCOD wrapper

Evaluation benchmark

Interface details: eval_flaggers

Inputs:

Outputs:

Visualize results of evaluation benchmark

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages

Interface details: `eval_flaggers`