Segmenting Car Parts with Deloitte

Final Project of Deep Learning at DTU.

This repo holds the notebooks and source files to recreate our results, and possibly also train the three different networks. The dataset used is NOT provided, as it is confidential. Instead, we provide 10 manually annotated images in the data directory, to replicate that of the given dataset. Despite there being a test.npy and train.npy in that directory, these are the same exact arrays and can be viewed as "placeholders". Training on train.npy will not yield comparable results. When evaluating the models with evaluation.ipynb, the performance is measured on these 10 test images (as opposed to the 30 provided images in the report).

Structure

The structure of the repo is as follows:

.
├── data
│   ├── holds all data (10 manually generated images)
├── hpc
│   ├── holds the same structure as used during training with HPC
├── images
│   ├── holds images for the readme
├── notebooks
│   ├── holds all the Jupyter notebooks (as remote)
└── src
    ├── holds all python source files (as remote)

Testing

To reproduce our results, perform the following steps.

Make sure all libraries listed in requirements.txt are installed.
Clone the repo.
Download the trained models by running the file download_extract_models.py from the root directory:

python3 src/download_extract_models.py

Note: the trained models require 965MB of free disk space.

This will first download the trained models, before extracting them, creating the folder trained_models where they will all be located.

Important! This step needs to be performed in order to be able to run the evaluation notebook.

Finally, run the notebook called evaluation.ipynb, where you'll be able to evaluate each model's performance in the final cell, as well as view its prediction on one of the images from the test set.

Technical details

For this project, three models have been implemented and trained on the provided dataset: U-Net, U-Net with ResNeSt as encoder, and $\mathbf{U^2}$-Net.

U-Net

Our implementation of the U-Net model follows the original, as described in this paper. It consists of a symmetric encoder-decoder structure, giving the U shape as seen above. The network utilises skip connections in order to be able to both extract features progressively moving down the downsampling path, as well as preserving as much spatial information as possible.

ResNeSt

Our second model replaces the encoder, or the downsampling path, in the original U-Net to rather use a Split-attention network (ResNeSt). This architecture stacks several blocks as seen above ResNet style, to ultimately give what is known as ResNeSt. The implementation is inspired by the original paper, with the addition of skip connections needed for the U-Net decoder.

$\mathbf{U^2}$-Net

The final model is also implemented like the original, with only minor changes. It consists of two nested U-structures, with the aim of extracting even finer details and relationships in the data - ideal for segmentation tasks.

Results

Model	mIoU	Pixel accuracy	Runtime
U-Net	51.1%	88.3%	264ms
ResNeSt-50	50.6%	88.5%	68ms
ResNeSt-100	54.6%	90.0%	83ms
ResNeSt-200	56.1%	90.2%	124ms
$\mathbf{U^2}$-Net	59.5%	90.6%	402ms

Above, each model's performance on one of the images from the original test data can be seen. The model using ResNeSt is trained with three different depths, i.e. number of layers: 50, 101 and 200 respectively.

The combined performance on all of the 30 test images can be seen in the table, which displays the mean intersection over union (mIoU, i.e. how much of the predicted mask overlaps the ground truth mask for each class), the pixel accuracy and the runtime of the models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Segmenting Car Parts with Deloitte

Final Project of Deep Learning at DTU.

Structure

Testing

Technical details

U-Net

ResNeSt

$\mathbf{U^2}$-Net

Results

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
data		data
hpc		hpc
images		images
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Jorgen-ha/project21

Folders and files

Latest commit

History

Repository files navigation

Segmenting Car Parts with Deloitte

Final Project of Deep Learning at DTU.

Structure

Testing

Technical details

U-Net

ResNeSt

$\mathbf{U^2}$-Net

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages