Model code

A Mixed-Reality Dataset for Category-level 6D Pose and Size Estimation of Hand-occluded Containers

This is the code to run the NOCS model trained on the CHOC mixed-reality dataset. The code is adapted from Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation.

[dataset] [webpage] [arxiv pre-print] [trained model]

Changes

Here, we detail the changes we made with respect to the original NOCS repository.

Change the dataloader and number of categories, to properly load images from the CHOC dataset.
Add EPnP as an alternative to Umeyama in the post-processing pose estimation step.

Installation

Requirements

This code has been tested on an Ubuntu 18.04 machine with CUDA 11.6 and cuDNN 7.5.0, and the following libraries.

Software/libraries:
- Python 3.5
- Tensorflow 1.14.0
- Keras 2.3.0
- Anaconda/Miniconda 22.9.0
- Open3D 0.16.0
- SciPy 1.2.2
- OpenCV 4.4.0
- Sci-Kit 0.15.0

Instructions

Install the essentials

sudo apt-get update
sudo apt-get install build-essential libssl-dev libffi-dev python-dev

Setup the conda environment (optional but strongly recommended)

Install Anaconda or Miniconda (please follow: https://docs.conda.io/en/latest/miniconda.html#linux-installers).

conda create --name choc-nocs-env python=3.5
conda activate choc-nocs-env

Install the dependencies

Make sure to upgrade pip first:

pip install --upgrade pip

Install the libraries as follows (if there are errors, try installing the libraries one at a time):

pip install tensorflow-gpu==1.14.0 keras==2.3.0
python3.5 -m pip install opencv-python moviepy open3d scipy scikit-image cython "git+https://github.com/philferriere/cocoapi.git#egg=pycocotools&subdirectory=PythonAPI"

Verify installation with CPU

python3 -c "import tensorflow as tf; print(tf.reduce_sum(tf.random.normal([1000, 1000])))"

Running demo

You can use the demo to run the model on your own RGB(-D optional) images. First, download the trained model from here.

The general command to run the demo is:

python demo.py --ckpt_path <path_to_model> --input_folder <path_to_inputs> --pp <post-processing_technique> --draw

Arguments:

ckpt_path: local path to the trained model (.h5 format)
input_folder : local path to the input folder
output_folder : local path to the desired output folder; if unspecified it will save in input_folder > output
pp: post-processing technique to compute the 6D pose, umeyama or epnp (default: umeyama)
draw: boolean flag to visualise the results

The input folder should be structured as follows (note that depth is optional - it's only necessary for the Umeyama post-processing):

input_folder
  |--rgb
  |   |--0001.png
  |   |--0002.png
  |   | ...
  |--depth
  |   |--0001.png
  |   |--0002.png
  |   | ...

We provide a sample in sample_folder.

Training

You can re-train the NOCS-model on the CHOC or other dataset.

The general command to run the training is:

python train.py --dataset <dataset_type> --datapath <path_to_dataset> --modeldir <path_to_models> --weight_init_mode <weight_initialization> --gpu --calcmean

Arguments:

dataset: type of dataset; CHOC or NOCS
datapath : local path to the input folder
modeldir : local path to the location of the stored models (usually /logs)
weight_init_mode: which weight initialisation technique, imagenet, coco or last (default: last)
gpu: boolean flag to use Graphical Processing Unit
calcmean : boolean flag to calculate the RGB mean of the entire training dataset

For simplification, we also add a bash script to run this command. You can change the variables for the arguments in the run_training.sh script and then run:

$ bash run_training.sh

Known issues

Python 3.5 reached the end of its life on September 13th, 2020 [DEPRECATION]

Enquiries, Question and Comments

If you have any further enquiries, question, or comments, or you would like to file a bug report or a feature request, use the Github issue tracker.

Licence

This work is licensed under the MIT License. To view a copy of this license, see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
ICASSP_experiments		ICASSP_experiments
sample_folder		sample_folder
ICP.py		ICP.py
LICENSE		LICENSE
README.md		README.md
aligning.py		aligning.py
config.py		config.py
datagenerator.py		datagenerator.py
dataset.py		dataset.py
demo.py		demo.py
detect_eval.py		detect_eval.py
detectiontargetlayer.py		detectiontargetlayer.py
model.py		model.py
parallel_model.py		parallel_model.py
run_training.sh		run_training.sh
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model code

A Mixed-Reality Dataset for Category-level 6D Pose and Size Estimation of Hand-occluded Containers

Changes

Table of Contents

Installation

Requirements

Instructions

Running demo

Training

Known issues

Enquiries, Question and Comments

Licence

About

Releases 1

Packages

Languages

License

Saafke/CHOC-NOCS

Folders and files

Latest commit

History

Repository files navigation

Model code

A Mixed-Reality Dataset for Category-level 6D Pose and Size Estimation of Hand-occluded Containers

Changes

Table of Contents

Installation

Requirements

Instructions

Running demo

Training

Known issues

Enquiries, Question and Comments

Licence

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages