Skip to content

haruishi43/py-edge-eval

Repository files navigation

Python Edge Evaluation Tools

Edge detection tasks heavily rely on the original codes introduced in the BSDS300/500 benchmark which were targeted for MATLAB users. In the field of computer vision, various edge detection algorithms are now resorting to Python which supports various machine learning libraries such as PyTorch and Tensorflow. However, not everyone has access to MATLAB and the original benchmark codes are outdated. I created this open-source library, pyEdgeEval, to make it easier to evaluate and reproduce recent deep learning models for edge and boundary detection. The original C++ codes used in the MATLAB benchmarks are ported with Cython and the evaluation scripts are rewritten in Python3. This is especially useful for evaluating algorithms on remote linux servers (just run pip install pyEdgeEval) and docker containers, which have been difficult before due to MATLAB's constrains. The codebase is designed to be extensible and supports various tasks and datasets as well as different evaluation protocols. To test the validity of the evaluation code, pyEdgeEval's results are compared with the results of the original MATLAB codes. Besides benchmarking, pyEdgeEval adds various tools for edge detection such as mask2edge transformation.

pyEdgeEval is:

  • a Python alternative to the original MATLAB benchmark
  • light with minimal dependencies
  • modular architecture and easily customizable
  • relatively fast (uses multiprocessing and Cython)
  • implements common preprocessing algorithms
  • supports various tasks and datasets (extensible to other datasets)
  • supports various evaluation protocols
  • edge generation tools
  • etc...

Supported tasks:

  • Edge Detection
  • Semantic Boundary Detection

Supported datasets:

Disclaimers:

  • The evaluation code does not output results that exactly match the original MATLAB benchmark. This could be for various reasons such as random seeds for matching algorithm. The results are, for the most part, close enough (around 0.01% difference).
  • The evaluation speed should, in theory, be around the same as the original MATLAB benchmark as we use the same C/C++ code. However, from my experience, using pyEdgeEval is around 3x ~ 5x faster than the original MATLAB benchmark (tested using 10-threads). This is a night-and-day improvement in quality-of-life ('raw' protocol evaluation for SBD on MATLAB took around 35 hours, whereas pyEdgeEval took around 9 hours).
  • The codes and algorithms are not perfect. I will not take responsibility for how the code is used (check the license(s)).
  • If you find some bugs or want to improve this project, please submit issues or pull requests.

Please cite the following if you found this project useful:

@misc{pyEdgeEval,
  author = {Haruya Ishikawa},
  title = {pyEdgeEval: Python Edge Evaluation Tools},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/haruishi43/py-edge-eval}}
}

Help Wanted:

  • If anyone knows a better way to package Cython codes, please let me know. I'm not a Cython expert and I'm not sure if I'm doing it right. I'm having trouble supporting various platforms since I'm only creating a wheel for Linux (manylinux).
  • I'm aware that the documentation is a little rough. If anyone is willing to help, please let me know.
  • For the other TODOs, check out dev.md.

Installation

Dependencies

  • python >= 3.8 (tested on 3.8.x)
  • cv2

Installation guide

# Install dependencies
pip install -r requirements.txt

# install cv2 (e.g. use pip)
pip install opencv-python

# Option 1. install without cloning the project (only tested on ubuntu with python 3.8)
pip install pyEdgeEval

# Option 2. install as a pip package (install as a package)
git clone https://github.com/haruishi43/py-edge-eval.git
pip install -e .

Converting Cityscapes Dataset for SBD

Script:

python scripts/convert_datasets/cityscapes.py --nproc=8

NOTE:

  • Beaware that using multi-processing will consume at most 10GB per process (I'm working on debugging memory allocation issues).
  • --nonIS will generate non-IS boundaries.
  • The script will generate full resolution training dataset, full resolution validation dataset, and half resolution validation dataset (both raw/thin for validation).

Evaluation for each datasets

BSDS500

# get complete list of options
$ python scripts/evaluate/bsds500.py -h

Options:

Evaluate BSDS output

positional arguments:
  bsds_path             the root path of the BSDS-500 dataset
  pred_path             the root path of the predictions

optional arguments:
  -h, --help            show this help message and exit
  --output-path OUTPUT_PATH
                        the root path of where the results are populated
  --use-val             val or test
  --max-dist MAX_DIST   tolerance distance (default: 0.0075)
  --thresholds THRESHOLDS
                        the number of thresholds (could be a
                        list of floats); use 99 for eval
  --raw                 option to remove the thinning process (i.e. uses raw predition)
  --apply-nms           applies NMS before evaluation
  --nproc NPROC         the number of parallel threads
# Example:
python scripts/evaluate/bsds500.py <path/to/bsds500> <path/to/pred> <path/to/output> \
    --thresholds=5 --nproc=8
  • Tested with @xwjabc's HED implementation.
  • Due to the randomness in the original MATLAB (C++) codebase, the results will be different (at most +-0.001 difference).
  • Setting --nproc to the number of available cores will drastically improve the speed of evaluation.

SBD

First, you need to generate the GT using the scripts/convert_dataset/sbd.py script.

Example evaluation commands:

python scripts/evaluate/sbd.py <path/to/sbd> <path/to/pred> --categories=15 --thresholds=5 --nproc=8

Cityscapes

First, create GT data using this script:

# if you plan on evaluating with instance-sensitive edges (IS edges)
python scripts/convert_dataset/cityscapes.py --nproc 8
# if you plan on evaluating with non-instance-sensitive edges
python scripts/convert_dataset/cityscapes.py --nonIS --nproc 8

The scripts will create two types of edges (raw and thin) for two different scales (half and full).

Evaluation script:

# thin protocol
python scripts/evaluate/cityscapes_thin.py <path/to/cityscapes> <path/to/predictions> <path/to/output> --categories='[1, 14]' --thresholds 99 --nproc 8

# raw protocol
python scripts/evaluate/cityscapes_raw.py <path/to/cityscapes> <path/to/predictions> <path/to/output> --categories='[1, 14]' --thresholds 99 --nproc 8
  • For instance-insensitive edges, you would need to supply --pre-seal argument.
  • You can also preprocess the predictions by passing --apply-thinning and/or --apply-nms for thinning and NMS respectively.

License

  • The code is released under the MIT License (please refer to the LICENSE file for details).
  • I modified codes from other projects and their licenses applies to those files (please refer to Licenses).

Development and Contribution

See dev.md.