Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

Nextflow
Docker

The list of benchmarking datasets

scATAC-seq:

How to prepare new benchmark datasets

Benchmark data is stored in anndata format. The anndata object should contain:

adata.X: cell by feature count matrix. For scATAC-seq, the format of feature names should be: chr:start-end.
adata.obs["cell_type"]: cluster/cell-type annotation of each barcode.
data.obs['batch']: batch annotation of each barcode (optional).
data.obs['compare']:differentially accessible region between two groups ref vs case
Groundtruth have at least two columns: ['cell_type'] and DARs named by ['index']

How to run benchmark

The pipeline uses docker containers to run the benchmark. Therefore, you need to install either docker on your machine. The pipeline needs to download large data files, so make sure you have enough disk space (>20G) and a fast internet connection.

Use ./bench.sh -profile singularity or ./bench.sh -profile docker to run benchmarks. The benchmark results will be stored in ./results folder.

See nextflow.config for additional configuration options.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docker		docker
src		src
.gitignore		.gitignore
README.md		README.md
bench.sh		bench.sh
nb_CD14_Mono_Memory_data.yaml		nb_CD14_Mono_Memory_data.yaml
nextflow-21.04.3-all		nextflow-21.04.3-all
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

The list of benchmarking datasets

How to prepare new benchmark datasets

How to run benchmark

About

Releases

Packages

Languages

regulatory-genomics/atac_da_benchmark

Folders and files

Latest commit

History

Repository files navigation

Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

The list of benchmarking datasets

How to prepare new benchmark datasets

How to run benchmark

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages