Cord CSA on UK biobank brain MRI database

This repository contains the analysis pipeline to normalize spinal cord cross-sectional area (CSA).

👉 Please cite this work if you use it or if you are inspired by it:

Bédard S, Cohen-Adad J. Automatic measure and normalization of spinal cord cross-sectional area using the pontomedullary junction. Frontiers in Neuroimaging [Internet]. 2022;1. Available from: https://www.frontiersin.org/articles/10.3389/fnimg.2022.1031253

Spinal cord CSA was computed on T1w MRI scans for 804 participants from the UK Biobank database. In addition to computing cross-sectional at the C2-C3 vertebral disc, it was also measured at 64 mm caudal from the pontomedullary junction (PMJ). The effect of various biological, demographic and anatomical factors was explored by computing Pearson’s correlation coefficients. A stepwise linear regression found significant predictors; the coefficients of the best fit model were used to normalize CSA.

Measure of the averaged cross-sectional area (CSA) between C2 and C3 of the spinal cord with UK Biobank Brain MRI dataset.

The brain MRI data of UK biobank follows the DICOM convention. The spinal cord of the processed brain MRI images is cut off. Because the purpose of this project is to measure the CSA between C2 and C3 of the spinal cord, the raw MRI images for T1w structural images and for T2w FLAIR are used as an input of the pipeline:

T1_orig_defaced.nii.gz
T2_FLAIR_orig_defaced.nii.gz

The analysis pipeline was applied on T1w images only.

The raw images have gradient distortion, correction will be applied in the preprocessing steps of the analysis pipeline.

The DICOM dataset is under: duke:mri/uk_biobank

Data conversion: DICOM to BIDS

For this project, a BIDS standard dataset is used. A conversion of DICOM to BIDS is necessary for the UK Biobank dataset. The data from the DICOM dataset in the BIDS standard for this project have the following correspondance for each subjects:

T1_orig_defaced.nii.gz in the BIDS standard is sub-XXXXXXX_T1w.nii.gz
T2_FLAIR_orig_defaced.nii.gz in the BIDS standard issub-XXXXXXX_T2w.nii.gz

To convert the DICOM dataset in a BIDS structure for this project, run the following line:

curate_project.py -path-in <path_DICOM_dataset> -path-output <path_BIDS_dataset>

The BIDS datset with raw data is under: data.neuro.polymtl.ca:datasets/uk-biobank. The dataset resulting from preprocessing is under data.neuro.polymtl.ca:datasets/uk-biobank-processed.

Here is an example of the BIDS data structure of uk-biobank-processed:

uk-biobank-processed
│
├── dataset_description.json
├── participants.json
├── participants.tsv
├── README
├── sub-1000032
├── sub-1000083
├── sub-1000252
├── sub-1000498
├── sub-1000537
├── sub-1000710
│   │
│   └── anat
│       ├── sub-1000710_T1w.json
│       ├── sub-1000710_T1w.nii.gz
│       ├── sub-1000710_T2w.json
│       └── sub-1000710_T2w.nii.gz
└── derivatives
    │
    └── labels
        └── sub-1000710
            │
            └── anat
                ├── sub-1000710_T1w_seg-manual.nii.gz  <---------- manually-corrected spinal cord segmentation
                ├── sub-1000710_T1w_seg-manual.json  <------------ information about origin of segmentation
                ├── sub-1000710_T1w_labels-manual.nii.gz  <------- manual vertebral labels
                ├── sub-1000710_T1w_labels-manual.json
                ├── sub-1000710_T1w_pmj-manual.nii.gz  <------- manual pmj label
                ├── sub-1000710_T1w_pmj-manual.json
                ├── sub-1000710_T2w_seg-manual.nii.gz  <---------- manually-corrected spinal cord segmentation
                └── sub-1000710_T2w_seg-manual.json

Acquisition parameters

Scanner: Siemens Skyra 3T running VD13A SP4 with a standard Siemens 32-channel RF receive head coil

T1-weighted structural imaging

Resolution: 1x1x1 mm
Field-of-view: 208x256x256 matrix

T2-weighted FLAIR structural imaging

Resolution: 1.05x1x1 mm
Field-of-view: 192x256x256 matrix

Analysis pipeline

This repository includes a collection of scripts to analyse a BIDS-structured MRI dataset.

Dependencies

MANDATORY:

SCT 5.3.0 for processing
gradunwarp for gradient correction
ANTs for processing
Python 3.7 for statistical analysis

OPTIONAL:

ITK-SNAP for correcting cord segmentations

NOTE: Make sure to add ITK-SNAP to the system path:
- For Windows, select the option during installation.
- For macOS, after installation, go to Help->Install Command-Line Tools.

Installation

Download this repository:

git clone https://github.com/sct-pipeline/ukbiobank-spinalcord-csa.git

Install:

cd ukbiobank-spinalcord-csa
pip install -e ./

Note on gradient distortion correction

A coeff.grad associated with the MRI scanner used for the data is necessary if it has not been applied yet. In this project, the gradient distortion correction is done in preprocess_data.sh with gradunwarp and Siemens coeff.grad file.

Preprocessing

Preprocessing generates a dataset with gradient distortion correction.

First, initialize shell variable with the path to the folder with the coeff.grad file:

PATH_GRADCORR_FILE=<path-gradcorr>

Launch preprocessing:

sct_run_batch -jobs -1 -path-data <PATH-DATA> -path-output ~/ukbiobank_preprocess -script preprocess_data.sh -script-args $PATH_GRADCORR_FILE

The results to use as the new dataset wil be in ~/ukbiobank_preprocess/data_processed/.

Processing

Processing will generate spinal cord segmentation, vertebral labels, pmj label and compute cord CSA. Specify the path of preprocessed dataset with the flag path-data.

Launch processing:

sct_run_batch -jobs -1 -path-data <PATH_DATA> -path-output ~/ukbiobank_results/ -script process_data.sh

Or you can launch processing with a config file instead by using the flag -config and by adjusting the file config_sct_run_batch.yml according to your setups. See sct_run_batch -h to look at the available options. To launch processing:

sct_run_batch -config config_sct_run_batch.yml

Quality control

After running the analysis, check your Quality Control (qc) report by opening the file ~/ukbiobank_results/qc/index.html. Use the "search" feature of the QC report to quickly jump to segmentations or labeling issues.

1. Assess quality of segmentation, vertebral and pontomedullary junction (PMJ) labeling

If segmentation or labeling issues are noticed while checking the quality report, proceed to manual correction using the procedure below:

In QC report, search for "deepseg" to only display results of spinal cord segmentation, search for "vertebrae" to only display vertebral labeling and "pmj" to only display PMJ labeling.
Review segmentation and spinal cord and PMJ labeling, note that the segmentation et vertebral labeling need to be accurate only between C2-C3, for cord CSA.
Click on the F key to indicate if the segmentation/label is OK ✅, needs manual correction ❌ or if the data is not usable ⚠️ (artifact). Two .yml lists, one for manual corrections and one for the unusable data, will automatically be generated.
Download the lists by clicking on Download QC Fails and on Download Qc Artifacts.

Note: Proceed to QC separately for cord segmentation and vertebral labeling to generate 2 separate lists.

The lists will have the following format:

.yml list for correcting cord segmentation:

FILES_SEG:
- sub-1000032_T1w.nii.gz
- sub-1000083_T2w.nii.gz

.yml list for correcting vertebral labeling:

FILES_LABEL:
- sub-1000032_T1w.nii.gz
- sub-1000710_T1w.nii.gz

.yml list for correcting pontomedullary junction (PMJ) labeling:

FILES_PMJ:
 - sub-1000032_T1w.nii.gz
 - sub-1000710_T1w.nii.gz

FILES_SEG: Images associated with spinal cord segmentation
FILES_LABEL Images associated with vertebral labeling
FILES_PMJ Images associated with PMJ labeling

For the next steps, the script uk_manual_correction loops through all the files listed in .yml file and opens an interactive window to either manually correct segmentation, vertebral or PMJ labeling. Each manually-corrected label is saved under derivatives/labels/ folder at the root of PATH_DATA according to the BIDS convention. Each manually-corrected file has the suffix -manual. The procedure is described bellow for cord segmentation and for vertebral labeling.

2. Correct segmentations

For manual segmentation, you will need ITK-SNAP and this repository only. See installation instructions and dependencies.

Here is a tutorial for manually correcting segmentations. Note that the new QC report format with interactive features (✅/❌/⚠️) is not included in the tutorial.

Run the following line and specify the .yml list for cord segmentation with the flag -config:

uk_manual_correction -config <.yml file> -path-in ~/ukbiobank_results/data_processed -path-out <PATH_DATA>

After all corrections are done, you can generate a QC report by adding the flag -qc-only- to the command above. Note that SCT is required for generating QC report.

3. Vertebral labeling

Note that manual labeling uses SCT and the QC report is generated automatically.

Here is a tutorial for manual vertebral labeling:

Run the following line and specify the .yml list for vertebral labeling with the flag -config:

uk_manual_correction -config <.yml file> -path-in ~/ukbiobank_results/data_processed -path-out <PATH_DATA>

To create disc labels, click at the posterior tip of the disc for C1-C2, C2-C3 and C3-C4 as shown in the following image:

4. PMJ labeling

Note that manual PMJ labeling uses SCT and the QC report is generated automatically.

Run the following line and specify the .yml list for PMJ labeling with the flag -config:

uk_manual_correction -config <.yml file> -path-in ~/ukbiobank_results/data_processed -path-out <PATH_DATA>

To create PMJ label, click at the posterior tip of the pontomedullary junction (PMJ) as shown in the following image:

See here for more examples.

Upload the manually-corrected files

A QC report of the manually corrected files is created in a zip file. To update the dataset, add all manually-corrected files derivatives/labels/, and include the qc zip file in the body of the PR. See our internal procedure for more details.

Add automatic segmentations to `derivatives/` folder

After all segmentations are manually QC-ed, you can add them to the derivatives/ by running again the script manual_correction.py and adding the flag -add-seg-only:

uk_manual_correction -config <.yml file> -path-in ~/ukbiobank_results/data_processed -path-out <PATH_DATA> -add-seg-only

The automatic segmentations that did not require manual correction (files in .yml file) are added to the derivatives/ folder. You can upload the folder following the instructions specified in Upload the manually-corrected files.

Re-run the analysis

After all the necessary segmentation and labels are corrected, re-run the analysis (sct_run_batch command in Processing). If manually-corrected files exist, they will be used instead of proceeding to automatic segmentation and labeling. Make sure to put the output results in another folder (flag -path-output) if you don't want the previous results to be overwritten.

Statistical analysis

Generate datafile:

To generate a data file with the CSA results from process_data.sh and a .csv file including the fields from the UK Biobank dataset, run the follwing line:

uk_get_subject_info -path-data <PATH_DATA> -path-output ~/ukbiobank_results/ -datafile <FILENAME>

Compute statistical analysis

To compute the statistical analysis of cord CSA results, use uk_compute_stats. If the datafile ouput of uk_get_subject_info is not data_ukbiobank.csv, add the flag -dataFile <FILENAME>. Run this script in /results folder or specify path to folder that contains output files of analysis pipeline in with -path-output flag. The flag -exclude points to a yml file containing the subjects to be excluded from the statistical analysis:

uk_compute_stats -path-output ~/ukbiobank_results/ -dataFile <DATAFILE> -exclude <EXCLUDE.yml>

The output of uk_compute_statshas the following data structure:

ukbiobank_results
│
├── data_processed
├── log
├── qc
└── results
    ├── csa-SC_c2c3.csv
    ├── csa-SC_pmj.csv
    ├── data_ukbiobank.csv
    └── stats_results
        ├── metrics
        |   ├── comparasion_c2c3_pmj
        |   |   └── scatterplots_c2c3_pmj_csa.png
        |   ├── scatter_plots
        |   |   ├── Age.jpeg
        |   |   ├── Brain GM volume.png
        |   |   ├── Brain WM volume.png
        |   |   ...
        |   ├── corr_table.csv
        |   ├── corr_table_and_pvalue.csv
        |   ├── corr_table_pvalue.csv        
        |   ├── stats_csa.csv
        |   ├── scatter_plot_age_brain_vol.png
        |   ├── scatter_plot_age_thalamus_vol.png
        |   └── stats_param.csv
        └── models
            ├── age
            |   ├── coeff
            |   |  ├── coeff_linear_fit.csv
            |   |  └── coeff_quadratic_fit.csv
            |   ├── summary
            |   |   ├── summary_linear_fit.txt
            |   |   └── summary_quadratic_fit.txt
            |   └── quadratic_fit.png
            ├── model_1
            |   └── CSA_PMJ
            |      ├── coeff
            |      |   ├── coeff_fullLin_CSA_PMJ.csv
            |      |   └── coeff_stepwise_CSA_PMJ.csv
            |      ├── residuals
            |      |    ├── res_plots_fullLin_CSA_PMJ.png
            |      |    └── res_plots_stepwise_CSA_PMJ.png
            |      ├── summary
            |      |   ├── summary_fullLin_CSA_PMJ.txt
            |      |   └── summary_stepwise_CSA_PMJ.txt
            |      └── compared_models.csv
            ├── model_2
            |  └── ...
            ├── sex
            |   └── violin_plot.csv
            └──norm_COV.csv

The coefficient files (coeff_*.csv) are included in SCT to normalize CSA.

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
dist		dist
pipeline_ukbiobank		pipeline_ukbiobank
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
compare_yml_list.py		compare_yml_list.py
compute_dice.py		compute_dice.py
config_sct_run_batch.yml		config_sct_run_batch.yml
curate_project.py		curate_project.py
exclude.yml		exclude.yml
get_distance_pmj_disc.py		get_distance_pmj_disc.py
gradunwarp_installation.md		gradunwarp_installation.md
job_template.sh		job_template.sh
preprocess_data.sh		preprocess_data.sh
process_data.sh		process_data.sh
requirements.txt		requirements.txt
run_all.py		run_all.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cord CSA on UK biobank brain MRI database

Table of contents

Data collection and organization

Uk Biobank database

Data conversion: DICOM to BIDS

Acquisition parameters

T1-weighted structural imaging

T2-weighted FLAIR structural imaging

Analysis pipeline

Dependencies

Installation

Note on gradient distortion correction

Preprocessing

Processing

Quality control

1. Assess quality of segmentation, vertebral and pontomedullary junction (PMJ) labeling

2. Correct segmentations

3. Vertebral labeling

4. PMJ labeling

Upload the manually-corrected files

Add automatic segmentations to `derivatives/` folder

Re-run the analysis

Statistical analysis

Generate datafile:

Compute statistical analysis

About

Releases 2

Packages

Contributors 4

Languages

License

sct-pipeline/ukbiobank-spinalcord-csa

Folders and files

Latest commit

History

Repository files navigation

Cord CSA on UK biobank brain MRI database

Table of contents

Data collection and organization

Uk Biobank database

Data conversion: DICOM to BIDS

Acquisition parameters

T1-weighted structural imaging

T2-weighted FLAIR structural imaging

Analysis pipeline

Dependencies

Installation

Note on gradient distortion correction

Preprocessing

Processing

Quality control

1. Assess quality of segmentation, vertebral and pontomedullary junction (PMJ) labeling

2. Correct segmentations

3. Vertebral labeling

4. PMJ labeling

Upload the manually-corrected files

Add automatic segmentations to derivatives/ folder

Re-run the analysis

Statistical analysis

Generate datafile:

Compute statistical analysis

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Add automatic segmentations to `derivatives/` folder

Packages