NCLscan-hybrid, a tool using long-read sequencing (Pabio/Nanopore) to validate non-col-linear (NCL) transcripts (fusion, trans-splicing, and circular RNA)
- Python
- bedtools==v2.25.0
- samtools
- minimap2
- seqtk
We recommand to use conda to install the dependencies.
git clone https://github.com/TreesLab/NCLscan-hybrid.git
./NCLscan-hybrid.sh \
-long [input long read fasta/fastq file] \
-long_type [pb or ont] \
-nclscan [NCLscan result file] \
-c [configure file] \
-o [out_prefix_name] \
-t [number of threads]
Parameter | Description |
---|---|
-long FILE | Long reads dataset.(FASTA or FASTQ) |
-long_type TYPE | The type of the long reads dataset. ('pb' or 'ont') |
-nclscan FILE | The results file from NCLscan. |
-c CONFIG_FILE | Config file. |
-o PREFIX | Prefix for output files. |
-t INT | Number of threads. |
# | Column |
---|---|
1 | chr (donor) |
2 | pos (donor) |
3 | strand (donor) |
4 | chr (acceptor) |
5 | pos (acceptor) |
6 | strand (acceptor) |
7 | gene_symbol (donor) |
8 | gene_symbol (acceptor) |
9 | is_intragenic |
The remaining columns generated by NCLscan are optional for NCLscan-hybrid.
- PREFIX.long_intra.result
- PREFIX.long_inter.result
# | Column | Description |
---|---|---|
1 | NCL_event_id | |
2 | #supporting_reads | |
3 | has_reads_out_of_circle | |
4 | #reads_out_of_circle | |
5 | has_reads_rolling_circle | |
6 | #reads_rolling_circle | |
7 ~ N | The remaining columns are from the original input file. |
# | Column | Description |
---|---|---|
1 | NCL_event_id | |
2 | #supporting_reads | |
3 ~ N | The remaining columns are from the original input file. |
To visualize the alignments of supporting reads of an supported NCL event, upload the BED files in the following directories to the UCSC genome browser.
- pass2_intra_BrowserView/
- WithinCircle_events_BrowserView/
- pass2_inter_BrowserView/