Skip to content

Commit

Permalink
Merge branch 'master' of github.com:dewyman/TALON
Browse files Browse the repository at this point in the history
  • Loading branch information
Dana Elizabeth Wyman committed Mar 20, 2020
2 parents 003ce73 + 50b980b commit d481546
Show file tree
Hide file tree
Showing 4 changed files with 11 additions and 11 deletions.
14 changes: 7 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,8 +11,8 @@ works from mapped SAM files, allowing data from different sequencing platforms
* [Running TALON](#how_to_run)
* [Flagging reads for internal priming](#label_reads)
* [Initializing a TALON database](#db_init)
* [Running TALON](#run_talon)
* [Post-TALON utilities](#talon_utils)
* [Annotating reads with TALON](#run_talon)
* [Working with the TALON results](#talon_utils)
* [Accessing abundance information](#talon_abundance)
* [Filtering transcript models](#talon_filter)
* [Citing TALON](#talon_cite)
Expand Down Expand Up @@ -60,8 +60,6 @@ Options:
```

## <a name="db_init"></a>Initializing a TALON database
For documentation of TALON versions 4.4.2 and lower, see https://github.com/mortazavilab/TALON/wiki/Archived-TALON-documentation.

The first step in using TALON is to initialize a SQLite database from the GTF annotation of your choice (i.e. GENCODE). This step is done using **`talon_initialize_database`**, and only needs to be performed once for your analysis. Keep track of the build and annotation names you choose, as these will be used downstream when running TALON and its utilities.

NOTE: The GTF file you use must contain genes, transcripts, and exons. If the file does not contain explicit gene and/or transcript entries, key tables of the database will be empty and you will experience problems in the downstream analysis. Please see our [GTF troubleshooting section](https://github.com/mortazavilab/TALON/wiki/Formatting-a-GTF-annotation-to-work-with-TALON) for help.
Expand All @@ -74,7 +72,7 @@ Options:
--f GTF annotation file
--g The name of the reference genome build that the annotation describes. Use a short and memorable name since you will need to specify the genome build when you run TALON later.
--a The name of the annotation (for metadata purposes)
--l Minimum required transcript length (default = 300 bp)
--l Minimum required transcript length (default = 0 bp)
--idprefix Prefix for naming novel discoveries in eventual TALON runs (default = 'TALON')
--5p Maximum allowable distance (bp) at the 5' end during annotation (default = 500 bp)
--3p Maximum allowable distance (bp) at the 3' end during annotation (default = 300 bp)
Expand Down Expand Up @@ -230,7 +228,7 @@ Before quantifying your results on the isoform level, it is important to filter
The **`talon_filter_transcripts`** module generates a whitelist of transcripts that are either:
a) Known
b) Observed at least n times in each of k datasets.
The default value for n is 5 and the default for k is the total number of datasets you provide for filtering. In addition, the filter requires that all n reads used to support a novel transcript must not have evidence of internal priming (default: internal priming defined as > 0.5 fraction As).
The default value for n is 5 and the default for k is the total number of datasets you provide for filtering. In addition, the filter requires that all n reads used to support a novel transcript must not have evidence of internal priming (default: internal priming defined as > 0.5 fraction As). If you wish to disregard internal priming, set --maxFracA to 1 (not generally recommended).
```
Usage: talon_filter_transcripts [options]
Expand Down Expand Up @@ -300,7 +298,9 @@ Options:
# <a name="talon_cite"></a>Citing TALON
Please cite our preprint when using TALON:

<add citation for new preprint>
*A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification.*
Dana Wyman, Gabriela Balderrama-Gutierrez, Fairlie Reese, Shan Jiang, Sorena Rahmanian, Weihua Zeng, Brian Williams, Diane Trout, Whitney England, Sophie Chu, Robert C. Spitale, Andrea J. Tenner, Barbara J. Wold, Ali Mortazavi
bioRxiv 672931; doi: https://doi.org/10.1101/672931

# License
MIT, see LICENSE
2 changes: 1 addition & 1 deletion example/README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# TALON example

## User note
This example was initially constructed for TALON v4.2. In the packaged versions since, the commands have changed since TALON does not need to be run from a specific path. If you are using a TALON version below 4.3, please see the archived documentation [here](https://github.com/mortazavilab/TALON/wiki/Archived-TALON-Example-Instructions-(v4.2)) for the correct commands. In addition, the full tutorial as of v4.4.2 is available [here]().
This example was initially constructed for TALON v4.2. In the packaged versions since, the commands have changed since TALON does not need to be run from a specific path. If you are using a TALON version below 4.3, please see the archived documentation [here](https://github.com/mortazavilab/TALON/wiki/Archived-TALON-Example-Instructions-(v4.2)) for the correct commands. In addition, the full tutorial as of v4.4.2 is available [here](https://github.com/mortazavilab/TALON/wiki/Archived-TALON-Example-Instructions-(v4.4)).

## Tutorial
Use the provided files to try TALON out on spike-in RNA variant (SIRV) reads sequenced on the PacBio Sequel II platform.
Expand Down
2 changes: 1 addition & 1 deletion requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
pandas
pyfaidx
pysam==0.15.2
pysam==0.15.4
pybedtools
4 changes: 2 additions & 2 deletions setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@

setup(
name="talon",
version="5.0-beta",
version="5.0",
description="TALON is a Python program for identifying known and novel "
"genes/isoforms in long read transcriptome data sets",
author="Dana Wyman",
Expand All @@ -49,7 +49,7 @@
python_requires=">=3.6",
install_requires=[
"pandas",
"pysam==0.15.2",
"pysam==0.15.4",
"pybedtools",
"pyfaidx"
],
Expand Down

0 comments on commit d481546

Please sign in to comment.