Taffy

This is a MIT licensed C and Python library with a CLI for manipulating/reading/writing TAF (described below) and MAF format multiple sequence alignments. It allows conversion between the formats and manipulation of the alignments with a number of useful utilities for preparing them for different use cases. The Python library is built on top of the C library and is therefore quite fast.

Taf Format Specification

See the Taf format page for a specification of the taf format and example.

Installation

See C/CLI Install for how to build and install this source for using the C library and CLI utilities.

See Python install for how to install the Python library.

CLI Utilities

See taffy utilities for a description of the many useful taffy utilities, including:

view - MAF / TAF conversion and region extraction
norm - normalize TAF blocks
add-gap-bases - add sequences from HAL or FASTA files into TAF gaps
index - create a .tai index (required for region extraction)
sort - sort the rows of a TAF file to a desired order
stats - print statistics of a TAF file
coverage - print coverage statistics of a given genome in a TAF file

Python scripts

See taffy scripts for a description of useful Python scripts, including:

alignment plot - A (relatively) fast MSA visualization, with coverage, copy-number, identity, and dotplot options.

Using the Python API

See using the Python API for how to work with MAF/TAF alignments using a convenient Python API designed to complement the CLI.

See the example notebook for a quick worked example of using the Python API for machine learning with PyTorch.

Using the C Library

There is also a simple C library for working with taf/maf files. See taf.h in the inc directory.

Comparing MAF and TAF file sizes

See quick file size comparison.

Name		Name	Last commit message	Last commit date
Latest commit History 288 Commits
docs		docs
examples		examples
scripts		scripts
taffy		taffy
tests		tests
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
include.mk		include.mk
notes.txt		notes.txt
pyproject.toml		pyproject.toml
setup.py		setup.py
taf_add_gap_bases.cpp		taf_add_gap_bases.cpp
taf_annotate.c		taf_annotate.c
taf_coverage.cpp		taf_coverage.cpp
taf_index.c		taf_index.c
taf_norm.c		taf_norm.c
taf_sort.c		taf_sort.c
taf_stats.c		taf_stats.c
taf_view.c		taf_view.c
taffy_main.cpp		taffy_main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taffy

Taf Format Specification

Installation

CLI Utilities

Python scripts

Using the Python API

Using the C Library

Comparing MAF and TAF file sizes

About

Releases

Packages

Contributors 2

Languages

License

ComparativeGenomicsToolkit/taffy

Folders and files

Latest commit

History

Repository files navigation

Taffy

Taf Format Specification

Installation

CLI Utilities

Python scripts

Using the Python API

Using the C Library

Comparing MAF and TAF file sizes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages