Coder Social home page Coder Social logo

tigar2's Introduction

====== TIGAR2

TIGAR2: sensitive and accurate estimation of transcript isoform expression with longer RNA-Seq reads

Naoki Nariai, Kaname Kojima, Takahiro Mimori, Yukuto Sato, Yosuke Kawai, Yumi Yamaguchi-Kabata and Masao Nagasaki

BMC Genomics, 15(Suppl 10):S5 (2014).

Latest news
Feb 19, 2015: TIGAR2.1 was released. Multi-threading is now available.
Dec 12, 2014: TIGAR2.0 was released.

Please download the jar file by clicking Download ZIP on the right panel.

Usage: java -jar Tigar2_1.jar FASTA SAM OUT
 
 FASTA           : reference FASTA file
 SAM             : target SAM/BAM file
 OUT             : output file
 
 Options:

 --thread_num INT : number of thread
 --alpha_zero DOUBLE : tuning parameter alpha_zero
 --is_paired     : paired-end data. default = FALSE.
 --frag_dist_mean DOUBLE: mean of the fragment length distribution. default = estimation from data
 --frag_dist_std DOUBLE:  standard dev of the fragment length distribution. default = estimation from data

Recommended pipeline to run TIGAR2

1. Prepare cDNA reference sequences in FASTA format.

e.g.) human
http://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/refMrna.fa.gz

e.g.) mouse
http://hgdownload.soe.ucsc.edu/goldenPath/mm9/bigZips/refMrna.fa.gz

2. Build bowtie2 index

mkdir ref
bowtie2-build refMrna.fa ./ref/refMrna

3. Run bowtie2

For single-end data

bowtie2 -p 8 -k 100 --very-sensitive ./ref/refMrna sample.fastq > sample.sam

For paired-end data

bowtie2 -p 8 -k 100 --very-sensitive ./ref/refMrna -1 sample_1.fastq -2 sample_2.fastq > sample.sam

4. Run TIGAR2

For single-end data

java -jar Tigar2_1.jar --thread_num 8 refMrna.fa sample.sam --alpha_zero 0.1 sample_out.txt

For paired-end data

java -jar Tigar2_1.jar --thread_num 8 refMrna.fa sample.sam --is_paired --alpha_zero 0.1 sample_out.txt

Output format

ID: transcript (mRNA) ID that the program predicted

LENGTH: transcript length

Z: the number of expected fragments that the program assigned to the transcript

FPKM: normalized expression level (Fragments Per Kilobase of exon per Million mapped fragments)

THETA: estimated parameter (transcript abundance), essentially Z divided by total mapped reads.

5. Visualization

You can visualize the optimized alignment by TIGAR2 as follows:

samtools sort sample_out.txt.opt.bam sample_opt_sorted
samtools index sample_opt_sorted.bam

Please start IGV_2.3.14 or later, and load refMrna.fa as Genome, and sample_opt_sorted.bam. You can look at the optimized alignment of reads on each transcript isoform.


Please note that the current implementation of TIGAR2 might require large memory size for large sam/bam files. In such cases, please specify:

e.g.) java -Xmx16g -Xms16g -jar Tigar2_1.jar --thread_num 8 FASTA SAM OUT --alpha_zero 0.1
e.g.) java -Xmx32g -Xms32g -jar Tigar2_1.jar --thread_num 8 FASTA SAM OUT --alpha_zero 0.1
e.g.) java -Xmx64g -Xms64g -jar Tigar2_1.jar --thread_num 8 FASTA SAM OUT --alpha_zero 0.1

You can also choose BWA-MEM as an aligner as follows:

* Build FM-index for BWA-MEM alignment

bwa index refMrna.fa

* Run BMA-MEM

For single-end data

bwa mem -t 8 -L 10000 -a refMrna.fa sample.fastq > sample.sam

For paired-end data

bwa mem -t 8 -P -L 10000 -a refMrna.fa sample_1.fastq sample_2.fastq > sample.sam

Please DO NOT sort sam files

Then, you can run TIGAR2 exactly the same as the way described previously.


References
Naoki Nariai, Osamu Hirose, Kaname Kojima, Masao Nagasaki.
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.
Bioinformatics, 29(18):2292-2299 (2013).

Naoki Nariai, Kaname Kojima, Takahiro Mimori, Yukuto Sato, Yosuke Kawai, Yumi Yamaguchi-Kabata and Masao Nagasaki TIGAR2: sensitive and accurate estimation of transcript isoform expression with longer RNA-Seq reads. BMC Genomics, 15(Suppl 10):S5 (2014).

This site is maintained by: Naoki Nariai

Contact:
nariai [at] megabank.tohoku.ac.jp

Last updated on 2015/02/19

tigar2's People

Contributors

nariai avatar

Watchers

James Cloos avatar Wayne Fang avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.