Coder Social home page Coder Social logo

leangreen / sctcrseq Goto Github PK

View Code? Open in Web Editor NEW

This project forked from elementolab/sctcrseq

0.0 2.0 0.0 40 KB

Processing of single cell RNAseq data for the recovery of TCRs in python

License: GNU Affero General Public License v3.0

Perl 5.15% Python 94.85%

sctcrseq's Introduction

scTCRseq

Introduction

For specific questions/problems please email David Redmond at: [email protected]

This project is an implementation of a pipeline for Single-cell RNAseq package for recovering TCR data in python

Github Project

Configuration and Dependencies

The pipeline needs for the following programs to be installed and the paths :

###SEQTK: https://github.com/lh3/seqtk

###Blastall: http://mirrors.vbi.vt.edu/mirrors/ftp.ncbi.nih.gov/blast/executables/release/2.2.15/

###GapFiller: http://www.baseclear.com/genomics/bioinformatics/basetools/gapfiller

###Vidjil: https://github.com/vidjil/vidjil

And their accompanying paths need to be changed in the script cmd_line_sctcrseq.py:

seqTkDir="/path/to/seqtk/"

blastallDir="/path/to/blastall/"

gapFillerDir="/path/to/GapFiller_v1-10_linux-x86_64/"

vidjildir="/path/to/vidjil/"

lengthScript="/path/to/calc.median.read.length.pl"

###Reference TCR sequences:

Also the user can select their chosen TCR alpha and beta V and C reference databases (we recommend downloading from imgt.org) and enter their locations:

location for FASTA BLAST reference sequences downloadable from imgt.org - (needs to be changed manually)

humanTRAVblast="/path/to/TRAV.human.fa"

humanTRBVblast="/path/to/TRBV.human.fa"

humanTRACblast="/path/to/TRAC.human.fa"

humanTRBCblast="/path/to/TRBC.human.fa"

mouseTRAVblast="/path/to/TRAV.mouse.fa"

mouseTRBVblast="/path/to/TRBV.mouse.fa"

mouseTRACblast="/path/to/TRAC.mouse.fa"

mouseTRBCblast="/path/to/TRBC.mouse.fa"

location for Vidjil BLAST reference sequences in vidjil program - (needs to be changed manually)

humanVidjilRef="/path/to/tr_germline/human"

mouseVidjilRef="/path/to/tr_germline/mouse"

Example Command Line

We recommend running the pipeline on paired end fluidigm single cell RNA seq data.

The usage is as follows:

####python cmd_line_sctcrseq.py --fastq1 FASTQ1 --fastq2 FASTQ2 --species human/mouse --outdir OUTPUT DIRECTORY --label OUTPUT LABEL

Also running:

####python cmd_line_sctcrseq.py

Will give command line options

To test the program we recommend either using your own single cell RNA sequencing data or downloading data such as at

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1104129

sctcrseq's People

Contributors

oelemento avatar happyclusters3 avatar

Watchers

James Cloos avatar LeanGreen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.