Coder Social home page Coder Social logo

Detailed parameters of STAR about pseudou-bidseq HOT 5 CLOSED

y9c avatar y9c commented on September 20, 2024
Detailed parameters of STAR

from pseudou-bidseq.

Comments (5)

y9c avatar y9c commented on September 20, 2024

Hi @xiaohe0404, do you want to start with trimmed fastq files rather than raw sequencing files? Extracting the UMI sequence in the cutadapt step is essential for the downstream analysis. I am not sure if your processed files still fit this pipeline. Could you provide more details about how you trim the fastq files?

from pseudou-bidseq.

xiaohe0404 avatar xiaohe0404 commented on September 20, 2024

Thanks for your timely reply!
Here are my detailed parameters:

  1. I cut 5' and 3' SR adapters by cutadapt;
  2. I cut 5'UMI(6bp)+GGG(TSO) of Read1 and added these infomation to query name by using fastp with following parameters: -A -Q -L -U --umi_loc=read1 --umi_len=6 --umi_prefix=UMI --umi_skip=3.
  3. I cut 3' barcode (6bp) of read1 by using seqkit subseq -r 1:-7. And then I used these output as clean trimmed fastq files and as the input of STAR.

from pseudou-bidseq.

y9c avatar y9c commented on September 20, 2024

fastp can trimming the adapter in your sample, but the output format (UMI_NNNNNN) is not compatible with this pipeline.

Suppose you are using the template switch with dual UMI strategy for your library construction, it is highly recommended that you can run this pipeline with barcode: NNNNNNXXX-XXXNNNNNN12 setting. No additional settings need to be modified.

Footnotes

  1. XXX after the - symbol is for trimming mismatch tail at the 3'. For your description, you might use the random RT method, which would also create mismatches at the 3' end of the reads.

  2. NNNNNN at the end is for extracting "3' barcode" you mentioned in step 3. If you do not need this sequence, replace NNNNNN with XXXXXX would help.

from pseudou-bidseq.

xiaohe0404 avatar xiaohe0404 commented on September 20, 2024

Thanks for your reply, this is really helpful!

from pseudou-bidseq.

y9c avatar y9c commented on September 20, 2024

You are welcome. If you have any question about this pipeline, do not hesitate to raise new issues.

from pseudou-bidseq.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.