Comments (5)
Hi @xiaohe0404, do you want to start with trimmed fastq files rather than raw sequencing files? Extracting the UMI sequence in the cutadapt step is essential for the downstream analysis. I am not sure if your processed files still fit this pipeline. Could you provide more details about how you trim the fastq files?
from pseudou-bidseq.
Thanks for your timely reply!
Here are my detailed parameters:
- I cut 5' and 3' SR adapters by cutadapt;
- I cut 5'UMI(6bp)+GGG(TSO) of Read1 and added these infomation to query name by using fastp with following parameters: -A -Q -L -U --umi_loc=read1 --umi_len=6 --umi_prefix=UMI --umi_skip=3.
- I cut 3' barcode (6bp) of read1 by using seqkit subseq -r 1:-7. And then I used these output as clean trimmed fastq files and as the input of STAR.
from pseudou-bidseq.
fastp can trimming the adapter in your sample, but the output format (UMI_NNNNNN
) is not compatible with this pipeline.
Suppose you are using the template switch with dual UMI strategy for your library construction, it is highly recommended that you can run this pipeline with barcode: NNNNNNXXX-XXXNNNNNN
12 setting. No additional settings need to be modified.
Footnotes
-
XXX
after the-
symbol is for trimming mismatch tail at the 3'. For your description, you might use the random RT method, which would also create mismatches at the 3' end of the reads. ↩ -
NNNNNN
at the end is for extracting "3' barcode" you mentioned in step 3. If you do not need this sequence, replaceNNNNNN
withXXXXXX
would help. ↩
from pseudou-bidseq.
Thanks for your reply, this is really helpful!
from pseudou-bidseq.
You are welcome. If you have any question about this pipeline, do not hesitate to raise new issues.
from pseudou-bidseq.
Related Issues (12)
- barcode setting HOT 1
- Duplicated reads level is high HOT 1
- --bind error HOT 7
- Long running time HOT 3
- Low mapping ratio and lost reads after realigngap and filter HOT 3
- samFilter: /lib64/libc.so.6: version `GLIBC_2.28' not found HOT 15
- Filtering question HOT 1
- rcFastq HOT 15
- header of result HOT 1
- Error in rule gap_realign HOT 8
- automatic program termination HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pseudou-bidseq.