Comments (5)
This looks pretty related to that: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3989762/
from chipseq.
"Duplicated reads can be seen to contribute to both artifact and ChIP signal and the removal of duplicated reads significantly reduces ChIP signal across samples. The evaluation of ChIP quality following duplicate removal may therefore underestimate the extent of ChIP enrichment relative to background and so careful consideration of the contribution of duplicates to artifact regions and ChIP signal must be made prior to evaluation of NSC and RSC metrics."
from chipseq.
Yup, that's the same publication that's linked in the original comment I think.
I'm not keen on using dedup data for some steps and not others, as originally suggested by @sifakise. But we could certainly have a --nodedup
flag to skip the deduplication.
We're calculating NSC and RSC in the pipeline, as well as collecting % duplication stats, so then it would be up to the user to choose whether to do it or not.
Phil
from chipseq.
Oh, sorry I didn't spot that :-(
I'd also say keeping things consistent is better and if a user knows whether dedup makes sense they should be able to set that with a parameter as you suggested.
from chipseq.
Fixed in PR #38
from chipseq.
Related Issues (20)
- mergeBed ERROR: Requested column 10, but database file - only has fields 1 - 9. HOT 18
- Normalisation of bigwig files
- package or namespace load failed for ‘UpSetR’ HOT 3
- No Space left on device error HOT 3
- Make subworkflows & modules available for nf-core tools HOT 1
- Default values for p-value and FDR
- Get rid of checkIfExists for params paths
- minor "samplesheet_pe.csv" format issue
- PHANTOMPEAKQUALTOOLS throws stack overflow exception HOT 1
- MACS2: Too few paired peaks (0) so I can not build the model!
- Error with NextSeq trimming
- Error running the pipline test in the BWA index step
- jobs failing with sigbus and unknown userid errors HOT 1
- Update MACS to v3
- There are multiple files for each of the following file names
- Can't run if there is no control
- Improve website on parameters section Alignment Options for argument --save_unaligned
- IDR analysis
- Remove lib folder to get ready for v2.13.1 template merge
- Macs2 Output
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chipseq.