Coder Social home page Coder Social logo

Comments (5)

apeltzer avatar apeltzer commented on May 31, 2024

This looks pretty related to that: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3989762/

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

"Duplicated reads can be seen to contribute to both artifact and ChIP signal and the removal of duplicated reads significantly reduces ChIP signal across samples. The evaluation of ChIP quality following duplicate removal may therefore underestimate the extent of ChIP enrichment relative to background and so careful consideration of the contribution of duplicates to artifact regions and ChIP signal must be made prior to evaluation of NSC and RSC metrics."

from chipseq.

ewels avatar ewels commented on May 31, 2024

Yup, that's the same publication that's linked in the original comment I think.

I'm not keen on using dedup data for some steps and not others, as originally suggested by @sifakise. But we could certainly have a --nodedup flag to skip the deduplication.

We're calculating NSC and RSC in the pipeline, as well as collecting % duplication stats, so then it would be up to the user to choose whether to do it or not.

Phil

from chipseq.

apeltzer avatar apeltzer commented on May 31, 2024

Oh, sorry I didn't spot that :-(

I'd also say keeping things consistent is better and if a user knows whether dedup makes sense they should be able to set that with a parameter as you suggested.

from chipseq.

chuan-wang avatar chuan-wang commented on May 31, 2024

Fixed in PR #38

from chipseq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.