Coder Social home page Coder Social logo

Comments (7)

buske avatar buske commented on June 16, 2024

We're switching over to using the variants.tsv output, and here too it would be useful to have it pre-filtered to PASS rows, since this leads to a reduction in file size of ~100x for an average exome.

from exomiser.

visze avatar visze commented on June 16, 2024

If we implement this cli option it should affect every output. Otherwise it will be confusing for the user.

But in piplines (or if you have a bash start script) it will be easy to implement a postprocessing step like:
cat variants.tsv | egrep "^#|PASS" > variants_filtered.tsv

(maybe Pass instead of PASS. Can't remeber it.)

from exomiser.

buske avatar buske commented on June 16, 2024

Agreed. Yep, that's exactly what we do. It is definitely not a critical issue at all, just a minor feature request. It was the ~200 mb of disk usage I was trying to avoid, since it slows things down when we're processing hundreds of VCF files, especially if we wanted to run them all in batch with --batch-settings (since you're going to have accumulated several hundred GB of data by the end, when all you care about is several hundred MB of it). :)

from exomiser.

julesjacobsen avatar julesjacobsen commented on June 16, 2024

Made a snappy new command for the console:

output-pass-variants-only

This defaults to false when not mentioned. Can be called from a settings file like so:

output-pass-variants-only=true

or from the command line

--output-pass-variants-only

from exomiser.

julesjacobsen avatar julesjacobsen commented on June 16, 2024

That was a lot of code for what could be done as a grep or awk command! Damn!

from exomiser.

visze avatar visze commented on June 16, 2024

:-)

2015-03-31 16:46 GMT+02:00 Jules Jacobsen [email protected]:

That was a lot of code for what could be done as a grep or awk command!
Damn!


Reply to this email directly or view it on GitHub
#35 (comment).

from exomiser.

buske avatar buske commented on June 16, 2024

:( Sorry!

from exomiser.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.