Coder Social home page Coder Social logo

Proper usage of index about sting HOT 4 CLOSED

jordanlab avatar jordanlab commented on August 18, 2024
Proper usage of index

from sting.

Comments (4)

ar0ch avatar ar0ch commented on August 18, 2024 1

The VFDB and CARD databases will need to be used with detector not typer, I'll note that in the STing docs.

➜  detector -x vfdb/db/gene_set -1 ERR008581_1.fastq -2 ERR008581_2.fastq -s ERR008581  --print-tidy | tee output.tsv
Sample  Gene    Presence        Total_hits      Total_k-mers    Total_reads     Input_files
ERR008581       vfdb_774        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_775        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_776        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_777        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_778        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_779        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_780        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_781        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_782        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_783        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_784        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_785        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_786        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_787        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_788        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_789        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_790        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_791        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_792        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_793        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_794        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_795        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_796        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_797        0       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_798        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_799        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_800        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_801        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_802        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_803        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_804        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_805        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_806        0       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_807        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_808        1       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq
ERR008581       vfdb_1389       0       2983466 2942836 377323  ERR008581_1.fastq,ERR008581_2.fastq

(I also just noticed that I did a horrible job naming the vfdb tar, too much time working with VCFs... I pushed an update to rename the archive)

from sting.

ar0ch avatar ar0ch commented on August 18, 2024 1

In both applications, --fastq-1-files and --fastq-2-files are accumulators, so they can be given multiple times or a list of files can be given. The ordering of inputs is preserved and they're assumed to be paired between 1 <> 2. However, both applications only operate on one sample at a time -- do not provide reads from multiple samples in one run. The classification problem, as solved by STing, is embarrassingly parallel and parallelism is achieved through additional instances.

from sting.

MatteoSchiavinato avatar MatteoSchiavinato commented on August 18, 2024

Thanks! That explains the issue.

Maybe another thing that could be clearer is whether I can declare --sample-name item1,item2,...,itemN the same way I would do with --fastq-1-files file1,file2,…,fileN. I did it like that for now and it did not complain but I never got to see an output due to this issue with the index.

from sting.

MatteoSchiavinato avatar MatteoSchiavinato commented on August 18, 2024

Roger that. Thanks!

from sting.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.