Coder Social home page Coder Social logo

mashpit's Issues

Requirements for python branch

Need to add requirements for Python branch into a future README.md

This is my first attempt:

  • Mash >= v2.0
  • Skesa >= v2.4.0
  • sratoolkit
  • Python3 modules
    • Biopython
    • sqlite

Results to screen

Could you change it so that the results are printed to stdout instead of to a file? And if so, document it in the readme?

database should be a parameter

Could you make the database a parameter for each of the scripts? It should not always be ./mashpit.db. For example, if I had a separate database for each species, then I might have differently named databases. Or I might be in a different directory.

Unit tests

Need to have unit tests. Need to use standard python methods for unit tests, e.g., https://docs.python.org/3/library/unittest.html

Suggestions for your first unit tests:

  • test --version and --help flags to make sure there is a desired stdout and exit code
  • Creating a small (3-genome?) database
    • test metadata against expected output
    • test signatures file against expected output
    • test a dist operation against expected distance

Test database

Please create one markdown file describing how to create a test database with about 1000 samples. It should be such that any typical user can follow the instructions and that the database will come out exactly the same in my hands and in your hands and in another person's hands, etc.

Comparison capability

Need a method so that we can run

Mashpit query Mashpit.sqlite sketch1.msh [sketch2.msh...]

Where Mashpit.sqlite is a database created by the Mashpit creation scripts and sketch1.msh is a regular mash sketch file.

README.md

Need to add main documentation

  • Synopsis
  • Installation
  • Requirements
  • For the impatient -- some basic wording on how to run tests.sh

Sourmash

Consider using sourmash instead of system calls to mash

Merging two databases

Just a thought for a future version (and not now) but it might be kind of awesome if we could merge two databases. If it works, it could be a mechanism for users to update local databases without having to remake them.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.