Coder Social home page Coder Social logo

Comments (7)

fconstancias avatar fconstancias commented on July 18, 2024 2

Thanks for the interesting discussion. An alternative approach could be to use lulu https://github.com/tobiasgf/lulu.

from astrobiomike.github.io.

difontaine avatar difontaine commented on July 18, 2024 1

Hi again Mike,

I ended up using vsearch (thank you for the suggestion!) -- the documentation is here. I used 0.99 for my percent identity and went from 160 ASVs down to 91 clusters based on similarity. I used vsearch in qiime2 and was able to get the appropriate fasta and count table for the new clusters. Thanks for the help!
Diana

from astrobiomike.github.io.

AstrobioMike avatar AstrobioMike commented on July 18, 2024

Hey there, Diana!

Thanks for the kind words :)

And I absolutely follow what you're saying and think it's a great idea to do it based on sequence similarity rather than taxonomic groupings :)

I unfortunately haven't actually gone through the process myself yet though, and don't have much useful help for you at the moment :/

But if i were doing it, i'd start with looking at vsearch to see how to just cluster my ASVs based on the percent ID i wanted, and would then have to iron out how to track that back to a regular count table afterwards – i don't know if there will be a straightforward way to do that already, or if it's something we'd have to workout ourselves (like getting a file connecting the new clusters to all the ASVs that were clustered together, and then summing our initial ASV count table based on which were clustered together). Sometimes usearch has some additional convenience functionality that hasn't been implemented in it's open-source vsearch counter-part, so if i were having trouble doing it with vsearch, i might look there too.

I think it'd be a great idea to get this on the site at some point, so thank you for bringing it up and posting a suggestion for it :) I'm just not sure when i'll be able to do that and add it :/

Sorry i can't be of more help right now! If try the above out, and you get to having a mapping file of which ASVs were clustered together but have some trouble trying to work out how to generate the new count table, I'd be happy to try to help with that for now. Feel free to post here again or email me the files (which ASVs were clustered together, and the original ASV count table) at michael.lee0517<at>gmail<dot>com and i can help workout how to make the new count table :)

Good luck!
-Mike

from astrobiomike.github.io.

AstrobioMike avatar AstrobioMike commented on July 18, 2024

And actually i just found this helpful thread on the dada2 github, with example code. I'm not familiar with the differences between how vsearch would cluster things vs how the process there would, but I suspect going either way would still get the job done just fine – if you want to try what they have there :)

from astrobiomike.github.io.

difontaine avatar difontaine commented on July 18, 2024

Hi Mike,

Thank you so much for these suggestions -- I actually did find that thread too a couple of days ago and tried it out with all of my initial ASVs (before I used phyloseq::tax_glom() to get just the taxonomic class I'm interested in. That didn't work so well but maybe it's because I need to do the clustering with just the ASVs I'm interested in (diatoms) so that there aren't so many ASVs to begin with...I'll give that another shot and look into the suggestions you offered.

Thank you!
Diana

from astrobiomike.github.io.

AstrobioMike avatar AstrobioMike commented on July 18, 2024

Excellent! Thanks for sharing what you ended up doing, Diana :)

from astrobiomike.github.io.

AstrobioMike avatar AstrobioMike commented on July 18, 2024

Thanks for sharing, @fconstancias!

from astrobiomike.github.io.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.