Coder Social home page Coder Social logo

Comments (3)

luissian avatar luissian commented on July 1, 2024

I have been testing ALFAClust with 1 allele in the schema. It is really fast in writing the result. Using the parameter "-l 0.925" to get an average estimated of 0.925, the alleles were classified in 21 clusters.
Output result is a file with the cluster name followed with the locus name of this cluster, then next line the next cluster with the locus. See the example bellow:

`#Cluster 1
lmo0002_36
lmo0002_79
lmo0002_90

.....
....

#Cluster 2
lmo0002_98
lmo0002_101
and so on
`

By grepping the output file searching for "#Cluster" and -n to get the number lines, we get something like this.
`
1:#Cluster 1
78:#Cluster 2
81:#Cluster 3
795:#Cluster 4

`
Meaning that locus name from line 2 up to 77 belongs to Cluster 1.
I am guessing (maybe I am wrong) that I could take any of this locus, randomly, to map all the locus on the cluster with an identity of 90% , using blast.
Unfortunately, I did not have time to verify it, yet.
If I am right we could use this alternative because of the fast output

from taranis.

luissian avatar luissian commented on July 1, 2024

I did more testing, based on the clustered file generated by alfaclust.
I selected some cluster randomize and for each cluster a random allele was selected.
Then using blast and the random allele, I get that every time run the script for checking the number of alleles that blast matches wit a 90% of identity I get that all alleles were included in the match.
However the first time running for testing, I select the last allele in the first group of cluster, I get only 71 of 76 matches. But it was the only time that get a value bellow 100% of matches.
To be sure we could run blast each cluster to verify that the selected random allele matches all alleles in the cluster.

I hope this test was useful for making the final decision.

from taranis.

luissian avatar luissian commented on July 1, 2024

I will star coding reference_allele feature using the approach of alfaclust

from taranis.

Related Issues (1)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.