Coder Social home page Coder Social logo

Comments (2)

spond avatar spond commented on August 22, 2024

Dear Kailey,

I haven't thought about using RELAX this way, to be honest with you. The "proper" statistical interpretation of what RELAX is doing is testing whether or not K≠1 for a given gene. It will provide an estimate of K, but, as with any estimate, there's an error associated with it. So in that sense, K, itself is not a test statistic, like the likelihood ratio test, but just a point estimate of a model parameter. I am also not 100% sure what is the exact null that is being tested by reshuffling branch labels; you'll have to run some sort of a distributional test (e.g. Wilcoxson or Kolmogorov-Smirnov) on the estimates, which are noisy.

May I ask what is the underlying hypotheses you are trying to test?

Incidentally, there is an option in RELAX which will analyze all genes jointly and estimate a single K from all genes together. So instead of having ~300 K values where some are <1, some are >1, some are significant and some are not, you get a single K ("average" for all genes), and a single p-value to test if it's different from one.

Is that something you might find useful? It's going to be slow to run, but less slow than 100 replicates x 300 genes. If so, I can provide additional instructions for how to run it.

We originally designed it to boost power of detection for smaller alignments (e.g. all genes from a viral genome).

Best,
Sergei

from hyphy.

kferger320 avatar kferger320 commented on August 22, 2024

Hi Sergei,
Thanks for getting back to me so quickly. The underlying hypothesis of the permutation tests would be that my original Test group, chosen because they all share a particular trait, produces an overall mean K that is significantly lower than what would be expected by chance. The reshuffling of branches many times would then provide estimates of the possible values that the mean K (over all genes) might take if the comparisons were instead between my Reference group and other Test groups, chosen randomly with respect to the shared trait. I have many species in my tree that do not share this trait (I’m actually testing several other traits separately as well), so many branches in the permutation Test groups would likely not possess my target trait.

I actually have tried to run RELAX on the entire gene set at once, basically stitching them together into a single alignment to produce a single K value. I didn’t realize there might be a specific mode in RELAX to do this though, would you mind sharing what that command would look like?

Previously the whole-alignment run kept running into convergence issues and didn’t produce a reliable K value. Do you think it would be valid to ‘prune’ the problematic genes that produced convergence issues when run on their own from the dataset, and then run the test on the rest of the un-problematic genes jointly?

Thanks,
Kailey

from hyphy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.