Comments (4)
Dear @NatJWalker-Hale,
RELAX checks the proportion of sites in ω categories 1 or 2 but just for the general exploratory
model. Are you seeing this happen in your data?
Best,
Sergei
from hyphy.
Dear @spond,
Thanks very much for the reply! I think that I'm understanding the output a little bit better now. I also ought to have mentioned that this is v2.5.60. For my example, the stdout contains:
### Fitting the general descriptive (separate k per branch) model
### * Log(L) = -6522.20, AIC-c = 13138.83 (47 estimated parameters)
* The following baseline rate distribution for branch-site combinations was inferred
| Selection mode | dN/dS |Proportion, %| Notes |
|-----------------------------------|---------------|-------------|-----------------------------------|
| Negative selection | 0.000 | 4.060 | |
| Negative selection | 0.000 | 93.895 | |
| Diversifying selection | 48.917 | 2.044 | |
and then
### Fitting the alternative model to test K != 1
* Log(L) = -6527.00, AIC-c = 13132.30 (39 estimated parameters)
* Relaxation/intensification parameter (K) = 0.64
* The following rate distribution was inferred for **test** branches
| Selection mode | dN/dS |Proportion, %| Notes |
|-----------------------------------|---------------|-------------|-----------------------------------|
| Negative selection | 0.572 | 79.236 | |
| Neutral evolution | 1.000 | 20.764 | |
| Diversifying selection | 1.007 | 0.000 | Not supported by data |
* The following rate distribution was inferred for **reference** branches
| Selection mode | dN/dS |Proportion, %| Notes |
|-----------------------------------|---------------|-------------|-----------------------------------|
| Negative selection | 0.418 | 79.236 | |
| Neutral evolution | 1.000 | 20.764 | |
| Diversifying selection | 1.011 | 0.000 | Not supported by data |
followed by
### Fitting the null (K := 1) model
* Log(L) = -6527.12, AIC-c = 13130.52 (38 estimated parameters)
* The following rate distribution for test/reference branches was inferred
| Selection mode | dN/dS |Proportion, %| Notes |
|-----------------------------------|---------------|-------------|-----------------------------------|
| Negative selection | 0.457 | 77.869 | |
| Negative selection | 0.878 | 22.131 | |
| Diversifying selection | 1.014 | 0.000 | Not supported by data |
(so no significant evidence)
But just to be clear, even though the proportion is 0, the alternative and null are still fitting that site class, correct?
The Not supported by data
message presumably appears when any proportion is estimated as 0. Is this also the case for the output JSON? For this example, none of the model fits have p2 or omega2 in the output.
On the other hand, I have just noticed that some of the sites in the alignment were empty (due to a pruning error), and now that I've removed those sites and re-run, the output JSON does have p2 and omega2 for each model (with p2 still 0). So I'm not sure what is going on there, but sanitising the input seems to have fixed it anyway.
Thanks again!
Nat
from hyphy.
Dear @NatJWalker-Hale,
The intended behavior for Not supported by data
screen output is as you surmised: the specified number of rates are always fitted, but if one (or more) of them have no support (very low weight) or essentially the same dN/dS values, the code will simply flag them as such for used information.
One exception is that if you fit All
models (including the general exploratory), then the code can kick down the number of rates if it finds lack of support for the user-specified number of rates
This will be indicated by the following echo to stdout
### Because some of the rate classes were collapsed to 0, the model is likely overparameterized. RELAX will reduce the number of site rate classes by one and repeat the fit now.
----
My guess is that's what happened when you had no corresponding rates in the JSON
file (otherwise they should have been there, just with near 0 weights).
Best,
Sergei
from hyphy.
Dear @spond,
Okay, that's good to know! Thanks very much for clarifying.
Cheers!
Nat
from hyphy.
Related Issues (20)
- error HOT 7
- Analysis ends but does not progress to report generation HOT 5
- Failed to dereference error with FitMG94 and the local option HOT 2
- test for detecting compartmentalization in dataMonkey HOT 2
- DataMonkey not recognizing forground branches HOT 2
- MEME interpretation - 0 branches? HOT 5
- Different aBSREL results for same inputs HOT 11
- How to process the results files from OrthoFinder into input files for HyPhy ABSREL? HOT 9
- How to choose background branches? HOT 2
- Does excluding species with paralogous sequences violate aBSREL and/or RELAX assumptions of single-copy orthology? HOT 6
- Interpretation of RELAX across many genes HOT 2
- FADE: substitution model selection HOT 2
- Multi-ORF Analysis Questions HOT 3
- Barely any parallelism (max 110% CPU) running `hyphy gard` on macOS arm64 HOT 2
- GARD crashes when run on 27 SARS-CoV-2 sequences HOT 7
- Format for selecting branches to test on FEL HOT 12
- Error when install hyphy using `conda install -c bioconda hyphy`? HOT 11
- MEME HyPhy killed by signal 15 HOT 4
- The dN/dS ratio shows a significant difference between the foreground and background sets. Why is the result still that selection is not associated with the phenotype/trait? HOT 2
- Question about evidence of saturation in BUSTED / MEME output HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hyphy.