Comments (5)
Alternatively (just noticed in your call): reduce "--accW"... should also reduce the problem! (right now you are trying to compute "exact" accessibilities for the WHOLE sequence, which are most likely wrong for such sequences anyway.. that's exactly why windows-based computation was developed: beside reduced runtime it focusses on local structure rather than extreeemly long base pairs (that are unlikely for many types of RNA)..
from intarna.
Hi @AbeerMM
fortunately, I found unexpectedly some time to look into the issue and a solution how to integrate the partition function scaling into the IntaRNA interface. This will be part of the upcoming version of IntaRNA.
You can use the feature already by downloading the 3.4.beta version from my space OR by cloning the recent version from GitHub.
You find respective pfScale documentation already on GitHub (thanks for the error image!)
Hope that helps.
Best,
Martin
from intarna.
Hi, the problem is not with IntaRNA but withe the ViennaRNA library used to compute the accessibility.
The pf_scale
parameter is used to keep the extreemly large partition function values (for long RNAs) within managable ranges with some computational tricks. The ViennaRNA developers implemented various things to dynamically adapt the parameter, to keep things working for large RNAs.
It seems, the tricks were not working for your RNA and the error reported stems from within the library..
https://github.com/ViennaRNA/ViennaRNA/blob/master/src/ViennaRNA/part_func.c#L400/part_func.c#L400
Unfortunately, there is no parameter in IntaRNA to alter pf_scale
.
The only workaround would be to
- run RNAplfold manually using similar parameters as when calling IntaRNA as discussed here
- it supports the
-S
/--pfScale
argument to alter the parameter, see manual - store the output in a file
- load accessibilities from this file when calling IntaRNA
- it supports the
Sorry for not having another possibility at hand. Please leave the issue open. Hope to find some time to incorporate a respective option into IntaRNA, but this needs some time that I dont have at the moment.. 😢
Best,
Martin
from intarna.
Alternatively (just noticed in your call): reduce "--accW"... should also reduce the problem! (right now you are trying to compute "exact" accessibilities for the WHOLE sequence, which are most likely wrong for such sequences anyway.. that's exactly why windows-based computation was developed: beside reduced runtime it focusses on local structure rather than extreeemly long base pairs (that are unlikely for many types of RNA)..
@martin-raden Thank you so much for taking the time to resolve the issue and answer my question.
About --accW=0, the reason why I used this value was based on the documentation of IntaRNA. I was trying to reproduce similar results to RNAup calculation.
I'm working with novel lncRNAs that have length range between couple hundreds to couple thousands. if you could recommend some options that give me the most reliable results, I'd appreciate it.
Thank you
from intarna.
Hi @AbeerMM ,
you are welcome.
Concerning your question what accessibility contraints to use for lncRNA.. mhh.. hard to say and I havent worked with them so far.
It boils down to the question: what is the maximal distance between base pair partners you want to consider/allow? For mRNAs, most people restrict that a lot with the assumption that "full structure formation", i.e. between any segments of the mRNA, is hindered by the continuous translation process and the involved molecules. Thus, a "local folding" using a window-based approach is reasonable and (beside runtime reduction) might exclude artificially long base pairing (eg of sequence ends etc.).
For lncRNA, there is (as far as I know) often no such continous processing and the RNAs can form whatever structure they want.. Thus, local folding will not show you the full (accessibility) figure of your RNA and might even consider regions accessible that are occupied by long range interactions. Thus, your selection of an RNAup-like full-range accessibility computation (accw=0) without bp-span constraint (accL=0) is a reasonable choice, but computationally demanding.
This is one of the reasons, why in mechRNA still local folding (accW=200 accL=150) is applied for both lncRNAs and the target genomes, even with the knowledge that this is only an estimate.
Does this help? :)
Best,
Martin
from intarna.
Related Issues (20)
- bug with predefined single-bp seed HOT 2
- Etotal CSV output with SHAPE data
- Reg: Prediction of miRNAs targets using 3' UTRs HOT 1
- intaRNA API in python HOT 2
- IntaRNAhelix parameters HOT 2
- Vienna package not found HOT 4
- scalebar issuses HOT 2
- Source compile error HOT 4
- Make sure program works HOT 3
- Install error: /usr/bin/ld: -f may not be used without -shared HOT 9
- Attaching IntaRNA docker container to VS Code HOT 2
- Segmentation fault (core dumped) HOT 1
- [R|python] CSV input wrapper call script
- --outOverlap=N might be buggy..
- enable sequence subsetting when accessibilities are read from file
- [MS WSL] file output for longer target lists crashes
- Conda command line and web based do not show the same output result HOT 2
- IntaRNA --version does not reflect the installed conda package version? HOT 3
- IntaRNAsTar Exception raised
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from intarna.