Coder Social home page Coder Social logo

Comments (7)

a7032018 avatar a7032018 commented on July 28, 2024 1

Hi HajkD,

Yes. I'd like to run all 300K elements against Dfam for the annotation purpose.
You indicate a good idea to take one representative rather than go through many sequences sharing homology. I will give a try.

Thanks!

from ltrpred.

HajkD avatar HajkD commented on July 28, 2024
Fatal exception (source file esl_hmm.c, line 198):
malloc of size -307968 failed
Aborted (core dumped)
Error running command:

It seems like you don't have enough memory to perform the search. I would recommend to use a computer with more RAM.

I hope this helps.

from ltrpred.

sadikmu avatar sadikmu commented on July 28, 2024

Thanks for that pointing out. I run it on 500G RAM machine with 32 cpu still getting the same error. I am not aware of how to allocate memory size to LTRpred in the parameterization options listed in the documentation.

Another challenge, is there a tweak to set up LTRpred to pick dfamscan from local installation or conda version which could help to test it in a server machine where one doesn't have admin privilege to install dfamscan in /usr/local/bin/?

from ltrpred.

sadikmu avatar sadikmu commented on July 28, 2024

Any suggestions on this, please?

from ltrpred.

HajkD avatar HajkD commented on July 28, 2024

Hi Sadik,

Since this is an issue coming from the dfamscan script provided by the Dfam community, I would suggest to contact them.

Alternatively, have you tried running the failed command? :

nhmmscan --noali -E 0.001 --dfamtblout /tmp/nXeK2iJYcP --cpu=16 dfam/Dfam.hmm /home/ltrpred/epo_ltrdigest/epo-ltrdigest_complete.fas

Maybe this yields more comprehensive error messages?

For me to understand, does the yeast example from the documentation work for you on this machine or does it also fail? It clearly seems to be a memory assignment issue and now you would need to troubleshoot where this comes from in detail.

Regarding your question:

Another challenge, is there a tweak to set up LTRpred to pick dfamscan from local installation or conda version which could help to test it in a server machine where one doesn't have admin privilege to install dfamscan in /usr/local/bin/?

You can run the dfamscan.pl directly without sudo rights in any folder by typing:

perl dfamscan.pl -fastafile [[seq_file]] -hmmfile path/to/Dfam.hmm -dfam_outfile DfamAnnotation.out -E 1E-5 -cpu 16 --log_file logfile.txt --masking_thresh

I hope this helps.

Cheers,
Hajk

from ltrpred.

a7032018 avatar a7032018 commented on July 28, 2024

I am facing the same issue when feeding 300K sequences to LTRpred Dfam scan with 1TB RAM, 112 threads machine.


Fatal exception (source file esl_hmm.c, line 198):
malloc of size -148920 failed


If RAM is the limitation causing the fault by feeding so many sequences, is it possible to do Dfam nhmmscan by batch? (eg. scan 1-1000 sequences -> store the result in tmp -> scan 1001-2000 sequences ->store the result in tmp -> ....->combine the chunks to the final output.

from ltrpred.

HajkD avatar HajkD commented on July 28, 2024

Hi @a7032018

This is an excellent idea.

Do I understand correctly that LTRpred annotated 300k elements and you would like to run all 300k elements against Dfam?

By any chance, did you enable the TE family clustering option in LTRpred to check whether some elements generate huge clusters and thus only a cluster representative (family member) needs to be hmmered against the Dfam? This could be an alternative option.

Regarding the batch Dfam scans I noted it down as feature request and will work on it when time permits.

from ltrpred.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.