Comments (7)
Helllo Ashmita,
Pplacer appears to be crashing which is likely because you are running out of memory. CheckM requires ~32Gb to place genomes into its full reference tree. You can reduce this to just under 16Gb by using the --reduced_tree
flag. Alternatively, you can use the taxonomy_wf if you know the taxonomic identity of the bins you wish to process with CheckM.
from checkm.
Hello Ashmita,
Can you please post the full output from CheckM. It is clear what the problem is just from the IOError message. Also, you might just try running CheckM on a set of your own bins/genomes to see if that works or at least provides a more informative error message.
Cheers,
Donovan
from checkm.
Hello Donovan
So as requested I have used Check M on a set of my own bins and here is the full output received.
ashmita@ashmita-VirtualBox:~/Documents/programms$ checkm lineage_wf -t 5 -x fa /home/ashmita/Documents/bins /home/ashmita/Documents/resultscm
[CheckM - tree] Placing bins in reference genome tree.
Identifying marker genes in 26 bins with 5 threads:
Finished processing 26 of 26 (100.00%) bins.
Saving HMM info to file.
Calculating genome statistics for 26 bins with 5 threads:
Finished processing 26 of 26 (100.00%) bins.
Extracting marker genes to align.
Parsing HMM hits to marker genes:
Finished parsing hits for 26 of 26 (100.00%) bins.
Extracting 43 HMMs with 5 threads:
Finished extracting 43 of 43 (100.00%) HMMs.
Aligning 43 marker genes with 5 threads:
Finished aligning 43 of 43 (100.00%) marker genes.
Reading marker alignment files.
Concatenating alignments.
Placing 26 bins into the genome tree with pplacer (be patient).
Killed
Uncaught exception: Sys_error("/home/ashmita/Documents/resultscm/storage/tree/concatenated.pplacer.json: No such file or directory")
Fatal error: exception Sys_error("/home/ashmita/Documents/resultscm/storage/tree/concatenated.pplacer.json: No such file or directory")
{ Current stage: 0:01:39.226 || Total: 0:01:39.226 }
[CheckM - lineage_set] Inferring lineage-specific marker sets.
Reading HMM info from file.
Parsing HMM hits to marker genes:
Finished parsing hits for 26 of 26 (100.00%) bins.
Determining marker sets for each genome bin.
Unexpected error: <type 'exceptions.IOError'>
Traceback (most recent call last):
File "/usr/local/bin/checkm", line 710, in
checkmParser.parseOptions(args)
File "/usr/local/lib/python2.7/dist-packages/checkm/main.py", line 1241, in parseOptions
self.lineageSet(options)
File "/usr/local/lib/python2.7/dist-packages/checkm/main.py", line 226, in lineageSet
resultsParser, options.unique, options.multi)
File "/usr/local/lib/python2.7/dist-packages/checkm/treeParser.py", line 477, in getBinMarkerSets
tree = dendropy.Tree.get_from_path(treeFile, schema='newick', rooting="force-rooted", preserve_underscores=True)
File "/usr/local/lib/python2.7/dist-packages/dendropy/datamodel/basemodel.py", line 217, in get_from_path
with open(src, "r", newline=None) as fsrc:
File "/usr/local/lib/python2.7/dist-packages/dendropy/utility/filesys.py", line 60, in pre_py34_open
buffering=buffering)
IOError: [Errno 2] No such file or directory: '/home/ashmita/Documents/resultscm/storage/tree/concatenated.tre'
It is roughly the same error as when I try to run the installation test..
Your help will be appreciated
Thank you
Ashmita
from checkm.
Thank you Donovan, It now works
from checkm.
@dparks1134 I get this error too when running CheckM under AWS Batch
from checkm.
It is likely a memory issue. CheckM requires ~32Gb to place genomes into its full reference tree. You can reduce this to just under 16Gb by using the --reduced_tree flag. Alternatively, you can use the taxonomy_wf if you know the taxonomic identity of the bins you wish to process with CheckM.
from checkm.
@dparks1134 Yep that's the issue, when setting an AWS Batch job to run with a minimum amount of memory of 32 GB, it works fine. Would it not be useful for CheckM to error out at the start or at least give a warning if the memory requirements are not met?
from checkm.
Related Issues (20)
- Understanding contamination value HOT 2
- ERROR conda.core.link:_execute(952): An error occurred while installing package 'bioconda::checkm-genome-1.2.2-pyhdfd78af_1'. HOT 1
- Unexpected error: <class 'KeyError'>
- FileNotFoundError: [Errno 2] No such file or directory: '/home/majorram/anaconda3/envs/checkm/hmms/phylo.hmm' HOT 3
- Chekm test: line 1: 16491 Bus error (core dumped) pplacer -j 1 HOT 2
- maximum number of genomes/MAGs to run checkm HOT 2
- WARNING: Expected all files to contain sequences in amino acid space. HOT 1
- local variable 'seqId' referenced before assignment
- 'utf-8' codec can't decode byte 0xb0 in position 37: invalid start byte HOT 1
- checkm ssu_finder HOT 2
- Fatal exception (source file p7_hmmfile.c, line 2095): hmm write failed. system error: No space left on device HOT 2
- checkm_coverage dropped some bins
- NameError: free variable 'mask' referenced before assignment in enclosing scope
- Having trouble interpreting the meaning of "root" as a marker lineage HOT 1
- Make sure pplacer is on your system path. HOT 1
- Having trouble running CheckM: Models must be parsed before identifying HMM hits.
- Unexpected error: <class 'FileNotFoundError'> HOT 1
- question:Anaconda ——conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge
- No space left on device error in HMM step not caught HOT 2
- For a specific bin, why is the Marker lineage in lineage.ms different with it in quality report?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from checkm.