Comments (5)
What is the exact error you get for the header? parse
takes the bwa mem output, nothing we can do about that...
128 Gb is a lot, is it possible your duplication rate is super high? Regardless, one of the options above should help with that.
from pairtools.
Can you clarify at what stage do you have to run pairtools generate header
?
How much memory do you have available that you are getting the out of memory error? To avoid any out of memory errors, you can use the old backend for dedup, using --backend cython
in the dedup command. Or, instead, you can reduce the chunksize, e.g. --chunksize 10000
.
from pairtools.
Thanks for the quick reply!
An error cropped up at pairtools parse
requiring me to run pairtools generate header
, but I instead downgraded to the old version of pairtools v.1.0.1. and pinned the older version of numpy. This seems to have circumvented the error.
If I use a full normal node, I get 128 Gb of RAM.
from pairtools.
I encountered the same error during the dedup
stage. The error comes from stats.py
where np.int
is deprecated and just int
should be used when declaring the type. The error:
AttributeError: module 'numpy' has no attribute 'int'.
`np.int` was a deprecated alias for the builtin `int`. To avoid this error in existing code, use `int` by itself. Doing this will not modify any behavior and is safe. When replacing `np.int`, you may wish to use e.g. `np.int64` or `np.int32` to specify the precision. If you wish to review your current use, check the release note link for additional information.
I also had the header error but it was a result of the dedup stage failing so the output of that, the input for the next step, was empty.
Hope this helps.
from pairtools.
From what I can tell, all instances of np.int
have been eliminated from the master branch. In the next release this error won't appear.
Assuming the other problems are resolved, I'll close this issue, feel free to reopen!
from pairtools.
Related Issues (20)
- bugs in stats HOT 2
- Gzip, BAM and CRAM support HOT 5
- warning [E::idx_find_and_load] Could not retrieve index file for HOT 9
- What's the difference between "uu' and "UU" in .pairs file? HOT 9
- pipe error with bwa-mem2.avx512bw (2.2.1) HOT 1
- There is no instruction on how to test the project.
- Why reading using pipes rather than another method? HOT 1
- stats redesign HOT 2
- Feature request - split parse from filters HOT 4
- All pairs are corrupt ("XX") HOT 6
- Cannot import name 'dedup_cython' from partially initialized module 'pairtools.lib' (most likely due to a circular import) HOT 7
- parse vs parse2 difference in default behavior for --flip?
- Memory overconsumption/leak (?) in pairtools restrict
- Swap memory overconsumption & program failure during pairtools restrict HOT 1
- Wording in pairtools parse2 for --max-insert-size
- most likely due to a circular import HOT 3
- raise an error/warning if chromosome names in .chrom.sizes and bwa index differ too much
- Updating the ReadTheDocs
- Scaling by pairs error HOT 1
- pairtools parse used name sorted or coodinate sorted bam file as input HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pairtools.