Comments (7)
It looks like you are using the 0.6.0 pre-release. Could you please upgrade to either 0.6.0 or 0.6.1? Several issues along the lines of which you are experiencing were discovered in the pre-release and were resolved in the release version.
from medaka.
Thanks. I'll do that and get back to you.
Slightly off topic note; medaka 0.6.1 doesn't seem to be available on the bioconda channel.
from medaka.
Hello,
In my case, I'm trying to run mekada for short read, (300 to 600 bp), I have the version 0.6.0, I'll tried to download from github repository and It's the same version, so I used 0.6.0 and the pipeline worked but I could n't obtain result(I have the fasta but it's empty).
mini_assemble -i ${BASECALLS} -o /home/ivan/Escritorio/medaka -p assm -t ${NPROC}
Copying FASTX input to workspace: /home/ivan/medaka/C26_carb_f.fasta > /home/ivan/Escritorio/medaka/assm.fa.gz
Skipped adapter trimming.
Skipped pre-assembly correction.
Overlapping reads...
[M::mm_idx_gen::0.0241.07] collected minimizers
[M::mm_idx_gen::0.0301.94] sorted minimizers
[M::main::0.0301.94] loaded/built the index for 1428 target sequence(s)
[M::mm_mapopt_update::0.0311.92] mid_occ = 487
[M::mm_idx_stat] kmer size: 15; skip: 5; is_hpc: 0; #seq: 1428
[M::mm_idx_stat::0.0321.90] distinct minimizers: 20842 (0.00% are singletons); average occurrences: 6.932; average spacing: 3.071
[M::worker_pipeline::0.7575.43] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -x ava-ont -K 500M -t 8 assm.fa.gz assm.fa.gz
[M::main] Real time: 0.762 sec; CPU: 4.120 sec; Peak RSS: 0.035 GB
Assembling graph...
[M::main] ===> Step 1: reading read mappings <===
[M::ma_hit_read::0.2081.00] read 243144 hits; stored 486288 hits and 680 sequences (210170 bp)
[M::main] ===> Step 2: 1-pass (crude) read selection <===
[M::ma_hit_sub::0.2601.00] 680 query sequences remain after sub
[M::ma_hit_cut::0.2671.00] 486288 hits remain after cut
[M::ma_hit_flt::0.2751.00] 484120 hits remain after filtering; crude coverage after filtering: 525.25
[M::main] ===> Step 3: 2-pass (fine) read selection <===
[M::ma_hit_sub::0.2951.00] 680 query sequences remain after sub
[M::ma_hit_cut::0.3031.00] 484120 hits remain after cut
[M::ma_hit_contained::0.3111.00] 10 sequences and 32 hits remain after containment removal
[M::main] ===> Step 4: graph cleaning <===
[M::ma_sg_gen] read 16 arcs
[M::main] ===> Step 4.1: transitive reduction <===
[M::asg_arc_del_trans] transitively reduced 0 arcs
[M::main] ===> Step 4.2: initial tip cutting and bubble popping <===
[M::asg_cut_tip] cut 10 tips
[M::asg_arc_del_multi] removed 0 multi-arcs
[M::asg_arc_del_asymm] removed 0 asymmetric arcs
[M::asg_pop_bubble] popped 0 bubbles and trimmed 0 tips
[M::main] ===> Step 4.3: cutting short overlaps (3 rounds in total) <===
[M::asg_arc_del_short] removed 0 short overlaps
[M::asg_arc_del_short] removed 0 short overlaps
[M::asg_arc_del_short] removed 0 short overlaps
[M::main] ===> Step 4.4: removing short internal sequences and bi-loops <===
[M::asg_cut_internal] cut 0 internal sequences
[M::asg_cut_biloop] cut 0 small bi-loops
[M::asg_cut_tip] cut 0 tips
[M::asg_pop_bubble] popped 0 bubbles and trimmed 0 tips
[M::main] ===> Step 4.5: aggressively cutting short overlaps <===
[M::asg_arc_del_short] removed 0 short overlaps
[M::main] ===> Step 5: generating unitigs <===
[M::main] Version: 0.3-r179
[M::main] CMD: miniasm -s 100 -e 3 -f assm.fa.gz assm.paf.gz
[M::main] Real time: 0.315 sec; CPU: 0.316 sec
Running racon read shuffle 1...
Running round 1 consensus...
[M::mm_idx_gen::0.0002.72] collected minimizers
[M::mm_idx_gen::0.0013.95] sorted minimizers
[M::main::0.0013.91] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0013.81] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.0013.73] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0073.27] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 assm.gfa.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.024 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 2 consensus...
[M::mm_idx_gen::0.0004.02] collected minimizers
[M::mm_idx_gen::0.0014.91] sorted minimizers
[M::main::0.0014.87] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.74] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.0014.61] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0073.37] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_1.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.025 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 3 consensus...
[M::mm_idx_gen::0.0003.61] collected minimizers
[M::mm_idx_gen::0.0014.48] sorted minimizers
[M::main::0.0014.44] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.34] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.0014.25] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0083.07] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_2.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.024 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 4 consensus...
[M::mm_idx_gen::0.0003.84] collected minimizers
[M::mm_idx_gen::0.0014.45] sorted minimizers
[M::main::0.0014.42] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.29] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.0014.18] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.008*3.18] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_3.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.026 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Waiting for cleanup.
rm: can not delete 'shuffled ': the file or directory does not exist
rm: can not delete ' paf *': the file or directory does not exist
Final assembly written in /home/ivan/medaka_2/assm_final.fa. You have a good day.
I checked the pipeline with the tutorial data and work perfectly. Could be in this case that I have to adjust something from minimap2, every time apperearing in the message minimap2 -K 500M -t 8 racon_1_3.fa.gz assm.fa.gz. I checked the final length of consensus.fasta from data tutorial is 47018010(4700M). Could be de reason for don't get consensus with my data?Do you know how to modificate minimap requeriments or another requeriments for get data with this small size?
Thank you very much
from medaka.
Did the newer versions work for you? There is now a v0.6.2 in bioconda.
from medaka.
Thank your very much,
I'll try it.
Cheers
from medaka.
After I posted, I updated to 0.6.1 and restarted the run. It is still running and hasn't yet hit the "processing short regions" part of the execution. I'll update when I have more information.
from medaka.
Well, Medaka is still running but seems to be functioning. It hit the processing short regions step a couple days ago, spent about 12 hours on it, and is now stitching everything up. So, I think this issue is closed...
from medaka.
Related Issues (20)
- Medaka Compatibility with Fungal Reads HOT 6
- I run medaka consensus in HPC, it only generated HDF5 data, how to generate consensus. fasta? HOT 6
- Is it possible to use medaka in offline mode? HOT 11
- Unable to install medaka on Mac M3 HOT 9
- help please with minimap2, tabix, bgzip and bcftools binary files
- Python 3.12 compatibility for pip HOT 1
- batch size and GPU use HOT 4
- ModelStoreTF exception <class 'tensorflow.python.framework.errors_impl.InternalError'> HOT 1
- Need help with 'AVX instructions not available' error HOT 2
- Medaka consensus error when stitching consensus chunks together HOT 7
- Medaka v1.11.3 ImportError - undefined symbol: libdeflate_free_compressor HOT 2
- Failed to run medaka consensus HOT 4
- [Question] Seeking Guidance on Selecting Appropriate Model for ONT Sequencing HOT 2
- failed to predict model HOT 6
- error of -d must be specified. HOT 2
- Use what kinds of reads for medaka consensus? HOT 4
- Suggestions of threshold of minimum mapping quality and read quality score HOT 2
- missing the bam file when running medaka_consensus HOT 1
- installing medaka (with pip) on M1 HOT 1
- Distribution for v1.12.0 on Bioconda please! HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medaka.