Coder Social home page Coder Social logo

Comments (7)

cjw85 avatar cjw85 commented on July 17, 2024

It looks like you are using the 0.6.0 pre-release. Could you please upgrade to either 0.6.0 or 0.6.1? Several issues along the lines of which you are experiencing were discovered in the pre-release and were resolved in the release version.

from medaka.

nhartwic avatar nhartwic commented on July 17, 2024

Thanks. I'll do that and get back to you.

Slightly off topic note; medaka 0.6.1 doesn't seem to be available on the bioconda channel.

from medaka.

Ivanbh214 avatar Ivanbh214 commented on July 17, 2024

Hello,
In my case, I'm trying to run mekada for short read, (300 to 600 bp), I have the version 0.6.0, I'll tried to download from github repository and It's the same version, so I used 0.6.0 and the pipeline worked but I could n't obtain result(I have the fasta but it's empty).

mini_assemble -i ${BASECALLS} -o /home/ivan/Escritorio/medaka -p assm -t ${NPROC}
Copying FASTX input to workspace: /home/ivan/medaka/C26_carb_f.fasta > /home/ivan/Escritorio/medaka/assm.fa.gz

Skipped adapter trimming.
Skipped pre-assembly correction.
Overlapping reads...
[M::mm_idx_gen::0.0241.07] collected minimizers
[M::mm_idx_gen::0.030
1.94] sorted minimizers
[M::main::0.0301.94] loaded/built the index for 1428 target sequence(s)
[M::mm_mapopt_update::0.031
1.92] mid_occ = 487
[M::mm_idx_stat] kmer size: 15; skip: 5; is_hpc: 0; #seq: 1428
[M::mm_idx_stat::0.0321.90] distinct minimizers: 20842 (0.00% are singletons); average occurrences: 6.932; average spacing: 3.071
[M::worker_pipeline::0.757
5.43] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -x ava-ont -K 500M -t 8 assm.fa.gz assm.fa.gz
[M::main] Real time: 0.762 sec; CPU: 4.120 sec; Peak RSS: 0.035 GB
Assembling graph...
[M::main] ===> Step 1: reading read mappings <===
[M::ma_hit_read::0.2081.00] read 243144 hits; stored 486288 hits and 680 sequences (210170 bp)
[M::main] ===> Step 2: 1-pass (crude) read selection <===
[M::ma_hit_sub::0.260
1.00] 680 query sequences remain after sub
[M::ma_hit_cut::0.2671.00] 486288 hits remain after cut
[M::ma_hit_flt::0.275
1.00] 484120 hits remain after filtering; crude coverage after filtering: 525.25
[M::main] ===> Step 3: 2-pass (fine) read selection <===
[M::ma_hit_sub::0.2951.00] 680 query sequences remain after sub
[M::ma_hit_cut::0.303
1.00] 484120 hits remain after cut
[M::ma_hit_contained::0.3111.00] 10 sequences and 32 hits remain after containment removal
[M::main] ===> Step 4: graph cleaning <===
[M::ma_sg_gen] read 16 arcs
[M::main] ===> Step 4.1: transitive reduction <===
[M::asg_arc_del_trans] transitively reduced 0 arcs
[M::main] ===> Step 4.2: initial tip cutting and bubble popping <===
[M::asg_cut_tip] cut 10 tips
[M::asg_arc_del_multi] removed 0 multi-arcs
[M::asg_arc_del_asymm] removed 0 asymmetric arcs
[M::asg_pop_bubble] popped 0 bubbles and trimmed 0 tips
[M::main] ===> Step 4.3: cutting short overlaps (3 rounds in total) <===
[M::asg_arc_del_short] removed 0 short overlaps
[M::asg_arc_del_short] removed 0 short overlaps
[M::asg_arc_del_short] removed 0 short overlaps
[M::main] ===> Step 4.4: removing short internal sequences and bi-loops <===
[M::asg_cut_internal] cut 0 internal sequences
[M::asg_cut_biloop] cut 0 small bi-loops
[M::asg_cut_tip] cut 0 tips
[M::asg_pop_bubble] popped 0 bubbles and trimmed 0 tips
[M::main] ===> Step 4.5: aggressively cutting short overlaps <===
[M::asg_arc_del_short] removed 0 short overlaps
[M::main] ===> Step 5: generating unitigs <===
[M::main] Version: 0.3-r179
[M::main] CMD: miniasm -s 100 -e 3 -f assm.fa.gz assm.paf.gz
[M::main] Real time: 0.315 sec; CPU: 0.316 sec
Running racon read shuffle 1...
Running round 1 consensus...
[M::mm_idx_gen::0.000
2.72] collected minimizers
[M::mm_idx_gen::0.0013.95] sorted minimizers
[M::main::0.001
3.91] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0013.81] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.001
3.73] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0073.27] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 assm.gfa.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.024 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 2 consensus...
[M::mm_idx_gen::0.000
4.02] collected minimizers
[M::mm_idx_gen::0.0014.91] sorted minimizers
[M::main::0.001
4.87] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.74] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.001
4.61] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0073.37] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_1.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.025 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 3 consensus...
[M::mm_idx_gen::0.000
3.61] collected minimizers
[M::mm_idx_gen::0.0014.48] sorted minimizers
[M::main::0.001
4.44] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.34] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.001
4.25] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.0083.07] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_2.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.024 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Running round 4 consensus...
[M::mm_idx_gen::0.000
3.84] collected minimizers
[M::mm_idx_gen::0.0014.45] sorted minimizers
[M::main::0.001
4.42] loaded/built the index for 0 target sequence(s)
[M::mm_mapopt_update::0.0014.29] mid_occ = 1
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 0
[M::mm_idx_stat::0.001
4.18] distinct minimizers: 0 (-nan% are singletons); average occurrences: -nan; average spacing: -nan
[M::worker_pipeline::0.008*3.18] mapped 1428 sequences
[M::main] Version: 2.14-r883
[M::main] CMD: minimap2 -K 500M -t 8 racon_1_3.fa.gz assm.fa.gz
[M::main] Real time: 0.008 sec; CPU: 0.026 sec; Peak RSS: 0.003 GB
[racon::Polisher::initialize] error: empty target sequences set!
Waiting for cleanup.
rm: can not delete 'shuffled ': the file or directory does not exist
rm: can not delete '
paf *': the file or directory does not exist
Final assembly written in /home/ivan/medaka_2/assm_final.fa. You have a good day.

I checked the pipeline with the tutorial data and work perfectly. Could be in this case that I have to adjust something from minimap2, every time apperearing in the message minimap2 -K 500M -t 8 racon_1_3.fa.gz assm.fa.gz. I checked the final length of consensus.fasta from data tutorial is 47018010(4700M). Could be de reason for don't get consensus with my data?Do you know how to modificate minimap requeriments or another requeriments for get data with this small size?

Thank you very much

from medaka.

cjw85 avatar cjw85 commented on July 17, 2024

@nhartwic

Did the newer versions work for you? There is now a v0.6.2 in bioconda.

from medaka.

Ivanbh214 avatar Ivanbh214 commented on July 17, 2024

Thank your very much,

I'll try it.

Cheers

from medaka.

nhartwic avatar nhartwic commented on July 17, 2024

@cjw85

After I posted, I updated to 0.6.1 and restarted the run. It is still running and hasn't yet hit the "processing short regions" part of the execution. I'll update when I have more information.

from medaka.

nhartwic avatar nhartwic commented on July 17, 2024

Well, Medaka is still running but seems to be functioning. It hit the processing short regions step a couple days ago, spent about 12 hours on it, and is now stitching everything up. So, I think this issue is closed...

from medaka.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.