Coder Social home page Coder Social logo

medaka_variant hanging about medaka HOT 6 CLOSED

nanoporetech avatar nanoporetech commented on August 16, 2024
medaka_variant hanging

from medaka.

Comments (6)

mwykes avatar mwykes commented on August 16, 2024

Yes, that sounds very much like #42. I would recommend upgrading to medaka v0.6.5. If you want to speed things up, you could try running on GPU, and parallelising over chromosomes, see here for details.

from medaka.

wdecoster avatar wdecoster commented on August 16, 2024

Thanks, I'll do that. Do you think the round_0_hap_mixed_probs.hdf is valid to restart from, or should I clean it up and entirely start over?

from medaka.

cjw85 avatar cjw85 commented on August 16, 2024

Restarting with that existing file should be fine, on the assumption it was closed correctly etc. This is the output of the initial pass over, you should delete any files created after this file if you want to pick up the workflow after this point.

from medaka.

wdecoster avatar wdecoster commented on August 16, 2024

I don't know for sure if that assumption is valid, as I have to kill the running job. What I get when restarting is the following:

...
Not running medaka consensus, round_0_hap_mixed_probs.hdf exists.
======================================
Running medaka snp with threshold 0.04
======================================

[12:21:26 - DataIndex] Could not find samples in round_0_hap_mixed_probs.hdf
[12:21:26 - DataIndex] Loaded sample-index from 1/1 (100.00%) of feature files.
VCF written to path/round_0_hap_mixed_thresh_0.04_unphased.vcf.
Using existing output:  path/GRCh38.fa.fai.

=============================================
Running whatshap phase to phase vcf round_0_hap_mixed_thresh_0.04_unphased.vcf.
=============================================

This is WhatsHap 0.18 running under Python 3.6.7
Working on 1 samples from 1 family

== SUMMARY ==
Maximum memory usage: 0.239 GB
Time spent reading BAM/CRAM:                    0.0 s
Time spent parsing VCF:                         0.0 s
Time spent selecting reads:                     0.0 s
Time spent phasing:                             0.0 s
Time spent writing VCF:                         0.0 s
Time spent finding components:                  0.0 s
Time spent on rest:                             0.9 s
Total elapsed time:                             0.9 s
Phased VCF written to path/round_0_hap_mixed_thresh_0.04_phased.vcf.
Compressing round_0_hap_mixed_thresh_0.04_phased.vcf.
Compressed file written to path/round_0_hap_mixed_thresh_0.04_phased.vcf.gz.
Indexing round_0_hap_mixed_thresh_0.04_phased.vcf.gz.
Compressed file written to path/round_0_hap_mixed_thresh_0.04_phased.vcf.gz.tbi.

=============================================
Running whatshap tag using round_0_hap_mixed_thresh_0.04_phased.vcf.gz.
=============================================

Found 1 samples in VCF file
Samples in BAM file: 
Processing alignments on chromosome chr1
No variants given for chromosome chr1 in the input VCF.

It's now running whatshap. The vcf file called round_0_hap_mixed_thresh_0.04_phased.vcf.gz is empty though, so that worries me.

from medaka.

cjw85 avatar cjw85 commented on August 16, 2024

I that case I would advise starting from scratch, perhaps testing first on a 1-10Mb region (medaka_variant has a -R option, or simply subset your .bam).

from medaka.

oneillkza avatar oneillkza commented on August 16, 2024

@wdecoster, very likely the same issue as #42 , since I'm using your NA19240 data. ;)

from medaka.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.