Coder Social home page Coder Social logo

Comments (5)

droazen avatar droazen commented on July 21, 2024

@DarioS How much memory are you providing to Java via the -Xmx option, and how much physical memory do you have available? You can see how to pass the -Xmx option in to GATK here: https://github.com/broadinstitute/gatk?tab=readme-ov-file#jvmoptions

from gatk.

DarioS avatar DarioS commented on July 21, 2024

-Xmx52g was used. Compute node has 1.5 TB physical RAM. I use af-only-gnomad.hg38.vcf.gz for -V and -L.

from gatk.

droazen avatar droazen commented on July 21, 2024

@DarioS You could try increasing the size of the Java heap (say, doubling it to 104g). Does your bam/cram have extremely high depth?

from gatk.

DarioS avatar DarioS commented on July 21, 2024

I copied 60× BAM file to an interactive Linux server with 768 GB physical RAM and eighty cores and used version 4.5.0.0.

%Cpu(s):  1.3 us,  0.0 sy,  0.1 ni, 98.7 id,  0.0 wa,  0.0 hi,  0.0 si,  0.0 st
GiB Mem :    754.5 total,     52.1 free,    107.3 used,    600.3 buff/cache     
GiB Swap:    931.3 total,    924.9 free,      6.4 used.    647.3 avail Mem 

    PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND                                                                                                                                                 
 171365 dario     20   0   35.0g  31.3g  23040 S 100.0   4.1  32:18.12 java   

I removed -Xmx and using top to see the process is consistently at about 32 GB. So, -Xmx is irrelevant to the problem.

12:15:04.531 INFO  ProgressMeter -        Current Locus  Elapsed Minutes        Loci Processed      Loci/Minute
12:57:32.208 INFO  GetPileupSummaries - Shutting down engine
[January 13, 2024 at 12:57:32 PM AEDT] org.broadinstitute.hellbender.tools.walkers.contamination.GetPileupSummaries done. Elapsed time: 50.36 minutes.
Runtime.totalMemory()=20753416192
java.lang.OutOfMemoryError: Java heap space: failed reallocation of scalar replaced objects

What does "reallocation of scalar replaced objects" mean? I don't think it could possibly have run out of memory.

from gatk.

amarinderthind avatar amarinderthind commented on July 21, 2024

I am in a similar boat. Xmx has a default value which is small. Using a specified 448 GB limit shows that this module is inefficient.

    PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND
 172833 thind     20   0  468.7g 378.7g  31360 S  99.9  50.2  29:16.31 java

The analysis dies a few seconds later because GATK tries to create impossibly-large Java array.

org.broadinstitute.hellbender.tools.walkers.contamination.GetPileupSummaries done. Elapsed time: 25.63 minutes.
Runtime.totalMemory()=481036337152
java.lang.OutOfMemoryError: Required array length 2147483640 + 16 is too large
        at java.base/jdk.internal.util.ArraysSupport.hugeLength(ArraysSupport.java:649)
        at java.base/jdk.internal.util.ArraysSupport.newLength(ArraysSupport.java:642)

I can independently reproduce Dario's problem on the same Linux server.

from gatk.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.