This is a shot in the dark, but I notice you are running with -Xmx100g.
You might try a less aggressive heap allocation.
The stochastic nature of the failures might be due to contention over system resources.
It looks like some of your runs are, in fact, growing the java heap to 100g.
-Bob

On 8/21/14, 12:18 PM, Nils Homer wrote:
This is certainly a strange issue. Perhaps there is a problem using the intel deflator with your JVM and environment. Could you retry by adding to your java command "-Dtry_use_intel_deflater=false"? You could also move the "libIntelDeflater.so" away from the path. You should see a difference in the version line that the tool outputs to stderr:

amd64; Java HotSpot(TM) 64-Bit Server VM 1.7.0_67-b01; Picard version: 1.118(2329276ea55d31ab6b19bab55b9ee7b51e4a446e_1406559781) IntelDeflater

The IntelDeflator should say JdkDeflator if you turned that feature off properly.

Nils


On Wed, Aug 20, 2014 at 7:21 PM, Thomas Sibley <[email protected] <mailto:[email protected]>> wrote:

    On Aug 20, 2014, at 15:58 , Thomas Sibley <[email protected]
    <mailto:[email protected]>> wrote:
    > On Aug 20, 2014, at 14:25 , Thomas Sibley <[email protected]
    <mailto:[email protected]>> wrote:
    >> I'm running into problems using Picard's MarkDuplicates
    utility.  Sometimes I get a JVM crash, sometimes the process just
    seems to hang.  After the initial startup, strace shows nothing
    but sched_yield() and futex() calls.  While the input data which
    hangs has ~900k reads, MarkDuplicates runs in about 1min on ~400k
    reads from a different input file.
    >>
    >> Example informational output from both crashing and hanging
    runs are attached, as well as the crash log of the JVM.
    >>
    >> $ java -version
    >> java version "1.7.0_67"
    >> Java(TM) SE Runtime Environment (build 1.7.0_67-b01)
    >> Java HotSpot(TM) 64-Bit Server VM (build 24.65-b04, mixed mode)
    >
    > Additional information: jconsole never connects to the java
    process and jstack -F produces the attached thread trace.

    The process I ran jstack on just completed without error, after
    130 minutes.  It took a few retry attempts to get a run all the
    way through without a JVM crash.  Attached is the informational
    output from the MarkDuplicates command.

    I'm still interested in tracking down the reason for the JVM
    crashes, if anyone has pointers or ideas.

    I'd also like to know more about what affects the runtime of
    MarkDuplicates.  It's a little wild to me that a ~2 fold increase
    in read number alone might produce a runtime difference from 1
    minute to 130 minutes.  Are there other properties of the input
    data that would affect this?

    Thanks,
    Thomas






    
------------------------------------------------------------------------------
    Slashdot TV.
    Video for Nerds.  Stuff that matters.
    http://tv.slashdot.org/
    _______________________________________________
    Samtools-help mailing list
    [email protected]
    <mailto:[email protected]>
    https://lists.sourceforge.net/lists/listinfo/samtools-help




------------------------------------------------------------------------------
Slashdot TV.
Video for Nerds.  Stuff that matters.
http://tv.slashdot.org/


_______________________________________________
Samtools-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/samtools-help

------------------------------------------------------------------------------
Slashdot TV.  
Video for Nerds.  Stuff that matters.
http://tv.slashdot.org/
_______________________________________________
Samtools-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to