Thank you very much, Coleen. I have converted the flag into a diagnostic flag. The revised webrev is at:

http://cr.openjdk.java.net/~jgeorge/8200613/webrev.03/

A plain diff between this patch and the earlier one is at:

http://cr.openjdk.java.net/~jgeorge/8200613/incremental_diff

I will withdraw the associated CSR.

Thanks!
Jini.

On 12/10/2018 11:55 PM, coleen.phillim...@oracle.com wrote:

I see.  In this case, the flag should be diagnostic and not require a CSR.  I suppose some documentation should be added so that sustaining will know about the option in this case (Hi Kevin!)

thanks,
Coleen


On 12/10/18 1:38 AM, Ioi Lam wrote:
Hi Coleen,

I was one of the people who suggested the DumpPrivateMappingsInCore flag. It's enabled by default, so by default all the contents of mmap'ed files with MAP_PRIVATE will be saved to the core files.

The worry is, there may be some extreme cases where the JVM has mapped very large files (with NIO or JNI, etc). For example, you could have a 100GB in-memory database. For those cases, if the user is experiencing crashes, but they are unable to get a core dump (because it would be too large), they can try running with -XX:-DumpPrivateMappingsInCore.

Thanks

- Ioi


On 12/7/18 12:03 PM, coleen.phillim...@oracle.com wrote:

Hi Jini,  We were just talking about this new option.  If someone gets a crash, I don't think they're going to run their application again with -XX:-DumpPrivateMappingsInCore in the case of the core file being too large.  So I don't know how generally useful this option is. I think it would be better to not add it and set the bit to include the mappings unconditionally.

How much larger is this core file (we had trouble understanding from the bug)?  If you need the mappings to understand and use the SA tools on it, we want to have them.

Thanks,
Coleen


On 12/7/18 2:22 PM, Jini George wrote:
I have the revised webrev here:

http://cr.openjdk.java.net/~jgeorge/8200613/webrev.02/index.html

The extra changes here are to:

* Introduce a new option DumpPrivateMappingsInCore to control the dumping of the file backed private regions into the corefile.

* Close the modules file before dumping core in os::abort(). Currently, there is a small bug (https://bugs.openjdk.java.net/browse/JDK-8215026) which prevents the closure of the image file in unmapping the entire file.

I plan to take up the unmapping of NIO MapMode.PRIVATE files as a separate task (https://bugs.openjdk.java.net/browse/JDK-8215027) since this seems a bit involved.

Thanks a bunch,
Jini.

On 11/12/2018 10:26 AM, Jini George wrote:
Thank you very much, Chris, Kevin and Ioi for your comments!

I will send another webrev with this change enabled under an opt-out flag, as you suggest, and would look at unmapping the JDK modules file and if possible, the NIO mapped files too in the signal handler.

Thanks a bunch,
Jini.

On 11/9/2018 11:53 PM, Ioi Lam wrote:
Hi Jini,

Thanks for investigating the size expansion issue.

I agree that the size increase is worth it. Even when not using SA, if we open the core file inside GDB, we cannot read certain sections in the CDS archive (such as the RO section and strings sections). That would make debugging difficult. So I am in favor of this change.

For the JDK modules file, maybe we can unmap it in the signal handler, before going ahead with the core dump? I think it's hardly needed for debugging purposes. (Perhaps we can also do the same for the NIO mapped files?)

A opt-flag as suggested by Kevin is a good idea.

Thanks

- Ioi

On 11/9/18 3:29 AM, Kevin Walls wrote:
Hi Jini,

Looks good to me.  It might be a significant increase in size of _some_ core files, but so many core files we see are much larger, in gigabytes++ of course, so the CDS data size should not be such a significant increase on (I think) most files.

The flexibiity of always having the CDS data there is very significant.  A core file should ideally be usable, without additionally requiring the CDS archive from the machine. That additional human round-trip upload request on every transmitted core that needs investigating, seems like a less efficient route...).

Is there an opt-out?  It's conditional on UseSharedSpaces but could there be a flag to disable, in case we see crashes with gigabytes of private mappings that we really don't want to retain (the user would have to know to set a flag, to disable the new coredump filter ahead of time).

Thanks!
Kevin


On 29/10/2018 06:02, Jini George wrote:
Thank you very much, Ioi, for looking into this, and the clarification offline. My bad, I had missed the earlier mail from you. :-( My responses below.

Yes, I had tested this on MacOS. The issue does not exist on MacOS since the file backed private mmap()-ed regions get dumped into the MacOS corefiles by default.

The corefile sizes on Linux do increase due to this change. And the increase would also include any file mapped using NIO with MapMode.PRIVATE. The typical corefile size increase with this change would include the following components at a high level:

* Any NIO file mapping with MapMode.PRIVATE.
* Any file mmap()-ed by any native library with MAP_PRIVATE.
* The read only CDS regions (ro and od): Of the order of a few MB.
* The shared strings CDS region. (typically less than 1 MB).
* 2 MB per native shared library (regions with ---p permissions mapped by the dynamic linker for better alignment and for keeping libraries efficiently shareable).
* The JDK 'modules' file. (About 140 MB).

So, without including any NIO mapping, I typically see around 250-300 MB increase in the corefile sizes. I agree that the size increase could be a cause for concern, but for FWIW, these privately mapped files get dumped into the corefile for MacOS too. And the corefile sizes for the same program on MacOS are way larger (of the order of a few GB as against about 300 MB on Linux (without the change)).

The advantage of fixing this by modifying the coredump_filter v/s doing it in SA (by reading in more sections of the shared archive file) is that this would benefit other debuggers like gdb also. (And reduces the dependence on having the shared archive file being available at the time of debugging). If folks still think this is a cause for concern, I could make modifications to fix this by reading in the regions from the shared archive file in the SA code. I also wonder if it is worth taking a relook at the mapping types of the various CDS regions also.

Thank you,
Jini.

On 10/22/2018 10:27 AM, Ioi Lam wrote:
Hi Jini,

Did you see my earlier reply? I might have sent it out during the mail server outage days :-(

http://openjdk.5641.n7.nabble.com/RFR-S-JDK-8200613-SA-jstack-throws-UnmappedAddressException-with-a-CDS-core-file-td352681.html#a353026

Here was my reply again:

Hi Jini,

The changes looks good to me.

Have you tested this on MacOS? CDS heap support is also enabled on
MacOS. See macros.hpp:

     #if INCLUDE_CDS && INCLUDE_G1GC && defined(_LP64) && !defined(_WINDOWS)
     #define INCLUDE_CDS_JAVA_HEAP 1

Also, besides CDS, do we know how often other files will be mmaped with MAP_PRIVATE? Will users get huge core files because CDS is enabled? In JDK 12, CDS will be enabled by default (see JDK-8202951), so all users
will be affected by the following line:

   if (UseSharedSpaces) {
     set_coredump_filter(FILE_BACKED_PVT_BIT);
   }

Maybe you can run an big app such as Eclipse, trigger a core dump, and
compare the size of the core file before/after this change?

Thanks

- Ioi






Thanks

- Ioi


On 10/21/18 8:58 PM, Jini George wrote:
Gentle reminder!

Thanks,
- Jini

On 10/9/2018 11:31 AM, Jini George wrote:
Hello!

[Including runtime-dev since the changes are in runtime code]

Requesting reviews for:

Webrev: http://cr.openjdk.java.net/~jgeorge/8200613/webrev.01/
BugID: https://bugs.openjdk.java.net/browse/JDK-8200613

Issue: jhsdb jstack would throw an UnmappedAddressException with a core file generated from a CDS enabled java process. This is seen only with Linux and with G1GC, while trying to read in data from the shared strings region (the closed archive heap space). This region (which is a file backed private memory region) is not dumped into the corefile for Linux. This, being a heap region (and therefore being a read-write region) is also not read in from the classes.jsa file in SA since only the read only regions are read in while processing the core file. (The expectation being that the read write regions are in the core file).

Proposed solution: The proposed solution is to have the coredump_filter value corresponding to the CDS process to include bit 2 (file-backed private memory), so that the file-backed private memory region also gets dumped into the corefile. The proposed fix is in src/hotspot/os/linux/os_linux.cpp.

Thanks,
Jini.







Reply via email to