RFR (S): 8237777 "Dumping core ..." is shown despite claiming that "# No core dump will be written."

Yasumasa Suenaga suenaga at oss.nttdata.com
Thu Apr 23 00:45:04 UTC 2020


Hi Gerard,

os_linux.cpp:

      if (core_pattern[0] == '|') {
+      // on some systems, PID is not included by default, so don't add it here either
        written = jio_snprintf(buffer, bufferSize,
-                             "\"%s\" (or dumping to %s/core.%d)",
-                             &core_pattern[1], p, current_process_id());
+                             "\"%s\", or dumping to %s/core",
+                             &core_pattern[1], p);

I think it is better to "%s" instead of "%s/core".
I added this code is for the behavior of ABRT. If the handler does not add extension to core file, it is better to show default location (directory).


>> I have filed JDK-8243196 to address related issues next.
> 
> I don't think we have an issue to fix. We do not want to be parsing the core pattern to go and determine what program is going to be used to process core files and then whether that program is installed or not, or then enabled or not.

In Fedora / RHEL / CentOS (maybe Oracle Linux) has different coredump handler such as systemd-coredump, and it is enabled by default.
I left my comment to JBS about them.


Thanks,

Yasumasa


On 2020/04/23 8:21, David Holmes wrote:
> On 23/04/2020 3:10 am, gerard ziemski wrote:
>> hi David,
>>
>> Thank you very much for your review!
>>
>>> Hi Gerard,
>>>
>>> On 21/04/2020 4:33 am, gerard ziemski wrote:
>>>> hi all,
>>>>
>>>> Please review this fix, where we use the knowledge available to us, like whether the ulimit is set, before deciding whether to tell the user in hs_err crash log that we dumped the core file, to match what actually happens.
>>>>
>>>> Before this fix, we often assumed that the core file would be created, when in actuality that might not be the case.
>>> General approach seems fine but some comments below ...
>>>
>>>> I also extend an existing test to go with this fix.
>>>>
>>>> bug:https://bugs.openjdk.java.net/browse/JDK-8237777
>>>> webrev:http://cr.openjdk.java.net/~gziemski/8237777_rev1
>>>> tests:  tested with Mach5 tier-hs1,2,3,4,5
>>> I'm not quite following all of the changes in relation to what gets printed:
>>>
>>> src/hotspot/os/linux/os_linux.cpp
>>>
>>> - "\"%s\" (or dumping to %s/core.%d)",
>>> - &core_pattern[1], p, current_process_id());
>>> + "\"%s\", or dumping to %s/core",
>>> + &core_pattern[1], p);
>>>
>>> why did you change the pattern? I get core files of the form core.<pid> on my linux box.
>>
>> I removed it because I thought it was better to be conservative and print the core file name at its minimum - on my Ubuntu OS, for example, the PID is not added to the filename and the OS only uses "core".
>>
>> I added a comment explaining the assumption here.
> 
> The existing code shows the user there are two possible locations for the core file - addressing both your case and mine. There is no reason for that to be changed.
> 
>>
>>> src/hotspot/os/posix/os_posix.cpp
>>>
>>> Why did you move the ifdef linux case? What if I don't have infinite stack limit?
>>
>> That was one of the main issues here. We would print the message about the core file before checking the getrlimit(RLIMIT_CORE), so it would get printed even for the "case 0 stack limit", when no core file gets created.
> 
> It stated "may". If core dumps are disabled you should see:
> 
> Core dumps may be processed with xxx
> Core dumps have been disabled. To enable core dumping, try \"ulimit -c unlimited\" before starting Java again
> 
> It's not perfect but I don't see any real problem with that - and it avoided duplicating the message line. What you have now only reports the first message in the RLIM_INFINITY case and skips the case where its non-infinite and non-zero. So potentially useful information has been lost.
> 
>> Having said that, I do not like that message to begin with for 2 reasons:
>>
>> #1 it's awkwardly phrased, i.e. "Core dump will be written. Default location: Core dumps may be processed with "/usr/share/apport/apport %p %s %c %d %P %E", or dumping to /home/gerard/Desktop/core.13857"
>>
>> #2 on Ubuntu "apport" (which is the program that processes the core dump) is not enabled by default, so the message is noisy.
> 
> It's awkward because this core processing mechanism is awkward. And yes broken Linux distributions do actually set the core pattern to a program that may not even be installed! That's not our problem. This was all covered in depth when this code was put in.
> 
>> Not having that message in the "case default" is therefore not a big deal, however, for consistency I can either add it to both or remove it from both cases. Or  leave it as in my fix.
> 
> It needs to be kept as-is in existing code or reported in both places. To avoid code duplication I'd keep as-is.
> 
> These message changes are not directly addressing the core (excuse the pun) issue of this bug, which is simply that the dump_core argument to os::abort() was not taking into account whether we could actually dump core. Your fix addresses that nicely.
> 
>> I have filed JDK-8243196 to address related issues next.
> 
> I don't think we have an issue to fix. We do not want to be parsing the core pattern to go and determine what program is going to be used to process core files and then whether that program is installed or not, or then enabled or not.
> 
>>
>>> The printing logic was examined in detail under JDK-8059586: hs_err report should treat redirected core pattern:
>>>
>>> http://hg.openjdk.java.net/jdk9/jdk9/hotspot/rev/30ed7423ae23
>>
>> Thank you for the link.
>>
>>
>>> ---
>>>
>>> src/hotspot/share/utilities/vmError.cpp
>>>
>>> Typo: core_dump_ppossible (double p)
>>
>> Fixed.
>>
>>
>>> ---
>>>
>>> test/hotspot/jtreg/runtime/ErrorHandling/CreateCoredumpOnCrash.java
>>>
>>> ! public static void main(String[] args) throws Exception, Throwable {
>>>
>>> Exception is redundant now you have added Throwable. But if you change runTest() to throws Exception then you don't need Throwable. But the overload of runTest() seems a bit awkward. I'd just add the new tests directly to main.
>>
>> How about if I keep the new tests in a separate method, to keep the new code nicely organized, but rename the overloaded method from simply "runTests()" to "checkFix8237777()" or some other suitable name?
> 
> Some other suitable name please - we avoid using bug numbers in tests like that, other than adding to the @bug line in the test.
> 
> Thanks,
> David
> -----
> 
>>
>> new webrev:http://cr.openjdk.java.net/~gziemski/8237777_rev2
>> testing: in progress...
>>
>>
>> cheers
>>


More information about the hotspot-runtime-dev mailing list