From erik.joelsson at oracle.com Thu Jan 2 07:57:07 2020 From: erik.joelsson at oracle.com (Erik Joelsson) Date: Thu, 2 Jan 2020 08:57:07 +0100 Subject: [15] Review Request: 8235975 Update copyright year to match last edit in jdk repository for 2014/15/16/17/18 In-Reply-To: <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> References: <1e7d0395-fc57-4d5b-9cfa-c33e0f6462d5@oracle.com> <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> Message-ID: <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> Build files look good. /Erik On 2019-12-24 19:22, Sergey Bylokhov wrote: > Hello. > > Here is an updated version: > ? Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 > ? Patch (2 Mb): > http://cr.openjdk.java.net/~serb/8235975/webrev.03/open.patch > ? Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.03/ > > ?- "jdk.internal.vm.compiler" is removed from the patch. > ?- "Aes128CtsHmacSha2EType.java" is updated to "Copyright (c) 2018" > > On 12/22/19 11:24 pm, Sergey Bylokhov wrote: >> Hello. >> Please review the fix for JDK 15. >> >> Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >> Patch (2 Mb): >> http://cr.openjdk.java.net/~serb/8235975/webrev.02/open.patch >> Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.02 >> >> I have updated the source code copyrights by the >> "update_copyright_year.sh" >> script for 2014/15/16/18/19 years, unfortunately, cannot run it for 2017 >> because of: "JDK-8187443: Forest Consolidation: Move files to unified >> layout" >> which touched all files. >> >> > > From Sergey.Bylokhov at oracle.com Thu Jan 2 12:02:14 2020 From: Sergey.Bylokhov at oracle.com (Sergey Bylokhov) Date: Thu, 2 Jan 2020 15:02:14 +0300 Subject: [15] Review Request: 8235975 Update copyright year to match last edit in jdk repository for 2014/15/16/17/18 In-Reply-To: <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> References: <1e7d0395-fc57-4d5b-9cfa-c33e0f6462d5@oracle.com> <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> Message-ID: <0caab700-28e3-16e7-db00-b698557443f0@oracle.com> I guess it is too late to fix it, will need to update the files at the end of 2020. On 1/2/20 10:57 am, Erik Joelsson wrote: > Build files look good. > > /Erik > > On 2019-12-24 19:22, Sergey Bylokhov wrote: >> Hello. >> >> Here is an updated version: >> ? Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >> ? Patch (2 Mb): http://cr.openjdk.java.net/~serb/8235975/webrev.03/open.patch >> ? Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.03/ >> >> ?- "jdk.internal.vm.compiler" is removed from the patch. >> ?- "Aes128CtsHmacSha2EType.java" is updated to "Copyright (c) 2018" >> >> On 12/22/19 11:24 pm, Sergey Bylokhov wrote: >>> Hello. >>> Please review the fix for JDK 15. >>> >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >>> Patch (2 Mb): http://cr.openjdk.java.net/~serb/8235975/webrev.02/open.patch >>> Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.02 >>> >>> I have updated the source code copyrights by the "update_copyright_year.sh" >>> script for 2014/15/16/18/19 years, unfortunately, cannot run it for 2017 >>> because of: "JDK-8187443: Forest Consolidation: Move files to unified layout" >>> which touched all files. >>> >>> >> >> -- Best regards, Sergey. From matthias.baesken at sap.com Thu Jan 2 13:26:15 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Thu, 2 Jan 2020 13:26:15 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 Message-ID: Hello, please review this small adjustment to jtreg test containers/docker/TestMemoryAwareness.java . After change "8226575: OperatingSystemMXBean should be made container aware" has been pushed, we observe failures on linux s390x / ppc64le in the docker related jtreg tests . The test runs into the following error : java.lang.RuntimeException: 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing from stdout/stderr at jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) at TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMemoryAwareness.java:154) at TestMemoryAwareness.main(TestMemoryAwareness.java:65) at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.base/java.lang.reflect.Method.invoke(Method.java:564) at com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapper.java:127) at java.base/java.lang.Thread.run(Thread.java:832) The reason is that the value found is instead OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . When looking into the getTotalSwapSpaceSize() function, we get values of 0 for "limit" and 104857600 for "memLimit" : 57 long limit = containerMetrics.getMemoryAndSwapLimit(); .... 62 long memLimit = containerMetrics.getMemoryLimit(); 63 if (limit >= 0 && memLimit >= 0) { 64 return limit - memLimit; 65 } That explains the value "-104857600" . We see messages "Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap" , this most likely causes the unexpected limit == 0 value . Bug/webrev : https://bugs.openjdk.java.net/browse/JDK-8236617 http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ Thanks, Matthias From martin.doerr at sap.com Thu Jan 2 13:46:51 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Thu, 2 Jan 2020 13:46:51 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References: Message-ID: Hi Matthias, thanks for fixing it. I suggest to put "Your kernel does not support swap limit capabilities or the cgroup is not mounted" into a String instead of having it 3 times. Looks good to me otherwise. Best regards, Martin > -----Original Message----- > From: hotspot-dev On Behalf Of > Baesken, Matthias > Sent: Donnerstag, 2. Januar 2020 14:26 > To: 'hotspot-dev at openjdk.java.net' > Subject: RFR: 8236617: jtreg test > containers/docker/TestMemoryAwareness.java fails after 8226575 > > Hello, please review this small adjustment to jtreg test > containers/docker/TestMemoryAwareness.java . > > After change "8226575: OperatingSystemMXBean should be made container > aware" has been pushed, > we observe failures on linux s390x / ppc64le in the docker related jtreg tests > . > > > The test runs into the following error : > java.lang.RuntimeException: > 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing from > stdout/stderr > > at > jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) > at > TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMem > oryAwareness.java:154) > at TestMemoryAwareness.main(TestMemoryAwareness.java:65) > at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > at > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMet > hodAccessorImpl.java:62) > at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Delega > tingMethodAccessorImpl.java:43) > at java.base/java.lang.reflect.Method.invoke(Method.java:564) > at > com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapp > er.java:127) > at java.base/java.lang.Thread.run(Thread.java:832) > > > The reason is that the value found is instead > OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . > When looking into the getTotalSwapSpaceSize() function, we get values of 0 > for "limit" and 104857600 for "memLimit" : > > 57 long limit = containerMetrics.getMemoryAndSwapLimit(); > .... > 62 long memLimit = containerMetrics.getMemoryLimit(); > 63 if (limit >= 0 && memLimit >= 0) { > 64 return limit - memLimit; > 65 } > > That explains the value "-104857600" . We see messages "Your kernel does > not support swap limit capabilities or the cgroup is not mounted. Memory > limited without swap" , this most likely > causes the unexpected limit == 0 value . > > > Bug/webrev : > > https://bugs.openjdk.java.net/browse/JDK-8236617 > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ > > > Thanks, Matthias From bob.vandette at oracle.com Thu Jan 2 16:45:24 2020 From: bob.vandette at oracle.com (Bob Vandette) Date: Thu, 2 Jan 2020 11:45:24 -0500 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References: Message-ID: Matthias, I really don?t like testing for some Docker message that could possibly change or go away in the future. There may be other reasons that the getTotalSwapSpaceSize function will fail and return 0. The real problem here is that the OperatingSystemMXBean.getTotalSwapSpaceSize is returning a negative value when there is no swap available. I?d prefer that we fix this problem by correcting the getTotalSwapSpaceSize function to properly return 0 under these conditions and allow 0 to be a valid expected result in the test. if (limit >= 0 && memLimit >= 0) { return (limit < memLimit) ? 0 : limit - memLimit; } Note: My suggestion assumes that there is no swap available when the kernel swap limit capability is not enabled. I have not verified this. The message does claim that this is the case "Memory limited without swap?. Bob. > On Jan 2, 2020, at 8:26 AM, Baesken, Matthias wrote: > > Hello, please review this small adjustment to jtreg test containers/docker/TestMemoryAwareness.java . > > After change "8226575: OperatingSystemMXBean should be made container aware" has been pushed, > we observe failures on linux s390x / ppc64le in the docker related jtreg tests . > > > The test runs into the following error : > java.lang.RuntimeException: 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing from stdout/stderr > > at jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) > at TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMemoryAwareness.java:154) > at TestMemoryAwareness.main(TestMemoryAwareness.java:65) > at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.base/java.lang.reflect.Method.invoke(Method.java:564) > at com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapper.java:127) > at java.base/java.lang.Thread.run(Thread.java:832) > > > The reason is that the value found is instead OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . > When looking into the getTotalSwapSpaceSize() function, we get values of 0 for "limit" and 104857600 for "memLimit" : > > 57 long limit = containerMetrics.getMemoryAndSwapLimit(); > .... > 62 long memLimit = containerMetrics.getMemoryLimit(); > 63 if (limit >= 0 && memLimit >= 0) { > 64 return limit - memLimit; > 65 } > > That explains the value "-104857600" . We see messages "Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap" , this most likely > causes the unexpected limit == 0 value . > > > Bug/webrev : > > https://bugs.openjdk.java.net/browse/JDK-8236617 > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ > > > Thanks, Matthias From joe.darcy at oracle.com Thu Jan 2 21:26:36 2020 From: joe.darcy at oracle.com (Joe Darcy) Date: Thu, 2 Jan 2020 13:26:36 -0800 Subject: RFR(S): 8236111 : narrow allowSmartActionArgs disabling In-Reply-To: <0BA46866-3DEA-44BF-B87C-2B59B84196C9@oracle.com> References:

<423ea31a-ebf8-4cba-72a4-6fbb934f7789@oracle.com> <0BA46866-3DEA-44BF-B87C-2B59B84196C9@oracle.com> Message-ID: <916e0375-abba-9945-b845-0fd4198513f0@oracle.com> The removal of the existing TEST.properties files look fine. Please also solicit feedback from the security libs team as their area is affected. Roger, FYI the serial filter tests are updated as part of this changeset. Cheers, -Joe On 12/23/2019 8:13 PM, Igor Ignatyev wrote: > Thanks David. > > core-libs folks, could you please review jdk part of this patch? > > Thanks, > -- Igor > >> On Dec 23, 2019, at 1:33 PM, David Holmes wrote: >> >> Hi Igor, >> >> Hotspot changes seem fine. Can't comment on jdk tests. >> >> Thanks, >> David >> >> On 24/12/2019 6:42 am, Igor Ignatyev wrote: >>> ping? >>>> On Dec 17, 2019, at 11:30 AM, Igor Ignatyev wrote: >>>> >>>> http://cr.openjdk.java.net/~iignatyev/8236111/webrev.00/ >>>>> 31 lines changed: 20 ins; 11 del; 0 mod; >>>> Hi all, >>>> >>>> could you please review this small patch which enables allowSmartActionArgs in hotspot and jdk test suites and disables them in a small number of test directories? the patch also removes TEST.properties files which enabled allowSmartActionArgs as they aren't needed anymore. >>>> >>>> from JBS: >>>>> currently, allowSmartActionArgs is disabled for the whole hotspot and jdk test suites and enabled just in few places. this makes it a bit harder for people to use smart action arguments in these test suites as they have to not to forget to enable them. and given in all the other test suites, smart action arguments are enabled, it can be confusing and frustrating. >>>> >>>> testing: tier1-5 >>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8236111 >>>> webrev: http://cr.openjdk.java.net/~iignatyev/8236111/webrev.00/ >>>> >>>> Thanks, >>>> -- Igor From matthias.baesken at sap.com Fri Jan 3 10:15:16 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Fri, 3 Jan 2020 10:15:16 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References:

Message-ID: HI Bob, Looking at the docker sources, the message seems to come from here : daemon_unix.go : // verifyPlatformContainerResources performs platform-specific validation of the container's resource-configuration func verifyPlatformContainerResources(resources *containertypes.Resources, sysInfo *sysinfo.SysInfo, update bool) (warnings []string, err error) { ..... // // means resources have positive Memory limit, memory+swap is not unlimited AND SwapLimit (memory.memsw.limit_in_bytes ?) is not enabled [comment added by me) if resources.Memory > 0 && resources.MemorySwap != -1 && !sysInfo.SwapLimit { warnings = append(warnings, "Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap.") resources.MemorySwap = -1 } with Resources from hostconfig.go : // Resources contains container's resources (cgroups config, ulimits...) type Resources struct { Memory int64 // Memory limit (in bytes) ... MemorySwap int64 // Total memory usage (memory + swap); set `-1` to enable unlimited swap So I think your suggestion to return 0 in that special case in function getTotalSwapSpaceSize sounds reasonable to me ( at least better than return a large negative value ). New webrev : http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ Thanks, Matthias > > Matthias, > > I really don?t like testing for some Docker message that could possibly change > or go away in the future. > There may be other reasons that the getTotalSwapSpaceSize function will fail > and return 0. > > The real problem here is that the > OperatingSystemMXBean.getTotalSwapSpaceSize is returning a > negative value when there is no swap available. > > I?d prefer that we fix this problem by correcting the getTotalSwapSpaceSize > function to properly return 0 > under these conditions and allow 0 to be a valid expected result in the test. > > if (limit >= 0 && memLimit >= 0) { > return (limit < memLimit) ? 0 : limit - memLimit; > } > > Note: My suggestion assumes that there is no swap available when the > kernel swap limit capability is not enabled. > I have not verified this. The message does claim that this is the case > "Memory limited without swap?. > > Bob. > > > > On Jan 2, 2020, at 8:26 AM, Baesken, Matthias > wrote: > > > > Hello, please review this small adjustment to jtreg test > containers/docker/TestMemoryAwareness.java . > > > > After change "8226575: OperatingSystemMXBean should be made > container aware" has been pushed, > > we observe failures on linux s390x / ppc64le in the docker related jtreg > tests . > > > > > > The test runs into the following error : > > java.lang.RuntimeException: > 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing from > stdout/stderr > > > > at > jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) > > at > TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMem > oryAwareness.java:154) > > at TestMemoryAwareness.main(TestMemoryAwareness.java:65) > > at > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > at > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMet > hodAccessorImpl.java:62) > > at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Delega > tingMethodAccessorImpl.java:43) > > at java.base/java.lang.reflect.Method.invoke(Method.java:564) > > at > com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapp > er.java:127) > > at java.base/java.lang.Thread.run(Thread.java:832) > > > > > > The reason is that the value found is instead > OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . > > When looking into the getTotalSwapSpaceSize() function, we get values of > 0 for "limit" and 104857600 for "memLimit" : > > > > 57 long limit = containerMetrics.getMemoryAndSwapLimit(); > > .... > > 62 long memLimit = containerMetrics.getMemoryLimit(); > > 63 if (limit >= 0 && memLimit >= 0) { > > 64 return limit - memLimit; > > 65 } > > > > That explains the value "-104857600" . We see messages "Your kernel > does not support swap limit capabilities or the cgroup is not mounted. > Memory limited without swap" , this most likely > > causes the unexpected limit == 0 value . > > > > > > Bug/webrev : > > > > https://bugs.openjdk.java.net/browse/JDK-8236617 > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ > > > > > > Thanks, Matthias From glaubitz at physik.fu-berlin.de Sat Jan 4 11:18:16 2020 From: glaubitz at physik.fu-berlin.de (John Paul Adrian Glaubitz) Date: Sat, 4 Jan 2020 12:18:16 +0100 Subject: Status for JDK-8199138, riscv64 support for Zero Message-ID: <96a56c15-5952-ab7c-8427-4860918f3c4a@physik.fu-berlin.de> Hi! Debian and several other distribution already have already been bootstrapped for riscv64 with a large number of packages building fine. Support for OpenJDK Zero has been added through a slightly modified version of JDK-8199138 [1]. Looking at the bug report for JDK-8199138, it seems that the patch was retracted back in 2018. However, since the Debian version of the patch works fine, I was wondering whether we could get it merged in one form or another? Thanks, Adrian > [1] https://git.launchpad.net/~openjdk/ubuntu/+source/openjdk/+git/openjdk/tree/debian/patches/riscv64.diff?h=openjdk-13 > [2] https://bugs.openjdk.java.net/browse/JDK-8199138 -- .''`. John Paul Adrian Glaubitz : :' : Debian Developer - glaubitz at debian.org `. `' Freie Universitaet Berlin - glaubitz at physik.fu-berlin.de `- GPG: 62FF 8A75 84E0 2956 9546 0006 7426 3B37 F5B5 F913 From christoph.langer at sap.com Sun Jan 5 22:21:40 2020 From: christoph.langer at sap.com (Langer, Christoph) Date: Sun, 5 Jan 2020 22:21:40 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References:

Message-ID: Hi Matthias, this change looks good to me. Best regards Christoph > -----Original Message----- > From: hotspot-dev On Behalf Of > Baesken, Matthias > Sent: Freitag, 3. Januar 2020 11:15 > To: Bob Vandette > Cc: hotspot-dev at openjdk.java.net > Subject: RE: RFR: 8236617: jtreg test > containers/docker/TestMemoryAwareness.java fails after 8226575 > > HI Bob, > Looking at the docker sources, the message seems to come from here : > > daemon_unix.go : > > // verifyPlatformContainerResources performs platform-specific validation of > the container's resource-configuration > func verifyPlatformContainerResources(resources > *containertypes.Resources, sysInfo *sysinfo.SysInfo, update bool) (warnings > []string, err error) { > ..... > // > // means resources have positive Memory limit, memory+swap is not > unlimited AND SwapLimit (memory.memsw.limit_in_bytes ?) is not enabled > [comment added by me) > if resources.Memory > 0 && resources.MemorySwap != -1 && > !sysInfo.SwapLimit { > warnings = append(warnings, "Your kernel does not support > swap limit capabilities or the cgroup is not mounted. Memory limited without > swap.") > resources.MemorySwap = -1 > } > > with Resources from hostconfig.go : > > // Resources contains container's resources (cgroups config, ulimits...) > type Resources struct { > Memory int64 // Memory limit (in bytes) > ... > MemorySwap int64 // Total memory usage (memory + > swap); set `-1` to enable unlimited swap > > > > So I think your suggestion to return 0 in that special case in function > getTotalSwapSpaceSize sounds reasonable to me ( at least better than > return a large negative value ). > New webrev : > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ > > > > Thanks, Matthias > > > > > > > > Matthias, > > > > I really don?t like testing for some Docker message that could possibly > change > > or go away in the future. > > There may be other reasons that the getTotalSwapSpaceSize function will > fail > > and return 0. > > > > The real problem here is that the > > OperatingSystemMXBean.getTotalSwapSpaceSize is returning a > > negative value when there is no swap available. > > > > I?d prefer that we fix this problem by correcting the getTotalSwapSpaceSize > > function to properly return 0 > > under these conditions and allow 0 to be a valid expected result in the test. > > > > if (limit >= 0 && memLimit >= 0) { > > return (limit < memLimit) ? 0 : limit - memLimit; > > } > > > > Note: My suggestion assumes that there is no swap available when the > > kernel swap limit capability is not enabled. > > I have not verified this. The message does claim that this is the case > > "Memory limited without swap?. > > > > Bob. > > > > > > > On Jan 2, 2020, at 8:26 AM, Baesken, Matthias > > wrote: > > > > > > Hello, please review this small adjustment to jtreg test > > containers/docker/TestMemoryAwareness.java . > > > > > > After change "8226575: OperatingSystemMXBean should be made > > container aware" has been pushed, > > > we observe failures on linux s390x / ppc64le in the docker related jtreg > > tests . > > > > > > > > > The test runs into the following error : > > > java.lang.RuntimeException: > > 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing from > > stdout/stderr > > > > > > at > > > jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) > > > at > > > TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMem > > oryAwareness.java:154) > > > at TestMemoryAwareness.main(TestMemoryAwareness.java:65) > > > at > > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native > > Method) > > > at > > > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMet > > hodAccessorImpl.java:62) > > > at > > > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Delega > > tingMethodAccessorImpl.java:43) > > > at java.base/java.lang.reflect.Method.invoke(Method.java:564) > > > at > > > com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapp > > er.java:127) > > > at java.base/java.lang.Thread.run(Thread.java:832) > > > > > > > > > The reason is that the value found is instead > > OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . > > > When looking into the getTotalSwapSpaceSize() function, we get values > of > > 0 for "limit" and 104857600 for "memLimit" : > > > > > > 57 long limit = containerMetrics.getMemoryAndSwapLimit(); > > > .... > > > 62 long memLimit = containerMetrics.getMemoryLimit(); > > > 63 if (limit >= 0 && memLimit >= 0) { > > > 64 return limit - memLimit; > > > 65 } > > > > > > That explains the value "-104857600" . We see messages "Your kernel > > does not support swap limit capabilities or the cgroup is not mounted. > > Memory limited without swap" , this most likely > > > causes the unexpected limit == 0 value . > > > > > > > > > Bug/webrev : > > > > > > https://bugs.openjdk.java.net/browse/JDK-8236617 > > > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ > > > > > > > > > Thanks, Matthias From aph at redhat.com Mon Jan 6 18:16:37 2020 From: aph at redhat.com (Andrew Haley) Date: Mon, 6 Jan 2020 18:16:37 +0000 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: <5BBB538B.5070404@oracle.com> References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> Message-ID: <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com> On 10/8/18 1:54 PM, Erik ?sterlund wrote: > Also note that the implementation space of the barrier itself has some > flexibility. Rickard's first prototype involved having an unconditional > branch patched in over a nop. Since the nop is removed in the frontend, > it seemed like the most conservative starting point. But since there was > no measurable difference to the conditional branch, that was more > favourable in the end, since it hade the additional advantage of not > requiring a code cache walk in the safepoint. But if you have a platform > where the trade off is not as obvious, both mechanisms could easily be > supported. Can you describe this mechanism a little more? I don't really understand how that would work, even on x86. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From kim.barrett at oracle.com Tue Jan 7 02:46:22 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Mon, 6 Jan 2020 21:46:22 -0500 Subject: RFR: 8235669: G1: Stack walking API can expose AS_NO_KEEPALIVE oops In-Reply-To: <3b2a6c47-b958-5e41-d7c3-a4d25000b17e@oracle.com> References: <3b2a6c47-b958-5e41-d7c3-a4d25000b17e@oracle.com> Message-ID: <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> > On Dec 10, 2019, at 11:02 AM, erik.osterlund at oracle.com wrote: > > Hi, > > When the stack is walked and e.g. locals are turned into StackValues, it might be that said local was made a constant oop by the JIT. In such cases, it is read from the nmethod using ON_STRONG_OOP_REF | AS_NO_KEEPALIVE. However, these oops need to be kept alive when concurrent marking is ongoing. > While I haven't seen crashes obviously linked to this yet, I don't think we should wait until we do, because it certainly will eventually. > > Bug: > https://bugs.openjdk.java.net/browse/JDK-8235669 > > Webrev: > http://cr.openjdk.java.net/~eosterlund/8235669/webrev.00/ > > Thanks, > /Erik Change looks good. I think it's kind of gross that oop_at returns an AS_NO_KEEPALIVE value to some more or less arbitrary context without any indication that this can happen. The scope of AS_NO_KEEPALIVE values really ought to be more constrained than that. I wonder if oop_at should do the phantom access, and there should be a different function for use by those places that want / can cope with an AS_NO_KEEPALIVE value. So I think there might be some naming / API issues in this neighborhood, but that can be addressed in a post-14 RFE. From coleen.phillimore at oracle.com Tue Jan 7 03:25:21 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Mon, 6 Jan 2020 22:25:21 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options Message-ID: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> Summary: Remove the options and code for options deprecated in JDK 14 open webrev at http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev bug link https://bugs.openjdk.java.net/browse/JDK-8236224 Ran tier1 on all oracle platforms, and 2, 3 on linux/windows-x64-debug and hs-tier4-graal because there were jvmci changes. thanks, Coleen From david.holmes at oracle.com Tue Jan 7 05:14:27 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 7 Jan 2020 15:14:27 +1000 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> Message-ID: <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> Hi Coleen, On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: > Summary: Remove the options and code for options deprecated in JDK 14 Generally looks good. > open webrev at http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev > bug link https://bugs.openjdk.java.net/browse/JDK-8236224 src/hotspot/share/aot/aotCodeHeap.hpp typedef struct { ! enum { CONFIG_SIZE = 7 * jintSize + 9 }; // 8 int values Now 7 int values // byte[11] array map to boolean values here Now byte[10]. Or should that be byte[9]? I think the original code may be off by one. --- src/hotspot/share/classfile/classFileParser.cpp 4133 bool allocate_oops_first = false; // was allocation_style == 0 The comment has no context now that there is no selectable allocation style. I don't understand why you removed a bunch of classes from this check: 4143 (_class_name == vmSymbols::java_lang_AssertionStatusDirectives() || 4144 _class_name == vmSymbols::java_lang_Class() || 4145 _class_name == vmSymbols::java_lang_ClassLoader() || 4147 _class_name == vmSymbols::java_lang_ref_SoftReference() || 4148 _class_name == vmSymbols::java_lang_StackTraceElement() || 4149 _class_name == vmSymbols::java_lang_String() || 4150 _class_name == vmSymbols::java_lang_Throwable() || ?? --- Thanks, David > Ran tier1 on all oracle platforms, and 2, 3 on linux/windows-x64-debug > and hs-tier4-graal because there were jvmci changes. > > thanks, > Coleen From coleen.phillimore at oracle.com Tue Jan 7 05:25:08 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 7 Jan 2020 00:25:08 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> Message-ID: <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> On 1/7/20 12:14 AM, David Holmes wrote: > Hi Coleen, > > On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >> Summary: Remove the options and code for options deprecated in JDK 14 > > Generally looks good. > >> open webrev at >> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 > > src/hotspot/share/aot/aotCodeHeap.hpp > > ? typedef struct { > !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; > ??? // 8 int values > > Now 7 int values > > ??? // byte[11] array map to boolean values here > > Now byte[10]. Or should that be byte[9]? I think the original code may > be off by one. Yes, it was wrong. I fixed the comments. > > --- > > src/hotspot/share/classfile/classFileParser.cpp > > 4133?? bool allocate_oops_first = false; // was allocation_style == 0 > > The comment has no context now that there is no selectable allocation > style. > Removed.? It was mostly to remind myself. > I don't understand why you removed a bunch of classes from this check: > > 4143?????? (_class_name == > vmSymbols::java_lang_AssertionStatusDirectives() || > 4144??????? _class_name == vmSymbols::java_lang_Class() || > 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || > > 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || > 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || > 4149??????? _class_name == vmSymbols::java_lang_String() || > 4150??????? _class_name == vmSymbols::java_lang_Throwable() || > > ?? The classes removed no longer have hardcoded offsets so did not need to follow the oops-first allocation style.? This was not cleaned up when the hardcoded offsets were removed from these classes. ? Fred also fixes this with his field layout patch in perhaps another place. Thanks, Coleen > > --- > > Thanks, > David > >> Ran tier1 on all oracle platforms, and 2, 3 on >> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >> changes. >> >> thanks, >> Coleen From david.holmes at oracle.com Tue Jan 7 06:05:49 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 7 Jan 2020 16:05:49 +1000 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: Hi Coleen, On 7/01/2020 3:25 pm, coleen.phillimore at oracle.com wrote: > > > On 1/7/20 12:14 AM, David Holmes wrote: >> Hi Coleen, >> >> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>> Summary: Remove the options and code for options deprecated in JDK 14 >> >> Generally looks good. >> >>> open webrev at >>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >> >> src/hotspot/share/aot/aotCodeHeap.hpp >> >> ? typedef struct { >> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >> ??? // 8 int values >> >> Now 7 int values >> >> ??? // byte[11] array map to boolean values here >> >> Now byte[10]. Or should that be byte[9]? I think the original code may >> be off by one. > > Yes, it was wrong. I fixed the comments. >> >> --- >> >> src/hotspot/share/classfile/classFileParser.cpp >> >> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >> >> The comment has no context now that there is no selectable allocation >> style. >> > Removed.? It was mostly to remind myself. >> I don't understand why you removed a bunch of classes from this check: >> >> 4143?????? (_class_name == >> vmSymbols::java_lang_AssertionStatusDirectives() || >> 4144??????? _class_name == vmSymbols::java_lang_Class() || >> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >> >> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >> 4149??????? _class_name == vmSymbols::java_lang_String() || >> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >> >> ?? > > The classes removed no longer have hardcoded offsets so did not need to > follow the oops-first allocation style.? This was not cleaned up when > the hardcoded offsets were removed from these classes. ? Fred also fixes > this with his field layout patch in perhaps another place. Okay thanks for clarifying. David > Thanks, > Coleen >> >> --- >> >> Thanks, >> David >> >>> Ran tier1 on all oracle platforms, and 2, 3 on >>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>> changes. >>> >>> thanks, >>> Coleen > From matthias.baesken at sap.com Tue Jan 7 08:09:13 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 7 Jan 2020 08:09:13 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References:

Message-ID: Hi Christoph, thanks for the review ! Bob are you fine with the latest version ? Best regards, Matthias > Hi Matthias, > > this change looks good to me. > > Best regards > Christoph > > > -----Original Message----- > > From: hotspot-dev On Behalf > Of > > Baesken, Matthias > > Sent: Freitag, 3. Januar 2020 11:15 > > To: Bob Vandette > > Cc: hotspot-dev at openjdk.java.net > > Subject: RE: RFR: 8236617: jtreg test > > containers/docker/TestMemoryAwareness.java fails after 8226575 > > > > HI Bob, > > Looking at the docker sources, the message seems to come from here : > > > > daemon_unix.go : > > > > // verifyPlatformContainerResources performs platform-specific validation > of > > the container's resource-configuration > > func verifyPlatformContainerResources(resources > > *containertypes.Resources, sysInfo *sysinfo.SysInfo, update bool) > (warnings > > []string, err error) { > > ..... > > // > > // means resources have positive Memory limit, memory+swap is not > > unlimited AND SwapLimit (memory.memsw.limit_in_bytes ?) is not > enabled > > [comment added by me) > > if resources.Memory > 0 && resources.MemorySwap != -1 && > > !sysInfo.SwapLimit { > > warnings = append(warnings, "Your kernel does not support > > swap limit capabilities or the cgroup is not mounted. Memory limited > without > > swap.") > > resources.MemorySwap = -1 > > } > > > > with Resources from hostconfig.go : > > > > // Resources contains container's resources (cgroups config, ulimits...) > > type Resources struct { > > Memory int64 // Memory limit (in bytes) > > ... > > MemorySwap int64 // Total memory usage (memory + > > swap); set `-1` to enable unlimited swap > > > > > > > > So I think your suggestion to return 0 in that special case in function > > getTotalSwapSpaceSize sounds reasonable to me ( at least better than > > return a large negative value ). > > New webrev : > > > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ > > > > > > > > Thanks, Matthias > > > > > > > > > > > > > > Matthias, > > > > > > I really don?t like testing for some Docker message that could possibly > > change > > > or go away in the future. > > > There may be other reasons that the getTotalSwapSpaceSize function will > > fail > > > and return 0. > > > > > > The real problem here is that the > > > OperatingSystemMXBean.getTotalSwapSpaceSize is returning a > > > negative value when there is no swap available. > > > > > > I?d prefer that we fix this problem by correcting the > getTotalSwapSpaceSize > > > function to properly return 0 > > > under these conditions and allow 0 to be a valid expected result in the > test. > > > > > > if (limit >= 0 && memLimit >= 0) { > > > return (limit < memLimit) ? 0 : limit - memLimit; > > > } > > > > > > Note: My suggestion assumes that there is no swap available when the > > > kernel swap limit capability is not enabled. > > > I have not verified this. The message does claim that this is the case > > > "Memory limited without swap?. > > > > > > Bob. > > > > > > > > > > On Jan 2, 2020, at 8:26 AM, Baesken, Matthias > > > wrote: > > > > > > > > Hello, please review this small adjustment to jtreg test > > > containers/docker/TestMemoryAwareness.java . > > > > > > > > After change "8226575: OperatingSystemMXBean should be made > > > container aware" has been pushed, > > > > we observe failures on linux s390x / ppc64le in the docker related jtreg > > > tests . > > > > > > > > > > > > The test runs into the following error : > > > > java.lang.RuntimeException: > > > 'OperatingSystemMXBean.getTotalSwapSpaceSize: 52428800' missing > from > > > stdout/stderr > > > > > > > > at > > > > > > jdk.test.lib.process.OutputAnalyzer.shouldContain(OutputAnalyzer.java:187) > > > > at > > > > > > TestMemoryAwareness.testOperatingSystemMXBeanAwareness(TestMem > > > oryAwareness.java:154) > > > > at TestMemoryAwareness.main(TestMemoryAwareness.java:65) > > > > at > > > > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native > > > Method) > > > > at > > > > > > java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMet > > > hodAccessorImpl.java:62) > > > > at > > > > > > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Delega > > > tingMethodAccessorImpl.java:43) > > > > at java.base/java.lang.reflect.Method.invoke(Method.java:564) > > > > at > > > > > > com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapp > > > er.java:127) > > > > at java.base/java.lang.Thread.run(Thread.java:832) > > > > > > > > > > > > The reason is that the value found is instead > > > OperatingSystemMXBean.getTotalSwapSpaceSize: -104857600 . > > > > When looking into the getTotalSwapSpaceSize() function, we get values > > of > > > 0 for "limit" and 104857600 for "memLimit" : > > > > > > > > 57 long limit = containerMetrics.getMemoryAndSwapLimit(); > > > > .... > > > > 62 long memLimit = containerMetrics.getMemoryLimit(); > > > > 63 if (limit >= 0 && memLimit >= 0) { > > > > 64 return limit - memLimit; > > > > 65 } > > > > > > > > That explains the value "-104857600" . We see messages "Your kernel > > > does not support swap limit capabilities or the cgroup is not mounted. > > > Memory limited without swap" , this most likely > > > > causes the unexpected limit == 0 value . > > > > > > > > > > > > Bug/webrev : > > > > > > > > https://bugs.openjdk.java.net/browse/JDK-8236617 > > > > > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.0/ > > > > > > > > > > > > Thanks, Matthias From erik.osterlund at oracle.com Tue Jan 7 08:55:30 2020 From: erik.osterlund at oracle.com (erik.osterlund at oracle.com) Date: Tue, 7 Jan 2020 09:55:30 +0100 Subject: RFR: 8235669: G1: Stack walking API can expose AS_NO_KEEPALIVE oops In-Reply-To: <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> References: <3b2a6c47-b958-5e41-d7c3-a4d25000b17e@oracle.com> <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> Message-ID: <0949e751-cae1-45f0-bbc8-4ba9abc54406@oracle.com> Hi Kim, Thanks for the review. I agree the naming should be fixed (in 14). Thanks, /Erik On 1/7/20 3:46 AM, Kim Barrett wrote: >> On Dec 10, 2019, at 11:02 AM, erik.osterlund at oracle.com wrote: >> >> Hi, >> >> When the stack is walked and e.g. locals are turned into StackValues, it might be that said local was made a constant oop by the JIT. In such cases, it is read from the nmethod using ON_STRONG_OOP_REF | AS_NO_KEEPALIVE. However, these oops need to be kept alive when concurrent marking is ongoing. >> While I haven't seen crashes obviously linked to this yet, I don't think we should wait until we do, because it certainly will eventually. >> >> Bug: >> https://bugs.openjdk.java.net/browse/JDK-8235669 >> >> Webrev: >> http://cr.openjdk.java.net/~eosterlund/8235669/webrev.00/ >> >> Thanks, >> /Erik > Change looks good. > > I think it's kind of gross that oop_at returns an AS_NO_KEEPALIVE > value to some more or less arbitrary context without any indication > that this can happen. The scope of AS_NO_KEEPALIVE values really > ought to be more constrained than that. > > I wonder if oop_at should do the phantom access, and there should be a > different function for use by those places that want / can cope with > an AS_NO_KEEPALIVE value. So I think there might be some naming / API > issues in this neighborhood, but that can be addressed in a post-14 RFE. > > From erik.osterlund at oracle.com Tue Jan 7 09:22:44 2020 From: erik.osterlund at oracle.com (erik.osterlund at oracle.com) Date: Tue, 7 Jan 2020 10:22:44 +0100 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com> References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com> Message-ID: Hi Andrew, On 1/6/20 7:16 PM, Andrew Haley wrote: > On 10/8/18 1:54 PM, Erik ?sterlund wrote: >> Also note that the implementation space of the barrier itself has some >> flexibility. Rickard's first prototype involved having an unconditional >> branch patched in over a nop. Since the nop is removed in the frontend, >> it seemed like the most conservative starting point. But since there was >> no measurable difference to the conditional branch, that was more >> favourable in the end, since it hade the additional advantage of not >> requiring a code cache walk in the safepoint. But if you have a platform >> where the trade off is not as obvious, both mechanisms could easily be >> supported. > Can you describe this mechanism a little more? I don't really understand > how that would work, even on x86. Presuming you would like to hear about the solution we didn't go for (unconditional branch)... The nmethod entry barriers are armed in a safepoint operation. Today that safepoint operation flips some epoch counter that the conditional branch will consider armed once the safepoint is released. In the alternative solution that biases the cost towards arming, instead of calling, you would instead walk the code cache and explicitly arm nmethods by patching in a jump over nops in the verified entry (for all nmethods). Disarming would be done by patching back nops over the jump on individual nmethods as they become safe to disarm. In the end, the hypothetical overhead of performing a conditional branch instead of executing nops was never observed to make a difference, and therefore we went with the conditional branch as the latency cost of walking the code cache was conversely not hypothetical. Note that since the entry barrier is used to protect mutators from observing stale oops, the current solution (and this alternative solution) relies on instruction cache coherency. Since there are oops embedded in the code stream, we rely on the disarming being a code modification such that a mutator observing the disarmed barrier implies it will also observe the fixed oops. If you are looking for an AArch64 solution, Stuart Monteith is cooking up a solution that we discussed, which does not rely on that for AArch64, which you might be interested in. Although, perhaps that is not what you are fishing for. Hope this helps. Thanks, /Erik From matthias.baesken at sap.com Tue Jan 7 09:27:50 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 7 Jan 2020 09:27:50 +0000 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] Message-ID: Hello, please review this small fix for an issue that I was running into when experimenting with gcc8 and the -flto compiler flag . When building with those flags, the gcc8 warns that the SwitchRange classes in HS code violate the C++ One Definition Rule . So I renamed one of those 2 SwitchRange classes . Bug/webrev : https://bugs.openjdk.java.net/browse/JDK-8236709 http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.0/ Thanks, Matthias > > Hello, when experimenting with gcc8 and the -flto compiler flag I was running into these warnings in the c1 coding : > > > > /open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:50:7: > > warning: type 'struct SwitchRange' violates the C++ One Definition Rule [- > > Wodr] > > class SwitchRange: public CompilationResourceObj { > > ^ > > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:319: note: a > > different type is defined in another translation unit > > class SwitchRange : public StackObj { > > > > > > > /usr/work/d040975/open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGener > > ator.hpp:52:7: note: the first difference of corresponding definitions is field > > '_low_key' > > int _low_key; > > ^ > > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:321: note: a > > field with different name is defined in another translation unit > > jint _lo; // inclusive lower limit > > > > > > Do you think this should be fixed ( renaming one SwitchRange ) ? > > > > > > Martin suggested that even without flto added it could be problematic . > > Yes, please file a bug and let's get this fixed quickly. This sort of thing can > lead > to really weird looking problems, like using the same vtable (which one > picked > ?at random?) for instances of both classes. From aph at redhat.com Tue Jan 7 10:04:22 2020 From: aph at redhat.com (Andrew Haley) Date: Tue, 7 Jan 2020 10:04:22 +0000 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com> Message-ID: On 1/7/20 9:22 AM, erik.osterlund at oracle.com wrote: > Presuming you would like to hear about the solution we didn't go for > (unconditional branch)... > > The nmethod entry barriers are armed in a safepoint operation. Today > that safepoint operation > flips some epoch counter that the conditional branch will consider armed > once the safepoint is > released. > > In the alternative solution that biases the cost towards arming, instead > of calling, you would > instead walk the code cache and explicitly arm nmethods by patching in a > jump over nops in the > verified entry (for all nmethods). > > Disarming would be done by patching back nops over the jump on > individual nmethods as they > become safe to disarm. Aha! That'd be a much simpler method for AArch64, for sure. We already have a nop at the start of every method, so we could rewrite it as a simple jump. > In the end, the hypothetical overhead of performing a conditional branch > instead of executing > nops was never observed to make a difference, and therefore we went with > the conditional branch > as the latency cost of walking the code cache was conversely not > hypothetical. Totally. However, that walk is not inline in the mutator code, and there's no reason not to run it concurrently. > Note that since the entry barrier is used to protect mutators from > observing stale oops, the > current solution (and this alternative solution) relies on instruction > cache coherency. I'm not sure it [the alternative] does, exactly. It requires that mutators see the changed jump once the cache flush has been done, but that's less of a requirement than icache coherency. > Since > there are oops embedded in the code stream, we rely on the disarming > being a code modification > such that a mutator observing the disarmed barrier implies it will also > observe the fixed oops. Sure, but there's little reason that oops should be embedded in the code stream. It's an optimization, but a pretty minor one. > If you are looking for an AArch64 solution, Stuart Monteith is cooking > up a solution that we > discussed, which does not rely on that for AArch64, which you might be > interested in. i haven't seen that. Was it discussed anywhere? I'll ask him. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From per.liden at oracle.com Tue Jan 7 10:25:08 2020 From: per.liden at oracle.com (Per Liden) Date: Tue, 7 Jan 2020 11:25:08 +0100 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com>

Message-ID: <1b5081ca-7f4b-64ec-b212-b7dc4110ac4c@oracle.com> Hi, On 1/7/20 11:04 AM, Andrew Haley wrote: > On 1/7/20 9:22 AM, erik.osterlund at oracle.com wrote: [...] >> In the alternative solution that biases the cost towards arming, instead >> of calling, you would >> instead walk the code cache and explicitly arm nmethods by patching in a >> jump over nops in the >> verified entry (for all nmethods). >> >> Disarming would be done by patching back nops over the jump on >> individual nmethods as they >> become safe to disarm. > > Aha! That'd be a much simpler method for AArch64, for sure. We already have > a nop at the start of every method, so we could rewrite it as a simple > jump. But as Erik hinted, the main problem with this alternative is that arming becomes an O(n) stop-the-world operation (where n is the number of nmethods) rather than O(1), which is highly undesirable for ZGC. cheers, Per From aph at redhat.com Tue Jan 7 10:36:57 2020 From: aph at redhat.com (Andrew Haley) Date: Tue, 7 Jan 2020 10:36:57 +0000 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: <1b5081ca-7f4b-64ec-b212-b7dc4110ac4c@oracle.com> References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com>

<1b5081ca-7f4b-64ec-b212-b7dc4110ac4c@oracle.com> Message-ID: <772a1cb2-9486-6d38-32fa-a7f6bb56b7fd@redhat.com> On 1/7/20 10:25 AM, Per Liden wrote: > > On 1/7/20 11:04 AM, Andrew Haley wrote: >> On 1/7/20 9:22 AM, erik.osterlund at oracle.com wrote: > [...] >>> In the alternative solution that biases the cost towards arming, instead >>> of calling, you would >>> instead walk the code cache and explicitly arm nmethods by patching in a >>> jump over nops in the >>> verified entry (for all nmethods). >>> >>> Disarming would be done by patching back nops over the jump on >>> individual nmethods as they >>> become safe to disarm. >> >> Aha! That'd be a much simpler method for AArch64, for sure. We already have >> a nop at the start of every method, so we could rewrite it as a simple >> jump. > > But as Erik hinted, the main problem with this alternative is that > arming becomes an O(n) stop-the-world operation O(n) I understand, but why stop the world? > (where n is the number > of nmethods) rather than O(1), which is highly undesirable for ZGC. Yeah, I get that. It's a choice between the devil and the deep blue sea. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From erik.osterlund at oracle.com Tue Jan 7 11:15:10 2020 From: erik.osterlund at oracle.com (erik.osterlund at oracle.com) Date: Tue, 7 Jan 2020 12:15:10 +0100 Subject: RFR: 8210498: nmethod entry barriers In-Reply-To: References: <5BB77E18.6040401@oracle.com> <399d9f0b-e66b-30e1-16b3-4873845367a9@redhat.com> <5BBB538B.5070404@oracle.com> <22b7f727-929c-b159-316e-78b76543b4fd@redhat.com>

Message-ID: <384efad9-2957-1159-1577-ab0207ad0d66@oracle.com> Hi Andrew, On 1/7/20 11:04 AM, Andrew Haley wrote: > On 1/7/20 9:22 AM, erik.osterlund at oracle.com wrote: >> Presuming you would like to hear about the solution we didn't go for >> (unconditional branch)... >> >> The nmethod entry barriers are armed in a safepoint operation. Today >> that safepoint operation >> flips some epoch counter that the conditional branch will consider armed >> once the safepoint is >> released. >> >> In the alternative solution that biases the cost towards arming, instead >> of calling, you would >> instead walk the code cache and explicitly arm nmethods by patching in a >> jump over nops in the >> verified entry (for all nmethods). >> >> Disarming would be done by patching back nops over the jump on >> individual nmethods as they >> become safe to disarm. > Aha! That'd be a much simpler method for AArch64, for sure. We already have > a nop at the start of every method, so we could rewrite it as a simple > jump. I'm presuming you are referring to the nop that we plaster a jump over when making thenmethod not_entrant. Conceptually that would be absolutely fine. However, note that 1) This nop is before the frame of the callee is constructed. The way the barrier works today is that ?? we wait for the frame to be constructed before calling the slow path, mostly out of convenience because ?? then the dispatch machinery has selected the callee nmethod of the call, and we can easily acquire the ?? callee nmethod from the slowpath code. Other miss handlers used in the VM typically resolve the call instead ?? figuring out what the callee nmethod should be (before the selection is done). ?? I think it's possible to rewrite the code in a style where this is done before the frame is constructed, ?? but I'm just noting that there might be some headache involved in that. 2) Unless care is taken, you might run into scenarios where two threads race, one making the nmethod not_entrant, ?? and another one disarming it. If the same nop is reused, you will need some additional synchronization to ensure ?? the monotonicity of the jump injected by not_entrant transitions. In other words, while reusing that nop is possible, I think it will be significantly more painful compared to putting another one right at the end of frame construction. > >> In the end, the hypothetical overhead of performing a conditional branch >> instead of executing >> nops was never observed to make a difference, and therefore we went with >> the conditional branch >> as the latency cost of walking the code cache was conversely not >> hypothetical. > Totally. However, that walk is not inline in the mutator code, and there's > no reason not to run it concurrently. There is. The disarming of nmethods must happen in the safepoint.The reason is that if an nmethod dies due to class unloading (an oop in the nmethod is dead), then subsequent calls to that nmethod from stale inline caches must be trapped so that we can unroll the frame and re-resolve the call. Since marking terminates in a safepoint for all current GCs, that same safepoint must disarm the nmethods before being released. >> Note that since the entry barrier is used to protect mutators from >> observing stale oops, the >> current solution (and this alternative solution) relies on instruction >> cache coherency. > I'm not sure it [the alternative] does, exactly. It requires that > mutators see the changed jump once the cache flush has been done, but > that's less of a requirement than icache coherency. Consider the following race during concurrent execution: JavaThread 1: Take nmethod entry barrier slow path JavaThread 1: Patch instruction oops JavaThread 1: Patch barrier jump to nop (disarm) JavaThread 2: Execute nop written by JavaThread 1 JavaThread 2: <--- surely we need at least isb here ---> JavaThread 2: Execute instruction oop As long as the oops are embedded as instructions, I presume we need at least an isb as indicated in my example above. Perhaps you are talking about if oops are data instead, in which case I am still not sure that there are global cross-CPU acquire-like semantics when performing instruction cache flushing. I certainly don't know of any such guarantees, but perhaps you know better. So I really don't know how we expect the oop loaded (which is concurrently modified) to be the new value and not a stale value. Also, as mentioned above, the arm operation really has to become globally observable in the safepoint. >> Since >> there are oops embedded in the code stream, we rely on the disarming >> being a code modification >> such that a mutator observing the disarmed barrier implies it will also >> observe the fixed oops. > Sure, but there's little reason that oops should be embedded in the code > stream. It's an optimization, but a pretty minor one. Agreed. I would love to see that disappear. >> If you are looking for an AArch64 solution, Stuart Monteith is cooking >> up a solution that we >> discussed, which does not rely on that for AArch64, which you might be >> interested in. > i haven't seen that. Was it discussed anywhere? I'll ask him. Stuart and I discussed it off-list. I proposed to him the following crazy solution: 1) Move oops to data by reserving a table of content (TOC) register, which is initialized ?? at nmethod entry time by loading the TOC from the nmethod (ldr). Each oop used by JIT ?? is simply loaded with ldr relative to the TOC register (it's like a lookup table). 2) Let the entry barrier compare the established TOC low order bits to the current GC phase ?? (load a global/TLS-local bit battern similar to the cmpl used in the x86 code) and take ?? the slow path if TOC has the wrong low order bits. 3) The TOC reserves N-1 extra slots, where N is the number of states observed by the barrier ?? (N == 3 for ZGC). This allows having different TOC pointers for each phase. 4) The nmethod entry barrier slow path selects a new TOC pointer and copies the oops in-place ?? such that after selecting the new TOC, each offset points as the same oops as before. In this scenario, the entry barrier dodges an ldar by relying on dependent loads not reordering instead. If the correct TOC is observed, then subsequent ldr of its oops will observe the correct oop as well (due to being dependent). Oh, and returns into compiled code from non-leaf calls must re-establish the TOC pointer with a new load in case a safepoint flipped it. Stuart is working on something similar ish to that, but instead of the TOC dependent load trick, he is exploring using ldar instead in the VEP and not reserving a TOC register, instead performing PC relative loads when accessing oops, which is sanity checking if my solution is a premature optimization or not before considering doing that fully, which seems to make sense to me as what I proposed is a bit tricky to cook up. So in both solutions the idea is to keep both the barrier check and the oops as data, and possibly optimize away acquire with some data dependency trick. Hope this makes sense. Thanks, /Erik From thomas.schatzl at oracle.com Tue Jan 7 11:15:41 2020 From: thomas.schatzl at oracle.com (Thomas Schatzl) Date: Tue, 7 Jan 2020 12:15:41 +0100 Subject: RFR: 8235669: G1: Stack walking API can expose AS_NO_KEEPALIVE oops In-Reply-To: <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> References: <3b2a6c47-b958-5e41-d7c3-a4d25000b17e@oracle.com> <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> Message-ID: <9b97e138-885d-2a3f-a82d-da94a51010ed@oracle.com> Hi, On 07.01.20 03:46, Kim Barrett wrote: >> On Dec 10, 2019, at 11:02 AM, erik.osterlund at oracle.com wrote: >> >> Hi, >> >> When the stack is walked and e.g. locals are turned into StackValues, it might be that said local was made a constant oop by the JIT. In such cases, it is read from the nmethod using ON_STRONG_OOP_REF | AS_NO_KEEPALIVE. However, these oops need to be kept alive when concurrent marking is ongoing. >> While I haven't seen crashes obviously linked to this yet, I don't think we should wait until we do, because it certainly will eventually. >> >> Bug: >> https://bugs.openjdk.java.net/browse/JDK-8235669 >> >> Webrev: >> http://cr.openjdk.java.net/~eosterlund/8235669/webrev.00/ >> >> Thanks, >> /Erik > > Change looks good. To me too. > > I think it's kind of gross that oop_at returns an AS_NO_KEEPALIVE > value to some more or less arbitrary context without any indication > that this can happen. The scope of AS_NO_KEEPALIVE values really > ought to be more constrained than that. > > I wonder if oop_at should do the phantom access, and there should be a > different function for use by those places that want / can cope with > an AS_NO_KEEPALIVE value. So I think there might be some naming / API > issues in this neighborhood, but that can be addressed in a post-14 RFE. > > +1 Thanks, Thomas From erik.osterlund at oracle.com Tue Jan 7 11:16:34 2020 From: erik.osterlund at oracle.com (erik.osterlund at oracle.com) Date: Tue, 7 Jan 2020 12:16:34 +0100 Subject: RFR: 8235669: G1: Stack walking API can expose AS_NO_KEEPALIVE oops In-Reply-To: <9b97e138-885d-2a3f-a82d-da94a51010ed@oracle.com> References: <3b2a6c47-b958-5e41-d7c3-a4d25000b17e@oracle.com> <76376D1F-0BDE-41CF-A702-CC121E01C1FE@oracle.com> <9b97e138-885d-2a3f-a82d-da94a51010ed@oracle.com> Message-ID: <13797ce3-7e95-b598-80fa-fa830ab03e02@oracle.com> Hi Thomas, Thanks for the review! /Erik On 1/7/20 12:15 PM, Thomas Schatzl wrote: > Hi, > > On 07.01.20 03:46, Kim Barrett wrote: >>> On Dec 10, 2019, at 11:02 AM, erik.osterlund at oracle.com wrote: >>> >>> Hi, >>> >>> When the stack is walked and e.g. locals are turned into >>> StackValues, it might be that said local was made a constant oop by >>> the JIT. In such cases, it is read from the nmethod using >>> ON_STRONG_OOP_REF | AS_NO_KEEPALIVE. However, these oops need to be >>> kept alive when concurrent marking is ongoing. >>> While I haven't seen crashes obviously linked to this yet, I don't >>> think we should wait until we do, because it certainly will eventually. >>> >>> Bug: >>> https://bugs.openjdk.java.net/browse/JDK-8235669 >>> >>> Webrev: >>> http://cr.openjdk.java.net/~eosterlund/8235669/webrev.00/ >>> >>> Thanks, >>> /Erik >> >> Change looks good. > > To me too. > >> >> I think it's kind of gross that oop_at returns an AS_NO_KEEPALIVE >> value to some more or less arbitrary context without any indication >> that this can happen.? The scope of AS_NO_KEEPALIVE values really >> ought to be more constrained than that. >> >> I wonder if oop_at should do the phantom access, and there should be a >> different function for use by those places that want / can cope with >> an AS_NO_KEEPALIVE value. So I think there might be some naming / API >> issues in this neighborhood, but that can be addressed in a post-14 RFE. >> >> > > +1 > > Thanks, > ? Thomas From harold.seigel at oracle.com Tue Jan 7 14:14:46 2020 From: harold.seigel at oracle.com (Harold Seigel) Date: Tue, 7 Jan 2020 09:14:46 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: Hi Coleen, The change looks good! Thanks, Harold On 1/7/2020 12:25 AM, coleen.phillimore at oracle.com wrote: > > > On 1/7/20 12:14 AM, David Holmes wrote: >> Hi Coleen, >> >> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>> Summary: Remove the options and code for options deprecated in JDK 14 >> >> Generally looks good. >> >>> open webrev at >>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >> >> src/hotspot/share/aot/aotCodeHeap.hpp >> >> ? typedef struct { >> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >> ??? // 8 int values >> >> Now 7 int values >> >> ??? // byte[11] array map to boolean values here >> >> Now byte[10]. Or should that be byte[9]? I think the original code >> may be off by one. > > Yes, it was wrong. I fixed the comments. >> >> --- >> >> src/hotspot/share/classfile/classFileParser.cpp >> >> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >> >> The comment has no context now that there is no selectable allocation >> style. >> > Removed.? It was mostly to remind myself. >> I don't understand why you removed a bunch of classes from this check: >> >> 4143?????? (_class_name == >> vmSymbols::java_lang_AssertionStatusDirectives() || >> 4144??????? _class_name == vmSymbols::java_lang_Class() || >> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >> >> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >> 4149??????? _class_name == vmSymbols::java_lang_String() || >> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >> >> ?? > > The classes removed no longer have hardcoded offsets so did not need > to follow the oops-first allocation style.? This was not cleaned up > when the hardcoded offsets were removed from these classes. ? Fred > also fixes this with his field layout patch in perhaps another place. > > Thanks, > Coleen >> >> --- >> >> Thanks, >> David >> >>> Ran tier1 on all oracle platforms, and 2, 3 on >>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>> changes. >>> >>> thanks, >>> Coleen > From frederic.parain at oracle.com Tue Jan 7 14:33:46 2020 From: frederic.parain at oracle.com (Frederic Parain) Date: Tue, 7 Jan 2020 09:33:46 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: Coleen, Thank you for cleaning up this code. Changes look good to me. Fred On 1/7/20 12:25 AM, coleen.phillimore at oracle.com wrote: > > > On 1/7/20 12:14 AM, David Holmes wrote: >> Hi Coleen, >> >> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>> Summary: Remove the options and code for options deprecated in JDK 14 >> >> Generally looks good. >> >>> open webrev at >>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >> >> src/hotspot/share/aot/aotCodeHeap.hpp >> >> ? typedef struct { >> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >> ??? // 8 int values >> >> Now 7 int values >> >> ??? // byte[11] array map to boolean values here >> >> Now byte[10]. Or should that be byte[9]? I think the original code may >> be off by one. > > Yes, it was wrong. I fixed the comments. >> >> --- >> >> src/hotspot/share/classfile/classFileParser.cpp >> >> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >> >> The comment has no context now that there is no selectable allocation >> style. >> > Removed.? It was mostly to remind myself. >> I don't understand why you removed a bunch of classes from this check: >> >> 4143?????? (_class_name == >> vmSymbols::java_lang_AssertionStatusDirectives() || >> 4144??????? _class_name == vmSymbols::java_lang_Class() || >> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >> >> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >> 4149??????? _class_name == vmSymbols::java_lang_String() || >> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >> >> ?? > > The classes removed no longer have hardcoded offsets so did not need to > follow the oops-first allocation style.? This was not cleaned up when > the hardcoded offsets were removed from these classes. ? Fred also fixes > this with his field layout patch in perhaps another place. > > Thanks, > Coleen >> >> --- >> >> Thanks, >> David >> >>> Ran tier1 on all oracle platforms, and 2, 3 on >>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>> changes. >>> >>> thanks, >>> Coleen > From coleen.phillimore at oracle.com Tue Jan 7 14:41:57 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 7 Jan 2020 09:41:57 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: Thanks Harold! Coleen On 1/7/20 9:14 AM, Harold Seigel wrote: > Hi Coleen, > > The change looks good! > > Thanks, Harold > > On 1/7/2020 12:25 AM, coleen.phillimore at oracle.com wrote: >> >> >> On 1/7/20 12:14 AM, David Holmes wrote: >>> Hi Coleen, >>> >>> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>>> Summary: Remove the options and code for options deprecated in JDK 14 >>> >>> Generally looks good. >>> >>>> open webrev at >>>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >>> >>> src/hotspot/share/aot/aotCodeHeap.hpp >>> >>> ? typedef struct { >>> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >>> ??? // 8 int values >>> >>> Now 7 int values >>> >>> ??? // byte[11] array map to boolean values here >>> >>> Now byte[10]. Or should that be byte[9]? I think the original code >>> may be off by one. >> >> Yes, it was wrong. I fixed the comments. >>> >>> --- >>> >>> src/hotspot/share/classfile/classFileParser.cpp >>> >>> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >>> >>> The comment has no context now that there is no selectable >>> allocation style. >>> >> Removed.? It was mostly to remind myself. >>> I don't understand why you removed a bunch of classes from this check: >>> >>> 4143?????? (_class_name == >>> vmSymbols::java_lang_AssertionStatusDirectives() || >>> 4144??????? _class_name == vmSymbols::java_lang_Class() || >>> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >>> >>> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >>> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >>> 4149??????? _class_name == vmSymbols::java_lang_String() || >>> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >>> >>> ?? >> >> The classes removed no longer have hardcoded offsets so did not need >> to follow the oops-first allocation style.? This was not cleaned up >> when the hardcoded offsets were removed from these classes. ? Fred >> also fixes this with his field layout patch in perhaps another place. >> >> Thanks, >> Coleen >>> >>> --- >>> >>> Thanks, >>> David >>> >>>> Ran tier1 on all oracle platforms, and 2, 3 on >>>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>>> changes. >>>> >>>> thanks, >>>> Coleen >> From coleen.phillimore at oracle.com Tue Jan 7 14:42:33 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 7 Jan 2020 09:42:33 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: Thanks Fred and for the prereview and hope it helps you a little. Now you don't have to worry about these flags. Coleen On 1/7/20 9:33 AM, Frederic Parain wrote: > Coleen, > > Thank you for cleaning up this code. > Changes look good to me. > > Fred > > On 1/7/20 12:25 AM, coleen.phillimore at oracle.com wrote: >> >> >> On 1/7/20 12:14 AM, David Holmes wrote: >>> Hi Coleen, >>> >>> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>>> Summary: Remove the options and code for options deprecated in JDK 14 >>> >>> Generally looks good. >>> >>>> open webrev at >>>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >>> >>> src/hotspot/share/aot/aotCodeHeap.hpp >>> >>> ? typedef struct { >>> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >>> ??? // 8 int values >>> >>> Now 7 int values >>> >>> ??? // byte[11] array map to boolean values here >>> >>> Now byte[10]. Or should that be byte[9]? I think the original code >>> may be off by one. >> >> Yes, it was wrong. I fixed the comments. >>> >>> --- >>> >>> src/hotspot/share/classfile/classFileParser.cpp >>> >>> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >>> >>> The comment has no context now that there is no selectable >>> allocation style. >>> >> Removed.? It was mostly to remind myself. >>> I don't understand why you removed a bunch of classes from this check: >>> >>> 4143?????? (_class_name == >>> vmSymbols::java_lang_AssertionStatusDirectives() || >>> 4144??????? _class_name == vmSymbols::java_lang_Class() || >>> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >>> >>> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >>> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >>> 4149??????? _class_name == vmSymbols::java_lang_String() || >>> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >>> >>> ?? >> >> The classes removed no longer have hardcoded offsets so did not need >> to follow the oops-first allocation style.? This was not cleaned up >> when the hardcoded offsets were removed from these classes. ? Fred >> also fixes this with his field layout patch in perhaps another place. >> >> Thanks, >> Coleen >>> >>> --- >>> >>> Thanks, >>> David >>> >>>> Ran tier1 on all oracle platforms, and 2, 3 on >>>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>>> changes. >>>> >>>> thanks, >>>> Coleen >> From coleen.phillimore at oracle.com Tue Jan 7 14:43:10 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 7 Jan 2020 09:43:10 -0500 Subject: RFR (S) 8236224: Obsolete the FieldsAllocationStyle and CompactFields options In-Reply-To: References: <2de68ab4-f913-29d9-97ca-a8fcc5dc94b7@oracle.com> <731c3127-46ce-15a5-f7b3-02b11dd5cbf3@oracle.com> <7cc12a90-159b-cdaf-2a48-7fec1e2540c9@oracle.com> Message-ID: <8c288b90-ea49-1eb9-aa10-d02d061ca22d@oracle.com> Thanks for the code review, David. Coleen On 1/7/20 1:05 AM, David Holmes wrote: > Hi Coleen, > > On 7/01/2020 3:25 pm, coleen.phillimore at oracle.com wrote: >> >> >> On 1/7/20 12:14 AM, David Holmes wrote: >>> Hi Coleen, >>> >>> On 7/01/2020 1:25 pm, coleen.phillimore at oracle.com wrote: >>>> Summary: Remove the options and code for options deprecated in JDK 14 >>> >>> Generally looks good. >>> >>>> open webrev at >>>> http://cr.openjdk.java.net/~coleenp/2019/8236224.01/webrev >>>> bug link https://bugs.openjdk.java.net/browse/JDK-8236224 >>> >>> src/hotspot/share/aot/aotCodeHeap.hpp >>> >>> ? typedef struct { >>> !?? enum { CONFIG_SIZE = 7 * jintSize + 9 }; >>> ??? // 8 int values >>> >>> Now 7 int values >>> >>> ??? // byte[11] array map to boolean values here >>> >>> Now byte[10]. Or should that be byte[9]? I think the original code >>> may be off by one. >> >> Yes, it was wrong. I fixed the comments. >>> >>> --- >>> >>> src/hotspot/share/classfile/classFileParser.cpp >>> >>> 4133?? bool allocate_oops_first = false; // was allocation_style == 0 >>> >>> The comment has no context now that there is no selectable >>> allocation style. >>> >> Removed.? It was mostly to remind myself. >>> I don't understand why you removed a bunch of classes from this check: >>> >>> 4143?????? (_class_name == >>> vmSymbols::java_lang_AssertionStatusDirectives() || >>> 4144??????? _class_name == vmSymbols::java_lang_Class() || >>> 4145??????? _class_name == vmSymbols::java_lang_ClassLoader() || >>> >>> 4147??????? _class_name == vmSymbols::java_lang_ref_SoftReference() || >>> 4148??????? _class_name == vmSymbols::java_lang_StackTraceElement() || >>> 4149??????? _class_name == vmSymbols::java_lang_String() || >>> 4150??????? _class_name == vmSymbols::java_lang_Throwable() || >>> >>> ?? >> >> The classes removed no longer have hardcoded offsets so did not need >> to follow the oops-first allocation style.? This was not cleaned up >> when the hardcoded offsets were removed from these classes. ? Fred >> also fixes this with his field layout patch in perhaps another place. > > Okay thanks for clarifying. > > David > >> Thanks, >> Coleen >>> >>> --- >>> >>> Thanks, >>> David >>> >>>> Ran tier1 on all oracle platforms, and 2, 3 on >>>> linux/windows-x64-debug and hs-tier4-graal because there were jvmci >>>> changes. >>>> >>>> thanks, >>>> Coleen >> From david.holmes at oracle.com Wed Jan 8 00:23:46 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 8 Jan 2020 10:23:46 +1000 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: Message-ID: <5d1e6853-ae4d-e229-c6b6-a8ea66402e8a@oracle.com> Hi Matthias, On 7/01/2020 7:27 pm, Baesken, Matthias wrote: > Hello, please review this small fix for an issue that I was running into when experimenting with gcc8 and the -flto compiler flag . > When building with those flags, the gcc8 warns that the SwitchRange classes in HS code violate the C++ One Definition Rule . > So I renamed one of those 2 SwitchRange classes . Could you instead put the ./share/opto/parse2.cpp version inside an anonymous namespace? It seems to me that both SwitchRange classes are intended for use in a single compilation unit, so if we can make that more obvious that seems better than solving the name clash. Otherwise I have to wonder whether you could rename the C1 version as you have in the hpp file (to avoid the global name clash) but add a typedef to allow the cpp file (and latter part of the header) to be unchanged? Also minor nit: class Invoke; ! class SwitchRangeC1; class LIRItem; The forward declaration seems unnecessary. Thanks, David ----- > Bug/webrev : > > https://bugs.openjdk.java.net/browse/JDK-8236709 > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.0/ > > > Thanks, Matthias > > >>> Hello, when experimenting with gcc8 and the -flto compiler flag I was running into these warnings in the c1 coding : >>> >>> /open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:50:7: >>> warning: type 'struct SwitchRange' violates the C++ One Definition Rule [- >>> Wodr] >>> class SwitchRange: public CompilationResourceObj { >>> ^ >>> /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:319: note: a >>> different type is defined in another translation unit >>> class SwitchRange : public StackObj { >>> >>> >>> >> /usr/work/d040975/open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGener >>> ator.hpp:52:7: note: the first difference of corresponding definitions is field >>> '_low_key' >>> int _low_key; >>> ^ >>> /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:321: note: a >>> field with different name is defined in another translation unit >>> jint _lo; // inclusive lower limit >>> >>> >>> Do you think this should be fixed ( renaming one SwitchRange ) ? >>> >>> >>> Martin suggested that even without flto added it could be problematic . >> >> Yes, please file a bug and let's get this fixed quickly. This sort of thing can >> lead >> to really weird looking problems, like using the same vtable (which one >> picked >> ?at random?) for instances of both classes. > From coleen.phillimore at oracle.com Wed Jan 8 00:23:27 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 7 Jan 2020 19:23:27 -0500 Subject: RFR 8232759: Remove or simplify GC.class_stats Message-ID: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> Summary: Make the GC.class_stats option obsolete open webrev at http://cr.openjdk.java.net/~coleenp/2019/8232759.01/webrev bug link https://bugs.openjdk.java.net/browse/JDK-8232759 Tested with tier1 on all Oracle platforms and tier2,3 on linux-x64-debug. Thanks, Coleen From kim.barrett at oracle.com Wed Jan 8 00:58:30 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Tue, 7 Jan 2020 19:58:30 -0500 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: <5d1e6853-ae4d-e229-c6b6-a8ea66402e8a@oracle.com> References: <5d1e6853-ae4d-e229-c6b6-a8ea66402e8a@oracle.com> Message-ID: <37D2B3E8-2CB8-47DB-824F-78ABD737B094@oracle.com> > On Jan 7, 2020, at 7:23 PM, David Holmes wrote: > > Hi Matthias, > > On 7/01/2020 7:27 pm, Baesken, Matthias wrote: >> Hello, please review this small fix for an issue that I was running into when experimenting with gcc8 and the -flto compiler flag . >> When building with those flags, the gcc8 warns that the SwitchRange classes in HS code violate the C++ One Definition Rule . >> So I renamed one of those 2 SwitchRange classes . > > Could you instead put the ./share/opto/parse2.cpp version inside an anonymous namespace? It seems to me that both SwitchRange classes are intended for use in a single compilation unit, so if we can make that more obvious that seems better than solving the name clash. At one time I considered proposing changing the HotSpot Style Guide to permit (and indeed encourage, as there are some performance benefits too) the use of anonymous namespaces. However, I discovered that debuggers don't seem to like them at all, so dropped that idea. https://groups.google.com/forum/#!topic/mozilla.dev.platform/KsaG3lEEaRM Suggests Visual Studio debugger might not be able to refer to anonoymous namespace symbols, so can't set breakpoints in them &etc. Though the discussion seems to go back and forth on that. https://firefox-source-docs.mozilla.org/tools/lint/coding-style/coding_style_cpp.html Search for "Anonymous namespaces" Suggests preferring "static" to anonymous namespaces where applicable, because of poor debugger support for anonymous namespaces. https://sourceware.org/bugzilla/show_bug.cgi?id=16874 Bug for similar gdb problems. From kim.barrett at oracle.com Wed Jan 8 01:16:31 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Tue, 7 Jan 2020 20:16:31 -0500 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: Message-ID: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> > On Jan 7, 2020, at 4:27 AM, Baesken, Matthias wrote: > > Hello, please review this small fix for an issue that I was running into when experimenting with gcc8 and the -flto compiler flag . > When building with those flags, the gcc8 warns that the SwitchRange classes in HS code violate the C++ One Definition Rule . > So I renamed one of those 2 SwitchRange classes . > > Bug/webrev : > > https://bugs.openjdk.java.net/browse/JDK-8236709 > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.0/ I think I would prefer the C1 "namespace" disambiguator to be a prefix rather than a suffix, to be consistent with usage in other parts of HotSpot. (GC code does that somewhat consistently, for example.) I don't recall seeing "namespace" suffixes used anywhere in HotSpot. I've occasionally considered proposing that C2 code be wrapped in a namespace. It grabs a bunch of very generic global type names ("Type", "Block", &etc; really!?) that can be annoying. (Though the diet for precompiled.hpp helped with that.) But anything like that is probably out of scope for the immediate problem. Giving C2 primacy of place (renaming the C1 class and letting C2 keep the "good" name) seems consistent with existing practice. From david.holmes at oracle.com Wed Jan 8 05:55:49 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 8 Jan 2020 15:55:49 +1000 Subject: RFR 8232759: Remove or simplify GC.class_stats In-Reply-To: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> References: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> Message-ID: <6b3b3556-1ae2-41a1-bfef-30c60e7e4526@oracle.com> Hi Coleen, On 8/01/2020 10:23 am, coleen.phillimore at oracle.com wrote: > Summary: Make the GC.class_stats option obsolete > > open webrev at http://cr.openjdk.java.net/~coleenp/2019/8232759.01/webrev > bug link https://bugs.openjdk.java.net/browse/JDK-8232759 The fan out from that was larger than I was expecting :) Change appears fine. May I suggest updating the bug to get rid of the "or simplify" part. Thanks, David > Tested with tier1 on all Oracle platforms and tier2,3 on linux-x64-debug. > > Thanks, > Coleen > From matthias.baesken at sap.com Wed Jan 8 08:10:30 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 8 Jan 2020 08:10:30 +0000 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> References: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> Message-ID: Hello, thanks for the input . I renamed SwitchRangeC1 to C1SwitchRange and removed the unnecessary forward declaration . New webrev : http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.1/ Best regards, Matthias > > > > Hello, please review this small fix for an issue that I was running into when > experimenting with gcc8 and the -flto compiler flag . > > When building with those flags, the gcc8 warns that the SwitchRange > classes in HS code violate the C++ One Definition Rule . > > So I renamed one of those 2 SwitchRange classes . > > > > Bug/webrev : > > > > https://bugs.openjdk.java.net/browse/JDK-8236709 > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.0/ > > I think I would prefer the C1 "namespace" disambiguator to be a prefix > rather than a suffix, to be consistent with usage in other parts of > HotSpot. (GC code does that somewhat consistently, for example.) I > don't recall seeing "namespace" suffixes used anywhere in HotSpot. > > I've occasionally considered proposing that C2 code be wrapped in a > namespace. It grabs a bunch of very generic global type names ("Type", > "Block", &etc; really!?) that can be annoying. (Though the diet for > precompiled.hpp helped with that.) But anything like that is probably > out of scope for the immediate problem. > > Giving C2 primacy of place (renaming the C1 class and letting C2 keep > the "good" name) seems consistent with existing practice. From thomas.schatzl at oracle.com Wed Jan 8 08:34:13 2020 From: thomas.schatzl at oracle.com (Thomas Schatzl) Date: Wed, 8 Jan 2020 09:34:13 +0100 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> Message-ID: Hi, On 08.01.20 09:10, Baesken, Matthias wrote: > Hello, thanks for the input . > > I renamed SwitchRangeC1 to C1SwitchRange and removed the unnecessary forward declaration . > > New webrev : > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.1/ > looks good. Thomas From david.holmes at oracle.com Wed Jan 8 09:51:11 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 8 Jan 2020 19:51:11 +1000 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> Message-ID: <56bd4688-7e48-c443-86f8-554539c3f325@oracle.com> On 8/01/2020 6:10 pm, Baesken, Matthias wrote: > Hello, thanks for the input . > > I renamed SwitchRangeC1 to C1SwitchRange and removed the unnecessary forward declaration . > > New webrev : > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.1/ Seems fine. Thanks, David > > Best regards, Matthias > > > > >>> >>> Hello, please review this small fix for an issue that I was running into when >> experimenting with gcc8 and the -flto compiler flag . >>> When building with those flags, the gcc8 warns that the SwitchRange >> classes in HS code violate the C++ One Definition Rule . >>> So I renamed one of those 2 SwitchRange classes . >>> >>> Bug/webrev : >>> >>> https://bugs.openjdk.java.net/browse/JDK-8236709 >>> >>> http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.0/ >> >> I think I would prefer the C1 "namespace" disambiguator to be a prefix >> rather than a suffix, to be consistent with usage in other parts of >> HotSpot. (GC code does that somewhat consistently, for example.) I >> don't recall seeing "namespace" suffixes used anywhere in HotSpot. >> >> I've occasionally considered proposing that C2 code be wrapped in a >> namespace. It grabs a bunch of very generic global type names ("Type", >> "Block", &etc; really!?) that can be annoying. (Though the diet for >> precompiled.hpp helped with that.) But anything like that is probably >> out of scope for the immediate problem. >> >> Giving C2 primacy of place (renaming the C1 class and letting C2 keep >> the "good" name) seems consistent with existing practice. > From matthias.baesken at sap.com Wed Jan 8 12:08:29 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 8 Jan 2020 12:08:29 +0000 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: <56bd4688-7e48-c443-86f8-554539c3f325@oracle.com> References: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> <56bd4688-7e48-c443-86f8-554539c3f325@oracle.com> Message-ID: Thanks ! Kim are you fine as well with the latest webrev ? Best regards, Matthias > On 8/01/2020 6:10 pm, Baesken, Matthias wrote: > > Hello, thanks for the input . > > > > I renamed SwitchRangeC1 to C1SwitchRange and removed the > unnecessary forward declaration . > > > > New webrev : > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.1/ > > Seems fine. > > Thanks, > David > From kim.barrett at oracle.com Wed Jan 8 13:44:20 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Wed, 8 Jan 2020 08:44:20 -0500 Subject: RFR [XS]: 8236709: struct SwitchRange in HS violates C++ One Definition Rule - was RE: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: <062FA66D-4C78-4B11-AB2B-AE9B792596A6@oracle.com> Message-ID: <85D461D2-400E-4814-A250-19A124AA37F8@oracle.com> > On Jan 8, 2020, at 3:10 AM, Baesken, Matthias wrote: > > Hello, thanks for the input . > > I renamed SwitchRangeC1 to C1SwitchRange and removed the unnecessary forward declaration . > > New webrev : > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236709.1/ Looks good. From coleen.phillimore at oracle.com Wed Jan 8 14:26:05 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Wed, 8 Jan 2020 09:26:05 -0500 Subject: RFR 8232759: Remove or simplify GC.class_stats In-Reply-To: <6b3b3556-1ae2-41a1-bfef-30c60e7e4526@oracle.com> References: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> <6b3b3556-1ae2-41a1-bfef-30c60e7e4526@oracle.com> Message-ID: <9ec8d0f8-c92d-5896-8482-98a2e80f3321@oracle.com> On 1/8/20 12:55 AM, David Holmes wrote: > Hi Coleen, > > On 8/01/2020 10:23 am, coleen.phillimore at oracle.com wrote: >> Summary: Make the GC.class_stats option obsolete >> >> open webrev at >> http://cr.openjdk.java.net/~coleenp/2019/8232759.01/webrev >> bug link https://bugs.openjdk.java.net/browse/JDK-8232759 > > The fan out from that was larger than I was expecting :) Yes.? If we wanted such a thing to affect all metadata classes like this, there is metaspace_pointers_do() now that should be used instead. > > Change appears fine. May I suggest updating the bug to get rid of the > "or simplify" part. Ok. Thanks! Coleen > > Thanks, > David > >> Tested with tier1 on all Oracle platforms and tier2,3 on >> linux-x64-debug. >> >> Thanks, >> Coleen >> From stefan.karlsson at oracle.com Wed Jan 8 15:34:05 2020 From: stefan.karlsson at oracle.com (Stefan Karlsson) Date: Wed, 8 Jan 2020 16:34:05 +0100 Subject: RFR: 8236778: Add Atomic::fetch_and_add Message-ID: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> Hi all, Please review this patch to introduce Atomic::fetch_and_add. https://cr.openjdk.java.net/~stefank/8236778/webrev.01 https://bugs.openjdk.java.net/browse/JDK-8236778 There are a number of places where we have this pattern: int result = Atomic::add(_index, amount) - amount; I'd like to introduce Atomic::fetch_and_add so that we can write: int result = Atomic::fetch_and_add(_index, amount); The current implementation already has support for both "add and fetch" and "fetch and add" but it's not exposed to the upper layers. Previously, the platform-specific code either implemented "add and fetch" or "fetch and add", and then exposed it as an "add and fetch" implementation by using CRTP and inheriting from either AddAndFetch or FetchAndAdd. My first implementation of this continued in this track, but got push-back because the code was non-intuitive and/or used non-intuitive names. Therefore, I've removed FetchAndAdd/AddAndFetch and opted to duplicate the trivial functionality in the platform files instead. For example: + template + D add_and_fetch(D volatile* dest, I add_value, atomic_memory_order order) const { + return fetch_and_add(dest, add_value, order) + add_value; + } There has been some thoughts that maybe we should have: void Atomic::add(...) D Atomic::add_and_fetch(...) D Atomic::fetch_and_add(...) Not sure if it's worth changing to this, but if others think this is good, I can do that change. Tested with tier123, but only compiled on platforms I have access to. Thanks, StefanK From ioi.lam at oracle.com Wed Jan 8 17:05:28 2020 From: ioi.lam at oracle.com (Ioi Lam) Date: Wed, 8 Jan 2020 09:05:28 -0800 Subject: RFR 8232759: Remove or simplify GC.class_stats In-Reply-To: <9ec8d0f8-c92d-5896-8482-98a2e80f3321@oracle.com> References: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> <6b3b3556-1ae2-41a1-bfef-30c60e7e4526@oracle.com> <9ec8d0f8-c92d-5896-8482-98a2e80f3321@oracle.com> Message-ID: <9789ea24-3f4c-6c82-1d69-670856f6e28d@oracle.com> Hi Coleen, The changes look good to me. Thanks for cleaning up the mess that I introduced :-) - Ioi On 1/8/20 6:26 AM, coleen.phillimore at oracle.com wrote: > > > On 1/8/20 12:55 AM, David Holmes wrote: >> Hi Coleen, >> >> On 8/01/2020 10:23 am, coleen.phillimore at oracle.com wrote: >>> Summary: Make the GC.class_stats option obsolete >>> >>> open webrev at >>> http://cr.openjdk.java.net/~coleenp/2019/8232759.01/webrev >>> bug link https://bugs.openjdk.java.net/browse/JDK-8232759 >> >> The fan out from that was larger than I was expecting :) > > Yes.? If we wanted such a thing to affect all metadata classes like > this, there is metaspace_pointers_do() now that should be used instead. >> >> Change appears fine. May I suggest updating the bug to get rid of the >> "or simplify" part. > > Ok. > Thanks! > Coleen >> >> Thanks, >> David >> >>> Tested with tier1 on all Oracle platforms and tier2,3 on >>> linux-x64-debug. >>> >>> Thanks, >>> Coleen >>> > From coleen.phillimore at oracle.com Wed Jan 8 21:14:21 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Wed, 8 Jan 2020 16:14:21 -0500 Subject: RFR 8232759: Remove or simplify GC.class_stats In-Reply-To: <9789ea24-3f4c-6c82-1d69-670856f6e28d@oracle.com> References: <2faa21b1-c5d3-0caf-d7ed-6e7852814669@oracle.com> <6b3b3556-1ae2-41a1-bfef-30c60e7e4526@oracle.com> <9ec8d0f8-c92d-5896-8482-98a2e80f3321@oracle.com> <9789ea24-3f4c-6c82-1d69-670856f6e28d@oracle.com> Message-ID: <89e463a9-0845-e0c9-8774-3af045bf5a72@oracle.com> On 1/8/20 12:05 PM, Ioi Lam wrote: > Hi Coleen, > > The changes look good to me. Thanks for cleaning up the mess that I > introduced :-) Thanks Ioi!? It was useful for a while though. Coleen > > - Ioi > > On 1/8/20 6:26 AM, coleen.phillimore at oracle.com wrote: >> >> >> On 1/8/20 12:55 AM, David Holmes wrote: >>> Hi Coleen, >>> >>> On 8/01/2020 10:23 am, coleen.phillimore at oracle.com wrote: >>>> Summary: Make the GC.class_stats option obsolete >>>> >>>> open webrev at >>>> http://cr.openjdk.java.net/~coleenp/2019/8232759.01/webrev >>>> bug link https://bugs.openjdk.java.net/browse/JDK-8232759 >>> >>> The fan out from that was larger than I was expecting :) >> >> Yes.? If we wanted such a thing to affect all metadata classes like >> this, there is metaspace_pointers_do() now that should be used instead. >>> >>> Change appears fine. May I suggest updating the bug to get rid of >>> the "or simplify" part. >> >> Ok. >> Thanks! >> Coleen >>> >>> Thanks, >>> David >>> >>>> Tested with tier1 on all Oracle platforms and tier2,3 on >>>> linux-x64-debug. >>>> >>>> Thanks, >>>> Coleen >>>> >> > From kim.barrett at oracle.com Thu Jan 9 00:00:34 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Wed, 8 Jan 2020 19:00:34 -0500 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> Message-ID: > On Jan 8, 2020, at 10:34 AM, Stefan Karlsson wrote: > > Hi all, > > Please review this patch to introduce Atomic::fetch_and_add. > > https://cr.openjdk.java.net/~stefank/8236778/webrev.01 > https://bugs.openjdk.java.net/browse/JDK-8236778 > > There are a number of places where we have this pattern: > int result = Atomic::add(_index, amount) - amount; > > I'd like to introduce Atomic::fetch_and_add so that we can write: > int result = Atomic::fetch_and_add(_index, amount); ------------------------------------------------------------------------------ src/hotspot/share/runtime/atomic.hpp Removed: 240 // - platform_add is an object of type PlatformAdd. 241 // 242 // Then 243 // platform_add(dest, add_value) 244 // must be a valid expression, returning a result convertible to D. and 250 // Helper base classes for defining PlatformAdd. To use, define ... 275 // caller. These comments should have been updated to describe the new protocol for PlatformAdd, rather than simply removed. Documentation of extension points like this is important. Something like // - platform_add is an object of type PlatformAdd. // // Then both // platform_add.add_and_fetch(dest, add_value) // platform_add.fetch_and_add(dest, add_value) // must be valid expressions returning a result convertible to D. // // add_and_fetch atomically adds add_value to the value of dest, // returning the new value. // // fetch_and_add atomically adds add_value to the value of dest, // returning the old value. // // When D is a pointer type P*, both add_and_fetch and fetch_and_add // treat it as if it were an uintptr_t; they do not perform any // scaling of add_value, as that has already been done by the caller. ------------------------------------------------------------------------------ src/hotspot/share/runtime/atomic.hpp 679 static I scale_addend(I add_value) { 680 return add_value * sizeof(P); 681 } 682 683 static P* add_and_fetch(P* volatile* dest, I add_value, atomic_memory_order order) { 684 CI addend = add_value; 685 return PlatformAdd().add_and_fetch(dest, scale_addend(addend), order); 686 } This is converting add_value from I to CI then back to I, the latter possibly being a narrowing conversion. Better would be static CI scale_addend(CI add_value) { return add_value * sizeof(P); } and then either static P* add_and_fetch(P* volatile* dest, I add_value, atomic_memory_order order) { CI addend = scale_addend(add_value); return PlatformAdd().add_and_fetch(dest, addend, order); } or don't bother with the addend variable and just pass the scaled result directly to add_and_fetch / fetch_and_add. ------------------------------------------------------------------------------ src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp 54 #pragma warning(disable: 4035) // Disables warnings reporting missing return statement Pre-existing: This warning suppression (and the corresponding restoration) appears to only be needed for the !defined(AMD64) case (where __asm statements are being used), and should be moved accordingly. ------------------------------------------------------------------------------ > The current implementation already has support for both "add and fetch" and "fetch and add" but it's not exposed to the upper layers. > > Previously, the platform-specific code either implemented "add and fetch" or "fetch and add", and then exposed it as an "add and fetch" implementation by using CRTP and inheriting from either AddAndFetch or FetchAndAdd. > > My first implementation of this continued in this track, but got push-back because the code was non-intuitive and/or used non-intuitive names. Therefore, I've removed FetchAndAdd/AddAndFetch and opted to duplicate the trivial functionality in the platform files instead. For example: > > + template > + D add_and_fetch(D volatile* dest, I add_value, atomic_memory_order order) const { > + return fetch_and_add(dest, add_value, order) + add_value; > + } For the record, I would have been fine with (and actually preferred; I dislike roughly a dozen copies of one or the other of the translation functions) CRTP-based AddAndFetch and FetchAndAdd that provided one operation in terms of the other. The move of the scaling of add_value in the pointer case to AddImpl is an improvement (even on the pre-existing code) that might have made such a CRTP approach clearer, possibly with some name changes. Then the helper CRTP base class just provides one of the functions in terms of Derived's other function, and nothing else. Perhaps the hardest part of that approach is naming the helper base classes. FetchAndAddUsingAddAndFetch is explicit but quite a mouthful. Perhaps FetchAndAddHelper, which provides fetch_and_add in terms of Derived's add_and_fetch? Apologies for not having followed the internal pre-review discussion of this and so not commenting there. > There has been some thoughts that maybe we should have: > > void Atomic::add(...) > D Atomic::add_and_fetch(...) > D Atomic::fetch_and_add(...) > > Not sure if it's worth changing to this, but if others think this is good, I can do that change. I would be okay with renaming "add" to "add_and_fetch", though I don't at the moment have a strong preference either way. I'm not sure also providing "add" that returns void is really all that useful. We have "inc" and "dec" that return void; my recollection is that they were at one time thought to provide an opportunity for a more efficient implementation on some platforms, but discussion during the templatization project showed that compilers could eliminate an unused return value anyway. For symmetry it seems like we should have fetch_and_sub, but I don't see any current uses of sub that would need that. If add_and_fetch is added (either instead of or in addition to add), then we should definitely treat the subtraction case similarly. > Tested with tier123, but only compiled on platforms I have access to. mach5 has the ability to cross-compile to several OpenJDK platforms not otherwise supported by Oracle, though can't run tests. That can be a useful smoke test to at least avoid things like typos that don't build. For example, "-b linux-aarch64-debug?. From robbin.ehn at oracle.com Thu Jan 9 09:32:12 2020 From: robbin.ehn at oracle.com (Robbin Ehn) Date: Thu, 9 Jan 2020 10:32:12 +0100 Subject: Status for JDK-8199138, riscv64 support for Zero In-Reply-To: <96a56c15-5952-ab7c-8427-4860918f3c4a@physik.fu-berlin.de> References: <96a56c15-5952-ab7c-8427-4860918f3c4a@physik.fu-berlin.de> Message-ID: <44d74403-9424-6d91-b090-f8c626436562@oracle.com> Hi Adrian, As Magnus wrote: "For this patch to be accepted in mainline, you will need the remove the changes to the build-aux/autoconf-config* files (we cannot change them due to copyright), and instead patch the OpenJDK wrapper scripts build-aux/config*." Which you seem to have done in the current debian patch. So AFAICT just re-open and send-out that patch. (note I'm not a build file guy...) Thanks, Robbin On 2020-01-04 12:18, John Paul Adrian Glaubitz wrote: > Hi! > > Debian and several other distribution already have already been bootstrapped > for riscv64 with a large number of packages building fine. > > Support for OpenJDK Zero has been added through a slightly modified version > of JDK-8199138 [1]. Looking at the bug report for JDK-8199138, it seems that > the patch was retracted back in 2018. > > However, since the Debian version of the patch works fine, I was wondering > whether we could get it merged in one form or another? > > Thanks, > Adrian > >> [1] https://git.launchpad.net/~openjdk/ubuntu/+source/openjdk/+git/openjdk/tree/debian/patches/riscv64.diff?h=openjdk-13 >> [2] https://bugs.openjdk.java.net/browse/JDK-8199138 > From aph at redhat.com Thu Jan 9 15:04:02 2020 From: aph at redhat.com (Andrew Haley) Date: Thu, 9 Jan 2020 15:04:02 +0000 Subject: 8236856: AArch64: Spurious GCC warnings Message-ID: <7a729234-0f79-e39d-00c8-27fd48b51bd4@redhat.com> With some versions of GCC we get this at compile time, which causes build failures when warnings-as-errors is enabled. It's a false positive, and should be fixed in GCC, but we need to shut it up. Compiling macroAssembler_aarch64.cpp (for libjvm.so) In file included from /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:632:0, from /home/aph/jdk-jdk/src/hotspot/share/oops/oop.hpp:33, from /home/aph/jdk-jdk/src/hotspot/share/runtime/handles.hpp:29, from /home/aph/jdk-jdk/src/hotspot/share/code/oopRecorder.hpp:28, from /home/aph/jdk-jdk/src/hotspot/share/asm/codeBuffer.hpp:28, from /home/aph/jdk-jdk/src/hotspot/share/asm/assembler.hpp:28, from /home/aph/jdk-jdk/src/hotspot/cpu/aarch64/macroAssembler_aarch64.cpp:30: /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp: In instantiation of 'T Atomic::PlatformCmpxchg::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; long unsigned int byte_size = 1ul]': /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:783:46: required from 'T Atomic::CmpxchgImpl::value || IsRegisteredEnum::value)>::type>::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; typename EnableIf<(IsIntegral::value || IsRegisteredEnum::value)>::type = void]' /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:756:75: required from 'static D Atomic::cmpxchg(volatile D*, U, T, atomic_memory_order) [with D = signed char; U = signed char; T = signed char]' /home/aph/jdk-jdk/src/hotspot/share/gc/shenandoah/shenandoahSharedVariables.hpp:77:113: required from here /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp:60:10: Fixed thusly. OK? -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 -------------- next part -------------- # HG changeset patch # User aph # Date 1578582061 18000 # Thu Jan 09 10:01:01 2020 -0500 # Node ID a6c0679606c37ad3c7c21537bd338f4d272aa1e3 # Parent 6d23020e3da0ed7b276e10f60e0c8d178d7c049f 8236856: AArch64: Spurious GCC warnings Reviewed-by: adinn diff -r 6d23020e3da0 -r a6c0679606c3 src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp --- a/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp Thu Jan 09 09:30:49 2020 -0500 +++ b/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp Thu Jan 09 10:01:01 2020 -0500 @@ -55,9 +55,10 @@ return res; } +// __attribute__((unused)) on dest is to get rid of spurious GCC warnings. template template -inline T Atomic::PlatformCmpxchg::operator()(T volatile* dest, +inline T Atomic::PlatformCmpxchg::operator()(T volatile* dest __attribute__((unused)), T compare_value, T exchange_value, atomic_memory_order order) const { From glaubitz at physik.fu-berlin.de Thu Jan 9 15:25:14 2020 From: glaubitz at physik.fu-berlin.de (John Paul Adrian Glaubitz) Date: Thu, 9 Jan 2020 16:25:14 +0100 Subject: Status for JDK-8199138, riscv64 support for Zero In-Reply-To: <44d74403-9424-6d91-b090-f8c626436562@oracle.com> References: <96a56c15-5952-ab7c-8427-4860918f3c4a@physik.fu-berlin.de> <44d74403-9424-6d91-b090-f8c626436562@oracle.com> Message-ID: Hi! On 1/9/20 10:32 AM, Robbin Ehn wrote: > As Magnus wrote: > "For this patch to be accepted in mainline, you will need the remove the changes to the build-aux/autoconf-config* files (we cannot change them due to copyright), and instead patch the OpenJDK wrapper scripts build-aux/config*." > > Which you seem to have done in the current debian patch. > > So AFAICT just re-open and send-out that patch. (note I'm not a build file guy...) Okay, I'll put myself as owner for the bug report first, then post a new RFR. Adrian -- .''`. John Paul Adrian Glaubitz : :' : Debian Developer - glaubitz at debian.org `. `' Freie Universitaet Berlin - glaubitz at physik.fu-berlin.de `- GPG: 62FF 8A75 84E0 2956 9546 0006 7426 3B37 F5B5 F913 From aph at redhat.com Thu Jan 9 16:09:52 2020 From: aph at redhat.com (Andrew Haley) Date: Thu, 9 Jan 2020 16:09:52 +0000 Subject: [resend] 8236856: AArch64: Spurious GCC warnings Message-ID: <2c9e41f3-44e3-9a56-1c6b-9ff86ebfb21a@redhat.com> With some versions of GCC we get this at compile time, which causes build failures when warnings-as-errors is enabled. It's a false positive, and should be fixed in GCC, but we need to shut it up. Compiling macroAssembler_aarch64.cpp (for libjvm.so) In file included from /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:632:0, from /home/aph/jdk-jdk/src/hotspot/share/oops/oop.hpp:33, from /home/aph/jdk-jdk/src/hotspot/share/runtime/handles.hpp:29, from /home/aph/jdk-jdk/src/hotspot/share/code/oopRecorder.hpp:28, from /home/aph/jdk-jdk/src/hotspot/share/asm/codeBuffer.hpp:28, from /home/aph/jdk-jdk/src/hotspot/share/asm/assembler.hpp:28, from /home/aph/jdk-jdk/src/hotspot/cpu/aarch64/macroAssembler_aarch64.cpp:30: /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp: In instantiation of 'T Atomic::PlatformCmpxchg::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; long unsigned int byte_size = 1ul]': /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:783:46: required from 'T Atomic::CmpxchgImpl::value || IsRegisteredEnum::value)>::type>::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; typename EnableIf<(IsIntegral::value || IsRegisteredEnum::value)>::type = void]' /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:756:75: required from 'static D Atomic::cmpxchg(volatile D*, U, T, atomic_memory_order) [with D = signed char; U = signed char; T = signed char]' /home/aph/jdk-jdk/src/hotspot/share/gc/shenandoah/shenandoahSharedVariables.hpp:77:113: required from here /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp:60:10: warning: parameter 'dest' set but not used [-Wunused-but-set-parameter] Fixed thusly. OK? -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 -------------- next part -------------- # HG changeset patch # User aph # Date 1578582061 18000 # Thu Jan 09 10:01:01 2020 -0500 # Node ID a6c0679606c37ad3c7c21537bd338f4d272aa1e3 # Parent 6d23020e3da0ed7b276e10f60e0c8d178d7c049f 8236856: AArch64: Spurious GCC warnings Reviewed-by: adinn diff -r 6d23020e3da0 -r a6c0679606c3 src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp --- a/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp Thu Jan 09 09:30:49 2020 -0500 +++ b/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp Thu Jan 09 10:01:01 2020 -0500 @@ -55,9 +55,10 @@ return res; } +// __attribute__((unused)) on dest is to get rid of spurious GCC warnings. template template -inline T Atomic::PlatformCmpxchg::operator()(T volatile* dest, +inline T Atomic::PlatformCmpxchg::operator()(T volatile* dest __attribute__((unused)), T compare_value, T exchange_value, atomic_memory_order order) const { From adinn at redhat.com Thu Jan 9 16:12:46 2020 From: adinn at redhat.com (Andrew Dinn) Date: Thu, 9 Jan 2020 16:12:46 +0000 Subject: [resend] 8236856: AArch64: Spurious GCC warnings In-Reply-To: <2c9e41f3-44e3-9a56-1c6b-9ff86ebfb21a@redhat.com> References: <2c9e41f3-44e3-9a56-1c6b-9ff86ebfb21a@redhat.com> Message-ID: <8577e9bd-acf6-3536-056f-a3043e040bf2@redhat.com> On 09/01/2020 16:09, Andrew Haley wrote: > With some versions of GCC we get this at compile time, which causes build failures > when warnings-as-errors is enabled. It's a false positive, and should be fixed in > GCC, but we need to shut it up. > > Compiling macroAssembler_aarch64.cpp (for libjvm.so) > In file included from /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:632:0, > from /home/aph/jdk-jdk/src/hotspot/share/oops/oop.hpp:33, > from /home/aph/jdk-jdk/src/hotspot/share/runtime/handles.hpp:29, > from /home/aph/jdk-jdk/src/hotspot/share/code/oopRecorder.hpp:28, > from /home/aph/jdk-jdk/src/hotspot/share/asm/codeBuffer.hpp:28, > from /home/aph/jdk-jdk/src/hotspot/share/asm/assembler.hpp:28, > from /home/aph/jdk-jdk/src/hotspot/cpu/aarch64/macroAssembler_aarch64.cpp:30: > /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp: In instantiation of 'T Atomic::PlatformCmpxchg::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; long unsigned int byte_size = 1ul]': > /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:783:46: required from 'T Atomic::CmpxchgImpl::value || IsRegisteredEnum::value)>::type>::operator()(volatile T*, T, T, atomic_memory_order) const [with T = signed char; typename EnableIf<(IsIntegral::value || IsRegisteredEnum::value)>::type = void]' > /home/aph/jdk-jdk/src/hotspot/share/runtime/atomic.hpp:756:75: required from 'static D Atomic::cmpxchg(volatile D*, U, T, atomic_memory_order) [with D = signed char; U = signed char; T = signed char]' > /home/aph/jdk-jdk/src/hotspot/share/gc/shenandoah/shenandoahSharedVariables.hpp:77:113: required from here > /home/aph/jdk-jdk/src/hotspot/os_cpu/linux_aarch64/atomic_linux_aarch64.hpp:60:10: warning: parameter 'dest' set but not used [-Wunused-but-set-parameter] > > Fixed thusly. OK? Yes, that looks fine /and/ trivial. regards, Andrew Dinn ----------- Senior Principal Software Engineer Red Hat UK Ltd Registered in England and Wales under Company Registration No. 03798903 Directors: Michael Cunningham, Michael ("Mike") O'Neill From stefan.karlsson at oracle.com Thu Jan 9 22:17:40 2020 From: stefan.karlsson at oracle.com (Stefan Karlsson) Date: Thu, 9 Jan 2020 23:17:40 +0100 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> Message-ID: <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> Updated webrev: ?https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta ?https://cr.openjdk.java.net/~stefank/8236778/webrev.02 Comments below: On 2020-01-09 01:00, Kim Barrett wrote: >> On Jan 8, 2020, at 10:34 AM, Stefan Karlsson wrote: >> >> Hi all, >> >> Please review this patch to introduce Atomic::fetch_and_add. >> >> https://cr.openjdk.java.net/~stefank/8236778/webrev.01 >> https://bugs.openjdk.java.net/browse/JDK-8236778 >> >> There are a number of places where we have this pattern: >> int result = Atomic::add(_index, amount) - amount; >> >> I'd like to introduce Atomic::fetch_and_add so that we can write: >> int result = Atomic::fetch_and_add(_index, amount); > ------------------------------------------------------------------------------ > src/hotspot/share/runtime/atomic.hpp > Removed: > 240 // - platform_add is an object of type PlatformAdd. > 241 // > 242 // Then > 243 // platform_add(dest, add_value) > 244 // must be a valid expression, returning a result convertible to D. > > and > > 250 // Helper base classes for defining PlatformAdd. To use, define > ... > 275 // caller. > > These comments should have been updated to describe the new protocol > for PlatformAdd, rather than simply removed. Documentation of > extension points like this is important. The comments seemed to already have started deteriorating. It's seems to be missing an *if* that matches the *then, and it doesn't mention *order*. So, instead of trying to figure out what the comment tried to say, I opted from getting rid of those problems by getting rid of that part of the comment. I thought the code was pretty self-explanatory, and didn't need that part. I'm not opposed to fixing the comments if you have suggestions. > Something like > > // - platform_add is an object of type PlatformAdd. > // > // Then both > // platform_add.add_and_fetch(dest, add_value) > // platform_add.fetch_and_add(dest, add_value) > // must be valid expressions returning a result convertible to D. > // > // add_and_fetch atomically adds add_value to the value of dest, > // returning the new value. > // > // fetch_and_add atomically adds add_value to the value of dest, > // returning the old value. > // > // When D is a pointer type P*, both add_and_fetch and fetch_and_add > // treat it as if it were an uintptr_t; they do not perform any > // scaling of add_value, as that has already been done by the caller. I'll update the code verbatim with what you've suggested. > > ------------------------------------------------------------------------------ > src/hotspot/share/runtime/atomic.hpp > 679 static I scale_addend(I add_value) { > 680 return add_value * sizeof(P); > 681 } > 682 > 683 static P* add_and_fetch(P* volatile* dest, I add_value, atomic_memory_order order) { > 684 CI addend = add_value; > 685 return PlatformAdd().add_and_fetch(dest, scale_addend(addend), order); > 686 } > > This is converting add_value from I to CI then back to I, the latter > possibly being a narrowing conversion. Better would be > > static CI scale_addend(CI add_value) { > return add_value * sizeof(P); > } > > and then either > > static P* add_and_fetch(P* volatile* dest, I add_value, atomic_memory_order order) { > CI addend = scale_addend(add_value); > return PlatformAdd().add_and_fetch(dest, addend, order); > } > > or don't bother with the addend variable and just pass the scaled > result directly to add_and_fetch / fetch_and_add. Thanks. This was an unintentional change from the original code. > > ------------------------------------------------------------------------------ > src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp > 54 #pragma warning(disable: 4035) // Disables warnings reporting missing return statement > > Pre-existing: This warning suppression (and the corresponding > restoration) appears to only be needed for the !defined(AMD64) case > (where __asm statements are being used), and should be moved accordingly. Will you create a bug report for that? > > ------------------------------------------------------------------------------ > >> The current implementation already has support for both "add and fetch" and "fetch and add" but it's not exposed to the upper layers. >> >> Previously, the platform-specific code either implemented "add and fetch" or "fetch and add", and then exposed it as an "add and fetch" implementation by using CRTP and inheriting from either AddAndFetch or FetchAndAdd. >> >> My first implementation of this continued in this track, but got push-back because the code was non-intuitive and/or used non-intuitive names. Therefore, I've removed FetchAndAdd/AddAndFetch and opted to duplicate the trivial functionality in the platform files instead. For example: >> >> + template >> + D add_and_fetch(D volatile* dest, I add_value, atomic_memory_order order) const { >> + return fetch_and_add(dest, add_value, order) + add_value; >> + } > For the record, I would have been fine with (and actually preferred; I > dislike roughly a dozen copies of one or the other of the translation > functions) CRTP-based AddAndFetch and FetchAndAdd that provided one > operation in terms of the other. > > The move of the scaling of add_value in the pointer case to AddImpl is > an improvement (even on the pre-existing code) that might have made > such a CRTP approach clearer, possibly with some name changes. Then > the helper CRTP base class just provides one of the functions in terms > of Derived's other function, and nothing else. > > Perhaps the hardest part of that approach is naming the helper base > classes. FetchAndAddUsingAddAndFetch is explicit but quite a > mouthful. Perhaps FetchAndAddHelper, which provides fetch_and_add in > terms of Derived's add_and_fetch? > > Apologies for not having followed the internal pre-review discussion > of this and so not commenting there. > >> There has been some thoughts that maybe we should have: >> >> void Atomic::add(...) >> D Atomic::add_and_fetch(...) >> D Atomic::fetch_and_add(...) >> >> Not sure if it's worth changing to this, but if others think this is good, I can do that change. > I would be okay with renaming "add" to "add_and_fetch", though I don't > at the moment have a strong preference either way. I'm not sure also > providing "add" that returns void is really all that useful. We have > "inc" and "dec" that return void; my recollection is that they were at > one time thought to provide an opportunity for a more efficient > implementation on some platforms, but discussion during the > templatization project showed that compilers could eliminate an unused > return value anyway. > > For symmetry it seems like we should have fetch_and_sub, but I don't > see any current uses of sub that would need that. > > If add_and_fetch is added (either instead of or in addition to add), > then we should definitely treat the subtraction case similarly. I'll interpret your comments above that there might be opportunities to do more changes, but none of those needs to be done for this patch. > >> Tested with tier123, but only compiled on platforms I have access to. > mach5 has the ability to cross-compile to several OpenJDK platforms > not otherwise supported by Oracle, though can't run tests. That can > be a useful smoke test to at least avoid things like typos that don't > build. For example, "-b linux-aarch64-debug?. Right. I wasn't explicit about it, but I've compiled locally (linux) for aarch64, arm32, ppc64, s390, zero, minimal. Thanks, StefanK > From kim.barrett at oracle.com Fri Jan 10 01:50:08 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Thu, 9 Jan 2020 20:50:08 -0500 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> Message-ID: <4F9B2E5D-DF2E-492C-A384-430C09B0DB5B@oracle.com> > On Jan 9, 2020, at 5:17 PM, Stefan Karlsson wrote: > > Updated webrev: > https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta > https://cr.openjdk.java.net/~stefank/8236778/webrev.02 Looks good except for a couple more comment tweaks described below. >> src/hotspot/share/runtime/atomic.hpp >> Removed: >> 240 // - platform_add is an object of type PlatformAdd. >> 241 // >> 242 // Then >> 243 // platform_add(dest, add_value) >> 244 // must be a valid expression, returning a result convertible to D. >> >> and >> >> 250 // Helper base classes for defining PlatformAdd. To use, define >> ... >> 275 // caller. >> >> These comments should have been updated to describe the new protocol >> for PlatformAdd, rather than simply removed. Documentation of >> extension points like this is important. > > The comments seemed to already have started deteriorating. It's seems to be missing an *if* that matches the *then, The structure is "Given then "; there was never an "if". All of the similar comments in this file were written that way. But there's a further update that I just noticed is needed by your changes: 233 // class is a function object that must be default constructable, Delete "is a function object that" in the above. > and it doesn't mention *order*. It's true that *order* isn't mentioned; that seems to have been missed when the order argument was added here. (It *is* described in the xchg and cmpxchg commentary, and we could do similarly here.) Add // - order is of type atomic_memory_order. to the requirements list, and change the expressions to include an order argument, e.g. // Then both // platform_add.add_and_fetch(dest, add_value, order) // platform_add.fetch_and_add(dest, add_value, order) > So, instead of trying to figure out what the comment tried to say, I opted from getting rid of those problems by getting rid of that part of the comment. I thought the code was pretty self-explanatory, and didn't need that part. I'm not opposed to fixing the comments if you have suggestions. Unfortunately, documenting templates can be pretty hard. But because of the looseness of templates, differentiating between intentional semantics and accidents of today's implementation based only on the code can also be pretty hard. Yet failure to do so leads to brittleness and uncertainties, especially when there are third-party specializations. So I think it's worth making an attempt at documentation, even if it's hard and the results not always perfect. > I'll update the code verbatim with what you've suggested. Plus the couple further modifications mentioned above? >> src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp >> 54 #pragma warning(disable: 4035) // Disables warnings reporting missing return statement >> >> Pre-existing: This warning suppression (and the corresponding >> restoration) appears to only be needed for the !defined(AMD64) case >> (where __asm statements are being used), and should be moved accordingly. > > Will you create a bug report for that? https://bugs.openjdk.java.net/browse/JDK-8236900 atomic_windows_x86.hpp disables warning 4035 more widely than needed > I'll interpret your comments above that there might be opportunities to do more changes, but none of those needs to be done for this patch. OK. > Right. I wasn't explicit about it, but I've compiled locally (linux) for aarch64, arm32, ppc64, s390, zero, minimal. Ah, that wasn?t clear to me from what you said. Good. From david.holmes at oracle.com Fri Jan 10 02:05:01 2020 From: david.holmes at oracle.com (David Holmes) Date: Fri, 10 Jan 2020 12:05:01 +1000 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> Message-ID: <48a620f5-a562-b3b6-3c80-7c6c33fa8ba9@oracle.com> Hi Stefan, On 10/01/2020 8:17 am, Stefan Karlsson wrote: > Updated webrev: > ?https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta > ?https://cr.openjdk.java.net/~stefank/8236778/webrev.02 That all seems okay to me (not that I fully understand the details of the template code). I'll leave you and Kim to work out the details of the commentary. Thanks, David > Comments below: > > On 2020-01-09 01:00, Kim Barrett wrote: >>> On Jan 8, 2020, at 10:34 AM, Stefan Karlsson >>> wrote: >>> >>> Hi all, >>> >>> Please review this patch to introduce Atomic::fetch_and_add. >>> >>> https://cr.openjdk.java.net/~stefank/8236778/webrev.01 >>> https://bugs.openjdk.java.net/browse/JDK-8236778 >>> >>> There are a number of places where we have this pattern: >>> int result = Atomic::add(_index, amount) - amount; >>> >>> I'd like to introduce Atomic::fetch_and_add so that we can write: >>> int result = Atomic::fetch_and_add(_index, amount); >> ------------------------------------------------------------------------------ >> >> src/hotspot/share/runtime/atomic.hpp >> Removed: >> ? 240?? // - platform_add is an object of type PlatformAdd. >> ? 241?? // >> ? 242?? // Then >> ? 243?? //?? platform_add(dest, add_value) >> ? 244?? // must be a valid expression, returning a result convertible >> to D. >> >> and >> >> ? 250?? // Helper base classes for defining PlatformAdd.? To use, define >> ... >> ? 275?? // caller. >> >> These comments should have been updated to describe the new protocol >> for PlatformAdd, rather than simply removed.? Documentation of >> extension points like this is important. > > The comments seemed to already have started deteriorating. It's seems to > be missing an *if* that matches the *then, and it doesn't mention > *order*. So, instead of trying to figure out what the comment tried to > say, I opted from getting rid of those problems by getting rid of that > part of the comment. I thought the code was pretty self-explanatory, and > didn't need that part. I'm not opposed to fixing the comments if you > have suggestions. > >> ?? Something like >> >> ?? // - platform_add is an object of type PlatformAdd. >> ?? // >> ?? // Then both >> ?? //?? platform_add.add_and_fetch(dest, add_value) >> ?? //?? platform_add.fetch_and_add(dest, add_value) >> ?? // must be valid expressions returning a result convertible to D. >> ?? // >> ?? // add_and_fetch atomically adds add_value to the value of dest, >> ?? // returning the new value. >> ?? // >> ?? // fetch_and_add atomically adds add_value to the value of dest, >> ?? // returning the old value. >> ?? // >> ?? // When D is a pointer type P*, both add_and_fetch and fetch_and_add >> ?? // treat it as if it were an uintptr_t; they do not perform any >> ?? // scaling of add_value, as that has already been done by the caller. > > I'll update the code verbatim with what you've suggested. > >> >> ------------------------------------------------------------------------------ >> >> src/hotspot/share/runtime/atomic.hpp >> ? 679?? static I scale_addend(I add_value) { >> ? 680???? return add_value * sizeof(P); >> ? 681?? } >> ? 682 >> ? 683?? static P* add_and_fetch(P* volatile* dest, I add_value, >> atomic_memory_order order) { >> ? 684???? CI addend = add_value; >> ? 685???? return PlatformAdd().add_and_fetch(dest, >> scale_addend(addend), order); >> ? 686?? } >> >> This is converting add_value from I to CI then back to I, the latter >> possibly being a narrowing conversion.? Better would be >> >> ?? static CI scale_addend(CI add_value) { >> ???? return add_value * sizeof(P); >> ?? } >> >> and then either >> >> ?? static P* add_and_fetch(P* volatile* dest, I add_value, >> atomic_memory_order order) { >> ???? CI addend = scale_addend(add_value); >> ???? return PlatformAdd().add_and_fetch(dest, addend, order); >> ?? } >> >> or don't bother with the addend variable and just pass the scaled >> result directly to add_and_fetch / fetch_and_add. > > Thanks. This was an unintentional change from the original code. >> >> ------------------------------------------------------------------------------ >> >> src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp >> ?? 54 #pragma warning(disable: 4035) // Disables warnings reporting >> missing return statement >> >> Pre-existing: This warning suppression (and the corresponding >> restoration) appears to only be needed for the !defined(AMD64) case >> (where __asm statements are being used), and should be moved accordingly. > > Will you create a bug report for that? > >> >> ------------------------------------------------------------------------------ >> >> >>> The current implementation already has support for both "add and >>> fetch" and "fetch and add" but it's not exposed to the upper layers. >>> >>> Previously, the platform-specific code either implemented "add and >>> fetch" or "fetch and add", and then exposed it as an "add and fetch" >>> implementation by using CRTP and inheriting from either AddAndFetch >>> or FetchAndAdd. >>> >>> My first implementation of this continued in this track, but got >>> push-back because the code was non-intuitive and/or used >>> non-intuitive names. Therefore, I've removed FetchAndAdd/AddAndFetch >>> and opted to duplicate the trivial functionality in the platform >>> files instead. For example: >>> >>> +? template >>> +? D add_and_fetch(D volatile* dest, I add_value, atomic_memory_order >>> order) const { >>> +??? return fetch_and_add(dest, add_value, order) + add_value; >>> +? } >> For the record, I would have been fine with (and actually preferred; I >> dislike roughly a dozen copies of one or the other of the translation >> functions) CRTP-based AddAndFetch and FetchAndAdd that provided one >> operation in terms of the other. >> >> The move of the scaling of add_value in the pointer case to AddImpl is >> an improvement (even on the pre-existing code) that might have made >> such a CRTP approach clearer, possibly with some name changes.? Then >> the helper CRTP base class just provides one of the functions in terms >> of Derived's other function, and nothing else. >> >> Perhaps the hardest part of that approach is naming the helper base >> classes.? FetchAndAddUsingAddAndFetch is explicit but quite a >> mouthful.? Perhaps FetchAndAddHelper, which provides fetch_and_add in >> terms of Derived's add_and_fetch? >> >> Apologies for not having followed the internal pre-review discussion >> of this and so not commenting there. >> >>> There has been some thoughts that maybe we should have: >>> >>> void Atomic::add(...) >>> D??? Atomic::add_and_fetch(...) >>> D??? Atomic::fetch_and_add(...) >>> >>> Not sure if it's worth changing to this, but if others think this is >>> good, I can do that change. >> I would be okay with renaming "add" to "add_and_fetch", though I don't >> at the moment have a strong preference either way.? I'm not sure also >> providing "add" that returns void is really all that useful.? We have >> "inc" and "dec" that return void; my recollection is that they were at >> one time thought to provide an opportunity for a more efficient >> implementation on some platforms, but discussion during the >> templatization project showed that compilers could eliminate an unused >> return value anyway. >> >> For symmetry it seems like we should have fetch_and_sub, but I don't >> see any current uses of sub that would need that. >> >> If add_and_fetch is added (either instead of or in addition to add), >> then we should definitely treat the subtraction case similarly. > > I'll interpret your comments above that there might be opportunities to > do more changes, but none of those needs to be done for this patch. > >> >>> Tested with tier123, but only compiled on platforms I have access to. >> mach5 has the ability to cross-compile to several OpenJDK platforms >> not otherwise supported by Oracle, though can't run tests.? That can >> be a useful smoke test to at least avoid things like typos that don't >> build.? For example, "-b linux-aarch64-debug?. > > Right. I wasn't explicit about it, but I've compiled locally (linux) for > aarch64, arm32, ppc64, s390, zero, minimal. > > Thanks, > StefanK >> > From kim.barrett at oracle.com Fri Jan 10 07:38:35 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Fri, 10 Jan 2020 02:38:35 -0500 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <4F9B2E5D-DF2E-492C-A384-430C09B0DB5B@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> <4F9B2E5D-DF2E-492C-A384-430C09B0DB5B@oracle.com> Message-ID: <8A76CC7A-7CD1-470C-A008-9E3AB94C0BFF@oracle.com> > On Jan 9, 2020, at 8:50 PM, Kim Barrett wrote: > >> On Jan 9, 2020, at 5:17 PM, Stefan Karlsson wrote: >> >> Updated webrev: >> https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta >> https://cr.openjdk.java.net/~stefank/8236778/webrev.02 > > Looks good except for a couple more comment tweaks described below. BTW, I don?t need a new webrev for those additional comment tweaks. From matthias.baesken at sap.com Fri Jan 10 08:51:29 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Fri, 10 Jan 2020 08:51:29 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 References:

Message-ID: Ping ... Martin/Bob are you fine with the latest rev too ? Best regards, Matthias > Hi Christoph, thanks for the review ! > > Bob are you fine with the latest version ? > > Best regards, Matthias > > > > > > > > So I think your suggestion to return 0 in that special case in function > > > getTotalSwapSpaceSize sounds reasonable to me ( at least better than > > > return a large negative value ). > > > New webrev : > > > > > > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ > > > > > > > > > From stefan.karlsson at oracle.com Fri Jan 10 08:56:06 2020 From: stefan.karlsson at oracle.com (Stefan Karlsson) Date: Fri, 10 Jan 2020 09:56:06 +0100 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <4F9B2E5D-DF2E-492C-A384-430C09B0DB5B@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> <4F9B2E5D-DF2E-492C-A384-430C09B0DB5B@oracle.com> Message-ID: On 2020-01-10 02:50, Kim Barrett wrote: >> On Jan 9, 2020, at 5:17 PM, Stefan Karlsson wrote: >> >> Updated webrev: >> https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta >> https://cr.openjdk.java.net/~stefank/8236778/webrev.02 > Looks good except for a couple more comment tweaks described below. > >>> src/hotspot/share/runtime/atomic.hpp >>> Removed: >>> 240 // - platform_add is an object of type PlatformAdd. >>> 241 // >>> 242 // Then >>> 243 // platform_add(dest, add_value) >>> 244 // must be a valid expression, returning a result convertible to D. >>> >>> and >>> >>> 250 // Helper base classes for defining PlatformAdd. To use, define >>> ... >>> 275 // caller. >>> >>> These comments should have been updated to describe the new protocol >>> for PlatformAdd, rather than simply removed. Documentation of >>> extension points like this is important. >> The comments seemed to already have started deteriorating. It's seems to be missing an *if* that matches the *then, > The structure is "Given then "; > there was never an "if". All of the similar comments in this file > were written that way. Well, there's no explicit *given* in the comments either. So, to me, it still sounds weird. Updated webrev with your comments below: ?https://cr.openjdk.java.net/~stefank/8236778/webrev.03.delta ?https://cr.openjdk.java.net/~stefank/8236778/webrev.03 StefanK > But there's a further update that I just noticed is needed by your changes: > > 233 // class is a function object that must be default constructable, > > Delete "is a function object that" in the above. > >> and it doesn't mention *order*. > It's true that *order* isn't mentioned; that seems to have been missed > when the order argument was added here. (It *is* described in the > xchg and cmpxchg commentary, and we could do similarly here.) > > Add > > // - order is of type atomic_memory_order. > > to the requirements list, and change the expressions to include an > order argument, e.g. > > // Then both > // platform_add.add_and_fetch(dest, add_value, order) > // platform_add.fetch_and_add(dest, add_value, order) > >> So, instead of trying to figure out what the comment tried to say, I opted from getting rid of those problems by getting rid of that part of the comment. I thought the code was pretty self-explanatory, and didn't need that part. I'm not opposed to fixing the comments if you have suggestions. > Unfortunately, documenting templates can be pretty hard. But because > of the looseness of templates, differentiating between intentional > semantics and accidents of today's implementation based only on the > code can also be pretty hard. Yet failure to do so leads to > brittleness and uncertainties, especially when there are third-party > specializations. So I think it's worth making an attempt at > documentation, even if it's hard and the results not always perfect. > >> I'll update the code verbatim with what you've suggested. > Plus the couple further modifications mentioned above? > >>> src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp >>> 54 #pragma warning(disable: 4035) // Disables warnings reporting missing return statement >>> >>> Pre-existing: This warning suppression (and the corresponding >>> restoration) appears to only be needed for the !defined(AMD64) case >>> (where __asm statements are being used), and should be moved accordingly. >> Will you create a bug report for that? > https://bugs.openjdk.java.net/browse/JDK-8236900 > atomic_windows_x86.hpp disables warning 4035 more widely than needed > >> I'll interpret your comments above that there might be opportunities to do more changes, but none of those needs to be done for this patch. > OK. > >> Right. I wasn't explicit about it, but I've compiled locally (linux) for aarch64, arm32, ppc64, s390, zero, minimal. > Ah, that wasn?t clear to me from what you said. Good. > From stefan.karlsson at oracle.com Fri Jan 10 08:57:26 2020 From: stefan.karlsson at oracle.com (Stefan Karlsson) Date: Fri, 10 Jan 2020 09:57:26 +0100 Subject: RFR: 8236778: Add Atomic::fetch_and_add In-Reply-To: <48a620f5-a562-b3b6-3c80-7c6c33fa8ba9@oracle.com> References: <3ca1aee9-0edf-e093-68d3-f77ce4dc02e4@oracle.com> <7b2a936a-e4c0-55fe-3c8a-4627fb59475a@oracle.com> <48a620f5-a562-b3b6-3c80-7c6c33fa8ba9@oracle.com> Message-ID: Thanks for reviewing, David. StefanK On 2020-01-10 03:05, David Holmes wrote: > Hi Stefan, > > On 10/01/2020 8:17 am, Stefan Karlsson wrote: >> Updated webrev: >> ??https://cr.openjdk.java.net/~stefank/8236778/webrev.02.delta >> ??https://cr.openjdk.java.net/~stefank/8236778/webrev.02 > > That all seems okay to me (not that I fully understand the details of > the template code). I'll leave you and Kim to work out the details of > the commentary. > > Thanks, > David > >> Comments below: >> >> On 2020-01-09 01:00, Kim Barrett wrote: >>>> On Jan 8, 2020, at 10:34 AM, Stefan Karlsson >>>> wrote: >>>> >>>> Hi all, >>>> >>>> Please review this patch to introduce Atomic::fetch_and_add. >>>> >>>> https://cr.openjdk.java.net/~stefank/8236778/webrev.01 >>>> https://bugs.openjdk.java.net/browse/JDK-8236778 >>>> >>>> There are a number of places where we have this pattern: >>>> int result = Atomic::add(_index, amount) - amount; >>>> >>>> I'd like to introduce Atomic::fetch_and_add so that we can write: >>>> int result = Atomic::fetch_and_add(_index, amount); >>> ------------------------------------------------------------------------------ >>> >>> src/hotspot/share/runtime/atomic.hpp >>> Removed: >>> ? 240?? // - platform_add is an object of type PlatformAdd. >>> ? 241?? // >>> ? 242?? // Then >>> ? 243?? //?? platform_add(dest, add_value) >>> ? 244?? // must be a valid expression, returning a result >>> convertible to D. >>> >>> and >>> >>> ? 250?? // Helper base classes for defining PlatformAdd.? To use, >>> define >>> ... >>> ? 275?? // caller. >>> >>> These comments should have been updated to describe the new protocol >>> for PlatformAdd, rather than simply removed.? Documentation of >>> extension points like this is important. >> >> The comments seemed to already have started deteriorating. It's seems >> to be missing an *if* that matches the *then, and it doesn't mention >> *order*. So, instead of trying to figure out what the comment tried >> to say, I opted from getting rid of those problems by getting rid of >> that part of the comment. I thought the code was pretty >> self-explanatory, and didn't need that part. I'm not opposed to >> fixing the comments if you have suggestions. >> >>> ?? Something like >>> >>> ?? // - platform_add is an object of type PlatformAdd. >>> ?? // >>> ?? // Then both >>> ?? //?? platform_add.add_and_fetch(dest, add_value) >>> ?? //?? platform_add.fetch_and_add(dest, add_value) >>> ?? // must be valid expressions returning a result convertible to D. >>> ?? // >>> ?? // add_and_fetch atomically adds add_value to the value of dest, >>> ?? // returning the new value. >>> ?? // >>> ?? // fetch_and_add atomically adds add_value to the value of dest, >>> ?? // returning the old value. >>> ?? // >>> ?? // When D is a pointer type P*, both add_and_fetch and fetch_and_add >>> ?? // treat it as if it were an uintptr_t; they do not perform any >>> ?? // scaling of add_value, as that has already been done by the >>> caller. >> >> I'll update the code verbatim with what you've suggested. >> >>> >>> ------------------------------------------------------------------------------ >>> >>> src/hotspot/share/runtime/atomic.hpp >>> ? 679?? static I scale_addend(I add_value) { >>> ? 680???? return add_value * sizeof(P); >>> ? 681?? } >>> ? 682 >>> ? 683?? static P* add_and_fetch(P* volatile* dest, I add_value, >>> atomic_memory_order order) { >>> ? 684???? CI addend = add_value; >>> ? 685???? return PlatformAdd().add_and_fetch(dest, >>> scale_addend(addend), order); >>> ? 686?? } >>> >>> This is converting add_value from I to CI then back to I, the latter >>> possibly being a narrowing conversion.? Better would be >>> >>> ?? static CI scale_addend(CI add_value) { >>> ???? return add_value * sizeof(P); >>> ?? } >>> >>> and then either >>> >>> ?? static P* add_and_fetch(P* volatile* dest, I add_value, >>> atomic_memory_order order) { >>> ???? CI addend = scale_addend(add_value); >>> ???? return PlatformAdd().add_and_fetch(dest, addend, >>> order); >>> ?? } >>> >>> or don't bother with the addend variable and just pass the scaled >>> result directly to add_and_fetch / fetch_and_add. >> >> Thanks. This was an unintentional change from the original code. >>> >>> ------------------------------------------------------------------------------ >>> >>> src/hotspot/os_cpu/windows_x86/atomic_windows_x86.hpp >>> ?? 54 #pragma warning(disable: 4035) // Disables warnings reporting >>> missing return statement >>> >>> Pre-existing: This warning suppression (and the corresponding >>> restoration) appears to only be needed for the !defined(AMD64) case >>> (where __asm statements are being used), and should be moved >>> accordingly. >> >> Will you create a bug report for that? >> >>> >>> ------------------------------------------------------------------------------ >>> >>> >>>> The current implementation already has support for both "add and >>>> fetch" and "fetch and add" but it's not exposed to the upper layers. >>>> >>>> Previously, the platform-specific code either implemented "add and >>>> fetch" or "fetch and add", and then exposed it as an "add and >>>> fetch" implementation by using CRTP and inheriting from either >>>> AddAndFetch or FetchAndAdd. >>>> >>>> My first implementation of this continued in this track, but got >>>> push-back because the code was non-intuitive and/or used >>>> non-intuitive names. Therefore, I've removed >>>> FetchAndAdd/AddAndFetch and opted to duplicate the trivial >>>> functionality in the platform files instead. For example: >>>> >>>> +? template >>>> +? D add_and_fetch(D volatile* dest, I add_value, >>>> atomic_memory_order order) const { >>>> +??? return fetch_and_add(dest, add_value, order) + add_value; >>>> +? } >>> For the record, I would have been fine with (and actually preferred; I >>> dislike roughly a dozen copies of one or the other of the translation >>> functions) CRTP-based AddAndFetch and FetchAndAdd that provided one >>> operation in terms of the other. >>> >>> The move of the scaling of add_value in the pointer case to AddImpl is >>> an improvement (even on the pre-existing code) that might have made >>> such a CRTP approach clearer, possibly with some name changes.? Then >>> the helper CRTP base class just provides one of the functions in terms >>> of Derived's other function, and nothing else. >>> >>> Perhaps the hardest part of that approach is naming the helper base >>> classes.? FetchAndAddUsingAddAndFetch is explicit but quite a >>> mouthful.? Perhaps FetchAndAddHelper, which provides fetch_and_add in >>> terms of Derived's add_and_fetch? >>> >>> Apologies for not having followed the internal pre-review discussion >>> of this and so not commenting there. >>> >>>> There has been some thoughts that maybe we should have: >>>> >>>> void Atomic::add(...) >>>> D??? Atomic::add_and_fetch(...) >>>> D??? Atomic::fetch_and_add(...) >>>> >>>> Not sure if it's worth changing to this, but if others think this >>>> is good, I can do that change. >>> I would be okay with renaming "add" to "add_and_fetch", though I don't >>> at the moment have a strong preference either way.? I'm not sure also >>> providing "add" that returns void is really all that useful. We have >>> "inc" and "dec" that return void; my recollection is that they were at >>> one time thought to provide an opportunity for a more efficient >>> implementation on some platforms, but discussion during the >>> templatization project showed that compilers could eliminate an unused >>> return value anyway. >>> >>> For symmetry it seems like we should have fetch_and_sub, but I don't >>> see any current uses of sub that would need that. >>> >>> If add_and_fetch is added (either instead of or in addition to add), >>> then we should definitely treat the subtraction case similarly. >> >> I'll interpret your comments above that there might be opportunities >> to do more changes, but none of those needs to be done for this patch. >> >>> >>>> Tested with tier123, but only compiled on platforms I have access to. >>> mach5 has the ability to cross-compile to several OpenJDK platforms >>> not otherwise supported by Oracle, though can't run tests. That can >>> be a useful smoke test to at least avoid things like typos that don't >>> build.? For example, "-b linux-aarch64-debug?. >> >> Right. I wasn't explicit about it, but I've compiled locally (linux) >> for aarch64, arm32, ppc64, s390, zero, minimal. >> >> Thanks, >> StefanK >>> >> From matthias.baesken at sap.com Fri Jan 10 10:01:47 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Fri, 10 Jan 2020 10:01:47 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) Message-ID: Hello, I recently looked into the gcc lto optimization mode (see for some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html and http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html ). This mode can lead to more compact binaries (~10% smaller) , it also might bring small performance improvements but that wasn't my (main) goal . The changes for this are rather small , one needs to use a recent gcc , add -flto to the compile flags , for example --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 @@ -530,8 +530,13 @@ fi if test "x$TOOLCHAIN_TYPE" = xgcc; then - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector" - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector -flto" + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" .... and you have to make sure to use gcc-ar and gcc-nm instead of ar / nm . Build and test(s) work, however with one exception. The serviceability tests like serviceability/sa seems to rely heavily on the "normal" structure of libjvm.so (from what I understand e.g. in LinuxVtblAccess it is attempted to access internal symbols like _ZTV ). Errors in the sa tests look like : java.lang.InternalError: Metadata does not appear to be polymorphic at jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) at jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) at jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) at jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) Has anyone experimented with LTO optimization ? And to the serviceability agent experts - any idea how to make the jdk.hotspot.agent more independent from optimization settings ? Best regards, Matthias From martin.doerr at sap.com Fri Jan 10 10:02:20 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Fri, 10 Jan 2020 10:02:20 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References:

Message-ID: Hi Matthias, yes, looks good to me. Thanks, Martin > -----Original Message----- > From: Baesken, Matthias > Sent: Freitag, 10. Januar 2020 09:51 > To: Langer, Christoph ; Bob Vandette > ; Doerr, Martin > Cc: hotspot-dev at openjdk.java.net > Subject: RE: RFR: 8236617: jtreg test > containers/docker/TestMemoryAwareness.java fails after 8226575 > > Ping ... Martin/Bob are you fine with the latest rev too ? > > Best regards, Matthias > > > Hi Christoph, thanks for the review ! > > > > Bob are you fine with the latest version ? > > > > Best regards, Matthias > > > > > > > > > > > > So I think your suggestion to return 0 in that special case in function > > > > getTotalSwapSpaceSize sounds reasonable to me ( at least better than > > > > return a large negative value ). > > > > New webrev : > > > > > > > > > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ > > > > > > > > > > > > From bob.vandette at oracle.com Fri Jan 10 16:56:25 2020 From: bob.vandette at oracle.com (Bob Vandette) Date: Fri, 10 Jan 2020 11:56:25 -0500 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: References:

Message-ID: <43D28D45-87E2-42CD-BE49-3575EC1D5153@oracle.com> Yes, the fix is fine. Bob. > On Jan 10, 2020, at 3:51 AM, Baesken, Matthias wrote: > > Ping ... Martin/Bob are you fine with the latest rev too ? > > Best regards, Matthias > >> Hi Christoph, thanks for the review ! >> >> Bob are you fine with the latest version ? >> >> Best regards, Matthias >> >> >>>> >>>> So I think your suggestion to return 0 in that special case in function >>>> getTotalSwapSpaceSize sounds reasonable to me ( at least better than >>>> return a large negative value ). >>>> New webrev : >>>> >>>> >>>> http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ >>>> >>>> >>>> > From volker.simonis at gmail.com Sat Jan 11 13:38:12 2020 From: volker.simonis at gmail.com (Volker Simonis) Date: Sat, 11 Jan 2020 14:38:12 +0100 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: Message-ID: SA pretends to know the exact types of objects in the JVM and for polymorphic objects it wants to read their vtable from the shared library. If LTO de-virtulizes methods and thus changes polymorphic to non-polymorphic types, this won't work. But if LTO can de-virtulizes a type, maybe you can do that manually (and update the corresponding representation in the SA), because it doesn't seem to be needed. Notice that other places in the VM may also rely on this. E.g. CDS stores Metadata objects in the CDS archive and restores their vtable pointers when they are loaded. On the other hand, if the CDS tests have passed, this doesn't seem to be a problem. Baesken, Matthias schrieb am Fr., 10. Jan. 2020, 11:03: > Hello, I recently looked into the gcc lto optimization mode (see for > some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html > and > http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html > ). > This mode can lead to more compact binaries (~10% smaller) , it also > might bring small performance improvements but that wasn't my (main) > goal . > > The changes for this are rather small , one needs to use a recent gcc , > add -flto to the compile flags , for example > > --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 > +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 > @@ -530,8 +530,13 @@ > fi > if test "x$TOOLCHAIN_TYPE" = xgcc; then > - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new > -fstack-protector" > - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" > + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new > -fstack-protector -flto" > + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" > > .... and you have to make sure to use gcc-ar and gcc-nm instead > of ar / nm . > Build and test(s) work, however with one exception. > The serviceability tests like serviceability/sa seems to rely > heavily on the "normal" structure of libjvm.so (from what I > understand e.g. in LinuxVtblAccess it is attempted to access internal > symbols like _ZTV ). > > Errors in the sa tests look like : > > > java.lang.InternalError: Metadata does not appear to be polymorphic > at > jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) > at > jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) > at > jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) > at > jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) > at > jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) > at > jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) > at > jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) > > Has anyone experimented with LTO optimization ? > > And to the serviceability agent experts - any idea how to make the > jdk.hotspot.agent more independent from optimization settings ? > > > Best regards, Matthias > From chris.plummer at oracle.com Sat Jan 11 18:27:03 2020 From: chris.plummer at oracle.com (Chris Plummer) Date: Sat, 11 Jan 2020 10:27:03 -0800 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References:

Message-ID: <00f43a29-1000-e5b7-1e36-c6b9b6177d21@oracle.com> cds is also disabled for minimalVM so testing of cds with LTO probably has not been done. There are a number of features that minimalVM excludes such as jvmti, cds and SA (which I think falls under "services"), and there was very little testing done with these features individually disabled. They would all at least build (if any one was disabled) and I think heartbeat testing was done, but probably no more than that. Also various combinations were not tested, other than the one combination that minimalVM used. Search for NON_MINIMAL_FEATURES in hotspot.m4 to see which features are disabled for minimalVM. Chris On 1/11/20 5:38 AM, Volker Simonis wrote: > SA pretends to know the exact types of objects in the JVM and for > polymorphic objects it wants to read their vtable from the shared library. > If LTO de-virtulizes methods and thus changes polymorphic to > non-polymorphic types, this won't work. But if LTO can de-virtulizes a > type, maybe you can do that manually (and update the corresponding > representation in the SA), because it doesn't seem to be needed. > > Notice that other places in the VM may also rely on this. E.g. CDS stores > Metadata objects in the CDS archive and restores their vtable pointers when > they are loaded. On the other hand, if the CDS tests have passed, this > doesn't seem to be a problem. > > Baesken, Matthias schrieb am Fr., 10. Jan. 2020, > 11:03: > >> Hello, I recently looked into the gcc lto optimization mode (see for >> some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html >> and >> http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html >> ). >> This mode can lead to more compact binaries (~10% smaller) , it also >> might bring small performance improvements but that wasn't my (main) >> goal . >> >> The changes for this are rather small , one needs to use a recent gcc , >> add -flto to the compile flags , for example >> >> --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 >> +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 >> @@ -530,8 +530,13 @@ >> fi >> if test "x$TOOLCHAIN_TYPE" = xgcc; then >> - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new >> -fstack-protector" >> - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" >> + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new >> -fstack-protector -flto" >> + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" >> >> .... and you have to make sure to use gcc-ar and gcc-nm instead >> of ar / nm . >> Build and test(s) work, however with one exception. >> The serviceability tests like serviceability/sa seems to rely >> heavily on the "normal" structure of libjvm.so (from what I >> understand e.g. in LinuxVtblAccess it is attempted to access internal >> symbols like _ZTV ). >> >> Errors in the sa tests look like : >> >> >> java.lang.InternalError: Metadata does not appear to be polymorphic >> at >> jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) >> at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) >> at >> jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) >> >> Has anyone experimented with LTO optimization ? >> >> And to the serviceability agent experts - any idea how to make the >> jdk.hotspot.agent more independent from optimization settings ? >> >> >> Best regards, Matthias >> From david.holmes at oracle.com Mon Jan 13 07:13:34 2020 From: david.holmes at oracle.com (David Holmes) Date: Mon, 13 Jan 2020 17:13:34 +1000 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() Message-ID: webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ bug: https://bugs.openjdk.java.net/browse/JDK-8235741 Full details in the bug report about the existing uses of javaTimeMillis(), many of which just want an elapsed time in ms and so should be using javaTimeNanos() and convert to ms. This covers areas all across the VM. Only non-simple change is in os_perf_linux.cpp (and the same code will be in os_perf_aix.cpp once it has been validated). There we are tracking an elapsed time in ms but relative to the boot time, which is seconds since the epoch. Consequently the first interval has to be calculated using javaTimeMillis, but after that we can use javaTimeNanos (using a new 'first time' captured at the same time we used javaTimeMillis). I think I have the logic right but other than through JFR this code seems unused and I have limited means of testing it. The JFR test jdk/jfr/event/os/TestThreadContextSwitches.java exercises the code but the results of running that test seems to exhibit arbitrary randomness in the rates reported - e.g. 0 to 16000Hz - both with and without my change, so not really that useful. Stefan K. suggested a gtest which I may look into - though it is frustrating to have to expend such effort to validate this. Other testing tiers 1-3. Thanks, David From shade at redhat.com Mon Jan 13 07:52:03 2020 From: shade at redhat.com (Aleksey Shipilev) Date: Mon, 13 Jan 2020 08:52:03 +0100 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: References: Message-ID: <7e4e82f2-b1e6-e79a-755e-39c86d957e9c@redhat.com> On 1/13/20 8:13 AM, David Holmes wrote: > webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ > bug: https://bugs.openjdk.java.net/browse/JDK-8235741 Shenandoah change looks good. >From the cursory look over the other changes, those look good too. -- Thanks, -Aleksey From matthias.baesken at sap.com Mon Jan 13 08:27:28 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Mon, 13 Jan 2020 08:27:28 +0000 Subject: RFR: 8236617: jtreg test containers/docker/TestMemoryAwareness.java fails after 8226575 In-Reply-To: <43D28D45-87E2-42CD-BE49-3575EC1D5153@oracle.com> References:

<43D28D45-87E2-42CD-BE49-3575EC1D5153@oracle.com> Message-ID: Thanks for the reviews ! > > Yes, the fix is fine. > > Bob. > > > > On Jan 10, 2020, at 3:51 AM, Baesken, Matthias > wrote: > > > > Ping ... Martin/Bob are you fine with the latest rev too ? > > > > Best regards, Matthias > > > >> Hi Christoph, thanks for the review ! > >> > >> Bob are you fine with the latest version ? > >> > >> Best regards, Matthias > >> > >> > >>>> > >>>> So I think your suggestion to return 0 in that special case in function > >>>> getTotalSwapSpaceSize sounds reasonable to me ( at least better than > >>>> return a large negative value ). > >>>> New webrev : > >>>> > >>>> > >>>> http://cr.openjdk.java.net/~mbaesken/webrevs/8236617.2/ > >>>> > >>>> > >>>> > > From matthias.baesken at sap.com Mon Jan 13 09:28:22 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Mon, 13 Jan 2020 09:28:22 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <00f43a29-1000-e5b7-1e36-c6b9b6177d21@oracle.com> References:

<00f43a29-1000-e5b7-1e36-c6b9b6177d21@oracle.com> Message-ID: Hello, thanks for the info - seems that for the minimal VM , lto is fine but currently not for the other (server/...) VM builds . Btw. I noticed similar issues with the SA when using link-time-gc . Looks like this eliminates the vtable info too that tha SA-coding ( LinuxVtblAccess class ?) wants to look into . Best regards, Matthias > cds is also disabled for minimalVM so testing of cds with LTO probably > has not been done. There are a number of features that minimalVM > excludes such as jvmti, cds and SA (which I think falls under > "services"), and there was very little testing done with these features > individually disabled. They would all at least build (if any one was > disabled) and I think heartbeat testing was done, but probably no more > than that. Also various combinations were not tested, other than the one > combination that minimalVM used. Search for NON_MINIMAL_FEATURES in > hotspot.m4 to see which features are disabled for minimalVM. > > Chris > > On 1/11/20 5:38 AM, Volker Simonis wrote: > > SA pretends to know the exact types of objects in the JVM and for > > polymorphic objects it wants to read their vtable from the shared library. > > If LTO de-virtulizes methods and thus changes polymorphic to > > non-polymorphic types, this won't work. But if LTO can de-virtulizes a > > type, maybe you can do that manually (and update the corresponding > > representation in the SA), because it doesn't seem to be needed. > > > > Notice that other places in the VM may also rely on this. E.g. CDS stores > > Metadata objects in the CDS archive and restores their vtable pointers > when > > they are loaded. On the other hand, if the CDS tests have passed, this > > doesn't seem to be a problem. > > > > Baesken, Matthias schrieb am Fr., 10. Jan. > 2020, > > 11:03: > > > >> Hello, I recently looked into the gcc lto optimization mode (see for > >> some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html > >> and > >> http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter- > procedural.html > >> ). > >> This mode can lead to more compact binaries (~10% smaller) , it also > >> might bring small performance improvements but that wasn't my (main) > >> goal . > >> > >> The changes for this are rather small , one needs to use a recent gcc , > >> add -flto to the compile flags , for example > >> > >> --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 > >> +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 > >> @@ -530,8 +530,13 @@ > >> fi > >> if test "x$TOOLCHAIN_TYPE" = xgcc; then > >> - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new > >> -fstack-protector" > >> - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" > >> + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new > >> -fstack-protector -flto" > >> + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" > >> > >> .... and you have to make sure to use gcc-ar and gcc-nm instead > >> of ar / nm . > >> Build and test(s) work, however with one exception. > >> The serviceability tests like serviceability/sa seems to rely > >> heavily on the "normal" structure of libjvm.so (from what I > >> understand e.g. in LinuxVtblAccess it is attempted to access internal > >> symbols like _ZTV ). > >> > >> Errors in the sa tests look like : > >> > >> > >> java.lang.InternalError: Metadata does not appear to be polymorphic > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDyna > micTypeForAddress(BasicTypeDataBase.java:279) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instanti > ateWrapperFor(VirtualBaseConstructor.java:102) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor( > Metadata.java:74) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoad > erKlass(SystemDictionary.java:96) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderS > tatistics(ClassLoaderStats.java:93) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderS > tats.java:78) > >> at > jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) > >> at > >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) > >> at > >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) > >> at > >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) > >> at > >> jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:3 > 21) > >> at > >> > jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) > >> > >> Has anyone experimented with LTO optimization ? > >> > >> And to the serviceability agent experts - any idea how to make the > >> jdk.hotspot.agent more independent from optimization settings ? > >> > >> > >> Best regards, Matthias > >> From christoph.langer at sap.com Mon Jan 13 10:23:38 2020 From: christoph.langer at sap.com (Langer, Christoph) Date: Mon, 13 Jan 2020 10:23:38 +0000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le Message-ID: Hi, after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing on linuxppc64 and linuxppc64le the same way as "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting the same exclusion of TestInstanceKlassSizeForInterface (referring to JDK-8230664 [1] for resolution). Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ Thanks Christoph [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc [1] https://bugs.openjdk.java.net/browse/JDK-8230664 From martin.doerr at sap.com Mon Jan 13 10:36:34 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Mon, 13 Jan 2020 10:36:34 +0000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: References: Message-ID: Hi Christoph, looks good to me. Thanks, Martin From: Langer, Christoph Sent: Montag, 13. Januar 2020 11:24 To: hotspot-dev at openjdk.java.net; Doerr, Martin Cc: OpenJDK Serviceability Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le Hi, after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing on linuxppc64 and linuxppc64le the same way as "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting the same exclusion of TestInstanceKlassSizeForInterface (referring to JDK-8230664 [1] for resolution). Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ Thanks Christoph [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc [1] https://bugs.openjdk.java.net/browse/JDK-8230664 From david.holmes at oracle.com Mon Jan 13 12:52:26 2020 From: david.holmes at oracle.com (David Holmes) Date: Mon, 13 Jan 2020 22:52:26 +1000 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: <7e4e82f2-b1e6-e79a-755e-39c86d957e9c@redhat.com> References: <7e4e82f2-b1e6-e79a-755e-39c86d957e9c@redhat.com> Message-ID: <2ca62f20-8748-c043-d3f1-ab3f3a78a2a6@oracle.com> On 13/01/2020 5:52 pm, Aleksey Shipilev wrote: > On 1/13/20 8:13 AM, David Holmes wrote: >> webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ >> bug: https://bugs.openjdk.java.net/browse/JDK-8235741 > > Shenandoah change looks good. > > From the cursory look over the other changes, those look good too. Thanks Aleksey! David From david.holmes at oracle.com Mon Jan 13 12:57:00 2020 From: david.holmes at oracle.com (David Holmes) Date: Mon, 13 Jan 2020 22:57:00 +1000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: References: Message-ID: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> Hi Christoph, I think those tests are currnetly failing on all platforms - see JDK-8236917. The failures after GC.class_stats removal are unrelated to the failures reported in 8230664 AFAICS. David On 13/01/2020 8:23 pm, Langer, Christoph wrote: > Hi, > > after JDK-8232759 ?Remove GC.class_stats? [0] was pushed, we see the > test ?serviceability/sa/TestInstanceKlassSizeForInterface.java? failing > on linuxppc64 and linuxppc64le the same way as > ?serviceability/sa/TestInstanceKlassSize.java?. Hence, I?m requesting > the same exclusion of TestInstanceKlassSizeForInterface (referring to > JDK-8230664 [1] for resolution). > > Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 > > Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ > > Thanks > > Christoph > > [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc > > [1] https://bugs.openjdk.java.net/browse/JDK-8230664 > From robbin.ehn at oracle.com Mon Jan 13 13:08:21 2020 From: robbin.ehn at oracle.com (Robbin Ehn) Date: Mon, 13 Jan 2020 14:08:21 +0100 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: References: Message-ID: <75f4cd8d-d291-2cf5-b0dd-91a1d76a6130@oracle.com> Hi David, looks good, thanks! /Robbin On 1/13/20 8:13 AM, David Holmes wrote: > webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ > bug: https://bugs.openjdk.java.net/browse/JDK-8235741 > > Full details in the bug report about the existing uses of javaTimeMillis(), many > of which just want an elapsed time in ms and so should be using javaTimeNanos() > and convert to ms. This covers areas all across the VM. > > Only non-simple change is in os_perf_linux.cpp (and the same code will be in > os_perf_aix.cpp once it has been validated). There we are tracking an elapsed > time in ms but relative to the boot time, which is seconds since the epoch. > Consequently the first interval has to be calculated using javaTimeMillis, but > after that we can use javaTimeNanos (using a new 'first time' captured at the > same time we used javaTimeMillis). I think I have the logic right but other than > through JFR this code seems unused and I have limited means of testing it. The > JFR test jdk/jfr/event/os/TestThreadContextSwitches.java exercises the code but > the results of running that test seems to exhibit arbitrary randomness in the > rates reported - e.g. 0 to 16000Hz - both with and without my change, so not > really that useful. Stefan K. suggested a gtest which I may look into - though > it is frustrating to have to expend such effort to validate this. > > Other testing tiers 1-3. > > Thanks, > David From christoph.langer at sap.com Mon Jan 13 13:26:46 2020 From: christoph.langer at sap.com (Langer, Christoph) Date: Mon, 13 Jan 2020 13:26:46 +0000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> References: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> Message-ID: Hi David, thanks for the heads up. However, in our CI, these tests pass on all platforms except linuxppc64/linuxppc64le consistently. I think I'll push the exclusion and once JDK-8236917 has been resolved I'll try on the ppc linuxes again... Best regards Christoph > -----Original Message----- > From: David Holmes > Sent: Montag, 13. Januar 2020 13:57 > To: Langer, Christoph ; hotspot- > dev at openjdk.java.net; Doerr, Martin > Cc: OpenJDK Serviceability > Subject: Re: RFR (XS): 8237008: Exclude > serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and > linuxppc64le > > Hi Christoph, > > I think those tests are currnetly failing on all platforms - see > JDK-8236917. The failures after GC.class_stats removal are unrelated to > the failures reported in 8230664 AFAICS. > > David > > On 13/01/2020 8:23 pm, Langer, Christoph wrote: > > Hi, > > > > after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the > > test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing > > on linuxppc64 and linuxppc64le the same way as > > "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting > > the same exclusion of TestInstanceKlassSizeForInterface (referring to > > JDK-8230664 [1] for resolution). > > > > Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 > > > > Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ > > > > Thanks > > > > Christoph > > > > [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc > > > > [1] https://bugs.openjdk.java.net/browse/JDK-8230664 > > From coleen.phillimore at oracle.com Mon Jan 13 14:22:07 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Mon, 13 Jan 2020 09:22:07 -0500 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: References: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> Message-ID: <60869e5e-d0ae-86e7-b514-897b5ae0bdc4@oracle.com> Hi, I didn't see this with my filtering.? Do you know why it fails for ppcle? It fails for us on all platforms because test.vm.opts isn't set in our CI jobs. Thanks, Coleen On 1/13/20 8:26 AM, Langer, Christoph wrote: > Hi David, > > thanks for the heads up. However, in our CI, these tests pass on all platforms except linuxppc64/linuxppc64le consistently. > > I think I'll push the exclusion and once JDK-8236917 has been resolved I'll try on the ppc linuxes again... > > Best regards > Christoph > >> -----Original Message----- >> From: David Holmes >> Sent: Montag, 13. Januar 2020 13:57 >> To: Langer, Christoph ; hotspot- >> dev at openjdk.java.net; Doerr, Martin >> Cc: OpenJDK Serviceability >> Subject: Re: RFR (XS): 8237008: Exclude >> serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and >> linuxppc64le >> >> Hi Christoph, >> >> I think those tests are currnetly failing on all platforms - see >> JDK-8236917. The failures after GC.class_stats removal are unrelated to >> the failures reported in 8230664 AFAICS. >> >> David >> >> On 13/01/2020 8:23 pm, Langer, Christoph wrote: >>> Hi, >>> >>> after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the >>> test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing >>> on linuxppc64 and linuxppc64le the same way as >>> "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting >>> the same exclusion of TestInstanceKlassSizeForInterface (referring to >>> JDK-8230664 [1] for resolution). >>> >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 >>> >>> Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ >>> >>> Thanks >>> >>> Christoph >>> >>> [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc >>> >>> [1] https://bugs.openjdk.java.net/browse/JDK-8230664 >>> From christoph.langer at sap.com Mon Jan 13 14:49:34 2020 From: christoph.langer at sap.com (Langer, Christoph) Date: Mon, 13 Jan 2020 14:49:34 +0000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: <60869e5e-d0ae-86e7-b514-897b5ae0bdc4@oracle.com> References: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> <60869e5e-d0ae-86e7-b514-897b5ae0bdc4@oracle.com> Message-ID: Hi Coleen, why it's failing on ppc or ppcle was analyzed in https://bugs.openjdk.java.net/browse/JDK-8230664. You can read in the description: The test retrieves the size of Java classes by 2 different APIs and expects the result to be equal: - SA reports the size of the reserved memory. Rounded up to a multiple of 16 on PPC64. - Jcmd GC.class_stats reports the size of the space needed for a number of 8 Byte blocks. (The number is from an internal statistic.) So there may be a difference of 8 Bytes on platforms which reserve memory 16 Byte wise. I assume that the functionality from Jcmd GC.class_stats was replaced in your change but goes back to the same statistics. In our tests we set test.vm.opts, so I guess that's why we don't see failures on other platforms. Best regards Christoph > -----Original Message----- > From: coleen.phillimore at oracle.com > Sent: Montag, 13. Januar 2020 15:22 > To: Langer, Christoph ; David Holmes > ; hotspot-dev at openjdk.java.net; Doerr, Martin > > Cc: OpenJDK Serviceability > Subject: Re: RFR (XS): 8237008: Exclude > serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and > linuxppc64le > > Hi, I didn't see this with my filtering.? Do you know why it fails for > ppcle? > > It fails for us on all platforms because test.vm.opts isn't set in our > CI jobs. > > Thanks, > Coleen > > On 1/13/20 8:26 AM, Langer, Christoph wrote: > > Hi David, > > > > thanks for the heads up. However, in our CI, these tests pass on all > platforms except linuxppc64/linuxppc64le consistently. > > > > I think I'll push the exclusion and once JDK-8236917 has been resolved I'll > try on the ppc linuxes again... > > > > Best regards > > Christoph > > > >> -----Original Message----- > >> From: David Holmes > >> Sent: Montag, 13. Januar 2020 13:57 > >> To: Langer, Christoph ; hotspot- > >> dev at openjdk.java.net; Doerr, Martin > >> Cc: OpenJDK Serviceability > >> Subject: Re: RFR (XS): 8237008: Exclude > >> serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 > and > >> linuxppc64le > >> > >> Hi Christoph, > >> > >> I think those tests are currnetly failing on all platforms - see > >> JDK-8236917. The failures after GC.class_stats removal are unrelated to > >> the failures reported in 8230664 AFAICS. > >> > >> David > >> > >> On 13/01/2020 8:23 pm, Langer, Christoph wrote: > >>> Hi, > >>> > >>> after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the > >>> test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing > >>> on linuxppc64 and linuxppc64le the same way as > >>> "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting > >>> the same exclusion of TestInstanceKlassSizeForInterface (referring to > >>> JDK-8230664 [1] for resolution). > >>> > >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 > >>> > >>> Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ > >>> > >>> Thanks > >>> > >>> Christoph > >>> > >>> [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc > >>> > >>> [1] https://bugs.openjdk.java.net/browse/JDK-8230664 > >>> From ioi.lam at oracle.com Mon Jan 13 17:46:02 2020 From: ioi.lam at oracle.com (Ioi Lam) Date: Mon, 13 Jan 2020 09:46:02 -0800 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: <60869e5e-d0ae-86e7-b514-897b5ae0bdc4@oracle.com> References: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> <60869e5e-d0ae-86e7-b514-897b5ae0bdc4@oracle.com> Message-ID: <75785e0b-adef-b61c-707d-5ff9604711b1@oracle.com> This test passed on linux/x64 on my machine, but failed on our linux/x64 CI as well. I think there's a fundamental bug that's triggered only by certain CPU and/or environment combinations. So this test should be excluded on all platforms for now until the fundamental issue is fixed. Thanks - Ioi On 1/13/20 6:22 AM, coleen.phillimore at oracle.com wrote: > Hi, I didn't see this with my filtering.? Do you know why it fails for > ppcle? > > It fails for us on all platforms because test.vm.opts isn't set in our > CI jobs. > > Thanks, > Coleen > > On 1/13/20 8:26 AM, Langer, Christoph wrote: >> Hi David, >> >> thanks for the heads up. However, in our CI, these tests pass on all >> platforms except linuxppc64/linuxppc64le consistently. >> >> I think I'll push the exclusion and once JDK-8236917 has been >> resolved I'll try on the ppc linuxes again... >> >> Best regards >> Christoph >> >>> -----Original Message----- >>> From: David Holmes >>> Sent: Montag, 13. Januar 2020 13:57 >>> To: Langer, Christoph ; hotspot- >>> dev at openjdk.java.net; Doerr, Martin >>> Cc: OpenJDK Serviceability >>> Subject: Re: RFR (XS): 8237008: Exclude >>> serviceability/sa/TestInstanceKlassSizeForInterface.java on >>> linuxppc64 and >>> linuxppc64le >>> >>> Hi Christoph, >>> >>> I think those tests are currnetly failing on all platforms - see >>> JDK-8236917. The failures after GC.class_stats removal are unrelated to >>> the failures reported in 8230664 AFAICS. >>> >>> David >>> >>> On 13/01/2020 8:23 pm, Langer, Christoph wrote: >>>> Hi, >>>> >>>> after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the >>>> test "serviceability/sa/TestInstanceKlassSizeForInterface.java" >>>> failing >>>> on linuxppc64 and linuxppc64le the same way as >>>> "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting >>>> the same exclusion of TestInstanceKlassSizeForInterface (referring to >>>> JDK-8230664 [1] for resolution). >>>> >>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 >>>> >>>> Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ >>>> >>>> Thanks >>>> >>>> Christoph >>>> >>>> [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc >>>> >>>> [1] https://bugs.openjdk.java.net/browse/JDK-8230664 >>>> > From david.holmes at oracle.com Mon Jan 13 21:35:56 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 14 Jan 2020 07:35:56 +1000 Subject: RFR (XS): 8237008: Exclude serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and linuxppc64le In-Reply-To: References: <8a6a9f3d-88ab-9733-7c7f-f822da148d1e@oracle.com> Message-ID: <0d2c6e0c-9f68-c478-ebfd-69de072d0e80@oracle.com> On 13/01/2020 11:26 pm, Langer, Christoph wrote: > Hi David, > > thanks for the heads up. However, in our CI, these tests pass on all platforms except linuxppc64/linuxppc64le consistently. Sorry I wasn't specific enough. This test fails on all platforms in some configuration due to a problem with test.vm.opts/test.java.opts not getting passed through as expected and causing a problem when CDS is disabled. All in all the failure behaviour is very confusing. :( David > I think I'll push the exclusion and once JDK-8236917 has been resolved I'll try on the ppc linuxes again... > > Best regards > Christoph > >> -----Original Message----- >> From: David Holmes >> Sent: Montag, 13. Januar 2020 13:57 >> To: Langer, Christoph ; hotspot- >> dev at openjdk.java.net; Doerr, Martin >> Cc: OpenJDK Serviceability >> Subject: Re: RFR (XS): 8237008: Exclude >> serviceability/sa/TestInstanceKlassSizeForInterface.java on linuxppc64 and >> linuxppc64le >> >> Hi Christoph, >> >> I think those tests are currnetly failing on all platforms - see >> JDK-8236917. The failures after GC.class_stats removal are unrelated to >> the failures reported in 8230664 AFAICS. >> >> David >> >> On 13/01/2020 8:23 pm, Langer, Christoph wrote: >>> Hi, >>> >>> after JDK-8232759 "Remove GC.class_stats" [0] was pushed, we see the >>> test "serviceability/sa/TestInstanceKlassSizeForInterface.java" failing >>> on linuxppc64 and linuxppc64le the same way as >>> "serviceability/sa/TestInstanceKlassSize.java". Hence, I'm requesting >>> the same exclusion of TestInstanceKlassSizeForInterface (referring to >>> JDK-8230664 [1] for resolution). >>> >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8237008 >>> >>> Webrev: http://cr.openjdk.java.net/~clanger/webrevs/8237008.0/ >>> >>> Thanks >>> >>> Christoph >>> >>> [0] https://hg.openjdk.java.net/jdk/jdk/rev/d8f6e926cedc >>> >>> [1] https://bugs.openjdk.java.net/browse/JDK-8230664 >>> From david.holmes at oracle.com Mon Jan 13 22:31:12 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 14 Jan 2020 08:31:12 +1000 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: <75f4cd8d-d291-2cf5-b0dd-91a1d76a6130@oracle.com> References: <75f4cd8d-d291-2cf5-b0dd-91a1d76a6130@oracle.com> Message-ID: <9e2804c6-3441-2e0a-5c38-7e5ecc96ecae@oracle.com> Hi Robbin, Thanks for looking at this. I'm going to wait to get more eyes on this as it does cover various parts of the VM. David On 13/01/2020 11:08 pm, Robbin Ehn wrote: > Hi David, looks good, thanks! > > /Robbin > > On 1/13/20 8:13 AM, David Holmes wrote: >> webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ >> bug: https://bugs.openjdk.java.net/browse/JDK-8235741 >> >> Full details in the bug report about the existing uses of >> javaTimeMillis(), many of which just want an elapsed time in ms and so >> should be using javaTimeNanos() and convert to ms. This covers areas >> all across the VM. >> >> Only non-simple change is in os_perf_linux.cpp (and the same code will >> be in os_perf_aix.cpp once it has been validated). There we are >> tracking an elapsed time in ms but relative to the boot time, which is >> seconds since the epoch. Consequently the first interval has to be >> calculated using javaTimeMillis, but after that we can use >> javaTimeNanos (using a new 'first time' captured at the same time we >> used javaTimeMillis). I think I have the logic right but other than >> through JFR this code seems unused and I have limited means of testing >> it. The JFR test jdk/jfr/event/os/TestThreadContextSwitches.java >> exercises the code but the results of running that test seems to >> exhibit arbitrary randomness in the rates reported - e.g. 0 to 16000Hz >> - both with and without my change, so not really that useful. Stefan >> K. suggested a gtest which I may look into - though it is frustrating >> to have to expend such effort to validate this. >> >> Other testing tiers 1-3. >> >> Thanks, >> David From bwmat.reloaded at gmail.com Mon Jan 13 23:56:28 2020 From: bwmat.reloaded at gmail.com (Bwmat .) Date: Mon, 13 Jan 2020 15:56:28 -0800 Subject: Need some help in debugging jvm corruption/crashes in C++/JNI code In-Reply-To: References: Message-ID: Hello, I just joined this mailing list to hopefully get a bit of help from someone familiar with hotspot. We found a crash in a C++ library we have that uses JNI, and I?ve been trying to debug it for a couple of days. I?m working on windows, and I managed to get a TTD (time travel debugging, a feature in the preview version of WinDBG) trace of when the problem occurs, and I built myself a debug JVM from source to help debug, but I?m completely unfamiliar with hotspot. I?ve also attached a log file created one time that the crash occurred (the symptoms aren?t always the same, sometimes it doesn?t crash at all, but just return an invalid value from a method call, so my test app quits early, sometimes it crashes from an assertion from within hotpot, not always the same one). One weird thing is that I found the issue while doing some testing, and a certain SQL query triggers the issue, but another, analogous query does not. This is significant, since all of the SQL processing is written in Java, so very little changes in the native part of the application between the case that works and the case that fails, which makes me think it?s a jvm issue. I?m currently debugging a java 8 openjdk (since I used an article about how to build it that used that version, and that?s what we?re using internally anyways, but I also reproduced the issue on a java 13 oracle jvm, so if it IS a JVM bug, doesn?t seem like it's been fixed yet. I?m quite aware that it?s probably still our fault somehow though.) The issue seems to somehow be caused by the wrong java method being invoked by a JNI call, or maybe the right method, but on the wrong object. Early on, while debugging using Eclipse?s remote debugger, I found that, right before crashing, I was able to hit a breakpoint in a method on a type T, but the debugger told me that the ?this? reference was actually of type String! Later on, while debugging the internals of hotspot in my TTD trace (which allowed me to get quite far without much understanding of what?s going one), I found that the crash in the trace occurs when the wrong method is invoked on an object, returning a long, which is later interpreted as a string reference, and the jvm notices it isn?t actually a string when the native code tries to get the string length. To lay out the situation without going into too many details of our code, we have an interface IColumn, that has a few methods that get attributes about a SQL column. Some methods return primitives, some return strings. The native code is trying to call IColumn.GetLabel(), so it passes a jmethodID that was generated via a successful call to JNI?s GetMethodID, passing the class object of IColumn. For the receiver of the call, it passes an instance of a type which implements IColumn, and has no superclasses, but is a private static inner class. In jni_invoke_nonstatic(), it goes into the ?else if (!m->has_itable_index())? branch, which seems wrong from what little I gleamed from https://wiki.openjdk.java.net/display/HotSpot/InterfaceCalls It then resolves the wrong method, getDisplaySize(), which returns a long (I note that getDisplaySize() is the method declared directly before getLabel() in the declaration of IColumn, so maybe an off-by-one error somewhere?), and then invokes it, dooming the process to a future crash. If I go backwards in time to when the _*vtable*_index field of the Method (the one originally resolved using GetMethodID, and later passed into CallObjectMethodV) is set, it?s in KlassVtable::initialize_vtable(), on the line ?mh()->set_vtable_index(initialized); // set primary vtable index?, and that?s the last time it?s set. Am I right in thinking that it should have instead been set to a negative value (so that it was an ?itable index?), since it?s an interface method? If so, where would that happen? What else should I check? Any suggestions are welcome, thanks in advance. From david.holmes at oracle.com Tue Jan 14 02:22:15 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 14 Jan 2020 12:22:15 +1000 Subject: Need some help in debugging jvm corruption/crashes in C++/JNI code In-Reply-To: References:

Message-ID: <83b068d7-464c-56ac-e0e0-56db59c31648@oracle.com> Hi, On 14/01/2020 9:56 am, Bwmat . wrote: > Hello, > > > I just joined this mailing list to hopefully get a bit of help from someone > familiar with hotspot. These mailing lists are not really for end user application debugging help. However, as you have deep dived into hotspot internals ... :) In a debug build you can use itable/vtable logging to see how the itable/vtable is constructed and see if anything odd appears there. -Xlog:itables*=trace,vtables*=trace You can also try running with -Xcheck:jni. FYI most file attachments are stripped by the mailing list so your log did not get included. Cheers, David > > > We found a crash in a C++ library we have that uses JNI, and I?ve been > trying to debug it for a couple of days. I?m working on windows, and I > managed to get a TTD (time travel debugging, a feature in the preview > version of WinDBG) trace of when the problem occurs, and I built myself a > debug JVM from source to help debug, but I?m completely unfamiliar with > hotspot. > > > > I?ve also attached a log file created one time that the crash occurred (the > symptoms aren?t always the same, sometimes it doesn?t crash at all, but > just return an invalid value from a method call, so my test app quits > early, sometimes it crashes from an assertion from within hotpot, not > always the same one). > > > > One weird thing is that I found the issue while doing some testing, and a > certain SQL query triggers the issue, but another, analogous query does > not. This is significant, since all of the SQL processing is written in > Java, so very little changes in the native part of the application between > the case that works and the case that fails, which makes me think it?s a > jvm issue. I?m currently debugging a java 8 openjdk (since I used an > article about how to build it that used that version, and that?s what we?re > using internally anyways, but I also reproduced the issue on a java 13 > oracle jvm, so if it IS a JVM bug, doesn?t seem like it's been fixed yet. > I?m quite aware that it?s probably still our fault somehow though.) > > > > The issue seems to somehow be caused by the wrong java method being invoked > by a JNI call, or maybe the right method, but on the wrong object. > > > > Early on, while debugging using Eclipse?s remote debugger, I found that, > right before crashing, I was able to hit a breakpoint in a method on a type > T, but the debugger told me that the ?this? reference was actually of type > String! > > > > Later on, while debugging the internals of hotspot in my TTD trace (which > allowed me to get quite far without much understanding of what?s going > one), I found that the crash in the trace occurs when the wrong method is > invoked on an object, returning a long, which is later interpreted as a > string reference, and the jvm notices it isn?t actually a string when the > native code tries to get the string length. > > > > To lay out the situation without going into too many details of our code, > we have an interface IColumn, that has a few methods that get attributes > about a SQL column. Some methods return primitives, some return strings. > The native code is trying to call IColumn.GetLabel(), so it passes a > jmethodID that was generated via a successful call to JNI?s GetMethodID, > passing the class object of IColumn. For the receiver of the call, it > passes an instance of a type which implements IColumn, and has no > superclasses, but is a private static inner class. > > > > In jni_invoke_nonstatic(), it goes into the ?else if > (!m->has_itable_index())? branch, which seems wrong from what little I > gleamed from https://wiki.openjdk.java.net/display/HotSpot/InterfaceCalls > > It then resolves the wrong method, getDisplaySize(), which returns a long > (I note that getDisplaySize() is the method declared directly before > getLabel() in the declaration of IColumn, so maybe an off-by-one error > somewhere?), and then invokes it, dooming the process to a future crash. > > > > If I go backwards in time to when the _*vtable*_index field of the Method > (the one originally resolved using GetMethodID, and later passed into > CallObjectMethodV) is set, it?s in KlassVtable::initialize_vtable(), on the > line ?mh()->set_vtable_index(initialized); // set primary vtable index?, > and that?s the last time it?s set. > > Am I right in thinking that it should have instead been set to a negative > value (so that it was an ?itable index?), since it?s an interface method? > If so, where would that happen? What else should I check? > > > > Any suggestions are welcome, thanks in advance. > From bwmat.reloaded at gmail.com Tue Jan 14 02:35:43 2020 From: bwmat.reloaded at gmail.com (Bwmat .) Date: Mon, 13 Jan 2020 18:35:43 -0800 Subject: Need some help in debugging jvm corruption/crashes in C++/JNI code In-Reply-To: <83b068d7-464c-56ac-e0e0-56db59c31648@oracle.com> References:

<83b068d7-464c-56ac-e0e0-56db59c31648@oracle.com> Message-ID: Thanks for the response, but I JUST figured it out. It was our fault, of course. I was staring at the trace, wondering why it wasn't doing an interface call, when I realized that what I had described as IColumn wasn't actually an interface, it was a class! I had assumed we had properly written the compatibility layer to use the interface, but the *real* Interface is actually rarely used, with the aforementioned class being used in the vast majority of cases... (and I haven't used this code much for a couple of years). Not here though! Really kicking myself for not realizing this sooner. Kinda wish that -Xcheck:jni checked this (or the debug assertions in hotspot, though maybe they do in a newer version). Has this been considered, anyone know? Thanks again, Matthew w. On Mon., Jan. 13, 2020, 6:22 p.m. David Holmes, wrote: > Hi, > > On 14/01/2020 9:56 am, Bwmat . wrote: > > Hello, > > > > > > I just joined this mailing list to hopefully get a bit of help from > someone > > familiar with hotspot. > > These mailing lists are not really for end user application debugging > help. However, as you have deep dived into hotspot internals ... :) > > In a debug build you can use itable/vtable logging to see how the > itable/vtable is constructed and see if anything odd appears there. > > -Xlog:itables*=trace,vtables*=trace > > You can also try running with -Xcheck:jni. > > FYI most file attachments are stripped by the mailing list so your log > did not get included. > > Cheers, > David > > > > > > > We found a crash in a C++ library we have that uses JNI, and I?ve been > > trying to debug it for a couple of days. I?m working on windows, and I > > managed to get a TTD (time travel debugging, a feature in the preview > > version of WinDBG) trace of when the problem occurs, and I built myself a > > debug JVM from source to help debug, but I?m completely unfamiliar with > > hotspot. > > > > > > > > I?ve also attached a log file created one time that the crash occurred > (the > > symptoms aren?t always the same, sometimes it doesn?t crash at all, but > > just return an invalid value from a method call, so my test app quits > > early, sometimes it crashes from an assertion from within hotpot, not > > always the same one). > > > > > > > > One weird thing is that I found the issue while doing some testing, and a > > certain SQL query triggers the issue, but another, analogous query does > > not. This is significant, since all of the SQL processing is written in > > Java, so very little changes in the native part of the application > between > > the case that works and the case that fails, which makes me think it?s a > > jvm issue. I?m currently debugging a java 8 openjdk (since I used an > > article about how to build it that used that version, and that?s what > we?re > > using internally anyways, but I also reproduced the issue on a java 13 > > oracle jvm, so if it IS a JVM bug, doesn?t seem like it's been fixed yet. > > I?m quite aware that it?s probably still our fault somehow though.) > > > > > > > > The issue seems to somehow be caused by the wrong java method being > invoked > > by a JNI call, or maybe the right method, but on the wrong object. > > > > > > > > Early on, while debugging using Eclipse?s remote debugger, I found that, > > right before crashing, I was able to hit a breakpoint in a method on a > type > > T, but the debugger told me that the ?this? reference was actually of > type > > String! > > > > > > > > Later on, while debugging the internals of hotspot in my TTD trace (which > > allowed me to get quite far without much understanding of what?s going > > one), I found that the crash in the trace occurs when the wrong method is > > invoked on an object, returning a long, which is later interpreted as a > > string reference, and the jvm notices it isn?t actually a string when the > > native code tries to get the string length. > > > > > > > > To lay out the situation without going into too many details of our code, > > we have an interface IColumn, that has a few methods that get attributes > > about a SQL column. Some methods return primitives, some return strings. > > The native code is trying to call IColumn.GetLabel(), so it passes a > > jmethodID that was generated via a successful call to JNI?s GetMethodID, > > passing the class object of IColumn. For the receiver of the call, it > > passes an instance of a type which implements IColumn, and has no > > superclasses, but is a private static inner class. > > > > > > > > In jni_invoke_nonstatic(), it goes into the ?else if > > (!m->has_itable_index())? branch, which seems wrong from what little I > > gleamed from > https://wiki.openjdk.java.net/display/HotSpot/InterfaceCalls > > > > It then resolves the wrong method, getDisplaySize(), which returns a long > > (I note that getDisplaySize() is the method declared directly before > > getLabel() in the declaration of IColumn, so maybe an off-by-one error > > somewhere?), and then invokes it, dooming the process to a future crash. > > > > > > > > If I go backwards in time to when the _*vtable*_index field of the Method > > (the one originally resolved using GetMethodID, and later passed into > > CallObjectMethodV) is set, it?s in KlassVtable::initialize_vtable(), on > the > > line ?mh()->set_vtable_index(initialized); // set primary vtable index?, > > and that?s the last time it?s set. > > > > Am I right in thinking that it should have instead been set to a negative > > value (so that it was an ?itable index?), since it?s an interface method? > > If so, where would that happen? What else should I check? > > > > > > > > Any suggestions are welcome, thanks in advance. > > > From david.holmes at oracle.com Tue Jan 14 04:21:33 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 14 Jan 2020 14:21:33 +1000 Subject: Need some help in debugging jvm corruption/crashes in C++/JNI code In-Reply-To: References:

<83b068d7-464c-56ac-e0e0-56db59c31648@oracle.com> Message-ID: <1f77a5f2-9056-d14e-ca95-3d4c68ce5397@oracle.com> On 14/01/2020 12:35 pm, Bwmat . wrote: > Thanks for the response, but I JUST figured it out. > > It was our fault, of course. I was staring at the trace, wondering why > it wasn't doing an interface call, when I realized that what I had > described as IColumn wasn't actually an interface, it was a class! I had > assumed we had properly written the compatibility layer to use the > interface, but the *real* Interface is actually rarely used, with the > aforementioned class being used in the vast majority of cases... (and I > haven't used this code much for a couple of years). Not here though! > Really kicking myself for not realizing this sooner. > > Kinda wish that -Xcheck:jni checked this (or the debug assertions in > hotspot, though maybe they do in a newer version). Has this been > considered, anyone know? Funny you should mention that: https://bugs.openjdk.java.net/browse/JDK-8229900 a change in JDK 14, should provide the receiver check you were looking for. Cheers, David > Thanks again, > Matthew w. > > > On Mon., Jan. 13, 2020, 6:22 p.m. David Holmes, > wrote: > > Hi, > > On 14/01/2020 9:56 am, Bwmat . wrote: > > Hello, > > > > > > I just joined this mailing list to hopefully get a bit of help > from someone > > familiar with hotspot. > > These mailing lists are not really for end user application debugging > help. However, as you have deep dived into hotspot internals ... :) > > In a debug build you can use itable/vtable logging to see how the > itable/vtable is constructed and see if anything odd appears there. > > -Xlog:itables*=trace,vtables*=trace > > You can also try running with -Xcheck:jni. > > FYI most file attachments are stripped by the mailing list so your log > did not get included. > > Cheers, > David > > > > > > > We found a crash in a C++ library we have that uses JNI, and I?ve > been > > trying to debug it for a couple of days. I?m working on windows, > and I > > managed to get a TTD (time travel debugging, a feature in the preview > > version of WinDBG) trace of when the problem occurs, and I built > myself a > > debug JVM from source to help debug, but I?m completely > unfamiliar with > > hotspot. > > > > > > > > I?ve also attached a log file created one time that the crash > occurred (the > > symptoms aren?t always the same, sometimes it doesn?t crash at > all, but > > just return an invalid value from a method call, so my test app quits > > early, sometimes it crashes from an assertion from within hotpot, not > > always the same one). > > > > > > > > One weird thing is that I found the issue while doing some > testing, and a > > certain SQL query triggers the issue, but another, analogous > query does > > not. This is significant, since all of the SQL processing is > written in > > Java, so very little changes in the native part of the > application between > > the case that works and the case that fails, which makes me think > it?s a > > jvm issue. I?m currently debugging a java 8 openjdk (since I used an > > article about how to build it that used that version, and that?s > what we?re > > using internally anyways, but I also reproduced the issue on a > java 13 > > oracle jvm, so if it IS a JVM bug, doesn?t seem like it's been > fixed yet. > > I?m quite aware that it?s probably still our fault somehow though.) > > > > > > > > The issue seems to somehow be caused by the wrong java method > being invoked > > by a JNI call, or maybe the right method, but on the wrong object. > > > > > > > > Early on, while debugging using Eclipse?s remote debugger, I > found that, > > right before crashing, I was able to hit a breakpoint in a method > on a type > > T, but the debugger told me that the ?this? reference was > actually of type > > String! > > > > > > > > Later on, while debugging the internals of hotspot in my TTD > trace (which > > allowed me to get quite far without much understanding of what?s > going > > one), I found that the crash in the trace occurs when the wrong > method is > > invoked on an object, returning a long, which is later > interpreted as a > > string reference, and the jvm notices it isn?t actually a string > when the > > native code tries to get the string length. > > > > > > > > To lay out the situation without going into too many details of > our code, > > we have an interface IColumn, that has a few methods that get > attributes > > about a SQL column. Some methods return primitives, some return > strings. > > The native code is trying to call IColumn.GetLabel(), so it passes a > > jmethodID that was generated via a successful call to JNI?s > GetMethodID, > > passing the class object of IColumn. For the receiver of the call, it > > passes an instance of a type which implements IColumn, and has no > > superclasses, but is a private static inner class. > > > > > > > > In jni_invoke_nonstatic(), it goes into the ?else if > > (!m->has_itable_index())? branch, which seems wrong from what > little I > > gleamed from > https://wiki.openjdk.java.net/display/HotSpot/InterfaceCalls > > > > It then resolves the wrong method, getDisplaySize(), which > returns a long > > (I note that getDisplaySize() is the method declared directly before > > getLabel() in the declaration of IColumn, so maybe an off-by-one > error > > somewhere?), and then invokes it, dooming the process to a future > crash. > > > > > > > > If I go backwards in time to when the _*vtable*_index field of > the Method > > (the one originally resolved using GetMethodID, and later passed into > > CallObjectMethodV) is set, it?s in > KlassVtable::initialize_vtable(), on the > > line ?mh()->set_vtable_index(initialized); // set primary vtable > index?, > > and that?s the last time it?s set. > > > > Am I right in thinking that it should have instead been set to a > negative > > value (so that it was an ?itable index?), since it?s an interface > method? > > If so, where would that happen? What else should I check? > > > > > > > > Any suggestions are welcome, thanks in advance. > > > From magnus.ihse.bursie at oracle.com Tue Jan 14 11:27:52 2020 From: magnus.ihse.bursie at oracle.com (Magnus Ihse Bursie) Date: Tue, 14 Jan 2020 12:27:52 +0100 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: Message-ID: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> On 2020-01-10 11:01, Baesken, Matthias wrote: > Hello, I recently looked into the gcc lto optimization mode (see for some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html and http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html ). > This mode can lead to more compact binaries (~10% smaller) , it also might bring small performance improvements but that wasn't my (main) goal . > > The changes for this are rather small , one needs to use a recent gcc , add -flto to the compile flags , for example > > --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 > +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 > @@ -530,8 +530,13 @@ > fi > if test "x$TOOLCHAIN_TYPE" = xgcc; then > - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector" > - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" > + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector -flto" > + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" > > .... and you have to make sure to use gcc-ar and gcc-nm instead of ar / nm . > Build and test(s) work, however with one exception. > The serviceability tests like serviceability/sa seems to rely heavily on the "normal" structure of libjvm.so (from what I understand e.g. in LinuxVtblAccess it is attempted to access internal symbols like _ZTV ). > > Errors in the sa tests look like : > > > java.lang.InternalError: Metadata does not appear to be polymorphic > at jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) > at jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) > at jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) > at jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) > at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) > at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) > at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) > > Has anyone experimented with LTO optimization ? Hi Matthias, We used to have LTO enabled on the old, closed-source Oracle arm-32 builds. There is still a "link-time-opt" JVM feature present; afaik it still works and adds the -flto flag. The main drawback of this is the *extremely* long link times of libjvm.so. I don't think servicability was ever supported for that platform, so I'm not surprised this does not work. /Magnus > > And to the serviceability agent experts - any idea how to make the jdk.hotspot.agent more independent from optimization settings ? > > > Best regards, Matthias From matthias.baesken at sap.com Tue Jan 14 12:49:43 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 14 Jan 2020 12:49:43 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> Message-ID: Hi Magnus, thanks for the info , I already noticed yesterday the setting for arm-32 in the minimal build . Do you think we could set it too for the other Linux platforms in the minimal build ( serviceability agent is not supported there as well so the observed issue wouldn?t be a problem). Best regards, Matthias On 2020-01-10 11:01, Baesken, Matthias wrote: Hello, I recently looked into the gcc lto optimization mode (see for some details https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html and http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html ). This mode can lead to more compact binaries (~10% smaller) , it also might bring small performance improvements but that wasn't my (main) goal . The changes for this are rather small , one needs to use a recent gcc , add -flto to the compile flags , for example --- a/make/autoconf/flags-cflags.m4 Wed Jan 01 03:08:45 2020 +0100 +++ b/make/autoconf/flags-cflags.m4 Wed Jan 08 17:39:10 2020 +0100 @@ -530,8 +530,13 @@ fi if test "x$TOOLCHAIN_TYPE" = xgcc; then - TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector" - TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" + TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector -flto" + TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" .... and you have to make sure to use gcc-ar and gcc-nm instead of ar / nm . Build and test(s) work, however with one exception. The serviceability tests like serviceability/sa seems to rely heavily on the "normal" structure of libjvm.so (from what I understand e.g. in LinuxVtblAccess it is attempted to access internal symbols like _ZTV ). Errors in the sa tests look like : java.lang.InternalError: Metadata does not appear to be polymorphic at jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) at jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) at jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) at jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) Has anyone experimented with LTO optimization ? Hi Matthias, We used to have LTO enabled on the old, closed-source Oracle arm-32 builds. There is still a "link-time-opt" JVM feature present; afaik it still works and adds the -flto flag. The main drawback of this is the *extremely* long link times of libjvm.so. I don't think servicability was ever supported for that platform, so I'm not surprised this does not work. /Magnus And to the serviceability agent experts - any idea how to make the jdk.hotspot.agent more independent from optimization settings ? Best regards, Matthias From harold.seigel at oracle.com Tue Jan 14 14:00:13 2020 From: harold.seigel at oracle.com (Harold Seigel) Date: Tue, 14 Jan 2020 09:00:13 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls Message-ID: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> Hi, Please review this small change, to reduce unnecessary calls to Thread::current() in MutexLocker calls, by passing the current thread as an argument.? A few ResoureMark declarations were also changed. Open Webrev: http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 The fix was regression tested by running Mach5 tiers 1 and 2 tests and builds on Linux-x64, Solaris, Windows, and Mac OS X, by running Mach5 tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on Linux-x64. Thanks, Harold From magnus.ihse.bursie at oracle.com Tue Jan 14 14:04:11 2020 From: magnus.ihse.bursie at oracle.com (Magnus Ihse Bursie) Date: Tue, 14 Jan 2020 15:04:11 +0100 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> Message-ID: <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> On 2020-01-14 13:49, Baesken, Matthias wrote: > > Hi Magnus, thanks for the info , I already noticed yesterday the > setting for arm-32 in the minimal build . > > Do you think? we could set it too for the other Linux platforms? in > the minimal build? ( serviceability agent is not supported there as > well so the? observed issue wouldn?t be a problem). > You mean if you could enable it on your builds without any issues? I'd guess so, but I don't know. Just try it: --with-jvm-features="link-time-opt". If you mean that it should be turned on by default on minimal builds for all platforms? No, I don't think that's a good idea. The link time is really a killer. I remember arm-32 going from like a couple of minutes to half an hour for linking libjvm.so. Things might be different with gold, though. I know they have done work with at least some kind of "lightweight" LTO, that might be worth at least looking into. /Magnus > Best regards, Matthias > > On 2020-01-10 11:01, Baesken, Matthias wrote: > > Hello,?? I recently looked into? the? gcc? lto? optimization mode (see for some detailshttps://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html ? andhttp://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html ? ). > > This mode can lead to more compact binaries (~10% smaller)? , it also might bring? small performance improvements? but that wasn't my (main)? goal? . > > The changes for this are rather small , one needs to use a recent gcc? , add? -flto?? to the compile flags? , for example > > --- a/make/autoconf/flags-cflags.m4????? Wed Jan 01 03:08:45 2020 +0100 > > +++ b/make/autoconf/flags-cflags.m4?? Wed Jan 08 17:39:10 2020 +0100 > > @@ -530,8 +530,13 @@ > > ?? fi > > ?? if test "x$TOOLCHAIN_TYPE" = xgcc; then > > -??? TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector" > > -??? TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" > > +??? TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new -fstack-protector -flto" > > +??? TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" > > ? .... and you have to make sure? to use? gcc-ar? and? gcc-nm instead?? of? ar / nm . > > Build and test(s)? work,? however with? one exception. > > The? serviceability?? tests like? serviceability/sa?? seems to rely?? heavily? on the "normal"?? structure? of?? libjvm.so?? (from what I?? understand? e.g. in? LinuxVtblAccess? it is attempted to access? internal symbols? like? _ZTV ). > > Errors in the sa? tests look like : > > java.lang.InternalError: Metadata does not appear to be polymorphic > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) > > ???? ????at jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) > > ??? ?????at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) > > ???????? at jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) > > Has anyone experimented with LTO optimization ? > > > Hi Matthias, > > We used to have LTO enabled on the old, closed-source Oracle arm-32 > builds. There is still a "link-time-opt" JVM feature present; afaik it > still works and adds the -flto flag. The main drawback of this is the > *extremely* long link times of libjvm.so. > > I don't think servicability was ever supported for that platform, so > I'm not surprised this does not work. > > /Magnus > > > And to the? serviceability?? agent experts -? any idea? how to make the? jdk.hotspot.agent?? more independent from? optimization settings ? > > Best regards, Matthias > From matthias.baesken at sap.com Tue Jan 14 14:07:16 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 14 Jan 2020 14:07:16 +0000 Subject: RFR: 8236714: enable link-time section-gc for linux to remove unused code Message-ID: Hello, the following change enables the link-time section-gc for linux . gcc and ld support enabling "garbage collection" of unused input sections. This can be used to eliminate unused coding from native libraries (especially when already compiling the objects with compiler flags -ffunction-sections -fdata-sections . See for details the --gc-sections and --print-gc-sections parts of the ld documentation : https://linux.die.net/man/1/ld We had this enabled already for linux s390x , with https://bugs.openjdk.java.net/browse/JDK-8234525 8234525: enable link-time section-gc for linux s390x to remove unused code . This time we enable it too for the other linux platforms . For the other platforms I do not enable it for JVM, just for the JDK libs. The reason is that the serviceability agent (not supported on linux s390x ) is not (yet) ready for the optimization . Below you see the results , for some libraries a significant size reduction can be achieved . Results from linux x86_64 product builds : without / with ltgc 320K / 300K /images/jdk/lib/libsunec.so <------------------------- 36K / 36K /images/jdk/lib/libdt_socket.so 280K / 276K /images/jdk/lib/libjdwp.so 23M / 23M /images/jdk/lib/server/libjvm.so <---- not set for libjvm.so for x86_64 16K / 16K /images/jdk/lib/server/libjsig.so 72K / 72M /images/jdk/lib/libverify.so 84K / 84M /images/jdk/lib/libjli.so 16K / 16K /images/jdk/lib/libjsig.so 196K / 196K /images/jdk/lib/libjava.so 44K / 44K /images/jdk/lib/libzip.so 144K / 136K /images/jdk/lib/libjimage.so 112K / 112K /images/jdk/lib/libnet.so 100K / 100K /images/jdk/lib/libnio.so 36K / 36K /images/jdk/lib/libsctp.so 576K / 556K /images/jdk/lib/libmlib_image.so 752K / 752K /images/jdk/lib/libawt.so 260K / 252K /images/jdk/lib/libjavajpeg.so 784K / 784K /images/jdk/lib/libfreetype.so 368K / 236K /images/jdk/lib/libsplashscreen.so <------------------------- 88K / 88K /images/jdk/lib/libjsound.so 472K / 468K /images/jdk/lib/libawt_xawt.so 564K / 404K /images/jdk/lib/liblcms.so <-------------------------- 48K / 48K /images/jdk/lib/libawt_headless.so 12K / 12K /images/jdk/lib/libjawt.so 1.5M / 900K /images/jdk/lib/libfontmanager.so <------------------------------ 12K / 12K /images/jdk/lib/libjaas.so 92K / 92K /images/jdk/lib/libj2pkcs11.so 16K / 16K /images/jdk/lib/libattach.so 8.0K / 8.0K /images/jdk/lib/librmi.so 56K / 56K /images/jdk/lib/libinstrument.so 16K / 16K /images/jdk/lib/libprefs.so 52K / 52K /images/jdk/lib/libj2gss.so 12K / 12K /images/jdk/lib/libmanagement_agent.so 36K / 32K /images/jdk/lib/libmanagement.so 16K / 16K /images/jdk/lib/libextnet.so 20K / 20K /images/jdk/lib/libj2pcsc.so 40K / 40K /images/jdk/lib/libmanagement_ext.so 60K / 60K /images/jdk/lib/libsaproc.so Bug/webrev : https://bugs.openjdk.java.net/browse/JDK-8236714 http://cr.openjdk.java.net/~mbaesken/webrevs/8236714.2/ Thanks, Matthias From lois.foltan at oracle.com Tue Jan 14 15:05:34 2020 From: lois.foltan at oracle.com (Lois Foltan) Date: Tue, 14 Jan 2020 10:05:34 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> Message-ID: On 1/14/2020 9:00 AM, Harold Seigel wrote: > Hi, > > Please review this small change, to reduce unnecessary calls to > Thread::current() in MutexLocker calls, by passing the current thread > as an argument.? A few ResoureMark declarations were also changed. > > Open Webrev: > http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html > > JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 > > The fix was regression tested by running Mach5 tiers 1 and 2 tests and > builds on Linux-x64, Solaris, Windows, and Mac OS X, by running Mach5 > tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on Linux-x64. > > Thanks, Harold > Overall looks great.? One comment: - prims/methodHandles.cpp line #1507: Curious to know why you use "THREAD" and the MutexLocker mu1 at line #1502 uses "thread"? Thanks, Lois From aleksei.voitylov at bell-sw.com Tue Jan 14 15:15:14 2020 From: aleksei.voitylov at bell-sw.com (Aleksei Voitylov) Date: Tue, 14 Jan 2020 18:15:14 +0300 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> Message-ID: <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> Magnus, Matthias, for me, lto is a little heavyweight for development. x86_64 build time with gcc 7: Server 1m32.484s Server+Minimal 1m42.166s Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s If the change to enable lto by default is proposed, what would be the recommended strategy for development? For ARM32 Minimal, please keep in mind that it's not uncommon to disable LTO plugin in commodity ARM32 gcc compiler distributions, so for some it does not matter what settings we have in OpenJDK. I believe there could be other reasons for that on top of build time (bugs?). -Aleksei On 14/01/2020 17:04, Magnus Ihse Bursie wrote: > On 2020-01-14 13:49, Baesken, Matthias wrote: >> >> Hi Magnus, thanks for the info , I already noticed yesterday the >> setting for arm-32 in the minimal build . >> >> Do you think? we could set it too for the other Linux platforms? in >> the minimal build? ( serviceability agent is not supported there as >> well so the? observed issue wouldn?t be a problem). >> > > You mean if you could enable it on your builds without any issues? I'd > guess so, but I don't know. Just try it: > --with-jvm-features="link-time-opt". > > If you mean that it should be turned on by default on minimal builds > for all platforms? No, I don't think that's a good idea. The link time > is really a killer. I remember arm-32 going from like a couple of > minutes to half an hour for linking libjvm.so. > > Things might be different with gold, though. I know they have done > work with at least some kind of "lightweight" LTO, that might be worth > at least looking into. > > /Magnus > > >> Best regards, Matthias >> >> On 2020-01-10 11:01, Baesken, Matthias wrote: >> >> ??? Hello,?? I recently looked into? the? gcc? lto? optimization mode >> (see for some >> detailshttps://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html ? >> andhttp://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html >> ? ). >> >> ??? This mode can lead to more compact binaries (~10% smaller)? , it >> also might bring? small performance improvements? but that wasn't my >> (main)? goal? . >> >> ??? The changes for this are rather small , one needs to use a recent >> gcc? , add? -flto?? to the compile flags? , for example >> >> ??? --- a/make/autoconf/flags-cflags.m4????? Wed Jan 01 03:08:45 2020 >> +0100 >> >> ??? +++ b/make/autoconf/flags-cflags.m4?? Wed Jan 08 17:39:10 2020 +0100 >> >> ??? @@ -530,8 +530,13 @@ >> >> ???? ?? fi >> >> ???? ?? if test "x$TOOLCHAIN_TYPE" = xgcc; then >> >> ??? -??? TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new >> -fstack-protector" >> >> ??? -??? TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector" >> >> ??? +??? TOOLCHAIN_CFLAGS_JVM="$TOOLCHAIN_CFLAGS_JVM -fcheck-new >> -fstack-protector -flto" >> >> ??? +??? TOOLCHAIN_CFLAGS_JDK="-pipe -fstack-protector -flto" >> >> ???? ? .... and you have to make sure? to use? gcc-ar? and? gcc-nm >> instead?? of? ar / nm . >> >> ??? Build and test(s)? work,? however with? one exception. >> >> ??? The? serviceability?? tests like? serviceability/sa?? seems to >> rely?? heavily? on the "normal"?? structure? of?? libjvm.so?? (from >> what I?? understand? e.g. in? LinuxVtblAccess? it is attempted to >> access? internal symbols? like? _ZTV ). >> >> ??? Errors in the sa? tests look like : >> >> ??? java.lang.InternalError: Metadata does not appear to be polymorphic >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.types.basic.BasicTypeDataBase.findDynamicTypeForAddress(BasicTypeDataBase.java:279) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.runtime.VirtualBaseConstructor.instantiateWrapperFor(VirtualBaseConstructor.java:102) >> >> ???? ???? ????at >> jdk.hotspot.agent/sun.jvm.hotspot.oops.Metadata.instantiateWrapperFor(Metadata.java:74) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.memory.SystemDictionary.getClassLoaderKlass(SystemDictionary.java:96) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.printClassLoaderStatistics(ClassLoaderStats.java:93) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.ClassLoaderStats.run(ClassLoaderStats.java:78) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.run(JMap.java:115) >> >> ???? ??? ?????at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.startInternal(Tool.java:262) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.start(Tool.java:225) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.Tool.execute(Tool.java:118) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.tools.JMap.main(JMap.java:176) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.runJMAP(SALauncher.java:321) >> >> ???? ???????? at >> jdk.hotspot.agent/sun.jvm.hotspot.SALauncher.main(SALauncher.java:406) >> >> ??? Has anyone experimented with LTO optimization ? >> >> >> Hi Matthias, >> >> We used to have LTO enabled on the old, closed-source Oracle arm-32 >> builds. There is still a "link-time-opt" JVM feature present; afaik >> it still works and adds the -flto flag. The main drawback of this is >> the *extremely* long link times of libjvm.so. >> >> I don't think servicability was ever supported for that platform, so >> I'm not surprised this does not work. >> >> /Magnus >> >> >> ??? And to the? serviceability?? agent experts -? any idea? how to >> make the? jdk.hotspot.agent?? more independent from? optimization >> settings ? >> >> ??? Best regards, Matthias >> > From erik.joelsson at oracle.com Tue Jan 14 16:12:24 2020 From: erik.joelsson at oracle.com (Erik Joelsson) Date: Tue, 14 Jan 2020 08:12:24 -0800 Subject: RFR: 8236714: enable link-time section-gc for linux to remove unused code In-Reply-To: References: Message-ID: (adding core-libs-dev) Change looks good to me, but would like input from at least someone in core-libs. /Erik On 2020-01-14 06:07, Baesken, Matthias wrote: > Hello, the following change enables the link-time section-gc for linux . > > gcc and ld support enabling "garbage collection" of unused input sections. > This can be used to eliminate unused coding from native libraries (especially when already compiling the objects with compiler flags -ffunction-sections -fdata-sections . > See for details the --gc-sections and --print-gc-sections parts of the ld documentation : > > https://linux.die.net/man/1/ld > > > We had this enabled already for linux s390x , with https://bugs.openjdk.java.net/browse/JDK-8234525 > 8234525: enable link-time section-gc for linux s390x to remove unused code . > > This time we enable it too for the other linux platforms . > > For the other platforms I do not enable it for JVM, just for the JDK libs. The reason is that the serviceability agent (not supported on linux s390x ) is not (yet) ready for the optimization . > Below you see the results , for some libraries a significant size reduction can be achieved . > > > Results from linux x86_64 product builds : > > without / with ltgc > > 320K / 300K /images/jdk/lib/libsunec.so <------------------------- > 36K / 36K /images/jdk/lib/libdt_socket.so > 280K / 276K /images/jdk/lib/libjdwp.so > 23M / 23M /images/jdk/lib/server/libjvm.so <---- not set for libjvm.so for x86_64 > 16K / 16K /images/jdk/lib/server/libjsig.so > 72K / 72M /images/jdk/lib/libverify.so > 84K / 84M /images/jdk/lib/libjli.so > 16K / 16K /images/jdk/lib/libjsig.so > 196K / 196K /images/jdk/lib/libjava.so > 44K / 44K /images/jdk/lib/libzip.so > 144K / 136K /images/jdk/lib/libjimage.so > 112K / 112K /images/jdk/lib/libnet.so > 100K / 100K /images/jdk/lib/libnio.so > 36K / 36K /images/jdk/lib/libsctp.so > 576K / 556K /images/jdk/lib/libmlib_image.so > 752K / 752K /images/jdk/lib/libawt.so > 260K / 252K /images/jdk/lib/libjavajpeg.so > 784K / 784K /images/jdk/lib/libfreetype.so > 368K / 236K /images/jdk/lib/libsplashscreen.so <------------------------- > 88K / 88K /images/jdk/lib/libjsound.so > 472K / 468K /images/jdk/lib/libawt_xawt.so > 564K / 404K /images/jdk/lib/liblcms.so <-------------------------- > 48K / 48K /images/jdk/lib/libawt_headless.so > 12K / 12K /images/jdk/lib/libjawt.so > 1.5M / 900K /images/jdk/lib/libfontmanager.so <------------------------------ > 12K / 12K /images/jdk/lib/libjaas.so > 92K / 92K /images/jdk/lib/libj2pkcs11.so > 16K / 16K /images/jdk/lib/libattach.so > 8.0K / 8.0K /images/jdk/lib/librmi.so > 56K / 56K /images/jdk/lib/libinstrument.so > 16K / 16K /images/jdk/lib/libprefs.so > 52K / 52K /images/jdk/lib/libj2gss.so > 12K / 12K /images/jdk/lib/libmanagement_agent.so > 36K / 32K /images/jdk/lib/libmanagement.so > 16K / 16K /images/jdk/lib/libextnet.so > 20K / 20K /images/jdk/lib/libj2pcsc.so > 40K / 40K /images/jdk/lib/libmanagement_ext.so > 60K / 60K /images/jdk/lib/libsaproc.so > > > Bug/webrev : > > https://bugs.openjdk.java.net/browse/JDK-8236714 > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236714.2/ > > > Thanks, Matthias From coleen.phillimore at oracle.com Tue Jan 14 16:24:43 2020 From: coleen.phillimore at oracle.com (coleen.phillimore at oracle.com) Date: Tue, 14 Jan 2020 11:24:43 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> Message-ID: <1fcf8439-2b5e-a776-9142-d9dc14a58faf@oracle.com> Hi Harold, I really wanted this change to move Thread to the first argument like many of the other calls in the VM that take THREAD as an argument. Written like this: + MutexLocker mu(Threads_lock, THREAD); It's too easy for someone who's cut/pasting to think the last THREAD argument should really be CHECK, which is completely wrong. Can you switch the arguments? Thanks, Coleen On 1/14/20 9:00 AM, Harold Seigel wrote: > Hi, > > Please review this small change, to reduce unnecessary calls to > Thread::current() in MutexLocker calls, by passing the current thread > as an argument.? A few ResoureMark declarations were also changed. > > Open Webrev: > http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html > > JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 > > The fix was regression tested by running Mach5 tiers 1 and 2 tests and > builds on Linux-x64, Solaris, Windows, and Mac OS X, by running Mach5 > tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on Linux-x64. > > Thanks, Harold > From harold.seigel at oracle.com Tue Jan 14 16:27:17 2020 From: harold.seigel at oracle.com (Harold Seigel) Date: Tue, 14 Jan 2020 11:27:17 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> Message-ID: <50019f6b-d94f-eb1a-de1c-622b437edf9f@oracle.com> Hi Lois, Thanks for the review! >> line #1507: Curious to know why you use "THREAD" and the MutexLocker mu1 at line #1502 uses "thread"? I used THREAD when available because it is more typically used, but I didn't change those calls that used JavaThread* objects. Harold On 1/14/2020 10:05 AM, Lois Foltan wrote: > On 1/14/2020 9:00 AM, Harold Seigel wrote: >> Hi, >> >> Please review this small change, to reduce unnecessary calls to >> Thread::current() in MutexLocker calls, by passing the current thread >> as an argument.? A few ResoureMark declarations were also changed. >> >> Open Webrev: >> http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html >> >> JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 >> >> The fix was regression tested by running Mach5 tiers 1 and 2 tests >> and builds on Linux-x64, Solaris, Windows, and Mac OS X, by running >> Mach5 tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on >> Linux-x64. >> >> Thanks, Harold >> > > Overall looks great.? One comment: > > - prims/methodHandles.cpp > line #1507: Curious to know why you use "THREAD" and the MutexLocker > mu1 at line #1502 uses "thread"? > > Thanks, > Lois From harold.seigel at oracle.com Tue Jan 14 16:33:00 2020 From: harold.seigel at oracle.com (Harold Seigel) Date: Tue, 14 Jan 2020 11:33:00 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: <1fcf8439-2b5e-a776-9142-d9dc14a58faf@oracle.com> References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> <1fcf8439-2b5e-a776-9142-d9dc14a58faf@oracle.com> Message-ID: Hi Coleen, I'll go ahead and switch the order and put out a new webrev. Thanks for looking at it. Harold On 1/14/2020 11:24 AM, coleen.phillimore at oracle.com wrote: > > Hi Harold, > > I really wanted this change to move Thread to the first argument like > many of the other calls in the VM that take THREAD as an argument. > > Written like this: > > + MutexLocker mu(Threads_lock, THREAD); > > > It's too easy for someone who's cut/pasting to think the last THREAD > argument should really be CHECK, which is completely wrong. > > Can you switch the arguments? > > Thanks, > Coleen > > On 1/14/20 9:00 AM, Harold Seigel wrote: >> Hi, >> >> Please review this small change, to reduce unnecessary calls to >> Thread::current() in MutexLocker calls, by passing the current thread >> as an argument.? A few ResoureMark declarations were also changed. >> >> Open Webrev: >> http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html >> >> JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 >> >> The fix was regression tested by running Mach5 tiers 1 and 2 tests >> and builds on Linux-x64, Solaris, Windows, and Mac OS X, by running >> Mach5 tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on >> Linux-x64. >> >> Thanks, Harold >> > From matthias.baesken at sap.com Tue Jan 14 16:57:33 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 14 Jan 2020 16:57:33 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> Message-ID: Hello Magnus and Aleksei, thanks for the input . The times you provided really look like they make a big difference at least for people often building minimal-vm . Guess I have to measure myself a bit (maybe the difference is not that big on our linux s390x / ppc64(le) ) . > > If the change to enable lto by default is proposed, what would be the > recommended strategy for development? > Probably we should a) do not enable it by default but just make sure it can be enabled easily and works for the minimal-vm or b) take it easy to disable it for local development. Best regards, Matthias > > Magnus, Matthias, > > for me, lto is a little heavyweight for development. x86_64 build time > with gcc 7: > > Server 1m32.484s > Server+Minimal 1m42.166s > Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s > > If the change to enable lto by default is proposed, what would be the > recommended strategy for development? > > For ARM32 Minimal, please keep in mind that it's not uncommon to disable > LTO plugin in commodity ARM32 gcc compiler distributions, so for some it > does not matter what settings we have in OpenJDK. I believe there could > be other reasons for that on top of build time (bugs?). > From igor.ignatyev at oracle.com Tue Jan 14 17:03:38 2020 From: igor.ignatyev at oracle.com (Igor Ignatyev) Date: Tue, 14 Jan 2020 09:03:38 -0800 Subject: RFR(S): 8236111 : narrow allowSmartActionArgs disabling In-Reply-To: <916e0375-abba-9945-b845-0fd4198513f0@oracle.com> References:

<423ea31a-ebf8-4cba-72a4-6fbb934f7789@oracle.com> <0BA46866-3DEA-44BF-B87C-2B59B84196C9@oracle.com> <916e0375-abba-9945-b845-0fd4198513f0@oracle.com> Message-ID: Joe and Roger, thank you for your reviews. security-libs guys, could you please take a look? Thanks, -- Igor > On Jan 2, 2020, at 12:58 PM, Roger Riggs wrote: > > The core lib changes look ok. > > Roger > On Jan 2, 2020, at 1:26 PM, Joe Darcy wrote: > > The removal of the existing TEST.properties files look fine. > > Please also solicit feedback from the security libs team as their area is affected. > > Roger, FYI the serial filter tests are updated as part of this changeset. > > Cheers, > > -Joe > > On 12/23/2019 8:13 PM, Igor Ignatyev wrote: >> Thanks David. >> >> core-libs folks, could you please review jdk part of this patch? >> >> Thanks, >> -- Igor >> >>> On Dec 23, 2019, at 1:33 PM, David Holmes wrote: >>> >>> Hi Igor, >>> >>> Hotspot changes seem fine. Can't comment on jdk tests. >>> >>> Thanks, >>> David >>> >>> On 24/12/2019 6:42 am, Igor Ignatyev wrote: >>>> ping? >>>>> On Dec 17, 2019, at 11:30 AM, Igor Ignatyev wrote: >>>>> >>>>> http://cr.openjdk.java.net/~iignatyev/8236111/webrev.00/ >>>>>> 31 lines changed: 20 ins; 11 del; 0 mod; >>>>> Hi all, >>>>> >>>>> could you please review this small patch which enables allowSmartActionArgs in hotspot and jdk test suites and disables them in a small number of test directories? the patch also removes TEST.properties files which enabled allowSmartActionArgs as they aren't needed anymore. >>>>> >>>>> from JBS: >>>>>> currently, allowSmartActionArgs is disabled for the whole hotspot and jdk test suites and enabled just in few places. this makes it a bit harder for people to use smart action arguments in these test suites as they have to not to forget to enable them. and given in all the other test suites, smart action arguments are enabled, it can be confusing and frustrating. >>>>> >>>>> testing: tier1-5 >>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8236111 >>>>> webrev: http://cr.openjdk.java.net/~iignatyev/8236111/webrev.00/ >>>>> >>>>> Thanks, >>>>> -- Igor From aleksei.voitylov at bell-sw.com Tue Jan 14 17:54:36 2020 From: aleksei.voitylov at bell-sw.com (Aleksei Voitylov) Date: Tue, 14 Jan 2020 20:54:36 +0300 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> Message-ID: <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> On 14/01/2020 19:57, Baesken, Matthias wrote: > Hello Magnus and Aleksei, thanks for the input . > > The times you provided really look like they make a big difference at least for people often building minimal-vm . > Guess I have to measure myself a bit (maybe the difference is not that big on our linux s390x / ppc64(le) ) . > >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> > Probably we should a) do not enable it by default but just make sure it can be enabled easily and works for the minimal-vm That would be welcome. I have high hopes to LTO the VM some time by default, and the tendency observed is that the compiler time overhead for GCC becomes smaller. At the same time there is no reason why vendors that invested in testing and can absorb the build time hit could provide binaries with LTO built VMs by passing an additional option flag. > or b) take it easy to disable it for local development. > > Best regards, Matthias > > > >> Magnus, Matthias, >> >> for me, lto is a little heavyweight for development. x86_64 build time >> with gcc 7: >> >> Server 1m32.484s >> Server+Minimal 1m42.166s >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s >> >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> >> For ARM32 Minimal, please keep in mind that it's not uncommon to disable >> LTO plugin in commodity ARM32 gcc compiler distributions, so for some it >> does not matter what settings we have in OpenJDK. I believe there could >> be other reasons for that on top of build time (bugs?). >> From bob.vandette at oracle.com Tue Jan 14 20:04:55 2020 From: bob.vandette at oracle.com (Bob Vandette) Date: Tue, 14 Jan 2020 15:04:55 -0500 Subject: [PING2!] RFR: 8230305: Cgroups v2: Container awareness In-Reply-To: References: <072f66ee8c44034831b4e38f6470da4bff6edd07.camel@redhat.com> <7540a208e306ab957032b18178a53c6afa105d33.camel@redhat.com> <5eec97c04d86562346243c1db3832e86e13697a1.camel@redhat.com> Message-ID: <52110EA5-26C9-43F2-8C2A-21D4E03ED3CC@oracle.com> Cgroup V2 is about to go mainstream this year for popular distros such as Oracle Linux 8, Redhat Linux 8 and Fedora so this fix it?s important to get into JDK 15 so we can start shaking out this support. Please take a look and help get this change reviewed. Thanks, Bob Vandette > On Nov 29, 2019, at 4:04 AM, Severin Gehwolf wrote: > > On Fri, 2019-11-15 at 17:56 +0100, Severin Gehwolf wrote: >> On Fri, 2019-11-08 at 15:21 +0100, Severin Gehwolf wrote: >>> Hi Bob, >>> >>> On Wed, 2019-11-06 at 10:47 +0100, Severin Gehwolf wrote: >>>> On Tue, 2019-11-05 at 16:54 -0500, Bob Vandette wrote: >>>>> Severin, >>>>> >>>>> Thanks for taking on this cgroup v2 improvement. >>>>> >>>>> In general I like the implementation and the refactoring. The CachedMetric class is nice. >>>>> We can add any metric we want to cache in a more general way. >>>>> >>>>> Is this the latest version of the webrev? >>>>> >>>>> http://cr.openjdk.java.net/~sgehwolf/webrevs/cgroupsv2-hotspot/03/webrev/src/hotspot/os/linux/cgroupV2Subsystem_linux.cpp.html >>>>> >>>>> It looks like you need to add the caching support for active_processor_count (JDK-8227006). >>> [...] >>>> I'll do a proper rebase ASAP. >>> >>> Latest webrev: >>> http://cr.openjdk.java.net/~sgehwolf/webrevs/cgroupsv2-hotspot/05/webrev/ >>> >>>>> I?m not sure it?s worth providing different strings for Unlimited versus Max or Scaled shares. >>>>> I?d just try to be compatible with the cgroupv2 output so you don?t have to change the test. >>>> >>>> OK. Will do. >>> >>> Unfortunately, there is no way of NOT changing TestCPUAwareness.java as >>> it expects CPU Shares to be written to the cgroup filesystem verbatim. >>> That's no longer the case for cgroups v2 (at least for crun). Either >>> way, most test changes are gone now. >>> >>>>> I wonder if it?s worth trying to synthesize memory_max_usage_in_bytes() by keeping the highest >>>>> value ever returned by the API. >>>> >>>> Interesting idea. I'll ponder this a bit and get back to you. >>> >>> This has been implemented. I'm not sure this is correct, though. It >>> merely piggy-backs on calls to memory_usage_in_bytes() and keeps the >>> high watermark value of that. >>> >>> Testing passed on F31 with cgroups v2 controllers properly configured >>> (podman) and hybrid (legacy hierarchy) with docker/podman. >>> >>> Thoughts? >> >> Ping? > > Anyone willing to review this? It would be nice to make some progress. > > Thanks, > Severin > >> Metrics work proposed for RFR here: >> http://mail.openjdk.java.net/pipermail/core-libs-dev/2019-November/063464.html >> >> Thanks, >> Severin > From david.holmes at oracle.com Tue Jan 14 23:04:46 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 15 Jan 2020 09:04:46 +1000 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> Message-ID: Hi Lois, On 15/01/2020 1:05 am, Lois Foltan wrote: > On 1/14/2020 9:00 AM, Harold Seigel wrote: >> Hi, >> >> Please review this small change, to reduce unnecessary calls to >> Thread::current() in MutexLocker calls, by passing the current thread >> as an argument.? A few ResoureMark declarations were also changed. >> >> Open Webrev: >> http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html >> >> JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 >> >> The fix was regression tested by running Mach5 tiers 1 and 2 tests and >> builds on Linux-x64, Solaris, Windows, and Mac OS X, by running Mach5 >> tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on Linux-x64. >> >> Thanks, Harold >> > > Overall looks great.? One comment: > > - prims/methodHandles.cpp > line #1507: Curious to know why you use "THREAD" and the MutexLocker mu1 > at line #1502 uses "thread"? Just for reference THREAD is a Thread*, but thread is a JavaThread* introduced by some of the *ENTRY macros. It can be confusing when they both get used by code that only needs a Thread* - as per the line you quoted. We probably have a few places where we explicitly cast THREAD to JavaThread* unnecessarily because it isn't obvious that the thread variable exists. David > Thanks, > Lois From kim.barrett at oracle.com Wed Jan 15 06:24:40 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Wed, 15 Jan 2020 01:24:40 -0500 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: References: Message-ID: <160821DF-BE15-435C-BDBE-AB553F794590@oracle.com> > On Jan 13, 2020, at 2:13 AM, David Holmes wrote: > > webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ > bug: https://bugs.openjdk.java.net/browse/JDK-8235741 > > Full details in the bug report about the existing uses of javaTimeMillis(), many of which just want an elapsed time in ms and so should be using javaTimeNanos() and convert to ms. This covers areas all across the VM. > > Only non-simple change is in os_perf_linux.cpp (and the same code will be in os_perf_aix.cpp once it has been validated). There we are tracking an elapsed time in ms but relative to the boot time, which is seconds since the epoch. Consequently the first interval has to be calculated using javaTimeMillis, but after that we can use javaTimeNanos (using a new 'first time' captured at the same time we used javaTimeMillis). I think I have the logic right but other than through JFR this code seems unused and I have limited means of testing it. The JFR test jdk/jfr/event/os/TestThreadContextSwitches.java exercises the code but the results of running that test seems to exhibit arbitrary randomness in the rates reported - e.g. 0 to 16000Hz - both with and without my change, so not really that useful. Stefan K. suggested a gtest which I may look into - though it is frustrating to have to expend such effort to validate this. > > Other testing tiers 1-3. > > Thanks, > David Thanks for the audit of uses of os::javaTimeMillis() in the bug report. I wonder if some of that ought to be captured as comments in the relevant code. It's not always obvious to me that an external time base is involved and thus making javaTimeMillis not a mistake. There are a lot of places where conversions from nanoseconds to milliseconds are being done to maintain existing units. Some of those places look like they could just as well be in nanoseconds. But I can see how changing the units for some of those could lead to a lot of fannout, so okay. ------------------------------------------------------------------------------ src/hotspot/os/windows/os_perf_windows.cpp 100 s8 lastUpdate; // Last time query was updated (current millis). ... 290 const s8 now = os::javaTimeNanos(); 291 if (NANOS_TO_MILLIS(now - update_query->lastUpdate) > min_update_interval_millis) { ... 295 update_query->lastUpdate = now; now and update_query->lastUpdate are now in nanos, but comment for lastUpdate still says it's in millis. Looks like the comment needs updating. ------------------------------------------------------------------------------ src/hotspot/share/utilities/globalDefinitions.hpp 262 // time unit conversion macros 263 264 #define NANOS_TO_MILLIS(ns) ((ns) / NANOSECS_PER_MILLISEC) 265 #define MILLIS_TO_NANOS(ms) ((ms) * NANOSECS_PER_MILLISEC) Why are these macros, rather than (template) functions? Also, depending on the type and value of ms, MILLIS_TO_NANOS could easily overflow, e.g. if ms type is a 32 bit type with a value of more than ~4 seconds. (I checked the two uses, and they happen to be okay.) inline int64_t nanos_to_millis(int64_t ns) { return ns / NANOSECS_PER_MILLISECOND; } inline int64_t millis_to_nanos(int64_t ms) { return ms * NANOSECONDS_PER_MILLISEC; } Also, the names don't suggest time conversions, but potentially arbitrary unit conversions, e.g. between something in NANOUNITS and something in MILLIUNITS. ------------------------------------------------------------------------------ Regarding this from the audit: --- begin --- ./share/gc/parallel/psParallelCompact.cpp: // os::javaTimeMillis() does not guarantee monotonicity. ... ./share/gc/shared/referenceProcessor.cpp: // os::javaTimeMillis() does not guarantee monotonicity. These are all describing why the subsequent code uses javaTimeNanos not javaTimeMillis. --- end --- Do we really still support platforms that don't have a monotonic clock? I guess we appear to at least try. But it's really wrong that callers of os::javaTimeNanos should even think they need to cope with that function being non-monotonic. Hm, I always thought System.nanoTime() was a monotonic clock, but I don't see any such guarantee. So I guess Java just doesn't have such a thing. Wow! So I guess none of this is really relevant to the change at hand after all. ------------------------------------------------------------------------------ From david.holmes at oracle.com Wed Jan 15 07:12:05 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 15 Jan 2020 17:12:05 +1000 Subject: RFR (M): 8235741: Inappropriate uses of os::javaTimeMillis() In-Reply-To: <160821DF-BE15-435C-BDBE-AB553F794590@oracle.com> References: <160821DF-BE15-435C-BDBE-AB553F794590@oracle.com> Message-ID: Hi Kim, Thanks for taking a look at this. On 15/01/2020 4:24 pm, Kim Barrett wrote: >> On Jan 13, 2020, at 2:13 AM, David Holmes wrote: >> >> webrev: http://cr.openjdk.java.net/~dholmes/8235741/webrev/ >> bug: https://bugs.openjdk.java.net/browse/JDK-8235741 >> >> Full details in the bug report about the existing uses of javaTimeMillis(), many of which just want an elapsed time in ms and so should be using javaTimeNanos() and convert to ms. This covers areas all across the VM. >> >> Only non-simple change is in os_perf_linux.cpp (and the same code will be in os_perf_aix.cpp once it has been validated). There we are tracking an elapsed time in ms but relative to the boot time, which is seconds since the epoch. Consequently the first interval has to be calculated using javaTimeMillis, but after that we can use javaTimeNanos (using a new 'first time' captured at the same time we used javaTimeMillis). I think I have the logic right but other than through JFR this code seems unused and I have limited means of testing it. The JFR test jdk/jfr/event/os/TestThreadContextSwitches.java exercises the code but the results of running that test seems to exhibit arbitrary randomness in the rates reported - e.g. 0 to 16000Hz - both with and without my change, so not really that useful. Stefan K. suggested a gtest which I may look into - though it is frustrating to have to expend such effort to validate this. >> >> Other testing tiers 1-3. >> >> Thanks, >> David > > Thanks for the audit of uses of os::javaTimeMillis() in the bug report. > I wonder if some of that ought to be captured as comments in the > relevant code. It's not always obvious to me that an external time > base is involved and thus making javaTimeMillis not a mistake. Okay, I will add comments to the other uses of currentTimeMillis(). > There are a lot of places where conversions from nanoseconds to > milliseconds are being done to maintain existing units. Some of those > places look like they could just as well be in nanoseconds. But I can > see how changing the units for some of those could lead to a lot of > fannout, so okay. Yes I tried to minimise the changes. In many cases a granularity of ms seems somewhat arbitrary. > ------------------------------------------------------------------------------ > src/hotspot/os/windows/os_perf_windows.cpp > 100 s8 lastUpdate; // Last time query was updated (current millis). > ... > 290 const s8 now = os::javaTimeNanos(); > 291 if (NANOS_TO_MILLIS(now - update_query->lastUpdate) > min_update_interval_millis) { > ... > 295 update_query->lastUpdate = now; > > now and update_query->lastUpdate are now in nanos, but comment for > lastUpdate still says it's in millis. Looks like the comment needs > updating. Yes - good catch. > ------------------------------------------------------------------------------ > src/hotspot/share/utilities/globalDefinitions.hpp > 262 // time unit conversion macros > 263 > 264 #define NANOS_TO_MILLIS(ns) ((ns) / NANOSECS_PER_MILLISEC) > 265 #define MILLIS_TO_NANOS(ms) ((ms) * NANOSECS_PER_MILLISEC) > > Why are these macros, rather than (template) functions? Just because I just wanted a simple textual replacement to make it clearer that I'm converting from millis to nanos or vice versa. I reach for macros for such simple cases. > Also, depending on the type and value of ms, MILLIS_TO_NANOS could > easily overflow, e.g. if ms type is a 32 bit type with a value of more > than ~4 seconds. (I checked the two uses, and they happen to be okay.) These are not trying to be mathematically sound. The conversion from millis to nanos is used in two cases: 1. Converting a current timestamp in ms to ns. Unless the current time is set far in the future I don't think we have any issue with overflow of such a value. 2. converting an elapsed time in ms to ns. These will be small values so no overflow is possible. > inline int64_t nanos_to_millis(int64_t ns) { > return ns / NANOSECS_PER_MILLISECOND; > } > > inline int64_t millis_to_nanos(int64_t ms) { > return ms * NANOSECONDS_PER_MILLISEC; > } > > Also, the names don't suggest time conversions, but potentially > arbitrary unit conversions, e.g. between something in NANOUNITS and > something in MILLIUNITS. They don't have to be time conversions - the calculation is unit-less in practice. The fact we have NANOSEC_PER_MILLISECOND et al is just an artifact of introducing those values for timeout calculations/conversions - it could just be NANOS_PER_MILLI etc > ------------------------------------------------------------------------------ > Regarding this from the audit: > > --- begin --- > ./share/gc/parallel/psParallelCompact.cpp: // os::javaTimeMillis() does not guarantee monotonicity. > ... > ./share/gc/shared/referenceProcessor.cpp: // os::javaTimeMillis() does not guarantee monotonicity. > > These are all describing why the subsequent code uses javaTimeNanos not javaTimeMillis. > --- end --- > > Do we really still support platforms that don't have a monotonic > clock? I guess we appear to at least try. But it's really wrong that > callers of os::javaTimeNanos should even think they need to cope with > that function being non-monotonic. > > Hm, I always thought System.nanoTime() was a monotonic clock, but I > don't see any such guarantee. So I guess Java just doesn't have such a > thing. Wow! > > So I guess none of this is really relevant to the change at hand after all. I think you read the comments the wrong way round. The code uses javaTimeNanos not javaTimeMillis because javaTimeMillis is not monotonic and the code wants a monotonic clock. These comments were mostly inserted when the incorrect use of javaTimeMillis was replaced with javaTimeNanos. Thanks, David ----- > ------------------------------------------------------------------------------ > > From matthias.baesken at sap.com Wed Jan 15 08:27:09 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 15 Jan 2020 08:27:09 +0000 Subject: RFR: 8236714: enable link-time section-gc for linux to remove unused code In-Reply-To: References:

Message-ID: Hi Erik, thanks for the review and for forwarding , you are correct corelibs-dev is probably interested in this as well . Best regards, Matthias > (adding core-libs-dev) > > Change looks good to me, but would like input from at least someone in > core-libs. > > /Erik > > On 2020-01-14 06:07, Baesken, Matthias wrote: > > Hello, the following change enables the link-time section-gc for linux . > > > > gcc and ld support enabling "garbage collection" of unused input sections. > > This can be used to eliminate unused coding from native libraries > (especially when already compiling the objects with compiler flags -ffunction- > sections -fdata-sections . > > See for details the --gc-sections and --print-gc-sections parts of the ld > documentation : > > > > https://linux.die.net/man/1/ld > > > > > > We had this enabled already for linux s390x , with > https://bugs.openjdk.java.net/browse/JDK-8234525 > > 8234525: enable link-time section-gc for linux s390x to remove unused code > . > > > > This time we enable it too for the other linux platforms . > > > > For the other platforms I do not enable it for JVM, just for the JDK libs. The > reason is that the serviceability agent (not supported on linux s390x ) is not > (yet) ready for the optimization . > > Below you see the results , for some libraries a significant size reduction > can be achieved . > > > > > > Results from linux x86_64 product builds : > > > > without / with ltgc > > > > 320K / 300K /images/jdk/lib/libsunec.so <------------------------- > > 36K / 36K /images/jdk/lib/libdt_socket.so > > 280K / 276K /images/jdk/lib/libjdwp.so > > 23M / 23M /images/jdk/lib/server/libjvm.so <---- not set for libjvm.so > for x86_64 > > 16K / 16K /images/jdk/lib/server/libjsig.so > > 72K / 72M /images/jdk/lib/libverify.so > > 84K / 84M /images/jdk/lib/libjli.so > > 16K / 16K /images/jdk/lib/libjsig.so > > 196K / 196K /images/jdk/lib/libjava.so > > 44K / 44K /images/jdk/lib/libzip.so > > 144K / 136K /images/jdk/lib/libjimage.so > > 112K / 112K /images/jdk/lib/libnet.so > > 100K / 100K /images/jdk/lib/libnio.so > > 36K / 36K /images/jdk/lib/libsctp.so > > 576K / 556K /images/jdk/lib/libmlib_image.so > > 752K / 752K /images/jdk/lib/libawt.so > > 260K / 252K /images/jdk/lib/libjavajpeg.so > > 784K / 784K /images/jdk/lib/libfreetype.so > > 368K / 236K /images/jdk/lib/libsplashscreen.so <------------------------- > > 88K / 88K /images/jdk/lib/libjsound.so > > 472K / 468K /images/jdk/lib/libawt_xawt.so > > 564K / 404K /images/jdk/lib/liblcms.so <-------------------------- > > 48K / 48K /images/jdk/lib/libawt_headless.so > > 12K / 12K /images/jdk/lib/libjawt.so > > 1.5M / 900K /images/jdk/lib/libfontmanager.so <------------------------------ > > 12K / 12K /images/jdk/lib/libjaas.so > > 92K / 92K /images/jdk/lib/libj2pkcs11.so > > 16K / 16K /images/jdk/lib/libattach.so > > 8.0K / 8.0K /images/jdk/lib/librmi.so > > 56K / 56K /images/jdk/lib/libinstrument.so > > 16K / 16K /images/jdk/lib/libprefs.so > > 52K / 52K /images/jdk/lib/libj2gss.so > > 12K / 12K /images/jdk/lib/libmanagement_agent.so > > 36K / 32K /images/jdk/lib/libmanagement.so > > 16K / 16K /images/jdk/lib/libextnet.so > > 20K / 20K /images/jdk/lib/libj2pcsc.so > > 40K / 40K /images/jdk/lib/libmanagement_ext.so > > 60K / 60K /images/jdk/lib/libsaproc.so > > > > > > Bug/webrev : > > > > https://bugs.openjdk.java.net/browse/JDK-8236714 > > > > http://cr.openjdk.java.net/~mbaesken/webrevs/8236714.2/ > > > > > > Thanks, Matthias From harold.seigel at oracle.com Wed Jan 15 12:57:24 2020 From: harold.seigel at oracle.com (Harold Seigel) Date: Wed, 15 Jan 2020 07:57:24 -0500 Subject: RFR 8235678: Remove unnecessary calls to Thread::current() in MutexLocker calls In-Reply-To: <1fcf8439-2b5e-a776-9142-d9dc14a58faf@oracle.com> References: <0a7cbbe4-3613-bd5a-962b-78dee06dde8a@oracle.com> <1fcf8439-2b5e-a776-9142-d9dc14a58faf@oracle.com> Message-ID: <0758d1d1-b74b-31e6-82fb-a1223a3481f7@oracle.com> Hi, Please review this new webrev that also makes Thread* the first argument to the relevant MutexLocker and MonitorLocker constructors as requested by Coleen. Updated Webrev: http://cr.openjdk.java.net/~hseigel/bug_8235678.2/webrev/index.html Thanks, Harold On 1/14/2020 11:24 AM, coleen.phillimore at oracle.com wrote: > > Hi Harold, > > I really wanted this change to move Thread to the first argument like > many of the other calls in the VM that take THREAD as an argument. > > Written like this: > > + MutexLocker mu(Threads_lock, THREAD); > > > It's too easy for someone who's cut/pasting to think the last THREAD > argument should really be CHECK, which is completely wrong. > > Can you switch the arguments? > > Thanks, > Coleen > > On 1/14/20 9:00 AM, Harold Seigel wrote: >> Hi, >> >> Please review this small change, to reduce unnecessary calls to >> Thread::current() in MutexLocker calls, by passing the current thread >> as an argument.? A few ResoureMark declarations were also changed. >> >> Open Webrev: >> http://cr.openjdk.java.net/~hseigel/bug_8235678/webrev/index.html >> >> JBS Bug: https://bugs.openjdk.java.net/browse/JDK-8235678 >> >> The fix was regression tested by running Mach5 tiers 1 and 2 tests >> and builds on Linux-x64, Solaris, Windows, and Mac OS X, by running >> Mach5 tiers 3-5 tests on Linux-x64, and JCK lang and VM tests on >> Linux-x64. >> >> Thanks, Harold >> > From volker.simonis at gmail.com Wed Jan 15 13:40:02 2020 From: volker.simonis at gmail.com (Volker Simonis) Date: Wed, 15 Jan 2020 05:40:02 -0800 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> Message-ID: While we are speaking about all the drawbacks of LTO, it's still not clear what the benefits are? In the very first mail Matthias mentioned that there might be performance improvements but that performance is not the main driving factor behind this initiative. So is it the reduced code size (Matthias mentioned something around ~10%)? It would be nice to see some real numbers on various platform for both, the performance improvements for native parts like JIT/GC as well as for the size reduction. Aleksei Voitylov schrieb am Di., 14. Jan. 2020, 09:54: > > On 14/01/2020 19:57, Baesken, Matthias wrote: > > Hello Magnus and Aleksei, thanks for the input . > > > > The times you provided really look like they make a big difference at > least for people often building minimal-vm . > > Guess I have to measure myself a bit (maybe the difference is not that > big on our linux s390x / ppc64(le) ) . > > > >> If the change to enable lto by default is proposed, what would be the > >> recommended strategy for development? > >> > > Probably we should a) do not enable it by default but just make sure > it can be enabled easily and works for the minimal-vm > That would be welcome. I have high hopes to LTO the VM some time by > default, and the tendency observed is that the compiler time overhead > for GCC becomes smaller. At the same time there is no reason why vendors > that invested in testing and can absorb the build time hit could provide > binaries with LTO built VMs by passing an additional option flag. > > or b) take it easy to disable it for local development. > > > > Best regards, Matthias > > > > > > > >> Magnus, Matthias, > >> > >> for me, lto is a little heavyweight for development. x86_64 build time > >> with gcc 7: > >> > >> Server 1m32.484s > >> Server+Minimal 1m42.166s > >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s > >> > >> If the change to enable lto by default is proposed, what would be the > >> recommended strategy for development? > >> > >> For ARM32 Minimal, please keep in mind that it's not uncommon to disable > >> LTO plugin in commodity ARM32 gcc compiler distributions, so for some it > >> does not matter what settings we have in OpenJDK. I believe there could > >> be other reasons for that on top of build time (bugs?). > >> > > From erik.joelsson at oracle.com Wed Jan 15 14:06:59 2020 From: erik.joelsson at oracle.com (Erik Joelsson) Date: Wed, 15 Jan 2020 06:06:59 -0800 Subject: RFR: 8236714: enable link-time section-gc for linux to remove unused code In-Reply-To: References:

Message-ID: <998c4a08-5670-f51d-4625-9c7e984b4b5d@oracle.com> Given the discussion regarding lto on hotspot and the extreme increased build time, have you noticed any difference in build times with this patch? /Erik On 2020-01-15 00:27, Baesken, Matthias wrote: > Hi Erik, thanks for the review and for forwarding , you are correct corelibs-dev is probably interested in this as well . > > Best regards, Matthias > > >> (adding core-libs-dev) >> >> Change looks good to me, but would like input from at least someone in >> core-libs. >> >> /Erik >> >> On 2020-01-14 06:07, Baesken, Matthias wrote: >>> Hello, the following change enables the link-time section-gc for linux . >>> >>> gcc and ld support enabling "garbage collection" of unused input sections. >>> This can be used to eliminate unused coding from native libraries >> (especially when already compiling the objects with compiler flags -ffunction- >> sections -fdata-sections . >>> See for details the --gc-sections and --print-gc-sections parts of the ld >> documentation : >>> https://linux.die.net/man/1/ld >>> >>> >>> We had this enabled already for linux s390x , with >> https://bugs.openjdk.java.net/browse/JDK-8234525 >>> 8234525: enable link-time section-gc for linux s390x to remove unused code >> . >>> This time we enable it too for the other linux platforms . >>> >>> For the other platforms I do not enable it for JVM, just for the JDK libs. The >> reason is that the serviceability agent (not supported on linux s390x ) is not >> (yet) ready for the optimization . >>> Below you see the results , for some libraries a significant size reduction >> can be achieved . >>> >>> Results from linux x86_64 product builds : >>> >>> without / with ltgc >>> >>> 320K / 300K /images/jdk/lib/libsunec.so <------------------------- >>> 36K / 36K /images/jdk/lib/libdt_socket.so >>> 280K / 276K /images/jdk/lib/libjdwp.so >>> 23M / 23M /images/jdk/lib/server/libjvm.so <---- not set for libjvm.so >> for x86_64 >>> 16K / 16K /images/jdk/lib/server/libjsig.so >>> 72K / 72M /images/jdk/lib/libverify.so >>> 84K / 84M /images/jdk/lib/libjli.so >>> 16K / 16K /images/jdk/lib/libjsig.so >>> 196K / 196K /images/jdk/lib/libjava.so >>> 44K / 44K /images/jdk/lib/libzip.so >>> 144K / 136K /images/jdk/lib/libjimage.so >>> 112K / 112K /images/jdk/lib/libnet.so >>> 100K / 100K /images/jdk/lib/libnio.so >>> 36K / 36K /images/jdk/lib/libsctp.so >>> 576K / 556K /images/jdk/lib/libmlib_image.so >>> 752K / 752K /images/jdk/lib/libawt.so >>> 260K / 252K /images/jdk/lib/libjavajpeg.so >>> 784K / 784K /images/jdk/lib/libfreetype.so >>> 368K / 236K /images/jdk/lib/libsplashscreen.so <------------------------- >>> 88K / 88K /images/jdk/lib/libjsound.so >>> 472K / 468K /images/jdk/lib/libawt_xawt.so >>> 564K / 404K /images/jdk/lib/liblcms.so <-------------------------- >>> 48K / 48K /images/jdk/lib/libawt_headless.so >>> 12K / 12K /images/jdk/lib/libjawt.so >>> 1.5M / 900K /images/jdk/lib/libfontmanager.so <------------------------------ >>> 12K / 12K /images/jdk/lib/libjaas.so >>> 92K / 92K /images/jdk/lib/libj2pkcs11.so >>> 16K / 16K /images/jdk/lib/libattach.so >>> 8.0K / 8.0K /images/jdk/lib/librmi.so >>> 56K / 56K /images/jdk/lib/libinstrument.so >>> 16K / 16K /images/jdk/lib/libprefs.so >>> 52K / 52K /images/jdk/lib/libj2gss.so >>> 12K / 12K /images/jdk/lib/libmanagement_agent.so >>> 36K / 32K /images/jdk/lib/libmanagement.so >>> 16K / 16K /images/jdk/lib/libextnet.so >>> 20K / 20K /images/jdk/lib/libj2pcsc.so >>> 40K / 40K /images/jdk/lib/libmanagement_ext.so >>> 60K / 60K /images/jdk/lib/libsaproc.so >>> >>> >>> Bug/webrev : >>> >>> https://bugs.openjdk.java.net/browse/JDK-8236714 >>> >>> http://cr.openjdk.java.net/~mbaesken/webrevs/8236714.2/ >>> >>> >>> Thanks, Matthias From aleksei.voitylov at bell-sw.com Wed Jan 15 14:57:11 2020 From: aleksei.voitylov at bell-sw.com (Aleksei Voitylov) Date: Wed, 15 Jan 2020 17:57:11 +0300 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> Message-ID: <58b2c73c-c49e-49cf-71fc-7d6c2225b880@bell-sw.com> Volker, not a full answer, but here is some static size stats: Server ??? x86_64? AArch64 regular ??? 23M ?? ?? 20M lto ??? ??? ?? 17M ? ??? 14M Minimal?? x86_64? AArch64 regular???? 4.9M????? 3.9M lto??????????? 4.7M????? 3.6M -Aleksei On 15/01/2020 16:40, Volker Simonis wrote: > While we are speaking about all the drawbacks of LTO, it's still not > clear what the benefits are? In the very first mail Matthias mentioned > that there might be performance improvements but that performance is > not the main driving factor behind this initiative. So is it the > reduced code size (Matthias mentioned something around ~10%)? > > It would be nice to see some real numbers on various platform for > both, the performance improvements for native parts like JIT/GC as > well as for the size reduction. > > Aleksei Voitylov > schrieb am Di., 14. Jan. 2020, > 09:54: > > > On 14/01/2020 19:57, Baesken, Matthias wrote: > > Hello? Magnus and Aleksei,? thanks for the input . > > > > The times you? provided really look like they make a big > difference? at least for people? often? building? ?minimal-vm? . > > Guess I have to measure myself a bit? (maybe the difference is > not that big on our linux s390x / ppc64(le) ) . > > > >> If the change to enable lto by default is proposed, what would > be the > >> recommended strategy for development? > >> > > Probably? we should a)? ?do not enable it by default but just > make sure it can be enabled easily and works? for? the minimal-vm? ? > That would be welcome. I have high hopes to LTO the VM some time by > default, and the tendency observed is that the compiler time overhead > for GCC becomes smaller. At the same time there is no reason why > vendors > that invested in testing and can absorb the build time hit could > provide > binaries with LTO built VMs by passing an additional option flag. > >? ?or? b)? take it easy to disable it for local development. > > > > Best regards, Matthias > > > > > > > >> Magnus, Matthias, > >> > >> for me, lto is a little heavyweight for development. x86_64 > build time > >> with gcc 7: > >> > >> Server 1m32.484s > >> Server+Minimal 1m42.166s > >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s > >> > >> If the change to enable lto by default is proposed, what would > be the > >> recommended strategy for development? > >> > >> For ARM32 Minimal, please keep in mind that it's not uncommon > to disable > >> LTO plugin in commodity ARM32 gcc compiler distributions, so > for some it > >> does not matter what settings we have in OpenJDK. I believe > there could > >> be other reasons for that on top of build time (bugs?). > >> > From matthias.baesken at sap.com Wed Jan 15 15:02:37 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 15 Jan 2020 15:02:37 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> Message-ID: Hello , I can comment on the code size . This is what I get when comparing a build without and with -flto . gcc7 linux x86_64 product build, normal / with -flto ---------------------------------------------------------------------------------- du -sh on the *.so files gives : 16K / 16K ./lib/libattach.so 48K / 44K ./lib/libawt_headless.so 752K / 760K ./lib/libawt.so <------------------ this one gets a bit larger with flto 472K / 456K ./lib/libawt_xawt.so <------------------ small gain 36K / 32K ./lib/libdt_socket.so 16K /16K ./lib/libextnet.so 1.5M / 824K ./lib/libfontmanager.so <------------------ HUGE gain 784K / 792K ./lib/libfreetype.so <------------------ this one gets a bit larger with flto 56K / 56K ./lib/libinstrument.so 52K / 52K ./lib/libj2gss.so 20K / 20K ./lib/libj2pcsc.so 92K / 84K ./lib/libj2pkcs11.so 12K / 12k ./lib/libjaas.so 260K / 244K ./lib/libjavajpeg.so <----------------- small gain 196K / 188K ./lib/libjava.so 12K / 12K ./lib/libjawt.so 280K / 256K ./lib/libjdwp.so <----------------- small gain 144K / 140K ./lib/libjimage.so 84K / 76K ./lib/libjli.so 16K / 16K ./lib/libjsig.so 88K / 80K ./lib/libjsound.so 564K / 420K ./lib/liblcms.so <----------------- large gain 12K / 12K ./lib/libmanagement_agent.so 40K / 36K ./lib/libmanagement_ext.so 36K / 32K ./lib/libmanagement.so 576K / 496K ./lib/libmlib_image.so <----------------- large gain 112K / 108K ./lib/libnet.so 100K / 100K ./lib/libnio.so 16K / 16K ./lib/libprefs.so 8.0K / 8.0K ./lib/librmi.so 60K / 60K ./lib/libsaproc.so 36K / 32K ./lib/libsctp.so 368K / 212K ./lib/libsplashscreen.so <----------------- large gain 320K / 296K ./lib/libsunec.so <----------------- medium gain 72K / 72K ./lib/libverify.so 44K / 44K ./lib/libzip.so 16K / 16K ./lib/server/libjsig.so 23M / 17M ./lib/server/libjvm.so <----------------- big gain maybe because it is C++ ? So for some libs you see 10% and more , but not for all . But most large libs like libjvm.so, libfontmanager.so or liblcms.so we see good results regarding reduced code size. I Cannot say much about performance improvements , probably it would be small . For SPEC you find something at http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html (not that these results would say too much about JVM performance ). Best regards, Matthias From: Volker Simonis Sent: Mittwoch, 15. Januar 2020 14:40 To: Aleksei Voitylov Cc: Baesken, Matthias ; Magnus Ihse Bursie ; serviceability-dev at openjdk.java.net; build-dev ; hotspot-dev at openjdk.java.net Subject: Re: serviceability agent : problems when using gcc LTO (link time optimization) While we are speaking about all the drawbacks of LTO, it's still not clear what the benefits are? In the very first mail Matthias mentioned that there might be performance improvements but that performance is not the main driving factor behind this initiative. So is it the reduced code size (Matthias mentioned something around ~10%)? It would be nice to see some real numbers on various platform for both, the performance improvements for native parts like JIT/GC as well as for the size reduction. Aleksei Voitylov > schrieb am Di., 14. Jan. 2020, 09:54: On 14/01/2020 19:57, Baesken, Matthias wrote: > Hello Magnus and Aleksei, thanks for the input . > > The times you provided really look like they make a big difference at least for people often building minimal-vm . > Guess I have to measure myself a bit (maybe the difference is not that big on our linux s390x / ppc64(le) ) . > >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> > Probably we should a) do not enable it by default but just make sure it can be enabled easily and works for the minimal-vm That would be welcome. I have high hopes to LTO the VM some time by default, and the tendency observed is that the compiler time overhead for GCC becomes smaller. At the same time there is no reason why vendors that invested in testing and can absorb the build time hit could provide binaries with LTO built VMs by passing an additional option flag. > or b) take it easy to disable it for local development. > > Best regards, Matthias > > > >> Magnus, Matthias, >> >> for me, lto is a little heavyweight for development. x86_64 build time >> with gcc 7: >> >> Server 1m32.484s >> Server+Minimal 1m42.166s >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s >> >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> >> For ARM32 Minimal, please keep in mind that it's not uncommon to disable >> LTO plugin in commodity ARM32 gcc compiler distributions, so for some it >> does not matter what settings we have in OpenJDK. I believe there could >> be other reasons for that on top of build time (bugs?). >> From volker.simonis at gmail.com Wed Jan 15 15:29:48 2020 From: volker.simonis at gmail.com (Volker Simonis) Date: Wed, 15 Jan 2020 07:29:48 -0800 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com> Message-ID: Aleksei, Matthias, thanks for the numbers. The size reduction on libjvm.so looks not bad, indeed. Do you know if newer versions of GCC use the gold linker by default? I remember from some experiments which I did many years ago that gold was considerably faster compared to the default ld linker. Unfortunately, the documentation I found about LTO/ld/gold [1,2] seems to be quite old and not very precise. Do you have gained any experience with LTO/gold and know if gold could maybe improve linking times with LTO? [1] https://gcc.gnu.org/wiki/LinkTimeOptimization [2] https://stackoverflow.com/questions/31688069/requirements-to-use-flto Baesken, Matthias schrieb am Mi., 15. Jan. 2020, 07:02: > Hello , I can comment on the code size . This is what I get when > comparing a build without and with -flto . > > > > gcc7 linux x86_64 product build, normal / with -flto > > > ---------------------------------------------------------------------------------- > > > > du -sh on the *.so files gives : > > > > 16K / 16K ./lib/libattach.so > > 48K / 44K ./lib/libawt_headless.so > > 752K / 760K ./lib/libawt.so <------------------ this one > gets a bit larger with flto > > 472K / 456K ./lib/libawt_xawt.so <------------------ small gain > > 36K / 32K ./lib/libdt_socket.so > > 16K /16K ./lib/libextnet.so > > 1.5M / 824K ./lib/libfontmanager.so <------------------ HUGE gain > > 784K / 792K ./lib/libfreetype.so <------------------ this one > gets a bit larger with flto > > 56K / 56K ./lib/libinstrument.so > > 52K / 52K ./lib/libj2gss.so > > 20K / 20K ./lib/libj2pcsc.so > > 92K / 84K ./lib/libj2pkcs11.so > > 12K / 12k ./lib/libjaas.so > > 260K / 244K ./lib/libjavajpeg.so <----------------- small gain > > 196K / 188K ./lib/libjava.so > > 12K / 12K ./lib/libjawt.so > > 280K / 256K ./lib/libjdwp.so <----------------- small gain > > 144K / 140K ./lib/libjimage.so > > 84K / 76K ./lib/libjli.so > > 16K / 16K ./lib/libjsig.so > > 88K / 80K ./lib/libjsound.so > > 564K / 420K ./lib/liblcms.so <----------------- large gain > > 12K / 12K ./lib/libmanagement_agent.so > > 40K / 36K ./lib/libmanagement_ext.so > > 36K / 32K ./lib/libmanagement.so > > 576K / 496K ./lib/libmlib_image.so <----------------- large gain > > 112K / 108K ./lib/libnet.so > > 100K / 100K ./lib/libnio.so > > 16K / 16K ./lib/libprefs.so > > 8.0K / 8.0K ./lib/librmi.so > > 60K / 60K ./lib/libsaproc.so > > 36K / 32K ./lib/libsctp.so > > 368K / 212K ./lib/libsplashscreen.so <----------------- large gain > > 320K / 296K ./lib/libsunec.so <----------------- medium gain > > 72K / 72K ./lib/libverify.so > > 44K / 44K ./lib/libzip.so > > 16K / 16K ./lib/server/libjsig.so > > 23M / 17M ./lib/server/libjvm.so <----------------- big gain > maybe because it is C++ ? > > > > > > So for some libs you see 10% and more , but not for all . But most > large libs like libjvm.so, libfontmanager.so or liblcms.so > we see good results regarding reduced code size. > > > > I Cannot say much about performance improvements , probably it would be > small . > > > > For SPEC you find something at > > > > > http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html > > > > (not that these results would say too much about JVM performance ). > > > > > > Best regards, Matthias > > > > *From:* Volker Simonis > *Sent:* Mittwoch, 15. Januar 2020 14:40 > *To:* Aleksei Voitylov > *Cc:* Baesken, Matthias ; Magnus Ihse Bursie < > magnus.ihse.bursie at oracle.com>; serviceability-dev at openjdk.java.net; > build-dev ; hotspot-dev at openjdk.java.net > *Subject:* Re: serviceability agent : problems when using gcc LTO (link > time optimization) > > > > While we are speaking about all the drawbacks of LTO, it's still not clear > what the benefits are? In the very first mail Matthias mentioned that there > might be performance improvements but that performance is not the main > driving factor behind this initiative. So is it the reduced code size > (Matthias mentioned something around ~10%)? > > > > It would be nice to see some real numbers on various platform for both, > the performance improvements for native parts like JIT/GC as well as for > the size reduction. > > Aleksei Voitylov schrieb am Di., 14. Jan. > 2020, 09:54: > > > On 14/01/2020 19:57, Baesken, Matthias wrote: > > Hello Magnus and Aleksei, thanks for the input . > > > > The times you provided really look like they make a big difference at > least for people often building minimal-vm . > > Guess I have to measure myself a bit (maybe the difference is not that > big on our linux s390x / ppc64(le) ) . > > > >> If the change to enable lto by default is proposed, what would be the > >> recommended strategy for development? > >> > > Probably we should a) do not enable it by default but just make sure > it can be enabled easily and works for the minimal-vm > That would be welcome. I have high hopes to LTO the VM some time by > default, and the tendency observed is that the compiler time overhead > for GCC becomes smaller. At the same time there is no reason why vendors > that invested in testing and can absorb the build time hit could provide > binaries with LTO built VMs by passing an additional option flag. > > or b) take it easy to disable it for local development. > > > > Best regards, Matthias > > > > > > > >> Magnus, Matthias, > >> > >> for me, lto is a little heavyweight for development. x86_64 build time > >> with gcc 7: > >> > >> Server 1m32.484s > >> Server+Minimal 1m42.166s > >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s > >> > >> If the change to enable lto by default is proposed, what would be the > >> recommended strategy for development? > >> > >> For ARM32 Minimal, please keep in mind that it's not uncommon to disable > >> LTO plugin in commodity ARM32 gcc compiler distributions, so for some it > >> does not matter what settings we have in OpenJDK. I believe there could > >> be other reasons for that on top of build time (bugs?). > >> > > From erik.joelsson at oracle.com Wed Jan 15 15:47:57 2020 From: erik.joelsson at oracle.com (Erik Joelsson) Date: Wed, 15 Jan 2020 07:47:57 -0800 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com>

Message-ID: <824316f1-43f6-b61b-c764-ad0c0996b325@oracle.com> On 2020-01-15 07:29, Volker Simonis wrote: > Do you know if newer versions of GCC use the gold linker by default? I > remember from some experiments which I did many years ago that gold was > considerably faster compared to the default ld linker. The default linker is system configured so it depends on your Linux distro. The devkits generated by the current devkit makefiles configures gold as default. /Erik From matthias.baesken at sap.com Wed Jan 15 16:00:38 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 15 Jan 2020 16:00:38 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: <824316f1-43f6-b61b-c764-ad0c0996b325@oracle.com> References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com>

<824316f1-43f6-b61b-c764-ad0c0996b325@oracle.com> Message-ID: Hello, I used the "normal" linker so I think what https://stackoverflow.com/questions/31688069/requirements-to-use-flto says is true , one can use also the "normal" linker . Haven't checked for any performance (or other) improvements when using gold instead . Best regards, Matthias > On 2020-01-15 07:29, Volker Simonis wrote: > > Do you know if newer versions of GCC use the gold linker by default? I > > remember from some experiments which I did many years ago that gold > was > > considerably faster compared to the default ld linker. > > The default linker is system configured so it depends on your Linux > distro. The devkits generated by the current devkit makefiles configures > gold as default. > > /Erik > From matthias.baesken at sap.com Wed Jan 15 16:11:03 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 15 Jan 2020 16:11:03 +0000 Subject: serviceability agent : problems when using gcc LTO (link time optimization) In-Reply-To: References: <131b1189-d0e8-3d3e-6137-81ffc8eeeb84@oracle.com> <4fdc39f0-9ed3-5878-93b2-b536b8779125@oracle.com> <249cba82-1469-03f8-d205-e5895eab2cc3@bell-sw.com> <9908f346-f376-5bfc-8d5b-bc250e336032@bell-sw.com>

Message-ID: Hello, here is another comparison for the larger JDK shared libs, this time with the sizes of build with linktime-gc (--gc-sections) added . ( just for the larger libs ) ( I had not enabled linktime-gc for libjvm in our test build , just for the JDK libs . ) Linuxx86_64 / gcc7 normal / with -flto / with linktime-gc (--gc-sections) ----------------------------------------------------------- 752K / 760K / 752K ./lib/libawt.so <------------------ this one gets a bit larger but only with flto 472K / 456K / 468K ./lib/libawt_xawt.so <------------------ small gain 1.5M / 824K / 900K ./lib/libfontmanager.so <------------------ HUGE gain , not as good with ltgc but still good 784K / 792K / 784K ./lib/libfreetype.so <------------------ this one gets a bit larger (but not with ltgc) 260K / 244K / 252K ./lib/libjavajpeg.so <----------------- small gain 196K / 188K / 196K ./lib/libjava.so 280K / 256K / 276K ./lib/libjdwp.so <----------------- small gain 144K / 140K / 136K ./lib/libjimage.so 564K / 420K / 404K ./lib/liblcms.so <----------------- large gain , even better with ltgc 576K / 496K / 556K ./lib/libmlib_image.so <----------------- large gain with flto , small one with ltgc 368K / 212K / 236K ./lib/libsplashscreen.so <----------------- large gain 320K / 296K / 300K ./lib/libsunec.so <----------------- medium gain 23M / 17M / --not enabled--- ./lib/server/libjvm.so <----------------- big gain maybe because it is C++ ? So one can see, that flto is usually a bit better than link-time-gc when it comes to improving lib sizes, but not always . However linktime-gc seems to be faster when comparing build times , I did not really notice much build time slowdown because of it . ( we have it enabled for linux s390x for some time in OpenJDK ). The linktime-gc also offers a nice feature to print out the eliminated stuff , that can be used to remove unused code cross-platform . e.g. the removed symbols from https://bugs.openjdk.java.net/browse/JDK-8234629 has been found this way . Best regards, Matthias Aleksei, Matthias, thanks for the numbers. The size reduction on libjvm.so looks not bad, indeed. Do you know if newer versions of GCC use the gold linker by default? I remember from some experiments which I did many years ago that gold was considerably faster compared to the default ld linker. Unfortunately, the documentation I found about LTO/ld/gold [1,2] seems to be quite old and not very precise. Do you have gained any experience with LTO/gold and know if gold could maybe improve linking times with LTO? [1] https://gcc.gnu.org/wiki/LinkTimeOptimization [2] https://stackoverflow.com/questions/31688069/requirements-to-use-flto Baesken, Matthias > schrieb am Mi., 15. Jan. 2020, 07:02: Hello , I can comment on the code size . This is what I get when comparing a build without and with -flto . gcc7 linux x86_64 product build, normal / with -flto ---------------------------------------------------------------------------------- du -sh on the *.so files gives : 16K / 16K ./lib/libattach.so 48K / 44K ./lib/libawt_headless.so 752K / 760K ./lib/libawt.so <------------------ this one gets a bit larger with flto 472K / 456K ./lib/libawt_xawt.so <------------------ small gain 36K / 32K ./lib/libdt_socket.so 16K /16K ./lib/libextnet.so 1.5M / 824K ./lib/libfontmanager.so <------------------ HUGE gain 784K / 792K ./lib/libfreetype.so <------------------ this one gets a bit larger with flto 56K / 56K ./lib/libinstrument.so 52K / 52K ./lib/libj2gss.so 20K / 20K ./lib/libj2pcsc.so 92K / 84K ./lib/libj2pkcs11.so 12K / 12k ./lib/libjaas.so 260K / 244K ./lib/libjavajpeg.so <----------------- small gain 196K / 188K ./lib/libjava.so 12K / 12K ./lib/libjawt.so 280K / 256K ./lib/libjdwp.so <----------------- small gain 144K / 140K ./lib/libjimage.so 84K / 76K ./lib/libjli.so 16K / 16K ./lib/libjsig.so 88K / 80K ./lib/libjsound.so 564K / 420K ./lib/liblcms.so <----------------- large gain 12K / 12K ./lib/libmanagement_agent.so 40K / 36K ./lib/libmanagement_ext.so 36K / 32K ./lib/libmanagement.so 576K / 496K ./lib/libmlib_image.so <----------------- large gain 112K / 108K ./lib/libnet.so 100K / 100K ./lib/libnio.so 16K / 16K ./lib/libprefs.so 8.0K / 8.0K ./lib/librmi.so 60K / 60K ./lib/libsaproc.so 36K / 32K ./lib/libsctp.so 368K / 212K ./lib/libsplashscreen.so <----------------- large gain 320K / 296K ./lib/libsunec.so <----------------- medium gain 72K / 72K ./lib/libverify.so 44K / 44K ./lib/libzip.so 16K / 16K ./lib/server/libjsig.so 23M / 17M ./lib/server/libjvm.so <----------------- big gain maybe because it is C++ ? So for some libs you see 10% and more , but not for all . But most large libs like libjvm.so, libfontmanager.so or liblcms.so we see good results regarding reduced code size. I Cannot say much about performance improvements , probably it would be small . For SPEC you find something at http://hubicka.blogspot.com/2019/05/gcc-9-link-time-and-inter-procedural.html (not that these results would say too much about JVM performance ). Best regards, Matthias From: Volker Simonis > Sent: Mittwoch, 15. Januar 2020 14:40 To: Aleksei Voitylov > Cc: Baesken, Matthias >; Magnus Ihse Bursie >; serviceability-dev at openjdk.java.net; build-dev >; hotspot-dev at openjdk.java.net Subject: Re: serviceability agent : problems when using gcc LTO (link time optimization) While we are speaking about all the drawbacks of LTO, it's still not clear what the benefits are? In the very first mail Matthias mentioned that there might be performance improvements but that performance is not the main driving factor behind this initiative. So is it the reduced code size (Matthias mentioned something around ~10%)? It would be nice to see some real numbers on various platform for both, the performance improvements for native parts like JIT/GC as well as for the size reduction. Aleksei Voitylov > schrieb am Di., 14. Jan. 2020, 09:54: On 14/01/2020 19:57, Baesken, Matthias wrote: > Hello Magnus and Aleksei, thanks for the input . > > The times you provided really look like they make a big difference at least for people often building minimal-vm . > Guess I have to measure myself a bit (maybe the difference is not that big on our linux s390x / ppc64(le) ) . > >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> > Probably we should a) do not enable it by default but just make sure it can be enabled easily and works for the minimal-vm That would be welcome. I have high hopes to LTO the VM some time by default, and the tendency observed is that the compiler time overhead for GCC becomes smaller. At the same time there is no reason why vendors that invested in testing and can absorb the build time hit could provide binaries with LTO built VMs by passing an additional option flag. > or b) take it easy to disable it for local development. > > Best regards, Matthias > > > >> Magnus, Matthias, >> >> for me, lto is a little heavyweight for development. x86_64 build time >> with gcc 7: >> >> Server 1m32.484s >> Server+Minimal 1m42.166s >> Server+Minimal (--with-jvm-features="link-time-opt") 5m29.422s >> >> If the change to enable lto by default is proposed, what would be the >> recommended strategy for development? >> >> For ARM32 Minimal, please keep in mind that it's not uncommon to disable >> LTO plugin in commodity ARM32 gcc compiler distributions, so for some it >> does not matter what settings we have in OpenJDK. I believe there could >> be other reasons for that on top of build time (bugs?). >> From matthias.baesken at sap.com Wed Jan 15 16:14:54 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Wed, 15 Jan 2020 16:14:54 +0000 Subject: RFR: 8236714: enable link-time section-gc for linux to remove unused code In-Reply-To: <998c4a08-5670-f51d-4625-9c7e984b4b5d@oracle.com> References: