Hi David,
So the very last comment there was about not implicitly assuming memory_order_consume, yet that has not been addressed in the proposal.
Further the discussion on hotspot-runtime-dev through September and October was far more illuminating. I think my post here:
https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi...
and the closely following one from Thomas Schatzl summed up the concerns about the proposed changes. Thank you very much for pointing out the missing items I need to take into account.
This is a proposal to change the memory ordering semantics of part of the shared GC code _not_ just the PPC64 implementation, but I have seen no analysis to demonstrate the correctness of such a proposal. I do agree the necessity of demonstrating the correctness. I would try my best for this.
Best regards, -- Michihiro, IBM Research - Tokyo From: David Holmes <david.holmes@oracle.com> To: Michihiro Horie <HORIE@jp.ibm.com>, ppc-aix-port-dev@openjdk.java.net, hotspot-dev@openjdk.java.net, hotspot-gc-dev@openjdk.java.net Cc: Hiroshi H Horii <HORII@jp.ibm.com> Date: 2018/04/25 21:45 Subject: Re: RFR(M): 8154736: enhancement of cmpxchg and copy_to_survivor for ppc64 Hi Michihiro, On 23/04/2018 8:33 PM, Michihiro Horie wrote:
Dear all,
I would like to ask reviews on 8154736 “enhancement of cmpxchg and copy_to_survivor”. The change adds options to avoid expensive syncs with compare-and-exchange. An experiment by using SPECjbb2015 showed 6% improvement in critical-jOPS. This change focuses on ppc64 but would be potentially beneficial for aarch64.
Although discussions stopped at
https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi...
, I would like to restart the review by taking over Hiroshi's work if the discussion is still open.
So the very last comment there was about not implicitly assuming memory_order_consume, yet that has not been addressed in the proposal. Further the discussion on hotspot-runtime-dev through September and October was far more illuminating. I think my post here: https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi... and the closely following one from Thomas Schatzl summed up the concerns about the proposed changes. AFAICS the restarted proposal addresses none of those concerns but simply takes up where the previous implementation suggestion left off. This is a proposal to change the memory ordering semantics of part of the shared GC code _not_ just the PPC64 implementation, but I have seen no analysis to demonstrate the correctness of such a proposal. David -----
Bug: https://urldefense.proofpoint.com/v2/url?u=https-3A__bugs.openjdk.java.net_b...
Webrev: https://urldefense.proofpoint.com/v2/url?u=http-3A__cr.openjdk.java.net_-7Em...
Previous review had discussions on improper log output (
https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi...
). Logs can be generated properly with this change, but I would like to ask if we should use “if(log) OrderAccess:acquire()” as is in webrev or more general approach with a call to OrderAccess:consume() with empty implementation on all supported platforms.
Also, there were discussions on the problem of unawareness of copied obj (
https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi...
). This change adds “release” in cmpxchg_pre_membar. This was discussed in
https://urldefense.proofpoint.com/v2/url?u=http-3A__mail.openjdk.java.net_pi...
.
I measured SPECjbb2015 with its multi JVMs mode on a POWER8 node (for JDK11 , I modified MANIFEST in specjbb2015.jar to specify locations of JAXB related libraries). As a result, critical-jOPS improved by 6% due to this change.
Best regards, -- Michihiro, IBM Research - Tokyo