RFR(M) JDK-8014555: G1: Memory ordering problem with Conc refinement and card marking

Bengt Rutisson bengt.rutisson at oracle.com
Wed Oct 2 21:54:46 PDT 2013


Hi Mikael,

My knowledge about the compiler and interpreter code is not enough to 
formally review it, but as far as I can tell it looks good.

The GC code changes looks good. Reviewed.

Thanks for doing such a good job with this difficult change!
Bengt

On 10/2/13 2:28 PM, Mikael Gerdin wrote:
> Hi
>
> Please review my fix for the issue discussed in the "G1 question: 
> concurrent cleaning of dirty cards" thread on the hotspot-gc-dev 
> mailing list.
>
> I'd like someone from the compiler (and runtime? the interpreter uses 
> macroAssembler_*, right?) teams to at least look at the changes to:
> macroAssembler_*.cpp
> c1_Runtime_*.cpp
> graphKit.cpp
>
> Problem description:
> G1 has a race where the concurrent refinement thread may miss object 
> references in a dirty card.
> The problem arises if the CPU re-orders the load of the old card value 
> (which G1 checks to determine if it can skip the barrier)
> before the store to the actual object.
> If that occurs the concurrent refinement thread may have set the card 
> to "clean" and proceeded to scan the card but the java thread may have 
> seen the "dirty" value and skipped the post barrier.
>
> Suggested fix:
> * Add a memory fence between the store to a java object and the 
> reading of the previous card value.
> * Modify the code for handling young regions so that all writes to 
> young regions can skip the fence (since it will never be needed for 
> such writes). This introduces a new value in the card table for G1 
> which indicates a young region.
>
> Performance impact:
> * This fix has a negative throughput performance impact of 1-1.5% 
> (tested on x86-AMD x86-Intel and SPARC).
> * We may want to try to get rid of this race at some point by 
> redesigning G1's post barrier but there is not enough time to do that 
> for JDK8.
>
> Performance numbers for x86 platforms can be seen here:
> http://cr.openjdk.java.net/~mgerdin/8014555/perf.txt
>
> Unfortunately the JIRA issue is not externally visible, but the major 
> parts of the discussions about this are present in the mailing list 
> thread. The bug mostly contains my analysis of the crashes which seems 
> to have been caused by this bug.
>
> Bug link: https://bugs.openjdk.java.net/browse/JDK-8014555
>
> Webrev: http://cr.openjdk.java.net/~mgerdin/8014555/webrev.0
>



More information about the hotspot-dev mailing list