review for 7129164: JNI Get/ReleasePrimitiveArrayCritical doesn't scale
Tom Rodriguez
tom.rodriguez at oracle.com
Fri Jan 27 11:53:17 PST 2012
On Jan 27, 2012, at 11:25 AM, Vladimir Kozlov wrote:
> I change alias to hotspot-dev since it affects everyone.
Thanks. I meant to send it to hotspot-dev but my auto pilot had other ideas.
>
> In GC_locker::check_active_before_gc() move wait_begin initialization under Print* check since it used only under such checks.
Done.
> Could method check_active_before_gc() be called from different threads concurrently? Does it need lock? Note that other methods have lock.
check_active_before_gc is only called at a safepoint:
assert(SafepointSynchronize::is_at_safepoint(), "only read at safepoint");
And it's normally only called at the beginning of the collection in the VMThread. The existing one had no locking and I don't think this changes anything in regards to the safety of it.
tom
>
> Thanks,
> Vladimir
>
> Tom Rodriguez wrote:
>> http://cr.openjdk.java.net/~never/7129164
>> 200 lines changed: 126 ins; 33 del; 41 mod; 3958 unchg
>> 7129164: JNI Get/ReleasePrimitiveArrayCritical doesn't scale
>> Summary:
>> Reviewed-by:
>> The machinery for GC_locker which supports GetPrimitiveArrayCritical
>> maintains a count of the number of threads that currently have raw
>> pointers exposed. This is used to allow existing critical sections to
>> drain before creating new ones so that a GC can occur. Currently
>> these counts are updated using atomic all the time, even if a GC isn't
>> being requested. This creates scalability problem when a lot of
>> threads are hammering atomic operations on the jni_lock_count. The
>> count only needs to be valid when checking if a critical section is
>> currently active and when draining the existing sections. The fix is
>> to make the count be computed as part of the safepointing machinery
>> and to only decrement the counters when _needs_gc is true. In debug
>> mode the old count is maintained and validated against the lazily
>> computed count.
>> On a microbenchmark that stresses GetPrimitiveArrayCritical with many
>> threads and relatively short critical section it can more than double
>> the throughput. This also slightly speeds up the normal
>> GetPrimtiveArrayCritical calls. Mostly it's not easily measurable
>> with larger scale programs.
>> Tested with microbenchmark that stresses GetPrimtiveArrayCritical and
>> the crypto benchmarks of specjvm2008 on x86 and sparc. I also ran the
>> java.io regression tests from the JDK.
More information about the hotspot-dev
mailing list