Removing intrinsic of Thread.isInterrupted()
Yumin Qi
yumin.qi at oracle.com
Tue Feb 25 12:41:23 PST 2014
When I am writing email I saw your email.
to answer Vladimir's question, I come up with a scenario:
When Thread is in process of interrupt call:
OSThread* osthread = thread->osthread();
osthread->set_interrupted(true);
// More than one thread can get here with the same value of osthread,
// resulting in multiple notifications. We do, however, want the store
// to interrupted() to be visible to other threads before we post
// the interrupt event.
OrderAccess::release();
SetEvent(osthread->interrupt_event());
Before SetEvent, bit is set.
Now, with intrinsification, call
Thread.currentThread().isInterrupted() will return return 'true' due
to clear_int is 'false'.
The following call
Thread.isInterrupted() will go to slow path, since the Event is
not set, we got a 'false' with new fix.
Karen's suggestion is get ride of second fastpath (t ==
Thread.current() && !clear_int) so it will be
return (t == Thread.current() && !TLS._osthread._interrupted) ?
fast : slow
Thanks
Yumin
Thanks
Yumin
On 2/25/2014 11:56 AM, Karen Kinnear wrote:
> Vladimir,
>
> I updated the bug to reflect the code review email thread between Yumin, myself
> and David Holmes.
>
> To the best of my understanding there is a potential timing hole here, with the clearInterrupted
> case (study the webrev for the fix), i.e. in the isInterrupted(false) case.
>
> What I proposed was that we could keep the intrinsic quick check for isInterrupted bit, but not
> the logic for the isInterrupted(false) - unless you want to change it to add the Windows
> WaitForSingleObject call - which I assume removes the benefit of the intrinsic.
>
> Does that make sense to you?
> thanks,
> Karen
>
> On Feb 25, 2014, at 2:14 PM, Vladimir Kozlov wrote:
>
>> Yumin,
>>
>> On 2/24/14 5:46 PM, Yumin Qi wrote:
>>> Hi, Compiler team
>>>
>>> I worked on this bug: 6498581:ThreadInterruptTest3 produces wrong
>>> output on Windows. This is a problem thread sleep wakes up spuriously
>>> due to a race condition between os::interrupt and os::is_interruped.
>>> Detail please see bug comments.
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-6498581
>>>
>>> The fix is (without removing intrinsic, but intend to remove it) :
>>> http://cr.openjdk.java.net/~minqi/6498581/webrev00/
>>>
>>> One problem is that Thread.isInterrupted() is intrinsic and there is
>>> chance that code like
>>>
>>> boolean a = Thread.currentThread().isInterrupted();
>>> boolean b = Thread.interrupted();
>>>
>>> Will get different value. (fast/slow path)
>> How you come to this conclusion? You my be mistaken. We intrinsify native boolean isInterrupted(boolean ClearInterrupted) and not java method: isInterrupted().
>>
>> Also you are comparing different code which is nothing to do with intrinsic:
>>
>> isInterrupted() passes ClearInterrupted=false:
>>
>> public boolean isInterrupted() {
>> return isInterrupted(false);
>> }
>>
>> when interrupted() passes 'true':
>>
>> public static boolean interrupted() {
>> return currentThread().isInterrupted(true);
>> }
>>
>> Both method calls native isInterrupted(bool) which is intrinsified. So both calls intrinsic. There should be no difference.
>>
>> From performance point of view, as Aleksey pointed, there is huge difference. We can't remove intrinsic.
>>
>> Thanks,
>> Vladimir
>>
>>> I tried to remove the intrinsic code and done a test using following
>>> code. The result showed there is no difference by removing the intrinsic
>>> of Thread.isInterrupted().
>>>
>>> // test performance of removing Thread.isInterrupted() inlining
>>> public class TestThreadInterrupted {
>>> public static void main(String... args) {
>>> Thread t = new Thread () {
>>> public void run() {
>>> boolean isInt = false;
>>> while (!isInt) {
>>> try {
>>> Thread.sleep(30);
>>> } catch (InterruptedException ie) {
>>> isInt = true;
>>> }
>>> }
>>> }
>>> };
>>>
>>> t.start();
>>> // run
>>> long start, finish, isum = 0L, osum = 0L;
>>> int NUM = 20000;
>>> for (int j = 0; j < 100; j++) {
>>> isum = 0L;
>>> for (int i = 0; i < NUM; i++) {
>>> start = System.currentTimeMillis();
>>> t.isInterrupted();
>>> finish = System.currentTimeMillis();
>>> isum += (finish - start);
>>> }
>>>
>>> System.out.println("Total cost of " + NUM + " calls is " + isum
>>> + " ms");
>>> osum += isum;
>>> }
>>> System.out.println("Average " + osum/100 + " ms");
>>> t.interrupt();
>>> try {
>>> t.join();
>>> } catch (InterruptedException e) {}
>>> }
>>> }
>>>
>>> And found there is no difference on Solaris-x64/sparcv9, Windows(32/64),
>>> linux(32/64) before and after the removing of intrinsic
>>> Thread.isInterrupted().
>>>
>>> Should I remove the intrinsic?
>>>
>>> Data (no major difference for both with/without intrinsic):
>>>
>>> 1)windows :
>>> ....
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 0 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 2 ms
>>> Average 1 ms
>>>
>>> 2) Solaris-x64
>>> ....
>>> Total cost of 20000 calls is 3 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 4 ms
>>> Total cost of 20000 calls is 6 ms
>>> Total cost of 20000 calls is 6 ms
>>> Total cost of 20000 calls is 5 ms
>>> Total cost of 20000 calls is 7 ms
>>> Total cost of 20000 calls is 5 ms
>>> Total cost of 20000 calls is 5 ms
>>> Total cost of 20000 calls is 1 ms
>>> Total cost of 20000 calls is 3 ms
>>> Total cost of 20000 calls is 2 ms
>>> Total cost of 20000 calls is 3 ms
>>> Total cost of 20000 calls is 3 ms
>>> Total cost of 20000 calls is 5 ms
>>> Total cost of 20000 calls is 4 ms
>>> Total cost of 20000 calls is 4 ms
>>> Total cost of 20000 calls is 7 ms
>>> Average 4 ms
>>>
>>> 3) Linux:
>>>
>>> ....
>>> Total cost of 20000 calls is 30 ms
>>> Total cost of 20000 calls is 29 ms
>>> Total cost of 20000 calls is 26 ms
>>> Total cost of 20000 calls is 26 ms
>>> Total cost of 20000 calls is 26 ms
>>> Total cost of 20000 calls is 24 ms
>>> Total cost of 20000 calls is 29 ms
>>> Total cost of 20000 calls is 25 ms
>>> Total cost of 20000 calls is 20 ms
>>> Average 24 ms
>>>
>>>
>>> Thanks
>>> Yumin
More information about the hotspot-compiler-dev
mailing list