From rwestrel at redhat.com  Wed Jan  2 08:25:16 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 02 Jan 2019 09:25:16 +0100
Subject: [aarch64-port-dev ] RFR(S): 8214922: Add vectorization support
 for fmin/fmax
In-Reply-To: <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115055477478B62D825C36196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87d0pv2iow.fsf@redhat.com> <c836cf2e-20a9-0ec4-212d-72326fb144a6@redhat.com>
 <877eg32bzq.fsf@redhat.com> <b91e56a1-dd8f-d9f1-40e8-af5d8c3c0d9d@redhat.com>
 <871s6a3map.fsf@redhat.com>
 <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <87va371n6b.fsf@redhat.com>


> http://cr.openjdk.java.net/~pli/rfr/8214922/webrev.01/

That looks good to me.

Roland.

From erik.joelsson at oracle.com  Wed Jan  2 08:52:03 2019
From: erik.joelsson at oracle.com (Erik Joelsson)
Date: Wed, 2 Jan 2019 09:52:03 +0100
Subject: RFR(M): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
Message-ID: <22c3e7fe-b092-bae5-39d2-2a28c96d5412@oracle.com>

 From a build perspective, this looks very good. I think adding a link 
to the github project in the doc makes sense if you want to do that.

/Erik

On 2018-12-25 16:19, Jakub Van?k wrote:
> Hi,
>
> please review this webrev. It is a successor of the softfloat-3 [patch]
> thread (first email
> http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
> )
>
> Changes since the last patch (v6):
>
> - renamed --with-softloat* to --with-sflt* (it is more compact and it
>    corresponds to the old --with-sflt-lib=... option)
>
> - license is now obtained via --with-sflt-license switch (so it is not
>    included in OpenJDK source tree)
>
> - updated documentation (slight rewording, added the license option)
>
> - checks for default --with/--without behavior are in place again
>    (I forgot them when I changed the way the library is detected)
>
> - added a simple testcase - I found a disrepancy between softfloat and
>    system function behavior. When a float with bits 0x003FFFFF is
>    added to 0x00000001, the correct result is 0x00400000, but the
>    default software floating point implementation returns 0x00000000.
>    However I'm not sure where to put this test - now it is in
>    test/hotspot/jtreg/compiler/floatingpoint.
>
> - comments in code refer to CR 6757269 and newly JDK-8215902 too.
>
> I have created a repository with SoftFloat-3e with build configuration
> specifically for OpenJDK on armel:
> https://github.com/ev3dev-lang-java/softfloat-openjdk
>
> I can add a link to it to the documentation.
>
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
> Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/
> CI build: https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
>
> Cheers,
>
> Jakub
>

From eric.caspole at oracle.com  Wed Jan  2 22:08:24 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Wed, 2 Jan 2019 17:08:24 -0500
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly for
 input to junits
Message-ID: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>

Hi everybody,
Could I have reviews on this change to add setup methods to produce the 
test +LogCompilation files on the fly, so we can test the output of the 
JDK in the repo by adding it into the PATH etc, instead of reading 
static files. Also, this changeset removes several large static input 
files that did not have any special significance. Tested and builds with 
JDK 8 and 13. Nothing prevents adding back useful static log test files 
later, because it is hard to reproduce some constructs on the fly.

Thanks,
Eric


JBS:
https://bugs.openjdk.java.net/browse/JDK-8196347

webrev:
http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/

From vladimir.kozlov at oracle.com  Wed Jan  2 22:28:23 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 2 Jan 2019 14:28:23 -0800
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly
 for input to junits
In-Reply-To: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
References: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
Message-ID: <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>

Hi Eric,

May be add -Xbatch (-XX:-BackgroundCompilation) to make sure compilation and log is complete before execution is 
finished. Also I think with -version much less methods are compiled than without it when 'java' help output is printed 
by running without arguments.

Thanks,
Vladimir

On 1/2/19 2:08 PM, Eric Caspole wrote:
> Hi everybody,
> Could I have reviews on this change to add setup methods to produce the test +LogCompilation files on the fly, so we can 
> test the output of the JDK in the repo by adding it into the PATH etc, instead of reading static files. Also, this 
> changeset removes several large static input files that did not have any special significance. Tested and builds with 
> JDK 8 and 13. Nothing prevents adding back useful static log test files later, because it is hard to reproduce some 
> constructs on the fly.
> 
> Thanks,
> Eric
> 
> 
> JBS:
> https://bugs.openjdk.java.net/browse/JDK-8196347
> 
> webrev:
> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/

From eric.caspole at oracle.com  Wed Jan  2 22:36:23 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Wed, 2 Jan 2019 17:36:23 -0500
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly
 for input to junits
In-Reply-To: <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>
References: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
 <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>
Message-ID: <d16029b4-c2f0-6143-b1e8-001b2a8689fa@oracle.com>

Hi Vladimir,
OK I will experiment with those and let you know.
Thanks,
Eric


On 1/2/19 17:28, Vladimir Kozlov wrote:
> Hi Eric,
> 
> May be add -Xbatch (-XX:-BackgroundCompilation) to make sure compilation 
> and log is complete before execution is finished. Also I think with 
> -version much less methods are compiled than without it when 'java' help 
> output is printed by running without arguments.
> 
> Thanks,
> Vladimir
> 
> On 1/2/19 2:08 PM, Eric Caspole wrote:
>> Hi everybody,
>> Could I have reviews on this change to add setup methods to produce 
>> the test +LogCompilation files on the fly, so we can test the output 
>> of the JDK in the repo by adding it into the PATH etc, instead of 
>> reading static files. Also, this changeset removes several large 
>> static input files that did not have any special significance. Tested 
>> and builds with JDK 8 and 13. Nothing prevents adding back useful 
>> static log test files later, because it is hard to reproduce some 
>> constructs on the fly.
>>
>> Thanks,
>> Eric
>>
>>
>> JBS:
>> https://bugs.openjdk.java.net/browse/JDK-8196347
>>
>> webrev:
>> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/

From eric.caspole at oracle.com  Wed Jan  2 23:27:48 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Wed, 2 Jan 2019 18:27:48 -0500
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly
 for input to junits
In-Reply-To: <d16029b4-c2f0-6143-b1e8-001b2a8689fa@oracle.com>
References: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
 <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>
 <d16029b4-c2f0-6143-b1e8-001b2a8689fa@oracle.com>
Message-ID: <bcbae797-b212-c6cd-6599-477a87d2b751@oracle.com>


On 1/2/19 17:36, Eric Caspole wrote:
> Hi Vladimir,
> OK I will experiment with those and let you know.
> Thanks,
> Eric

You are right, running with no -version produces more compilations, the 
log file is about 40% bigger. -Xbatch was sort of a wash. So I added 
combos of each since these are short running and will give more coverage.

New one:
http://cr.openjdk.java.net/~ecaspole/JDK-8196347/02/webrev/


> 
> 
> On 1/2/19 17:28, Vladimir Kozlov wrote:
>> Hi Eric,
>>
>> May be add -Xbatch (-XX:-BackgroundCompilation) to make sure 
>> compilation and log is complete before execution is finished. Also I 
>> think with -version much less methods are compiled than without it 
>> when 'java' help output is printed by running without arguments.
>>
>> Thanks,
>> Vladimir
>>
>> On 1/2/19 2:08 PM, Eric Caspole wrote:
>>> Hi everybody,
>>> Could I have reviews on this change to add setup methods to produce 
>>> the test +LogCompilation files on the fly, so we can test the output 
>>> of the JDK in the repo by adding it into the PATH etc, instead of 
>>> reading static files. Also, this changeset removes several large 
>>> static input files that did not have any special significance. Tested 
>>> and builds with JDK 8 and 13. Nothing prevents adding back useful 
>>> static log test files later, because it is hard to reproduce some 
>>> constructs on the fly.
>>>
>>> Thanks,
>>> Eric
>>>
>>>
>>> JBS:
>>> https://bugs.openjdk.java.net/browse/JDK-8196347
>>>
>>> webrev:
>>> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/

From vladimir.kozlov at oracle.com  Thu Jan  3 00:41:07 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 2 Jan 2019 16:41:07 -0800
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly
 for input to junits
In-Reply-To: <bcbae797-b212-c6cd-6599-477a87d2b751@oracle.com>
References: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
 <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>
 <d16029b4-c2f0-6143-b1e8-001b2a8689fa@oracle.com>
 <bcbae797-b212-c6cd-6599-477a87d2b751@oracle.com>
Message-ID: <C92AC816-BCB0-4517-BA32-CFAE42D3FB2B@oracle.com>

Good.

Thanks,
Vladimir

> On Jan 2, 2019, at 3:27 PM, Eric Caspole <eric.caspole at oracle.com> wrote:
> 
> 
>> On 1/2/19 17:36, Eric Caspole wrote:
>> Hi Vladimir,
>> OK I will experiment with those and let you know.
>> Thanks,
>> Eric
> 
> You are right, running with no -version produces more compilations, the log file is about 40% bigger. -Xbatch was sort of a wash. So I added combos of each since these are short running and will give more coverage.
> 
> New one:
> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/02/webrev/
> 
> 
> 
>>> On 1/2/19 17:28, Vladimir Kozlov wrote:
>>> Hi Eric,
>>> 
>>> May be add -Xbatch (-XX:-BackgroundCompilation) to make sure compilation and log is complete before execution is finished. Also I think with -version much less methods are compiled than without it when 'java' help output is printed by running without arguments.
>>> 
>>> Thanks,
>>> Vladimir
>>> 
>>>> On 1/2/19 2:08 PM, Eric Caspole wrote:
>>>> Hi everybody,
>>>> Could I have reviews on this change to add setup methods to produce the test +LogCompilation files on the fly, so we can test the output of the JDK in the repo by adding it into the PATH etc, instead of reading static files. Also, this changeset removes several large static input files that did not have any special significance. Tested and builds with JDK 8 and 13. Nothing prevents adding back useful static log test files later, because it is hard to reproduce some constructs on the fly.
>>>> 
>>>> Thanks,
>>>> Eric
>>>> 
>>>> 
>>>> JBS:
>>>> https://bugs.openjdk.java.net/browse/JDK-8196347
>>>> 
>>>> webrev:
>>>> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/


From tobias.hartmann at oracle.com  Thu Jan  3 08:44:53 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 3 Jan 2019 09:44:53 +0100
Subject: RFR 13 (S): 8196347: LogCompilation: generate log file on the fly
 for input to junits
In-Reply-To: <bcbae797-b212-c6cd-6599-477a87d2b751@oracle.com>
References: <6060b883-65e7-d20a-fa1c-2cd4977f5e37@oracle.com>
 <eb6d0c8e-dcf1-7e81-ce25-deb9a04e2433@oracle.com>
 <d16029b4-c2f0-6143-b1e8-001b2a8689fa@oracle.com>
 <bcbae797-b212-c6cd-6599-477a87d2b751@oracle.com>
Message-ID: <7cbb403b-2898-fb42-a8b7-a1e9769a2ec2@oracle.com>

Hi Eric,

this looks good to me too.

Best regards,
Tobias

On 03.01.19 00:27, Eric Caspole wrote:
> 
> On 1/2/19 17:36, Eric Caspole wrote:
>> Hi Vladimir,
>> OK I will experiment with those and let you know.
>> Thanks,
>> Eric
> 
> You are right, running with no -version produces more compilations, the log file is about 40%
> bigger. -Xbatch was sort of a wash. So I added combos of each since these are short running and will
> give more coverage.
> 
> New one:
> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/02/webrev/
> 
> 
> 
>>
>>
>> On 1/2/19 17:28, Vladimir Kozlov wrote:
>>> Hi Eric,
>>>
>>> May be add -Xbatch (-XX:-BackgroundCompilation) to make sure compilation and log is complete
>>> before execution is finished. Also I think with -version much less methods are compiled than
>>> without it when 'java' help output is printed by running without arguments.
>>>
>>> Thanks,
>>> Vladimir
>>>
>>> On 1/2/19 2:08 PM, Eric Caspole wrote:
>>>> Hi everybody,
>>>> Could I have reviews on this change to add setup methods to produce the test +LogCompilation
>>>> files on the fly, so we can test the output of the JDK in the repo by adding it into the PATH
>>>> etc, instead of reading static files. Also, this changeset removes several large static input
>>>> files that did not have any special significance. Tested and builds with JDK 8 and 13. Nothing
>>>> prevents adding back useful static log test files later, because it is hard to reproduce some
>>>> constructs on the fly.
>>>>
>>>> Thanks,
>>>> Eric
>>>>
>>>>
>>>> JBS:
>>>> https://bugs.openjdk.java.net/browse/JDK-8196347
>>>>
>>>> webrev:
>>>> http://cr.openjdk.java.net/~ecaspole/JDK-8196347/01/webrev/

From tobias.hartmann at oracle.com  Thu Jan  3 09:01:16 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 3 Jan 2019 10:01:16 +0100
Subject: RFR (XS): 8215888: Register to register spill may use AVX 512
 move instruction on unsupported platform
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A365AB@FMSMSX126.amr.corp.intel.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A363D9@FMSMSX126.amr.corp.intel.com>
 <c20f41d3-2a5f-99f7-7cdf-068306ad996e@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A36514@FMSMSX126.amr.corp.intel.com>
 <7b19baa8-c48f-d0aa-02c0-aacaac3e984b@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A365AB@FMSMSX126.amr.corp.intel.com>
Message-ID: <78a0415c-9cce-dea0-dd37-3fa22692ac7b@oracle.com>

Hi Sandhya,

all webrevs look good to me (and the tests submitted by Vladimir passed).

You don't need to push to JDK 13 because patches pushed to JDK 12 will be synced with mainline
automatically:
https://mail.openjdk.java.net/pipermail/jdk-dev/2018-December/002376.html

So I would suggest to push the patch to jdk/jdk12 and request a backport to JDK 11u after some
iterations of nightly testing have passed.

Best regards,
Tobias


On 22.12.18 01:55, Viswanathan, Sandhya wrote:
> Thanks a lot!  I have also created backport patches for JDK 12 and JDK 11.0.2 as this bug affects those versions too. The below are for your consideration:
> 
> JDK 12:
> http://cr.openjdk.java.net/~sviswanathan/8215888/jdk12/webrev.01/
> JDK11u:
> http://cr.openjdk.java.net/~sviswanathan/8215888/jdk11u/webrev.01/
> 
> The compiler jtreg testing passes for these as well. 
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Vladimir Ivanov [mailto:vladimir.x.ivanov at oracle.com] 
> Sent: Friday, December 21, 2018 4:27 PM
> To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; vladimir.kozlov at oracle.com
> Subject: Re: RFR (XS): 8215888: Register to register spill may use AVX 512 move instruction on unsupported platform
> 
> 
>> Please find the updated webrev with your comments incorporated at:
>>
>> http://cr.openjdk.java.net/~sviswanathan/8215888/webrev.01/
> 
> Thanks, submitted for testing.
> 
> Best regards,
> Vladimir Ivanov
> 
>> -----Original Message-----
>> From: Vladimir Ivanov [mailto:vladimir.x.ivanov at oracle.com]
>> Sent: Friday, December 21, 2018 12:00 PM
>> To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; vladimir.kozlov at oracle.com
>> Subject: Re: RFR (XS): 8215888: Register to register spill may use AVX 512 move instruction on unsupported platform
>>
>> Sandhya,
>>
>> I'd prefer to see the check inverted:
>>
>>       if (UseAVX > 2 && !VM_Version::supports_avx512vl()) {
>>         int vector_len = 2;
>>         __ evmovdquq($dst$$XMMRegister, $src$$XMMRegister, vector_len);
>>       } else {
>>         __ movdqu($dst$$XMMRegister, $src$$XMMRegister);
>>       }
>>
>> It looks easier to read considering the code around is full of "UseAVX > 2" checks.
>>
>> By coincidence I was debugging the very same bug today and at first I didn't notice the problem with "UseAVX < 2" misreading it as "UseAVX > 2".
>>
>> Otherwise, looks good.
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> On 21/12/2018 11:44, Viswanathan, Sandhya wrote:
>>> Hi All,
>>>
>>> We noticed that the register to register moves in x86.ad file attempt
>>> to generate emovdqu when UseAVX==2.
>>>
>>> The instruction emovdquq is only supported on platforms where UseAVX >
>>> 2 (AVX 512).
>>>
>>> The following rules in x86.ad file need to be corrected:
>>>
>>> MoveVecX2Leg
>>>
>>> MoveLeg2VecX
>>>
>>> MoveVecY2Leg
>>>
>>> MoveLeg2VecY
>>>
>>> The above move rules when activated through register allocator could
>>> result in illegal instruction exception.
>>>
>>> Bug:
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-8215888
>>>
>>> This bug affects versions 11.0.2, 12 and the mainline.
>>>
>>> Webrev for jdk mainline:
>>>
>>> http://cr.openjdk.java.net/~sviswanathan/8215888/webrev.00/
>>>
>>> This webrev passes jtreg compiler tests on Haswell and SKX.
>>>
>>> Best Regards,
>>>
>>> Sandhya
>>>

From Pengfei.Li at arm.com  Thu Jan  3 09:42:30 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Thu, 3 Jan 2019 09:42:30 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect result
Message-ID: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi,

This is a patch to fix an AArch64 string intrinsics issue. It can be reproduced by below code and JVM options.

public class Test {
  public static void main(String[] args) {
    StringBuilder str = new StringBuilder("ABCDEFGHIJKLMNOPQRSTUVWXYZ01234567890123456789");
    str.setLength(str.length() - 10);
    System.out.println(str.indexOf("01234567890123456789"));
  }
}

$ java Test
-1
$ java -Xcomp -XX:-Inline Test
26

In the case above, we firstly have a long string "ABC...Z012...9012...9" (hereinafter called "the main string") and then truncate it by removing its last 10 characters. After doing this, we can incorrectly find the pattern string ("012...9012...9") inside the main string. This bug is caused by the boundary of the main string not being checked while working on the matching in AArch64 String.indexOf() intrinsics.

In the intrinsic implementation, we firstly find indexes of the first character of the pattern string (0x30 in this case) inside the main string. Each of the indexes could be a potential return value of the String.indexOf() method. And then for each index value, we compare the remaining characters inside the two strings. In this step, as Java strings in memory do not necessarily end with '\0' like C strings, we should explicitly check if the length of the remaining part of the main string is shorter than that of the pattern string.

In my fix, the length of the remaining part of the main string is calculated after we found a first-character-match. The length value is put into the ch2 register (as it can be used as a temp according to the code context) and then compared to the length of the pattern string (in cnt1). The compare and branch code is like below.

__ cmp(ch2, cnt1);
__ br(__ LT, NOMATCH);

Here we directly branch to the NOMATCH label since if the remaining part of the main string has fewer characters, there would not be any other pattern string match after current first-character-match index.

The length calculation and compare code is added at two positions in my patch, as there are two different first-character-match exits (L_HAS_ZERO and L_SMALL_HAS_ZERO) in the original intrinsic code. I also fixed the cnt2 value (which is used to count the number of bytes not processed in the main string) as well as some branch conditions in my patch. Because cnt2 always counts one more byte than the actual length. Fixing that makes the number of remaining bytes in the main string easier to be calculated.

JBS: https://bugs.openjdk.java.net/browse/JDK-8215792
webrev: http://cr.openjdk.java.net/~pli/rfr/8215792/webrev.00/

Could anyone help review this fix?

--
Thanks,
Pengfei


From aph at redhat.com  Thu Jan  3 12:12:58 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 3 Jan 2019 12:12:58 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <6959eea4-be01-5302-9ecb-1631066adb9e@redhat.com>

On 1/3/19 9:42 AM, Pengfei Li (Arm Technology China) wrote:
> JBS: https://bugs.openjdk.java.net/browse/JDK-8215792
> webrev: http://cr.openjdk.java.net/~pli/rfr/8215792/webrev.00/
> 
> Could anyone help review this fix?

I'm looking now. Thanks.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From dmitrij.pochepko at bell-sw.com  Thu Jan  3 13:19:26 2019
From: dmitrij.pochepko at bell-sw.com (dmitrij.pochepko at bell-sw.com)
Date: Thu, 03 Jan 2019 16:19:26 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>

An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190103/459d62b3/attachment.html>

From martin.doerr at sap.com  Thu Jan  3 14:17:01 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Thu, 3 Jan 2019 14:17:01 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by
 interpreter and be faster for short arrays
Message-ID: <1c4646d554954551b73c077fa40f983d@sap.com>

Hi,

the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.

Bug:
https://bugs.openjdk.java.net/browse/JDK-8216060

I have addressed these 2 issues + some cleanup with the following webrev:
http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/

Please review.

Best regards,
Martin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190103/598ccdd5/attachment.html>

From gromero at linux.vnet.ibm.com  Thu Jan  3 16:13:23 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 3 Jan 2019 14:13:23 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <1c4646d554954551b73c077fa40f983d@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
Message-ID: <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>

Hi Martin,

oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)

For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.

On the Interpreter I see an improvement of at least 50% for 1024 bytes.

This is all for the CRC32 class.

On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.

I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/

I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)

Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:

I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
for Barrett but it should be changed in

+  // Point to Barret constants
+  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
+    

?

s/not/note/ in:
cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):

d/lives/ in:
cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now

Best regards,
Gustavo

On 01/03/2019 12:17 PM, Doerr, Martin wrote:
> Hi,
> 
> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
> 
> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
> 
> Bug:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216060
> 
> I have addressed these 2 issues + some cleanup with the following webrev:
> 
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
> 
> Please review.
> 
> Best regards,
> 
> Martin
> 


From martin.doerr at sap.com  Thu Jan  3 17:34:57 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Thu, 3 Jan 2019 17:34:57 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
Message-ID: <9863276de30643338249ead2a6ac7fe9@sap.com>

Hi Gustavo,

thanks for testing and your feedback. I just fixed the comment typos in place.

Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
I guess that the frameless spills mess up the stack. Can you check if the patch below helps?

Best regards,
Martin


diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
--- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
+++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
@@ -1924,6 +1924,9 @@
       __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
     }

+    // Restore caller sp for c2i case.
+    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
+
     StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);

     if (!VM_Version::has_vpmsumb()) {
@@ -1933,8 +1936,6 @@
       __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
     }

-    // Restore caller sp for c2i case and return.
-    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
     __ blr();

     // Generate a vanilla native entry as the slow path.
@@ -2014,6 +2015,9 @@
       __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
     }

+    // Restore caller sp for c2i case.
+    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
+
     StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);

     if (!VM_Version::has_vpmsumb()) {
@@ -2023,8 +2027,6 @@
       __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
     }

-    // Restore caller sp for c2i case and return.
-    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
     __ blr();

     BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Donnerstag, 3. Januar 2019 17:13
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin,

oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)

For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.

On the Interpreter I see an improvement of at least 50% for 1024 bytes.

This is all for the CRC32 class.

On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.

I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/

I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)

Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:

I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
for Barrett but it should be changed in

+  // Point to Barret constants
+  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
+    

?

s/not/note/ in:
cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):

d/lives/ in:
cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now

Best regards,
Gustavo

On 01/03/2019 12:17 PM, Doerr, Martin wrote:
> Hi,
> 
> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
> 
> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
> 
> Bug:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216060
> 
> I have addressed these 2 issues + some cleanup with the following webrev:
> 
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
> 
> Please review.
> 
> Best regards,
> 
> Martin
> 


From gromero at linux.vnet.ibm.com  Thu Jan  3 18:36:16 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 3 Jan 2019 16:36:16 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <9863276de30643338249ead2a6ac7fe9@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
Message-ID: <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>

Hi Martin,

On 01/03/2019 03:34 PM, Doerr, Martin wrote:
> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?

Thanks for providing a fix so I can try it.
Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.

Just as reference, I can reproduce it on the release build with the following trivial code:

import java.util.zip.CRC32C;

class CRC32C_v1 {
   public static void main(String[] arg) {
     byte[] b = new byte[1024];
   
     CRC32C crc32c = new CRC32C();
     crc32c.update(b, 0, b.length);

     System.out.println(crc32c.getValue());
   }
}

Thanks for fixing the typos.


Best regards,
Gustavo
  
> Best regards,
> Martin
> 
> 
> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
> @@ -1924,6 +1924,9 @@
>         __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>       }
> 
> +    // Restore caller sp for c2i case.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>       StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
> 
>       if (!VM_Version::has_vpmsumb()) {
> @@ -1933,8 +1936,6 @@
>         __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>       }
> 
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>       __ blr();
> 
>       // Generate a vanilla native entry as the slow path.
> @@ -2014,6 +2015,9 @@
>         __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>       }
> 
> +    // Restore caller sp for c2i case.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>       StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
> 
>       if (!VM_Version::has_vpmsumb()) {
> @@ -2023,8 +2027,6 @@
>         __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>       }
> 
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>       __ blr();
> 
>       BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Donnerstag, 3. Januar 2019 17:13
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
> 
> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
> 
> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
> 
> This is all for the CRC32 class.
> 
> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
> 
> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
> 
> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
> 
> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
> 
> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
> for Barrett but it should be changed in
> 
> +  // Point to Barret constants
> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
> +
> 
> ?
> 
> s/not/note/ in:
> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
> 
> d/lives/ in:
> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
> 
> Best regards,
> Gustavo
> 
> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>> Hi,
>>
>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>
>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>
>> Bug:
>>
>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>
>> I have addressed these 2 issues + some cleanup with the following webrev:
>>
>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>
>> Please review.
>>
>> Best regards,
>>
>> Martin
>>
> 


From sandhya.viswanathan at intel.com  Thu Jan  3 18:35:25 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Thu, 3 Jan 2019 18:35:25 +0000
Subject: RFR (XS): 8215888: Register to register spill may use AVX 512
 move instruction on unsupported platform
In-Reply-To: <78a0415c-9cce-dea0-dd37-3fa22692ac7b@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A363D9@FMSMSX126.amr.corp.intel.com>
 <c20f41d3-2a5f-99f7-7cdf-068306ad996e@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A36514@FMSMSX126.amr.corp.intel.com>
 <7b19baa8-c48f-d0aa-02c0-aacaac3e984b@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A365AB@FMSMSX126.amr.corp.intel.com>
 <78a0415c-9cce-dea0-dd37-3fa22692ac7b@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A444E2@FMSMSX126.amr.corp.intel.com>

Thanks a lot Tobias! I will work with Vivek to push the patch to jdk/jdk12.

Best Regards,
Sandhya

-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
Sent: Thursday, January 03, 2019 1:01 AM
To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Vladimir Ivanov <vladimir.x.ivanov at oracle.com>; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; vladimir.kozlov at oracle.com
Subject: Re: RFR (XS): 8215888: Register to register spill may use AVX 512 move instruction on unsupported platform

Hi Sandhya,

all webrevs look good to me (and the tests submitted by Vladimir passed).

You don't need to push to JDK 13 because patches pushed to JDK 12 will be synced with mainline
automatically:
https://mail.openjdk.java.net/pipermail/jdk-dev/2018-December/002376.html

So I would suggest to push the patch to jdk/jdk12 and request a backport to JDK 11u after some iterations of nightly testing have passed.

Best regards,
Tobias


On 22.12.18 01:55, Viswanathan, Sandhya wrote:
> Thanks a lot!  I have also created backport patches for JDK 12 and JDK 11.0.2 as this bug affects those versions too. The below are for your consideration:
> 
> JDK 12:
> http://cr.openjdk.java.net/~sviswanathan/8215888/jdk12/webrev.01/
> JDK11u:
> http://cr.openjdk.java.net/~sviswanathan/8215888/jdk11u/webrev.01/
> 
> The compiler jtreg testing passes for these as well. 
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Vladimir Ivanov [mailto:vladimir.x.ivanov at oracle.com]
> Sent: Friday, December 21, 2018 4:27 PM
> To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot 
> compiler <hotspot-compiler-dev at openjdk.java.net>; 
> vladimir.kozlov at oracle.com
> Subject: Re: RFR (XS): 8215888: Register to register spill may use AVX 
> 512 move instruction on unsupported platform
> 
> 
>> Please find the updated webrev with your comments incorporated at:
>>
>> http://cr.openjdk.java.net/~sviswanathan/8215888/webrev.01/
> 
> Thanks, submitted for testing.
> 
> Best regards,
> Vladimir Ivanov
> 
>> -----Original Message-----
>> From: Vladimir Ivanov [mailto:vladimir.x.ivanov at oracle.com]
>> Sent: Friday, December 21, 2018 12:00 PM
>> To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot 
>> compiler <hotspot-compiler-dev at openjdk.java.net>; 
>> vladimir.kozlov at oracle.com
>> Subject: Re: RFR (XS): 8215888: Register to register spill may use 
>> AVX 512 move instruction on unsupported platform
>>
>> Sandhya,
>>
>> I'd prefer to see the check inverted:
>>
>>       if (UseAVX > 2 && !VM_Version::supports_avx512vl()) {
>>         int vector_len = 2;
>>         __ evmovdquq($dst$$XMMRegister, $src$$XMMRegister, vector_len);
>>       } else {
>>         __ movdqu($dst$$XMMRegister, $src$$XMMRegister);
>>       }
>>
>> It looks easier to read considering the code around is full of "UseAVX > 2" checks.
>>
>> By coincidence I was debugging the very same bug today and at first I didn't notice the problem with "UseAVX < 2" misreading it as "UseAVX > 2".
>>
>> Otherwise, looks good.
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> On 21/12/2018 11:44, Viswanathan, Sandhya wrote:
>>> Hi All,
>>>
>>> We noticed that the register to register moves in x86.ad file 
>>> attempt to generate emovdqu when UseAVX==2.
>>>
>>> The instruction emovdquq is only supported on platforms where UseAVX 
>>> >
>>> 2 (AVX 512).
>>>
>>> The following rules in x86.ad file need to be corrected:
>>>
>>> MoveVecX2Leg
>>>
>>> MoveLeg2VecX
>>>
>>> MoveVecY2Leg
>>>
>>> MoveLeg2VecY
>>>
>>> The above move rules when activated through register allocator could 
>>> result in illegal instruction exception.
>>>
>>> Bug:
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-8215888
>>>
>>> This bug affects versions 11.0.2, 12 and the mainline.
>>>
>>> Webrev for jdk mainline:
>>>
>>> http://cr.openjdk.java.net/~sviswanathan/8215888/webrev.00/
>>>
>>> This webrev passes jtreg compiler tests on Haswell and SKX.
>>>
>>> Best Regards,
>>>
>>> Sandhya
>>>

From Pengfei.Li at arm.com  Fri Jan  4 08:52:17 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Fri, 4 Jan 2019 08:52:17 +0000
Subject: [aarch64-port-dev ] RFR(S): 8214922: Add vectorization support
 for fmin/fmax
In-Reply-To: <87va371n6b.fsf@redhat.com>
References: <DB7PR08MB3115055477478B62D825C36196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87d0pv2iow.fsf@redhat.com> <c836cf2e-20a9-0ec4-212d-72326fb144a6@redhat.com>
 <877eg32bzq.fsf@redhat.com> <b91e56a1-dd8f-d9f1-40e8-af5d8c3c0d9d@redhat.com>
 <871s6a3map.fsf@redhat.com>
 <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87va371n6b.fsf@redhat.com>
Message-ID: <DB7PR08MB3115836C7236BFBDC988D729968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi,
 
> > http://cr.openjdk.java.net/~pli/rfr/8214922/webrev.01/
> 
> That looks good to me.
> 

Thanks Roland. May I have other review comments for this 2nd webrev?

--
Thanks,
Pengfei

From martin.doerr at sap.com  Fri Jan  4 09:30:43 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 4 Jan 2019 09:30:43 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
Message-ID: <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>

Hi Gustavo,

thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.

New webrev:
http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/

Best regards,
Martin


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Donnerstag, 3. Januar 2019 19:36
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin,

On 01/03/2019 03:34 PM, Doerr, Martin wrote:
> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?

Thanks for providing a fix so I can try it.
Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.

Just as reference, I can reproduce it on the release build with the following trivial code:

import java.util.zip.CRC32C;

class CRC32C_v1 {
   public static void main(String[] arg) {
     byte[] b = new byte[1024];
   
     CRC32C crc32c = new CRC32C();
     crc32c.update(b, 0, b.length);

     System.out.println(crc32c.getValue());
   }
}

Thanks for fixing the typos.


Best regards,
Gustavo
  
> Best regards,
> Martin
> 
> 
> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
> @@ -1924,6 +1924,9 @@
>         __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>       }
> 
> +    // Restore caller sp for c2i case.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>       StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
> 
>       if (!VM_Version::has_vpmsumb()) {
> @@ -1933,8 +1936,6 @@
>         __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>       }
> 
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>       __ blr();
> 
>       // Generate a vanilla native entry as the slow path.
> @@ -2014,6 +2015,9 @@
>         __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>       }
> 
> +    // Restore caller sp for c2i case.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>       StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
> 
>       if (!VM_Version::has_vpmsumb()) {
> @@ -2023,8 +2027,6 @@
>         __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>       }
> 
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>       __ blr();
> 
>       BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Donnerstag, 3. Januar 2019 17:13
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
> 
> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
> 
> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
> 
> This is all for the CRC32 class.
> 
> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
> 
> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
> 
> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
> 
> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
> 
> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
> for Barrett but it should be changed in
> 
> +  // Point to Barret constants
> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
> +
> 
> ?
> 
> s/not/note/ in:
> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
> 
> d/lives/ in:
> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
> 
> Best regards,
> Gustavo
> 
> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>> Hi,
>>
>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>
>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>
>> Bug:
>>
>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>
>> I have addressed these 2 issues + some cleanup with the following webrev:
>>
>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>
>> Please review.
>>
>> Best regards,
>>
>> Martin
>>
> 


From Pengfei.Li at arm.com  Fri Jan  4 11:04:40 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Fri, 4 Jan 2019 11:04:40 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
Message-ID: <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Dmitrij,

Thanks a lot for your reply.

> since cnt2 is used as counter, wouldn't it be easier and shorter just to substract cnt1 from cnt2 at the beginning of this code. Total (cnt2 - cnt1 +1) combinations must be checked. That is why first sustraction is by (wordSize/str2_chr_size - 1).
> Then whole fix will be probably just 1 line at the beginning: sub(cnt2, cnt2, cnt1);

I don't think the whole fix could be as easy as "sub(cnt2, cnt2, cnt1)" because cnt2 is the counter which counts number of bytes not processed. It could be different from the number of bytes after current first-character-match index.

But this is just my thought. Perhaps I didn't understand your idea and code thoroughly. So could you post your shorter fix and let's test if it's right?

--
Thanks,
Pengfei


From aph at redhat.com  Fri Jan  4 12:13:26 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 4 Jan 2019 12:13:26 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <e1a00b25-55fa-8fa8-469a-dbd465d240eb@redhat.com>

On 1/4/19 11:04 AM, Pengfei Li (Arm Technology China) wrote:

> But this is just my thought. Perhaps I didn't understand your idea
> and code thoroughly. So could you post your shorter fix and let's
> test if it's right?

I agree, that's the best way to proceed.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From dmitrij.pochepko at bell-sw.com  Fri Jan  4 12:52:03 2019
From: dmitrij.pochepko at bell-sw.com (Dmitrij Pochepko)
Date: Fri, 4 Jan 2019 15:52:03 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>

Sure.

I could miss something, so, need to try it. I'll send webrev with patch 
once it's done.


Thanks,

Dmitrij


On 04.01.2019 14:04, Pengfei Li (Arm Technology China) wrote:
> Hi Dmitrij,
>
> Thanks a lot for your reply.
>
>> since cnt2 is used as counter, wouldn't it be easier and shorter just to substract cnt1 from cnt2 at the beginning of this code. Total (cnt2 - cnt1 +1) combinations must be checked. That is why first sustraction is by (wordSize/str2_chr_size - 1).
>> Then whole fix will be probably just 1 line at the beginning: sub(cnt2, cnt2, cnt1);
> I don't think the whole fix could be as easy as "sub(cnt2, cnt2, cnt1)" because cnt2 is the counter which counts number of bytes not processed. It could be different from the number of bytes after current first-character-match index.
>
> But this is just my thought. Perhaps I didn't understand your idea and code thoroughly. So could you post your shorter fix and let's test if it's right?
>
> --
> Thanks,
> Pengfei
>


From gromero at linux.vnet.ibm.com  Fri Jan  4 13:44:27 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Fri, 4 Jan 2019 11:44:27 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
Message-ID: <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>

Hi Martin,

On 01/04/2019 07:30 AM, Doerr, Martin wrote:
> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.

Glad to help! Thanks for the additional information, I was not aware that the
selection of different frame headers could be done at compile time. One last
question only for my education: what exactly advanced (incremented) R1_SP so it
has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
which function exactly or "who" is the caller exactly here?

Thank you.

Best regards,
Gustavo

> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Donnerstag, 3. Januar 2019 19:36
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
> 
> Thanks for providing a fix so I can try it.
> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
> 
> Just as reference, I can reproduce it on the release build with the following trivial code:
> 
> import java.util.zip.CRC32C;
> 
> class CRC32C_v1 {
>     public static void main(String[] arg) {
>       byte[] b = new byte[1024];
>     
>       CRC32C crc32c = new CRC32C();
>       crc32c.update(b, 0, b.length);
> 
>       System.out.println(crc32c.getValue());
>     }
> }
> 
> Thanks for fixing the typos.
> 
> 
> Best regards,
> Gustavo
>    
>> Best regards,
>> Martin
>>
>>
>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>> @@ -1924,6 +1924,9 @@
>>          __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>        }
>>
>> +    // Restore caller sp for c2i case.
>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>> +
>>        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>
>>        if (!VM_Version::has_vpmsumb()) {
>> @@ -1933,8 +1936,6 @@
>>          __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>        }
>>
>> -    // Restore caller sp for c2i case and return.
>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>        __ blr();
>>
>>        // Generate a vanilla native entry as the slow path.
>> @@ -2014,6 +2015,9 @@
>>          __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>        }
>>
>> +    // Restore caller sp for c2i case.
>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>> +
>>        StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>
>>        if (!VM_Version::has_vpmsumb()) {
>> @@ -2023,8 +2027,6 @@
>>          __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>        }
>>
>> -    // Restore caller sp for c2i case and return.
>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>        __ blr();
>>
>>        BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Donnerstag, 3. Januar 2019 17:13
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>
>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>
>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>
>> This is all for the CRC32 class.
>>
>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>
>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>
>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>
>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>
>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>> for Barrett but it should be changed in
>>
>> +  // Point to Barret constants
>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>> +
>>
>> ?
>>
>> s/not/note/ in:
>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>
>> d/lives/ in:
>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>
>> Best regards,
>> Gustavo
>>
>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>> Hi,
>>>
>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>
>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>
>>> Bug:
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>
>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>
>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>
>>> Please review.
>>>
>>> Best regards,
>>>
>>> Martin
>>>
>>
> 


From martin.doerr at sap.com  Fri Jan  4 16:13:56 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 4 Jan 2019 16:13:56 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
Message-ID: <beff3d359c954a29962be71c40bc235b@sap.com>

Hi Gustavo,

when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).
When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).

"mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.

Best regards,
Martin


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Freitag, 4. Januar 2019 14:44
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin,

On 01/04/2019 07:30 AM, Doerr, Martin wrote:
> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.

Glad to help! Thanks for the additional information, I was not aware that the
selection of different frame headers could be done at compile time. One last
question only for my education: what exactly advanced (incremented) R1_SP so it
has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
which function exactly or "who" is the caller exactly here?

Thank you.

Best regards,
Gustavo

> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Donnerstag, 3. Januar 2019 19:36
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
> 
> Thanks for providing a fix so I can try it.
> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
> 
> Just as reference, I can reproduce it on the release build with the following trivial code:
> 
> import java.util.zip.CRC32C;
> 
> class CRC32C_v1 {
>     public static void main(String[] arg) {
>       byte[] b = new byte[1024];
>     
>       CRC32C crc32c = new CRC32C();
>       crc32c.update(b, 0, b.length);
> 
>       System.out.println(crc32c.getValue());
>     }
> }
> 
> Thanks for fixing the typos.
> 
> 
> Best regards,
> Gustavo
>    
>> Best regards,
>> Martin
>>
>>
>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>> @@ -1924,6 +1924,9 @@
>>          __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>        }
>>
>> +    // Restore caller sp for c2i case.
>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>> +
>>        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>
>>        if (!VM_Version::has_vpmsumb()) {
>> @@ -1933,8 +1936,6 @@
>>          __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>        }
>>
>> -    // Restore caller sp for c2i case and return.
>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>        __ blr();
>>
>>        // Generate a vanilla native entry as the slow path.
>> @@ -2014,6 +2015,9 @@
>>          __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>        }
>>
>> +    // Restore caller sp for c2i case.
>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>> +
>>        StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>
>>        if (!VM_Version::has_vpmsumb()) {
>> @@ -2023,8 +2027,6 @@
>>          __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>        }
>>
>> -    // Restore caller sp for c2i case and return.
>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>        __ blr();
>>
>>        BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Donnerstag, 3. Januar 2019 17:13
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>
>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>
>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>
>> This is all for the CRC32 class.
>>
>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>
>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>
>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>
>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>
>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>> for Barrett but it should be changed in
>>
>> +  // Point to Barret constants
>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>> +
>>
>> ?
>>
>> s/not/note/ in:
>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>
>> d/lives/ in:
>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>
>> Best regards,
>> Gustavo
>>
>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>> Hi,
>>>
>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>
>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>
>>> Bug:
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>
>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>
>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>
>>> Please review.
>>>
>>> Best regards,
>>>
>>> Martin
>>>
>>
> 


From gromero at linux.vnet.ibm.com  Fri Jan  4 18:54:32 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Fri, 4 Jan 2019 16:54:32 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <beff3d359c954a29962be71c40bc235b@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
Message-ID: <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>

Hi Martin,

On 01/04/2019 02:13 PM, Doerr, Martin wrote:
> Hi Gustavo,
> 
> when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).

Got it. Thanks a lot for the explanations.

I think it doesn't currently matter in practice, but I'm wondering if to be
consistent we should cut back the stack back earlier also in
TemplateInterpreterGenerator::generate_CRC32_update_entry()?

diff -r a35f8c35d8c9 src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
--- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 10:09:00 2019 +0100
+++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 13:44:37 2019 -0500
@@ -1840,11 +1840,12 @@
  #endif
      __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64 bit to have a clean register.
  
+    // Restore caller sp for c2i case and return.
+    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
+
      StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
      __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
  
-    // Restore caller sp for c2i case and return.
-    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
      __ blr();
  
      // Generate a vanilla native entry as the slow path.

Currently there is no issue probably because generated code is simpler and does
no spills.

Best regards,
Gustavo

> When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).
> 
> "mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 4. Januar 2019 14:44
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
>> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.
> 
> Glad to help! Thanks for the additional information, I was not aware that the
> selection of different frame headers could be done at compile time. One last
> question only for my education: what exactly advanced (incremented) R1_SP so it
> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
> which function exactly or "who" is the caller exactly here?
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
>> New webrev:
>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
>>
>> Best regards,
>> Martin
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Donnerstag, 3. Januar 2019 19:36
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
>>
>> Thanks for providing a fix so I can try it.
>> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
>> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
>> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
>>
>> Just as reference, I can reproduce it on the release build with the following trivial code:
>>
>> import java.util.zip.CRC32C;
>>
>> class CRC32C_v1 {
>>      public static void main(String[] arg) {
>>        byte[] b = new byte[1024];
>>      
>>        CRC32C crc32c = new CRC32C();
>>        crc32c.update(b, 0, b.length);
>>
>>        System.out.println(crc32c.getValue());
>>      }
>> }
>>
>> Thanks for fixing the typos.
>>
>>
>> Best regards,
>> Gustavo
>>     
>>> Best regards,
>>> Martin
>>>
>>>
>>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>>> @@ -1924,6 +1924,9 @@
>>>           __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>         }
>>>
>>> +    // Restore caller sp for c2i case.
>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>> +
>>>         StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>>
>>>         if (!VM_Version::has_vpmsumb()) {
>>> @@ -1933,8 +1936,6 @@
>>>           __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>>         }
>>>
>>> -    // Restore caller sp for c2i case and return.
>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>         __ blr();
>>>
>>>         // Generate a vanilla native entry as the slow path.
>>> @@ -2014,6 +2015,9 @@
>>>           __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>         }
>>>
>>> +    // Restore caller sp for c2i case.
>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>> +
>>>         StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>>
>>>         if (!VM_Version::has_vpmsumb()) {
>>> @@ -2023,8 +2027,6 @@
>>>           __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>>         }
>>>
>>> -    // Restore caller sp for c2i case and return.
>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>         __ blr();
>>>
>>>         BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>>
>>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Donnerstag, 3. Januar 2019 17:13
>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>
>>> Hi Martin,
>>>
>>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>>
>>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>>
>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>>
>>> This is all for the CRC32 class.
>>>
>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>>
>>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>>
>>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>>
>>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>>
>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>>> for Barrett but it should be changed in
>>>
>>> +  // Point to Barret constants
>>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>>> +
>>>
>>> ?
>>>
>>> s/not/note/ in:
>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>>
>>> d/lives/ in:
>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>>
>>> Best regards,
>>> Gustavo
>>>
>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>>> Hi,
>>>>
>>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>>
>>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>>
>>>> Bug:
>>>>
>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>>
>>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>>
>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>>
>>>> Please review.
>>>>
>>>> Best regards,
>>>>
>>>> Martin
>>>>
>>>
>>
> 


From yasuenag at gmail.com  Sat Jan  5 01:33:42 2019
From: yasuenag at gmail.com (Yasumasa Suenaga)
Date: Sat, 5 Jan 2019 10:33:42 +0900
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
Message-ID: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>

Hi all,

Please review this change:

   JBS: https://bugs.openjdk.java.net/browse/JDK-8216154
   webrev: http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.00/
   Discussion on build-dev: https://mail.openjdk.java.net/pipermail/build-dev/2019-January/024581.html


I tried to build OpenJDK on WSL (Windows 10 1809 + VS2017 (15.9.4) + Ubuntu 18.04 LTS).
However, I saw some C4819 warnings as below:

```
c:/OpenJDK/jdk/src/hotspot/share/compiler/methodMatcher.cpp(258): warning C4819: ???????????? ??? (0) ??????????????????????????????????? Unicode ????????????
```

* The locale of my laptop is set to Japanese (CP932)

I saw this warning at 2 files as below:

   - hotspot/share/code/codeHeapState.cpp
   - hotspot/share/compiler/methodMatcher.cpp

We can see the problem with iconv:
   $ iconv -f US-ASCII -t UTF8 <file>


This change passed submit repo tests.


Thanks,

Yasumasa

From kim.barrett at oracle.com  Sun Jan  6 07:14:22 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Sun, 6 Jan 2019 02:14:22 -0500
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
Message-ID: <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>

> On Jan 4, 2019, at 8:33 PM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
> 
> Hi all,
> 
> Please review this change:
> 
>  JBS: https://bugs.openjdk.java.net/browse/JDK-8216154
>  webrev: http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.00/
>  Discussion on build-dev: https://mail.openjdk.java.net/pipermail/build-dev/2019-January/024581.html

The preferred idiom to disable a warning over some scope is to use

#pragma warning(push)
#pragma warning(disable : 4819)
?
#pragma warning(pop)


From yasuenag at gmail.com  Sun Jan  6 12:53:21 2019
From: yasuenag at gmail.com (Yasumasa Suenaga)
Date: Sun, 6 Jan 2019 21:53:21 +0900
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
Message-ID: <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>

Hi Kim,

Thank you for your comment.
I uploaded new webrev to use pragma warning push/pop:

   http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/


Please review again.


Yasumasa


On 2019/01/06 16:14, Kim Barrett wrote:
>> On Jan 4, 2019, at 8:33 PM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
>>
>> Hi all,
>>
>> Please review this change:
>>
>>   JBS: https://bugs.openjdk.java.net/browse/JDK-8216154
>>   webrev: http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.00/
>>   Discussion on build-dev: https://mail.openjdk.java.net/pipermail/build-dev/2019-January/024581.html
> 
> The preferred idiom to disable a warning over some scope is to use
> 
> #pragma warning(push)
> #pragma warning(disable : 4819)
> ?
> #pragma warning(pop)
> 

From kim.barrett at oracle.com  Sun Jan  6 17:54:48 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Sun, 6 Jan 2019 12:54:48 -0500
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
Message-ID: <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>

> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
> 
> Hi Kim,
> 
> Thank you for your comment.
> I uploaded new webrev to use pragma warning push/pop:
> 
>  http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
> 
> 
> Please review again.

Looks good.


From kim.barrett at oracle.com  Sun Jan  6 22:18:55 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Sun, 6 Jan 2019 17:18:55 -0500
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
Message-ID: <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>

> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com> wrote:
> 
>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
>> 
>> Hi Kim,
>> 
>> Thank you for your comment.
>> I uploaded new webrev to use pragma warning push/pop:
>> 
>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>> 
>> 
>> Please review again.
> 
> Looks good.

It later occurred to me to wonder whether _WINDOWS was the right macro to conditionalize
on.  All other uses of #pragma warning push/pop (there are 5 in HotSpot) use _MSC_VER.

I also wonder why we don?t have a Visual Studio definition for PRAGMA_DIAG_PUSH/POP,
but that?s a different issue altogether.


From OGATAK at jp.ibm.com  Mon Jan  7 05:13:31 2019
From: OGATAK at jp.ibm.com (Kazunori Ogata)
Date: Mon, 7 Jan 2019 14:13:31 +0900
Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy stubs
 by using vector instructions
In-Reply-To: <OFB23FA863.EF67A003-ON49258367.004E5542-49258367.0050AF0E@LocalDomain>
References: <OFB23FA863.EF67A003-ON49258367.004E5542-49258367.0050AF0E@LocalDomain>
Message-ID: <OF3558918E.E6D1DAF0-ON4925837B.001C7F7B-4925837B.001CB464@notes.na.collabserv.com>

Hi,

Ping.  Can anyone review this enhancement backport request?

Regards,
Ogata


Kazunori Ogata/Japan/IBM wrote on 2018/12/18 23:41:16:

> From: Kazunori Ogata/Japan/IBM
> To: hotspot-compiler-dev at openjdk.java.net, 
ppc-aix-port-dev at openjdk.java.net
> Date: 2018/12/18 23:41
> Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy 
stubs
> by using vector instructions
> 
> Hi,
> 
> May I get review for enhancement backport of 8154156: PPC64: improve 
array
> copy stubs by using vector instructions?
> 
> To make this patch buildable (and usable by other planned backports 
listed
> in [1]), I cherry picked config_dscr() and its dependent code from [2,3] 

> and has_mfdscr() from [4].
> 
> Original patch: http://hg.openjdk.java.net/jdk/jdk/rev/c9d756fa846e
> Weberv: 
http://cr.openjdk.java.net/~horii/jdk8u_aes_be/8154156/webrev.01/
> 
> I confirmed it was buildable for both relase and fastdebug builds, and 
> JTREG caused no degradation.
> 
> Refs:
> [1] 
http://mail.openjdk.java.net/pipermail/ppc-aix-port-dev/2018-December/
> 003818.html
> [2] 8149655: PPC64: Implement CompactString intrinsics
>     http://hg.openjdk.java.net/jdk/jdk/rev/6241574f5982
> [3] 8080684: PPC64: Fix little-endian build after "8077838: Recent 
> developments for ppc"
>     http://hg.openjdk.java.net/jdk/jdk/rev/12ccf8b26eb0 
> [4] 8077838: Recent developments for ppc.
>     http://hg.openjdk.java.net/jdk/jdk/rev/c703c89fddbf
> 
> Regards,
> Ogata


From claes.redestad at oracle.com  Mon Jan  7 12:36:45 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 7 Jan 2019 13:36:45 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
Message-ID: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>

Hi,

DelayCompilationAtStartup doesn't delay any compilations.

Webrev: http://cr.openjdk.java.net/~redestad/8216262/open.00/
Bug:    https://bugs.openjdk.java.net/browse/JDK-8216262

Testing: tier1

Thanks!

/Claes

From yasuenag at gmail.com  Mon Jan  7 12:36:24 2019
From: yasuenag at gmail.com (Yasumasa Suenaga)
Date: Mon, 7 Jan 2019 21:36:24 +0900
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
Message-ID: <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>

Hi Kim,

On 2019/01/07 7:18, Kim Barrett wrote:
>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com> wrote:
>>
>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
>>>
>>> Hi Kim,
>>>
>>> Thank you for your comment.
>>> I uploaded new webrev to use pragma warning push/pop:
>>>
>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>
>>>
>>> Please review again.
>>
>> Looks good.
> 
> It later occurred to me to wonder whether _WINDOWS was the right macro to conditionalize
> on.  All other uses of #pragma warning push/pop (there are 5 in HotSpot) use _MSC_VER.

I updated webrev to use _MSC_VER. Is it ok?

   http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/


Thanks,

Yasumasa


> I also wonder why we don?t have a Visual Studio definition for PRAGMA_DIAG_PUSH/POP,
> but that?s a different issue altogether.
> 

From david.holmes at oracle.com  Mon Jan  7 12:54:17 2019
From: david.holmes at oracle.com (David Holmes)
Date: Mon, 7 Jan 2019 22:54:17 +1000
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
Message-ID: <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>

Hi Claes,

On 7/01/2019 10:36 pm, Claes Redestad wrote:
> Hi,
> 
> DelayCompilationAtStartup doesn't delay any compilations.
> 
> Webrev: http://cr.openjdk.java.net/~redestad/8216262/open.00/
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216262

Normally we would follow a staged removal process: deprecate, obsolete, 
then expire - see arguments.cpp and special_jvm_flags table. In this 
case we can probably start at obsoletion, but that would leave 
expiration for JDK 14. Or compiler folk can argue for / justify 
immediate full expiration/removal.

Cheers,
David

> Testing: tier1
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Mon Jan  7 13:01:51 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 7 Jan 2019 14:01:51 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
Message-ID: <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>

On 2019-01-07 13:54, David Holmes wrote:
> Hi Claes,
> 
> On 7/01/2019 10:36 pm, Claes Redestad wrote:
>> Hi,
>>
>> DelayCompilationAtStartup doesn't delay any compilations.
>>
>> Webrev: http://cr.openjdk.java.net/~redestad/8216262/open.00/
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216262
> 
> Normally we would follow a staged removal process: deprecate, obsolete, 
> then expire - see arguments.cpp and special_jvm_flags table. In this 
> case we can probably start at obsoletion, but that would leave 
> expiration for JDK 14. Or compiler folk can argue for / justify 
> immediate full expiration/removal.

I'm under the impression this process does not apply to develop flags
(which are not visible an anything by debug builds)?

/Claes

From martin.doerr at sap.com  Mon Jan  7 13:08:57 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Mon, 7 Jan 2019 13:08:57 +0000
Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy stubs
 by using vector instructions
In-Reply-To: <OF3558918E.E6D1DAF0-ON4925837B.001C7F7B-4925837B.001CB464@notes.na.collabserv.com>
References: <OFB23FA863.EF67A003-ON49258367.004E5542-49258367.0050AF0E@LocalDomain>
 <OF3558918E.E6D1DAF0-ON4925837B.001C7F7B-4925837B.001CB464@notes.na.collabserv.com>
Message-ID: <6adf0a283eda47b29df02a3a2d8550ee@sap.com>

Hi Ogata,

looks good to me. However, I'm not a jdk8u reviewer.

Best regards,
Martin


-----Original Message-----
From: ppc-aix-port-dev <ppc-aix-port-dev-bounces at openjdk.java.net> On Behalf Of Kazunori Ogata
Sent: Montag, 7. Januar 2019 06:14
To: hotspot-compiler-dev at openjdk.java.net; ppc-aix-port-dev at openjdk.java.net
Subject: Re: [8u] RFR for backport of 8154156: PPC64: improve array copy stubs by using vector instructions

Hi,

Ping.  Can anyone review this enhancement backport request?

Regards,
Ogata


Kazunori Ogata/Japan/IBM wrote on 2018/12/18 23:41:16:

> From: Kazunori Ogata/Japan/IBM
> To: hotspot-compiler-dev at openjdk.java.net, 
ppc-aix-port-dev at openjdk.java.net
> Date: 2018/12/18 23:41
> Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy 
stubs
> by using vector instructions
> 
> Hi,
> 
> May I get review for enhancement backport of 8154156: PPC64: improve 
array
> copy stubs by using vector instructions?
> 
> To make this patch buildable (and usable by other planned backports 
listed
> in [1]), I cherry picked config_dscr() and its dependent code from [2,3] 

> and has_mfdscr() from [4].
> 
> Original patch: http://hg.openjdk.java.net/jdk/jdk/rev/c9d756fa846e
> Weberv: 
http://cr.openjdk.java.net/~horii/jdk8u_aes_be/8154156/webrev.01/
> 
> I confirmed it was buildable for both relase and fastdebug builds, and 
> JTREG caused no degradation.
> 
> Refs:
> [1] 
http://mail.openjdk.java.net/pipermail/ppc-aix-port-dev/2018-December/
> 003818.html
> [2] 8149655: PPC64: Implement CompactString intrinsics
>     http://hg.openjdk.java.net/jdk/jdk/rev/6241574f5982
> [3] 8080684: PPC64: Fix little-endian build after "8077838: Recent 
> developments for ppc"
>     http://hg.openjdk.java.net/jdk/jdk/rev/12ccf8b26eb0 
> [4] 8077838: Recent developments for ppc.
>     http://hg.openjdk.java.net/jdk/jdk/rev/c703c89fddbf
> 
> Regards,
> Ogata


From thomas.schatzl at oracle.com  Mon Jan  7 13:20:43 2019
From: thomas.schatzl at oracle.com (Thomas Schatzl)
Date: Mon, 07 Jan 2019 14:20:43 +0100
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
Message-ID: <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>

Hi,

On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
> Hi Kim,
> 
> On 2019/01/07 7:18, Kim Barrett wrote:
> > > On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
> > > wrote:
> > > 
> > > > On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
> > > > yasuenag at gmail.com> wrote:
> > > > 
> > > > Hi Kim,
> > > > 
> > > > Thank you for your comment.
> > > > I uploaded new webrev to use pragma warning push/pop:
> > > > 
> > > > http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
> > > > 
> > > > 
> > > > Please review again.
> > > 
> > > Looks good.

I tried to verify these problems on these two files as suggested with
"iconv -f US-ASCII -t UTF8 <file>" which errored out on
codeHeapState.cpp as expected but there has been no error with
methodMatcher.cpp. Am I doing something wrong?

I am fine with that change if it is really needed for successful
compliation :) I just can't find the non-US-ASCII character used in the
line indicated by the error message.

> > 
> > It later occurred to me to wonder whether _WINDOWS was the right
> > macro to conditionalize on.  All other uses of #pragma warning
> > push/pop (there are 5 in HotSpot) use _MSC_VER.
> 
> I updated webrev to use _MSC_VER. Is it ok?
> 
>    http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/
> 

Please add a "// warning C4189: The file contains a character that
cannot be represented in the current code page" comment above or next
to the pragma warning(disable) declaration.

Not many people know the VC warning numbers by default...

Looks good otherwise, I do not need a re-review for this comment
change.

Thanks,
  Thomas


From claes.redestad at oracle.com  Mon Jan  7 13:31:42 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 7 Jan 2019 14:31:42 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
 <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>
Message-ID: <818084dc-3e98-97da-20f4-aa00f3f6545e@oracle.com>


On 2019-01-07 14:01, Claes Redestad wrote:
>>
>> Normally we would follow a staged removal process: deprecate, 
>> obsolete, then expire - see arguments.cpp and special_jvm_flags table. 
>> In this case we can probably start at obsoletion, but that would leave 
>> expiration for JDK 14. Or compiler folk can argue for / justify 
>> immediate full expiration/removal.
> 
> I'm under the impression this process does not apply to develop flags
> (which are not visible an anything but debug builds)?

We've removed develop flags without obsoletion + expiry many times in
the past[1], and while this goes against the written down expiration
in arguments.cpp, I believe it to be a misguided recommendation for
develop flags.

/Claes

[1]
https://bugs.openjdk.java.net/browse/JDK-8191870
https://bugs.openjdk.java.net/browse/JDK-8132318
https://bugs.openjdk.java.net/browse/JDK-8186042
https://bugs.openjdk.java.net/browse/JDK-8180423
https://bugs.openjdk.java.net/browse/JDK-8058259

From martin.doerr at sap.com  Mon Jan  7 13:49:34 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Mon, 7 Jan 2019 13:49:34 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
Message-ID: <d898c13929a44afb82d477fd732d23e7@sap.com>

Hi Gustavo,

I want to check all places where we use "mr(R1_SP, R21_sender_SP)". There may be more issues with that. I'll probably handle that in a separate change and push this CRC change afterwards.

Best regards,
Martin


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Freitag, 4. Januar 2019 19:55
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin,

On 01/04/2019 02:13 PM, Doerr, Martin wrote:
> Hi Gustavo,
> 
> when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).

Got it. Thanks a lot for the explanations.

I think it doesn't currently matter in practice, but I'm wondering if to be
consistent we should cut back the stack back earlier also in
TemplateInterpreterGenerator::generate_CRC32_update_entry()?

diff -r a35f8c35d8c9 src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
--- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 10:09:00 2019 +0100
+++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 13:44:37 2019 -0500
@@ -1840,11 +1840,12 @@
  #endif
      __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64 bit to have a clean register.
  
+    // Restore caller sp for c2i case and return.
+    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
+
      StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
      __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
  
-    // Restore caller sp for c2i case and return.
-    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
      __ blr();
  
      // Generate a vanilla native entry as the slow path.

Currently there is no issue probably because generated code is simpler and does
no spills.

Best regards,
Gustavo

> When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).
> 
> "mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 4. Januar 2019 14:44
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
>> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.
> 
> Glad to help! Thanks for the additional information, I was not aware that the
> selection of different frame headers could be done at compile time. One last
> question only for my education: what exactly advanced (incremented) R1_SP so it
> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
> which function exactly or "who" is the caller exactly here?
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
>> New webrev:
>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
>>
>> Best regards,
>> Martin
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Donnerstag, 3. Januar 2019 19:36
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
>>
>> Thanks for providing a fix so I can try it.
>> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
>> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
>> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
>>
>> Just as reference, I can reproduce it on the release build with the following trivial code:
>>
>> import java.util.zip.CRC32C;
>>
>> class CRC32C_v1 {
>>      public static void main(String[] arg) {
>>        byte[] b = new byte[1024];
>>      
>>        CRC32C crc32c = new CRC32C();
>>        crc32c.update(b, 0, b.length);
>>
>>        System.out.println(crc32c.getValue());
>>      }
>> }
>>
>> Thanks for fixing the typos.
>>
>>
>> Best regards,
>> Gustavo
>>     
>>> Best regards,
>>> Martin
>>>
>>>
>>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>>> @@ -1924,6 +1924,9 @@
>>>           __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>         }
>>>
>>> +    // Restore caller sp for c2i case.
>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>> +
>>>         StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>>
>>>         if (!VM_Version::has_vpmsumb()) {
>>> @@ -1933,8 +1936,6 @@
>>>           __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>>         }
>>>
>>> -    // Restore caller sp for c2i case and return.
>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>         __ blr();
>>>
>>>         // Generate a vanilla native entry as the slow path.
>>> @@ -2014,6 +2015,9 @@
>>>           __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>         }
>>>
>>> +    // Restore caller sp for c2i case.
>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>> +
>>>         StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>>
>>>         if (!VM_Version::has_vpmsumb()) {
>>> @@ -2023,8 +2027,6 @@
>>>           __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>>         }
>>>
>>> -    // Restore caller sp for c2i case and return.
>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>         __ blr();
>>>
>>>         BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>>
>>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Donnerstag, 3. Januar 2019 17:13
>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>
>>> Hi Martin,
>>>
>>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>>
>>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>>
>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>>
>>> This is all for the CRC32 class.
>>>
>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>>
>>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>>
>>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>>
>>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>>
>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>>> for Barrett but it should be changed in
>>>
>>> +  // Point to Barret constants
>>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>>> +
>>>
>>> ?
>>>
>>> s/not/note/ in:
>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>>
>>> d/lives/ in:
>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>>
>>> Best regards,
>>> Gustavo
>>>
>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>>> Hi,
>>>>
>>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>>
>>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>>
>>>> Bug:
>>>>
>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>>
>>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>>
>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>>
>>>> Please review.
>>>>
>>>> Best regards,
>>>>
>>>> Martin
>>>>
>>>
>>
> 


From gromero at linux.vnet.ibm.com  Mon Jan  7 13:52:19 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Mon, 7 Jan 2019 11:52:19 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <d898c13929a44afb82d477fd732d23e7@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
Message-ID: <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>

Hi Martin,

On 01/07/2019 11:49 AM, Doerr, Martin wrote:
> I want to check all places where we use "mr(R1_SP, R21_sender_SP)". There may be more issues with that. I'll probably handle that in a separate change and push this CRC change afterwards.

I see. Thanks for letting me know.

Best regards,
Gustavo

> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 4. Januar 2019 19:55
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/04/2019 02:13 PM, Doerr, Martin wrote:
>> Hi Gustavo,
>>
>> when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).
> 
> Got it. Thanks a lot for the explanations.
> 
> I think it doesn't currently matter in practice, but I'm wondering if to be
> consistent we should cut back the stack back earlier also in
> TemplateInterpreterGenerator::generate_CRC32_update_entry()?
> 
> diff -r a35f8c35d8c9 src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 10:09:00 2019 +0100
> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 13:44:37 2019 -0500
> @@ -1840,11 +1840,12 @@
>    #endif
>        __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64 bit to have a clean register.
>    
> +    // Restore caller sp for c2i case and return.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>        __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
>    
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>        __ blr();
>    
>        // Generate a vanilla native entry as the slow path.
> 
> Currently there is no issue probably because generated code is simpler and does
> no spills.
> 
> Best regards,
> Gustavo
> 
>> When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).
>>
>> "mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.
>>
>> Best regards,
>> Martin
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Freitag, 4. Januar 2019 14:44
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
>>> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.
>>
>> Glad to help! Thanks for the additional information, I was not aware that the
>> selection of different frame headers could be done at compile time. One last
>> question only for my education: what exactly advanced (incremented) R1_SP so it
>> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
>> which function exactly or "who" is the caller exactly here?
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>>> New webrev:
>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
>>>
>>> Best regards,
>>> Martin
>>>
>>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Donnerstag, 3. Januar 2019 19:36
>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>
>>> Hi Martin,
>>>
>>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>>>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
>>>
>>> Thanks for providing a fix so I can try it.
>>> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
>>> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
>>> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
>>>
>>> Just as reference, I can reproduce it on the release build with the following trivial code:
>>>
>>> import java.util.zip.CRC32C;
>>>
>>> class CRC32C_v1 {
>>>       public static void main(String[] arg) {
>>>         byte[] b = new byte[1024];
>>>       
>>>         CRC32C crc32c = new CRC32C();
>>>         crc32c.update(b, 0, b.length);
>>>
>>>         System.out.println(crc32c.getValue());
>>>       }
>>> }
>>>
>>> Thanks for fixing the typos.
>>>
>>>
>>> Best regards,
>>> Gustavo
>>>      
>>>> Best regards,
>>>> Martin
>>>>
>>>>
>>>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>>>> @@ -1924,6 +1924,9 @@
>>>>            __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>          }
>>>>
>>>> +    // Restore caller sp for c2i case.
>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>> +
>>>>          StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>>>
>>>>          if (!VM_Version::has_vpmsumb()) {
>>>> @@ -1933,8 +1936,6 @@
>>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>>>          }
>>>>
>>>> -    // Restore caller sp for c2i case and return.
>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>          __ blr();
>>>>
>>>>          // Generate a vanilla native entry as the slow path.
>>>> @@ -2014,6 +2015,9 @@
>>>>            __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>          }
>>>>
>>>> +    // Restore caller sp for c2i case.
>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>> +
>>>>          StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>>>
>>>>          if (!VM_Version::has_vpmsumb()) {
>>>> @@ -2023,8 +2027,6 @@
>>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>>>          }
>>>>
>>>> -    // Restore caller sp for c2i case and return.
>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>          __ blr();
>>>>
>>>>          BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>>>
>>>>
>>>> -----Original Message-----
>>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>>> Sent: Donnerstag, 3. Januar 2019 17:13
>>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>>
>>>> Hi Martin,
>>>>
>>>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>>>
>>>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>>>
>>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>>>
>>>> This is all for the CRC32 class.
>>>>
>>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>>>
>>>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>>>
>>>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>>>
>>>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>>>
>>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>>>> for Barrett but it should be changed in
>>>>
>>>> +  // Point to Barret constants
>>>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>>>> +
>>>>
>>>> ?
>>>>
>>>> s/not/note/ in:
>>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>>>
>>>> d/lives/ in:
>>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>>>
>>>> Best regards,
>>>> Gustavo
>>>>
>>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>>>> Hi,
>>>>>
>>>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>>>
>>>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>>>
>>>>> Bug:
>>>>>
>>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>>>
>>>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>>>
>>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>>>
>>>>> Please review.
>>>>>
>>>>> Best regards,
>>>>>
>>>>> Martin
>>>>>
>>>>
>>>
>>
> 


From leo.korinth at oracle.com  Mon Jan  7 14:32:06 2019
From: leo.korinth at oracle.com (Leo Korinth)
Date: Mon, 7 Jan 2019 15:32:06 +0100
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
 <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
Message-ID: <e998e444-6848-0255-11e0-653b374a24cb@oracle.com>

Hi!

Running: find -name "*.[ch]pp" | xargs file | grep -v ASCII
./src/hotspot/cpu/x86/macroAssembler_x86_sha.cpp: 
                          C source, UTF-8 Unicode text
./src/hotspot/cpu/aarch64/macroAssembler_aarch64_trig.cpp: 
                          C source, UTF-8 Unicode text
./src/hotspot/share/gc/parallel/gcTaskManager.hpp: 
                          data
./src/hotspot/share/code/codeHeapState.cpp: 
                          C source, UTF-8 Unicode text
./src/hotspot/share/oops/method.cpp: 
                                                                   C 
source, UTF-8 Unicode text
./test/hotspot/gtest/utilities/test_json.cpp: 
                                                                   C 
source, UTF-8 Unicode text


The single hpp file seems fine though (just file not understanding that 
it is a source file).

Some questions, as it seems like I am missing something.
1) Should not all of those files be fixed?
2) Why remove warning (in one file, methodMatcher.cpp) instead of 
changing encoding?
3) methodMatcher.cpp seems to be pure ASCII, why the change in that file 
at all?

$ grep --color -P -n "[^[:ascii:]]" is a good way to find the 
problematic line.

Thanks, Leo

On 07/01/2019 14:20, Thomas Schatzl wrote:
> Hi,
> 
> On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
>> Hi Kim,
>>
>> On 2019/01/07 7:18, Kim Barrett wrote:
>>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
>>>> wrote:
>>>>
>>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
>>>>> yasuenag at gmail.com> wrote:
>>>>>
>>>>> Hi Kim,
>>>>>
>>>>> Thank you for your comment.
>>>>> I uploaded new webrev to use pragma warning push/pop:
>>>>>
>>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>>>
>>>>>
>>>>> Please review again.
>>>>
>>>> Looks good.
> 
> I tried to verify these problems on these two files as suggested with
> "iconv -f US-ASCII -t UTF8 <file>" which errored out on
> codeHeapState.cpp as expected but there has been no error with
> methodMatcher.cpp. Am I doing something wrong?
> 
> I am fine with that change if it is really needed for successful
> compliation :) I just can't find the non-US-ASCII character used in the
> line indicated by the error message.
> 
>>>
>>> It later occurred to me to wonder whether _WINDOWS was the right
>>> macro to conditionalize on.  All other uses of #pragma warning
>>> push/pop (there are 5 in HotSpot) use _MSC_VER.
>>
>> I updated webrev to use _MSC_VER. Is it ok?
>>
>>     http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/
>>
> 
> Please add a "// warning C4189: The file contains a character that
> cannot be represented in the current code page" comment above or next
> to the pragma warning(disable) declaration.
> 
> Not many people know the VC warning numbers by default...
> 
> Looks good otherwise, I do not need a re-review for this comment
> change.
> 
> Thanks,
>    Thomas
> 
> 

From yasuenag at gmail.com  Mon Jan  7 14:38:42 2019
From: yasuenag at gmail.com (Yasumasa Suenaga)
Date: Mon, 7 Jan 2019 23:38:42 +0900
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
 <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
Message-ID: <97640e53-a344-636d-1005-b62bb364aaa7@gmail.com>

Hi Thomas,

On 2019/01/07 22:20, Thomas Schatzl wrote:
> Hi,
> 
> On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
>> Hi Kim,
>>
>> On 2019/01/07 7:18, Kim Barrett wrote:
>>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
>>>> wrote:
>>>>
>>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
>>>>> yasuenag at gmail.com> wrote:
>>>>>
>>>>> Hi Kim,
>>>>>
>>>>> Thank you for your comment.
>>>>> I uploaded new webrev to use pragma warning push/pop:
>>>>>
>>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>>>
>>>>>
>>>>> Please review again.
>>>>
>>>> Looks good.
> 
> I tried to verify these problems on these two files as suggested with
> "iconv -f US-ASCII -t UTF8 <file>" which errored out on
> codeHeapState.cpp as expected but there has been no error with
> methodMatcher.cpp. Am I doing something wrong?

Sorry, it's my mistake.
But the error will occur in its file because `RANGE0` which contains non-ASCII
characters passes to sscanf().


> I am fine with that change if it is really needed for successful
> compliation :) I just can't find the non-US-ASCII character used in the
> line indicated by the error message.
> 
>>>
>>> It later occurred to me to wonder whether _WINDOWS was the right
>>> macro to conditionalize on.  All other uses of #pragma warning
>>> push/pop (there are 5 in HotSpot) use _MSC_VER.
>>
>> I updated webrev to use _MSC_VER. Is it ok?
>>
>>     http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/
>>
> 
> Please add a "// warning C4189: The file contains a character that
> cannot be represented in the current code page" comment above or next
> to the pragma warning(disable) declaration.
> 
> Not many people know the VC warning numbers by default...
> 
> Looks good otherwise, I do not need a re-review for this comment
> change.

Ok, I will add the comment.


Thanks,

Yasumasa


> Thanks,
>    Thomas
> 
> 

From yasuenag at gmail.com  Mon Jan  7 14:53:31 2019
From: yasuenag at gmail.com (Yasumasa Suenaga)
Date: Mon, 7 Jan 2019 23:53:31 +0900
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <e998e444-6848-0255-11e0-653b374a24cb@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
 <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
 <e998e444-6848-0255-11e0-653b374a24cb@oracle.com>
Message-ID: <ba49f0b3-d684-e76c-2078-10181a1efd42@gmail.com>

Hi Leo,

> 1) Should not all of those files be fixed?


> ./src/hotspot/cpu/x86/macroAssembler_x86_sha.cpp:                          C source, UTF-8 Unicode text
> ./src/hotspot/share/oops/method.cpp:                                                                   C source, UTF-8 Unicode text

Non-ASCII character(s) in comment line.


> ./src/hotspot/cpu/aarch64/macroAssembler_aarch64_trig.cpp:                          C source, UTF-8 Unicode text

It's not used for Windows (AArch64).


> ./src/hotspot/share/gc/parallel/gcTaskManager.hpp:                          data

I couldn't find why `file` detects it as "data". So I have no idea for it.


> ./src/hotspot/share/code/codeHeapState.cpp:                          C source, UTF-8 Unicode text

It's ASCII file on my laptop :-)


> ./test/hotspot/gtest/utilities/test_json.cpp:                                                                   C source, UTF-8 Unicode text

It's test code.


> 2) Why remove warning (in one file, methodMatcher.cpp) instead of changing encoding?
> 3) methodMatcher.cpp seems to be pure ASCII, why the change in that file at all?

The error occurs about `RANGE0`. It has binary data, so it might not be able to change encoding.


Thanks,

Yasumasa


On 2019/01/07 23:32, Leo Korinth wrote:
> Hi!
> 
> Running: find -name "*.[ch]pp" | xargs file | grep -v ASCII
> ./src/hotspot/cpu/x86/macroAssembler_x86_sha.cpp: ???????????????????????? C source, UTF-8 Unicode text
> ./src/hotspot/cpu/aarch64/macroAssembler_aarch64_trig.cpp: ???????????????????????? C source, UTF-8 Unicode text
> ./src/hotspot/share/gc/parallel/gcTaskManager.hpp: ???????????????????????? data
> ./src/hotspot/share/code/codeHeapState.cpp: ???????????????????????? C source, UTF-8 Unicode text
> ./src/hotspot/share/oops/method.cpp: ????????????????????????????????????????????????????????????????? C source, UTF-8 Unicode text
> ./test/hotspot/gtest/utilities/test_json.cpp: ????????????????????????????????????????????????????????????????? C source, UTF-8 Unicode text
> 
> 
> The single hpp file seems fine though (just file not understanding that it is a source file).
> 
> Some questions, as it seems like I am missing something.
> 1) Should not all of those files be fixed?
> 2) Why remove warning (in one file, methodMatcher.cpp) instead of changing encoding?
> 3) methodMatcher.cpp seems to be pure ASCII, why the change in that file at all?
> 
> $ grep --color -P -n "[^[:ascii:]]" is a good way to find the problematic line.
> 
> Thanks, Leo
> 
> On 07/01/2019 14:20, Thomas Schatzl wrote:
>> Hi,
>>
>> On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
>>> Hi Kim,
>>>
>>> On 2019/01/07 7:18, Kim Barrett wrote:
>>>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
>>>>> wrote:
>>>>>
>>>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
>>>>>> yasuenag at gmail.com> wrote:
>>>>>>
>>>>>> Hi Kim,
>>>>>>
>>>>>> Thank you for your comment.
>>>>>> I uploaded new webrev to use pragma warning push/pop:
>>>>>>
>>>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>>>>
>>>>>>
>>>>>> Please review again.
>>>>>
>>>>> Looks good.
>>
>> I tried to verify these problems on these two files as suggested with
>> "iconv -f US-ASCII -t UTF8 <file>" which errored out on
>> codeHeapState.cpp as expected but there has been no error with
>> methodMatcher.cpp. Am I doing something wrong?
>>
>> I am fine with that change if it is really needed for successful
>> compliation :) I just can't find the non-US-ASCII character used in the
>> line indicated by the error message.
>>
>>>>
>>>> It later occurred to me to wonder whether _WINDOWS was the right
>>>> macro to conditionalize on.? All other uses of #pragma warning
>>>> push/pop (there are 5 in HotSpot) use _MSC_VER.
>>>
>>> I updated webrev to use _MSC_VER. Is it ok?
>>>
>>> ??? http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/
>>>
>>
>> Please add a "// warning C4189: The file contains a character that
>> cannot be represented in the current code page" comment above or next
>> to the pragma warning(disable) declaration.
>>
>> Not many people know the VC warning numbers by default...
>>
>> Looks good otherwise, I do not need a re-review for this comment
>> change.
>>
>> Thanks,
>> ?? Thomas
>>
>>

From zgu at redhat.com  Mon Jan  7 15:38:12 2019
From: zgu at redhat.com (zgu at redhat.com)
Date: Mon, 07 Jan 2019 10:38:12 -0500
Subject: RFR(T) 8216199: Local variable arg defined but never used in
 BCEscapeAnalyzer::compute_escape_for_intrinsic()
Message-ID: <1546875492.3477.36.camel@redhat.com>

Please review this trivial change to remove unused local variable.

Bug: https://bugs.openjdk.java.net/browse/JDK-8216199
Webrev: http://cr.openjdk.java.net/~zgu/JDK-8216199/webrev.00/

Test:

  hotspot_compiler on Linux 64 (fastdebug and release)

Thanks,

-Zhengyu

From tobias.hartmann at oracle.com  Mon Jan  7 15:45:06 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 7 Jan 2019 16:45:06 +0100
Subject: RFR(T) 8216199: Local variable arg defined but never used in
 BCEscapeAnalyzer::compute_escape_for_intrinsic()
In-Reply-To: <1546875492.3477.36.camel@redhat.com>
References: <1546875492.3477.36.camel@redhat.com>
Message-ID: <d9c0c263-47b3-c69f-365b-197d2b636479@oracle.com>

Hi Zhengyu,

looks good and trivial to me.

Thanks,
Tobias

On 07.01.19 16:38, zgu at redhat.com wrote:
> Please review this trivial change to remove unused local variable.
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216199
> Webrev: http://cr.openjdk.java.net/~zgu/JDK-8216199/webrev.00/
> 
> Test:
> 
>   hotspot_compiler on Linux 64 (fastdebug and release)
> 
> Thanks,
> 
> -Zhengyu
> 

From zgu at redhat.com  Mon Jan  7 15:59:23 2019
From: zgu at redhat.com (zgu at redhat.com)
Date: Mon, 07 Jan 2019 10:59:23 -0500
Subject: RFR(T) 8216200: BCEscapeAnalyzer::ArgumentMap::set_intersect() is
 incorrect
Message-ID: <1546876763.3477.43.camel@redhat.com>

Please review this trivial change that removes unused/incorrect method.

BCEscapeAnalyzer::ArgumentMap::set_intersect()'s implementation is
wrong. The reason that it did not blowup anything, is that it does not
have users. 

We can fix it or remove it: based on Tobias' comment in bug, let's
simply remove it. 


Bug: https://bugs.openjdk.java.net/browse/JDK-8216200
Webrev: http://cr.openjdk.java.net/~zgu/JDK-8216200/webrev.00/

Test:

  hotspot_compiler on Linux 64.

Thanks,

-Zhengyu

From tobias.hartmann at oracle.com  Mon Jan  7 16:01:19 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 7 Jan 2019 17:01:19 +0100
Subject: RFR(T) 8216200: BCEscapeAnalyzer::ArgumentMap::set_intersect() is
 incorrect
In-Reply-To: <1546876763.3477.43.camel@redhat.com>
References: <1546876763.3477.43.camel@redhat.com>
Message-ID: <d8433230-dfcd-a6bf-cfb2-8c0d1a5beedd@oracle.com>

Hi Zhengyu,

looks good and trivial to me.

Best regards,
Tobias

On 07.01.19 16:59, zgu at redhat.com wrote:
> Please review this trivial change that removes unused/incorrect method.
> 
> BCEscapeAnalyzer::ArgumentMap::set_intersect()'s implementation is
> wrong. The reason that it did not blowup anything, is that it does not
> have users. 
> 
> We can fix it or remove it: based on Tobias' comment in bug, let's
> simply remove it. 
> 
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216200
> Webrev: http://cr.openjdk.java.net/~zgu/JDK-8216200/webrev.00/
> 
> Test:
> 
>   hotspot_compiler on Linux 64.
> 
> Thanks,
> 
> -Zhengyu
> 

From zgu at redhat.com  Mon Jan  7 16:03:46 2019
From: zgu at redhat.com (zgu at redhat.com)
Date: Mon, 07 Jan 2019 11:03:46 -0500
Subject: RFR(T) 8216200: BCEscapeAnalyzer::ArgumentMap::set_intersect()
 is incorrect
In-Reply-To: <d8433230-dfcd-a6bf-cfb2-8c0d1a5beedd@oracle.com>
References: <1546876763.3477.43.camel@redhat.com>
 <d8433230-dfcd-a6bf-cfb2-8c0d1a5beedd@oracle.com>
Message-ID: <1546877026.3477.44.camel@redhat.com>

Thanks for the quick review, Tobias.

-Zhengyu

On Mon, 2019-01-07 at 17:01 +0100, Tobias Hartmann wrote:
> Hi Zhengyu,
> 
> looks good and trivial to me.
> 
> Best regards,
> Tobias
> 
> On 07.01.19 16:59, zgu at redhat.com wrote:
> > Please review this trivial change that removes unused/incorrect
> > method.
> > 
> > BCEscapeAnalyzer::ArgumentMap::set_intersect()'s implementation is
> > wrong. The reason that it did not blowup anything, is that it does
> > not
> > have users. 
> > 
> > We can fix it or remove it: based on Tobias' comment in bug, let's
> > simply remove it. 
> > 
> > 
> > Bug: https://bugs.openjdk.java.net/browse/JDK-8216200
> > Webrev: http://cr.openjdk.java.net/~zgu/JDK-8216200/webrev.00/
> > 
> > Test:
> > 
> >   hotspot_compiler on Linux 64.
> > 
> > Thanks,
> > 
> > -Zhengyu
> > 

From vladimir.kozlov at oracle.com  Mon Jan  7 16:02:55 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 7 Jan 2019 08:02:55 -0800
Subject: RFR(S): 8214862: assert(proj != __null) at compile.cpp:3251
In-Reply-To: <878t0thi36.fsf@redhat.com>
References: <87d0qfhtyo.fsf@redhat.com>
 <3167fa2c-a9bd-ae8d-a084-7a09275b35e1@oracle.com> <874lbpish9.fsf@redhat.com>
 <19387ab7-2dbc-61ed-5722-ff5ecbcc3b51@oracle.com> <878t0thi36.fsf@redhat.com>
Message-ID: <1632cec4-c3a1-239a-9f95-3b991c68ff97@oracle.com>

On 12/13/18 1:49 AM, Roland Westrelin wrote:
> 
>> Do you hit next bailout (with fix)?:
>>
>> http://hg.openjdk.java.net/jdk/jdk/file/24525070d934/src/hotspot/share/opto/compile.cpp#l3669
> 
> yes.
> 
>> Is fall-through path eliminated because it is not reachable from Root because of infinite loop?
> 
> yes.
> 
>> I think we should detect infinite loop very early, after first PhaseRemoveUseless. Or may be just before or during
>> PhaseRemoveUseless when we still have path.
> 
> Isn't there a chance that the path that leads to the infinite loop can
> be optimized out during optimizations so bailing out early could cause a
> valid method to never be compiled?

It could be.  It seems your solution is simplest one. I agree with it.

Thanks,
vladimir

> 
>> What happens if a method has *only* infinite loop? In which phase we detect it and bailout?
> 
> This:
> 
> private static void test() {
>      while (true);
> }
> 
> bails out in:
> 
> bool Compile::final_graph_reshaping() {
>    // an infinite loop may have been eliminated by the optimizer,
>    // in which case the graph will be empty.
>    if (root()->req() == 1) {
>      record_method_not_compilable("trivial infinite loop");
>      return true;
>    }
> 
> 
> Roland.
> 

From leo.korinth at oracle.com  Mon Jan  7 16:46:26 2019
From: leo.korinth at oracle.com (Leo Korinth)
Date: Mon, 7 Jan 2019 17:46:26 +0100
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <ba49f0b3-d684-e76c-2078-10181a1efd42@gmail.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
 <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
 <e998e444-6848-0255-11e0-653b374a24cb@oracle.com>
 <ba49f0b3-d684-e76c-2078-10181a1efd42@gmail.com>
Message-ID: <b7feaac0-aa50-8392-6eac-dec79ccf8a1a@oracle.com>

Hi!

On 07/01/2019 15:53, Yasumasa Suenaga wrote:
> Hi Leo,
> 
>> 1) Should not all of those files be fixed?
> 
> 
>> ./src/hotspot/cpu/x86/macroAssembler_x86_sha.cpp:                          
>> C source, UTF-8 Unicode text
>> ./src/hotspot/share/oops/method.cpp:                                                                   
>> C source, UTF-8 Unicode text
> 
> Non-ASCII character(s) in comment line.
> 
> 
>> ./src/hotspot/cpu/aarch64/macroAssembler_aarch64_trig.cpp:                          
>> C source, UTF-8 Unicode text
> 
> It's not used for Windows (AArch64).
> 
> 
>> ./src/hotspot/share/gc/parallel/gcTaskManager.hpp:                          
>> data
> 
> I couldn't find why `file` detects it as "data". So I have no idea for it.
> 
> 
>> ./src/hotspot/share/code/codeHeapState.cpp:????????????????????????? C 
>> source, UTF-8 Unicode text
> 
> It's ASCII file on my laptop :-)


Check line 1979:

00015f20: 7468 6579 2068 6176 6520 6e6f 2063 6f6d  they have no com
00015f30: 7069 6c61 7469 6f6e c2a0 4944 2061 7373  pilation..ID ass

c2a0 is not ASCII but Latin-1 Supplement (NO-BREAK SPACE) and is not in 
a comment.

I do not have a windows environment up and running. The compiler seems 
to warn in strange places, and _not_ warn in others. I can not get a 
confirmation in the style guide that we are to use only ASCII, so please 
ignore my questions as I do not know what to recommend.

Thanks,
Leo

> 
> 
>> ./test/hotspot/gtest/utilities/test_json.cpp:                                                                   
>> C source, UTF-8 Unicode text
> 
> It's test code.
> 
> 
>> 2) Why remove warning (in one file, methodMatcher.cpp) instead of 
>> changing encoding?
>> 3) methodMatcher.cpp seems to be pure ASCII, why the change in that 
>> file at all?
> 
> The error occurs about `RANGE0`. It has binary data, so it might not be 
> able to change encoding. >
> Thanks,
> 
> Yasumasa
> 
> 
> On 2019/01/07 23:32, Leo Korinth wrote:
>> Hi!
>>
>> Running: find -name "*.[ch]pp" | xargs file | grep -v ASCII
>> ./src/hotspot/cpu/x86/macroAssembler_x86_sha.cpp: 
>> ???????????????????????? C source, UTF-8 Unicode text
>> ./src/hotspot/cpu/aarch64/macroAssembler_aarch64_trig.cpp: 
>> ???????????????????????? C source, UTF-8 Unicode text
>> ./src/hotspot/share/gc/parallel/gcTaskManager.hpp: 
>> ???????????????????????? data
>> ./src/hotspot/share/code/codeHeapState.cpp: ???????????????????????? C 
>> source, UTF-8 Unicode text
>> ./src/hotspot/share/oops/method.cpp: 
>> ????????????????????????????????????????????????????????????????? C 
>> source, UTF-8 Unicode text
>> ./test/hotspot/gtest/utilities/test_json.cpp: 
>> ????????????????????????????????????????????????????????????????? C 
>> source, UTF-8 Unicode text
>>
>>
>> The single hpp file seems fine though (just file not understanding 
>> that it is a source file).
>>
>> Some questions, as it seems like I am missing something.
>> 1) Should not all of those files be fixed?
>> 2) Why remove warning (in one file, methodMatcher.cpp) instead of 
>> changing encoding?
>> 3) methodMatcher.cpp seems to be pure ASCII, why the change in that 
>> file at all?
>>
>> $ grep --color -P -n "[^[:ascii:]]" is a good way to find the 
>> problematic line.
>>
>> Thanks, Leo
>>
>> On 07/01/2019 14:20, Thomas Schatzl wrote:
>>> Hi,
>>>
>>> On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
>>>> Hi Kim,
>>>>
>>>> On 2019/01/07 7:18, Kim Barrett wrote:
>>>>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
>>>>>> wrote:
>>>>>>
>>>>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
>>>>>>> yasuenag at gmail.com> wrote:
>>>>>>>
>>>>>>> Hi Kim,
>>>>>>>
>>>>>>> Thank you for your comment.
>>>>>>> I uploaded new webrev to use pragma warning push/pop:
>>>>>>>
>>>>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>>>>>
>>>>>>>
>>>>>>> Please review again.
>>>>>>
>>>>>> Looks good.
>>>
>>> I tried to verify these problems on these two files as suggested with
>>> "iconv -f US-ASCII -t UTF8 <file>" which errored out on
>>> codeHeapState.cpp as expected but there has been no error with
>>> methodMatcher.cpp. Am I doing something wrong?
>>>
>>> I am fine with that change if it is really needed for successful
>>> compliation :) I just can't find the non-US-ASCII character used in the
>>> line indicated by the error message.
>>>
>>>>>
>>>>> It later occurred to me to wonder whether _WINDOWS was the right
>>>>> macro to conditionalize on.? All other uses of #pragma warning
>>>>> push/pop (there are 5 in HotSpot) use _MSC_VER.
>>>>
>>>> I updated webrev to use _MSC_VER. Is it ok?
>>>>
>>>> ??? http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/
>>>>
>>>
>>> Please add a "// warning C4189: The file contains a character that
>>> cannot be represented in the current code page" comment above or next
>>> to the pragma warning(disable) declaration.
>>>
>>> Not many people know the VC warning numbers by default...
>>>
>>> Looks good otherwise, I do not need a re-review for this comment
>>> change.
>>>
>>> Thanks,
>>> ?? Thomas
>>>
>>>

From sandhya.viswanathan at intel.com  Mon Jan  7 20:44:17 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Mon, 7 Jan 2019 20:44:17 +0000
Subject: RFR (XS): 8216290: Backport of 8215888 to JDK11u (Register to
 register spill may use AVX 512 move instruction on unsupported platform)
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A4AC0F@FMSMSX126.amr.corp.intel.com>

This is a request to backport the 8215888<https://bugs.openjdk.java.net/browse/JDK-8215888> fix to JDK11u. The fix has been in JDK 12 branch for a couple of days now and passed nightly testing.

The backport bug request is at:
https://bugs.openjdk.java.net/browse/JDK-8216290

The backport webrev is at:
http://cr.openjdk.java.net/~sviswanathan/8216290/webrev.00/

Best Regards,
Sandhya

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190107/5bfa9fbe/attachment-0001.html>

From kim.barrett at oracle.com  Mon Jan  7 20:50:54 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Mon, 7 Jan 2019 15:50:54 -0500
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
 <3a1836446f59ae6ac89a00dc09867c1d1e97e157.camel@oracle.com>
Message-ID: <884D1538-A96A-41D3-8039-103C99BA6A30@oracle.com>

> On Jan 7, 2019, at 8:20 AM, Thomas Schatzl <thomas.schatzl at oracle.com> wrote:
> 
> Hi,
> 
> On Mon, 2019-01-07 at 21:36 +0900, Yasumasa Suenaga wrote:
>> Hi Kim,
>> 
>> On 2019/01/07 7:18, Kim Barrett wrote:
>>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com>
>>>> wrote:
>>>> 
>>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <
>>>>> yasuenag at gmail.com> wrote:
>>>>> 
>>>>> Hi Kim,
>>>>> 
>>>>> Thank you for your comment.
>>>>> I uploaded new webrev to use pragma warning push/pop:
>>>>> 
>>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>>> 
>>>>> 
>>>>> Please review again.
>>>> 
>>>> Looks good.
> 
> I tried to verify these problems on these two files as suggested with
> "iconv -f US-ASCII -t UTF8 <file>" which errored out on
> codeHeapState.cpp as expected but there has been no error with
> methodMatcher.cpp. Am I doing something wrong?
> 
> I am fine with that change if it is really needed for successful
> compliation :) I just can't find the non-US-ASCII character used in the
> line indicated by the error message.

The problem is in RANGEBASE, which is referenced directly and appended
with other strings into RANGE0 and RANGESLASH, all of which are only
referenced in parse_method_pattern.

RANGEBASE is strings of '\xXX' encoded characters.  At the source
level this is all fine.  Even after preprocessing it should all be
fine, as the string/char encoding reduction doesn't happen until
translation phase 5, e.g. after preprocessing.

But during that encoding reduction the compiler is noticing that some
of the encodings (or sequence thereof?) don't map to valid characters
in the currently selected code page for the OS (Japanese in Yasumasa's
case).  So it complains.  It's kind of an annoying complaint, for
several reasons, but oh well.

It is because the warning is triggered by that encoding reduction
during compilation, rather than by literal characters in the source
code, that nothing problematic shows up with iconv/grep/&etc.


From kim.barrett at oracle.com  Mon Jan  7 20:54:42 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Mon, 7 Jan 2019 15:54:42 -0500
Subject: RFR: 8216154: C4819 warnings at HotSpot sources on Windows
In-Reply-To: <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
References: <9b1bd147-a261-95df-18f8-a5476415d0a4@gmail.com>
 <E5EB73E3-26B8-49F9-8A64-545DAC2DE41C@oracle.com>
 <aa401521-7166-996d-0bc3-47095f9ed391@gmail.com>
 <0E131429-5B2F-4A2E-980A-8966354AB7F4@oracle.com>
 <F4A867AA-9BB2-480A-A421-0C05E018ADA7@oracle.com>
 <cc76b240-9088-0a03-91a5-9304325417aa@gmail.com>
Message-ID: <34070646-9304-4EBE-BB9D-0FE5E6BFF2FF@oracle.com>

> On Jan 7, 2019, at 7:36 AM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
> 
> Hi Kim,
> 
> On 2019/01/07 7:18, Kim Barrett wrote:
>>> On Jan 6, 2019, at 12:54 PM, Kim Barrett <kim.barrett at oracle.com> wrote:
>>> 
>>>> On Jan 6, 2019, at 7:53 AM, Yasumasa Suenaga <yasuenag at gmail.com> wrote:
>>>> 
>>>> Hi Kim,
>>>> 
>>>> Thank you for your comment.
>>>> I uploaded new webrev to use pragma warning push/pop:
>>>> 
>>>> http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.01/
>>>> 
>>>> 
>>>> Please review again.
>>> 
>>> Looks good.
>> It later occurred to me to wonder whether _WINDOWS was the right macro to conditionalize
>> on.  All other uses of #pragma warning push/pop (there are 5 in HotSpot) use _MSC_VER.
> 
> I updated webrev to use _MSC_VER. Is it ok?
> 
>  http://cr.openjdk.java.net/~ysuenaga/JDK-8216154/webrev.02/

Thanks for doing that.  I don?t know that _WINDOWS was actually wrong, or that _MSC_VER is
actually better, but it seems better to be consistent about it.  And sorry for not noticing earlier.

Looks good.


From david.holmes at oracle.com  Mon Jan  7 21:21:18 2019
From: david.holmes at oracle.com (David Holmes)
Date: Tue, 8 Jan 2019 07:21:18 +1000
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <818084dc-3e98-97da-20f4-aa00f3f6545e@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
 <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>
 <818084dc-3e98-97da-20f4-aa00f3f6545e@oracle.com>
Message-ID: <2c837971-967c-285c-f85d-a88cfe82aac3@oracle.com>

On 7/01/2019 11:31 pm, Claes Redestad wrote:
> 
> 
> On 2019-01-07 14:01, Claes Redestad wrote:
>>>
>>> Normally we would follow a staged removal process: deprecate, 
>>> obsolete, then expire - see arguments.cpp and special_jvm_flags 
>>> table. In this case we can probably start at obsoletion, but that 
>>> would leave expiration for JDK 14. Or compiler folk can argue for / 
>>> justify immediate full expiration/removal.
>>
>> I'm under the impression this process does not apply to develop flags
>> (which are not visible an anything but debug builds)?
> 
> We've removed develop flags without obsoletion + expiry many times in
> the past[1], and while this goes against the written down expiration
> in arguments.cpp, I believe it to be a misguided recommendation for
> develop flags.

There have been and still can be exceptions depending on the actual flag 
but the general guideline is:

  * To remove internal options (e.g. diagnostic, experimental, develop 
options), use
  * a 2-step model adding major release numbers to the obsolete and 
expire columns.

Compiler folk can identify whether this flag can be expired immediately.

Thanks,
David

> /Claes
> 
> [1]
> https://bugs.openjdk.java.net/browse/JDK-8191870
> https://bugs.openjdk.java.net/browse/JDK-8132318
> https://bugs.openjdk.java.net/browse/JDK-8186042
> https://bugs.openjdk.java.net/browse/JDK-8180423
> https://bugs.openjdk.java.net/browse/JDK-8058259

From eric.caspole at oracle.com  Mon Jan  7 22:47:00 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Mon, 7 Jan 2019 17:47:00 -0500
Subject: RFR: 8076988: reevaluate trivial method policy
Message-ID: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>

Hi everyone,
Could I get reviews or comments for a fix/simplification of the trivial 
method policy. I have an internal benchmark where a very hot "trivial" 
method gets compiled at level 1 and it leads to a ~9% regression 
compared to getting compiled with C2 level 4. Others have expressed 
thoughts that this policy might now not as useful as originally 
intended. I have run performance testing of throughput and startup time 
with no noticeable regressions.

This webrev passed regular tier1 and tier 2 testing.
Thanks,
Eric


JBS:
https://bugs.openjdk.java.net/browse/JDK-8076988

Webrev:
http://cr.openjdk.java.net/~ecaspole/JDK-8076988/01/webrev/

From shade at redhat.com  Mon Jan  7 22:51:14 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Mon, 7 Jan 2019 23:51:14 +0100
Subject: RFR: 8076988: reevaluate trivial method policy
In-Reply-To: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
References: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
Message-ID: <245f8a33-56f4-fb63-a592-014db7d3db82@redhat.com>

On 1/7/19 11:47 PM, Eric Caspole wrote:
> JBS:
> https://bugs.openjdk.java.net/browse/JDK-8076988
> 
> Webrev:
> http://cr.openjdk.java.net/~ecaspole/JDK-8076988/01/webrev/

I like it. Accessors and constant getters may also compile better with C2, especially with advanced
GCs, but that might not be as significant.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190107/928b286e/signature-0001.asc>

From claes.redestad at oracle.com  Mon Jan  7 23:13:26 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 8 Jan 2019 00:13:26 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <2c837971-967c-285c-f85d-a88cfe82aac3@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
 <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>
 <818084dc-3e98-97da-20f4-aa00f3f6545e@oracle.com>
 <2c837971-967c-285c-f85d-a88cfe82aac3@oracle.com>
Message-ID: <cdc5b0bd-b1a3-265e-2cb4-bfcad8a72299@oracle.com>

On 2019-01-07 22:21, David Holmes wrote:
>>>
>>> I'm under the impression this process does not apply to develop flags
>>> (which are not visible an anything but debug builds)?
>>
>> We've removed develop flags without obsoletion + expiry many times in
>> the past[1], and while this goes against the written down expiration
>> in arguments.cpp, I believe it to be a misguided recommendation for
>> develop flags.
>
> There have been and still can be exceptions depending on the actual 
> flag but the general guideline is:
>
> ?* To remove internal options (e.g. diagnostic, experimental, develop 
> options), use
> ?* a 2-step model adding major release numbers to the obsolete and 
> expire columns.
>
> Compiler folk can identify whether this flag can be expired immediately.

To me it seems to be the general rule rather than an exception lately, 
and see no point in
sticking to that recommendation. I've filed 
https://bugs.openjdk.java.net/browse/JDK-8216311
to drop develop flags from that recommendation.

/Claes


From david.holmes at oracle.com  Tue Jan  8 01:27:47 2019
From: david.holmes at oracle.com (David Holmes)
Date: Tue, 8 Jan 2019 11:27:47 +1000
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <cdc5b0bd-b1a3-265e-2cb4-bfcad8a72299@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <9a9f466b-5ffc-06de-9fde-6a8ef78622ee@oracle.com>
 <2708906c-c27e-4514-3066-0f2a86fbae9e@oracle.com>
 <818084dc-3e98-97da-20f4-aa00f3f6545e@oracle.com>
 <2c837971-967c-285c-f85d-a88cfe82aac3@oracle.com>
 <cdc5b0bd-b1a3-265e-2cb4-bfcad8a72299@oracle.com>
Message-ID: <3a286b71-c723-0cf3-47d5-e663963d9b11@oracle.com>

On 8/01/2019 9:13 am, Claes Redestad wrote:
> On 2019-01-07 22:21, David Holmes wrote:
>>>>
>>>> I'm under the impression this process does not apply to develop flags
>>>> (which are not visible an anything but debug builds)?
>>>
>>> We've removed develop flags without obsoletion + expiry many times in
>>> the past[1], and while this goes against the written down expiration
>>> in arguments.cpp, I believe it to be a misguided recommendation for
>>> develop flags.
>>
>> There have been and still can be exceptions depending on the actual 
>> flag but the general guideline is:
>>
>> ?* To remove internal options (e.g. diagnostic, experimental, develop 
>> options), use
>> ?* a 2-step model adding major release numbers to the obsolete and 
>> expire columns.
>>
>> Compiler folk can identify whether this flag can be expired immediately.
> 
> To me it seems to be the general rule rather than an exception lately, 

Perhaps ... I didn't do a complete census. I see the process being used:

8198635: Remove unused safepoint message functions and ShowSafepointMsgs

and also not used:

6909265: assert(_OnDeck != Self->_MutexEvent,"invariant") with 
-XX:+PrintMallocFree

but in this case using the flag led to an assertion failure so it was a 
reasonable assumption that the flag was not actually being used in 
practice and so could be immediately removed.

> and see no point in
> sticking to that recommendation. I've filed 
> https://bugs.openjdk.java.net/browse/JDK-8216311
> to drop develop flags from that recommendation.

Noted.

David

> /Claes
> 
> 

From dean.long at oracle.com  Tue Jan  8 04:09:22 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Mon, 7 Jan 2019 20:09:22 -0800
Subject: RFR: 8076988: reevaluate trivial method policy
In-Reply-To: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
References: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
Message-ID: <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>

Eric, you should be able to revert 8145579 at the same time.

dl

On 1/7/19 2:47 PM, Eric Caspole wrote:
> Hi everyone,
> Could I get reviews or comments for a fix/simplification of the 
> trivial method policy. I have an internal benchmark where a very hot 
> "trivial" method gets compiled at level 1 and it leads to a ~9% 
> regression compared to getting compiled with C2 level 4. Others have 
> expressed thoughts that this policy might now not as useful as 
> originally intended. I have run performance testing of throughput and 
> startup time with no noticeable regressions.
>
> This webrev passed regular tier1 and tier 2 testing.
> Thanks,
> Eric
>
>
> JBS:
> https://bugs.openjdk.java.net/browse/JDK-8076988
>
> Webrev:
> http://cr.openjdk.java.net/~ecaspole/JDK-8076988/01/webrev/


From OGATAK at jp.ibm.com  Tue Jan  8 04:59:42 2019
From: OGATAK at jp.ibm.com (Kazunori Ogata)
Date: Tue, 8 Jan 2019 13:59:42 +0900
Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy stubs
 by using vector instructions
In-Reply-To: <6adf0a283eda47b29df02a3a2d8550ee@sap.com>
References: <OFB23FA863.EF67A003-ON49258367.004E5542-49258367.0050AF0E@LocalDomain>
 <OF3558918E.E6D1DAF0-ON4925837B.001C7F7B-4925837B.001CB464@notes.na.collabserv.com>
 <6adf0a283eda47b29df02a3a2d8550ee@sap.com>
Message-ID: <OF1FFEC243.4719B215-ON4925837C.001AD2A1-4925837C.001B716C@notes.na.collabserv.com>

Hi Martin,

Thank you for reviewing the patch.  I'll submit RFR to jdk8u-dev mailing 
list referring your reply.


Regards,
Ogata


"Doerr, Martin" <martin.doerr at sap.com> wrote on 2019/01/07 22:08:57:

> From: "Doerr, Martin" <martin.doerr at sap.com>
> To: Kazunori Ogata <OGATAK at jp.ibm.com>, "hotspot-compiler-
> dev at openjdk.java.net" <hotspot-compiler-dev at openjdk.java.net>, "ppc-aix-
> port-dev at openjdk.java.net" <ppc-aix-port-dev at openjdk.java.net>
> Date: 2019/01/07 22:09
> Subject: RE: [8u] RFR for backport of 8154156: PPC64: improve array copy 

> stubs by using vector instructions
> 
> Hi Ogata,
> 
> looks good to me. However, I'm not a jdk8u reviewer.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: ppc-aix-port-dev <ppc-aix-port-dev-bounces at openjdk.java.net> On 
> Behalf Of Kazunori Ogata
> Sent: Montag, 7. Januar 2019 06:14
> To: hotspot-compiler-dev at openjdk.java.net; 
ppc-aix-port-dev at openjdk.java.net
> Subject: Re: [8u] RFR for backport of 8154156: PPC64: improve array copy 

> stubs by using vector instructions
> 
> Hi,
> 
> Ping.  Can anyone review this enhancement backport request?
> 
> Regards,
> Ogata
> 
> 
> Kazunori Ogata/Japan/IBM wrote on 2018/12/18 23:41:16:
> 
> > From: Kazunori Ogata/Japan/IBM
> > To: hotspot-compiler-dev at openjdk.java.net, 
> ppc-aix-port-dev at openjdk.java.net
> > Date: 2018/12/18 23:41
> > Subject: [8u] RFR for backport of 8154156: PPC64: improve array copy 
> stubs
> > by using vector instructions
> > 
> > Hi,
> > 
> > May I get review for enhancement backport of 8154156: PPC64: improve 
> array
> > copy stubs by using vector instructions?
> > 
> > To make this patch buildable (and usable by other planned backports 
> listed
> > in [1]), I cherry picked config_dscr() and its dependent code from 
[2,3] 
> 
> > and has_mfdscr() from [4].
> > 
> > Original patch: http://hg.openjdk.java.net/jdk/jdk/rev/c9d756fa846e
> > Weberv: 
http://cr.openjdk.java.net/~horii/jdk8u_aes_be/8154156/webrev.01/
> > 
> > I confirmed it was buildable for both relase and fastdebug builds, and 

> > JTREG caused no degradation.
> > 
> > Refs:
> > [1] 
http://mail.openjdk.java.net/pipermail/ppc-aix-port-dev/2018-December/
> 003818.html
> > [2] 8149655: PPC64: Implement CompactString intrinsics
> >     http://hg.openjdk.java.net/jdk/jdk/rev/6241574f5982
> > [3] 8080684: PPC64: Fix little-endian build after "8077838: Recent 
> > developments for ppc"
> >     http://hg.openjdk.java.net/jdk/jdk/rev/12ccf8b26eb0  
> > [4] 8077838: Recent developments for ppc.
> >     http://hg.openjdk.java.net/jdk/jdk/rev/c703c89fddbf
> > 
> > Regards,
> > Ogata
> 
> 


From vladimir.kozlov at oracle.com  Tue Jan  8 06:27:24 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 7 Jan 2019 22:27:24 -0800
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
Message-ID: <13ef0777-a122-555d-4719-ff09c17e674e@oracle.com>

This mechanism was added before I joined the team. I see that it is present from day one.
I speculate it was added to avoid allocating CPU (very limited at that time) cycles to compilation during VM startup.
I agree that currently it is not the case since we have enough cyles and need to compile MHs very eagerly.
JVMCI has other mechanism which delay compilation until it is initialized.
And I don't think we should delay usage of AOT code.

In short - I agree with changes and removal this archaic feature.
I would suggest immediate removal (or shortest available).

Thanks,
Vladimir

On 1/7/19 4:36 AM, Claes Redestad wrote:
> Hi,
> 
> DelayCompilationAtStartup doesn't delay any compilations.
> 
> Webrev: http://cr.openjdk.java.net/~redestad/8216262/open.00/
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216262
> 
> Testing: tier1
> 
> Thanks!
> 
> /Claes

From Nick.Gasson at arm.com  Tue Jan  8 08:03:43 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Tue, 8 Jan 2019 08:03:43 +0000
Subject: RFR: 8216350: AArch64: monitor unlock fast path not called
Message-ID: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>

Hi,

While looking at the profiling output of some micro-benchmarks for 
locking on AArch64, I noticed that the monitor unlock fast-path in 
aarch64_enc_fast_unlock in aarch64.ad (under label `object_has_monitor') 
is almost never executed, even though the lock in the test is inflated.

In order to branch to this fast-path we check if bit #1 is set in the 
displaced header word on the stack:

   __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value), 
object_has_monitor);

But in the common case the value in the dhw is set by the monitor 
locking fast-path in aarch64_enc_fast_lock, where we use the pointer to 
the dhw as an arbitrary non-null value. But the lower three bits of this 
pointer will always be zero, and so won't trigger the unlock fast-path 
which is looking for bit #1 set, and we will fall through to call the 
runtime to unlock the monitor.

   // store a non-null value into the box.
   __ str(box, Address(box, BasicLock::displaced_header_offset_in_bytes()));

It seems that the unlock fast-path will only be executed when the 
monitor was originally locked by the runtime (e.g. when the lock was 
first inflated), because ObjectSynchronizer::slow_enter will store 
markOopDesc::unused_mark into the dhw, and this value has bit #1 set.

Can someone help me review this change to aarch64_enc_fast_lock to use 
markOopDesc::unused_mark as the arbitrary non-null value rather than 
`box' to match ObjectSynchronizer::slow_enter?

Webrev: http://cr.openjdk.java.net/~njian/8216350/webrev.0/
Bug: https://bugs.openjdk.java.net/browse/JDK-8216350

Also removed an unnecessary double branch in the unlock code.

Ran jtreg + jcstress.

I also added a new micro-benchmark to 
test/micro/org/openjdk/bench/vm/lang/LockUnlock.java so you can see this 
behaviour:

Without patch:

Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
   597.855 ?(99.9%) 73.183 ns/op [Average]
   (min, avg, max) = (438.862, 597.855, 861.028), stdev = 97.697
   CI (99.9%): [524.672, 671.038] (assumes normal distribution)

With patch:

Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
   219.067 ?(99.9%) 21.146 ns/op [Average]
   (min, avg, max) = (176.379, 219.067, 300.186), stdev = 28.229
   CI (99.9%): [197.921, 240.212] (assumes normal distribution)

This is with -XX:+UseLSE, -UseLSE has a similar improvement.

Thanks,
Nick

From aph at redhat.com  Tue Jan  8 08:49:21 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 8 Jan 2019 08:49:21 +0000
Subject: RFR: 8216350: AArch64: monitor unlock fast path not called
In-Reply-To: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
References: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
Message-ID: <6890ad14-5af5-33ba-dcf1-e6e258633a29@redhat.com>

On 1/8/19 8:03 AM, Nick Gasson (Arm Technology China) wrote:
> It seems that the unlock fast-path will only be executed when the 
> monitor was originally locked by the runtime (e.g. when the lock was 
> first inflated), because ObjectSynchronizer::slow_enter will store 
> markOopDesc::unused_mark into the dhw, and this value has bit #1 set.
> 
> Can someone help me review this change to aarch64_enc_fast_lock to use 
> markOopDesc::unused_mark as the arbitrary non-null value rather than 
> `box' to match ObjectSynchronizer::slow_enter?

Thanks. How does this compare with the x86 code?

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From Nick.Gasson at arm.com  Tue Jan  8 09:00:25 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Tue, 8 Jan 2019 09:00:25 +0000
Subject: RFR: 8216350: AArch64: monitor unlock fast path not called
In-Reply-To: <6890ad14-5af5-33ba-dcf1-e6e258633a29@redhat.com>
References: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
 <6890ad14-5af5-33ba-dcf1-e6e258633a29@redhat.com>
Message-ID: <02950bd9-254a-ee81-041e-13ef7143a17b@arm.com>

Hi Andrew,

On 08/01/2019 16:49, Andrew Haley wrote:
> 
> Thanks. How does this compare with the x86 code?
> 

In macroAssembler_x86.cpp MacroAssembler::fast_lock the _LP64 version 
also uses markOopDesc::unused_mark()

   // Unconditionally set box->_displaced_header = 
markOopDesc::unused_mark().
   // Without cast to int32_t movptr will destroy r10 which is typically 
obj.
   movptr(Address(boxReg, 0), 
(int32_t)intptr_t(markOopDesc::unused_mark()));

(The !_LP64 version uses the literal "3" which is just 
markOopDesc::unused_mark anyway.)

And then in the x86 fast_unlock they are testing the same bit as AArch64:

   testptr(tmpReg, markOopDesc::monitor_value);    // Inflated?

Thanks,
Nick

From aph at redhat.com  Tue Jan  8 09:02:48 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 8 Jan 2019 09:02:48 +0000
Subject: RFR: 8216350: AArch64: monitor unlock fast path not called
In-Reply-To: <02950bd9-254a-ee81-041e-13ef7143a17b@arm.com>
References: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
 <6890ad14-5af5-33ba-dcf1-e6e258633a29@redhat.com>
 <02950bd9-254a-ee81-041e-13ef7143a17b@arm.com>
Message-ID: <0c1968bf-d9c6-cbc5-70d9-cd576938dfd2@redhat.com>

On 1/8/19 9:00 AM, Nick Gasson (Arm Technology China) wrote:
> Hi Andrew,
> 
> On 08/01/2019 16:49, Andrew Haley wrote:
>>
>> Thanks. How does this compare with the x86 code?
>>
> 
> In macroAssembler_x86.cpp MacroAssembler::fast_lock the _LP64 version 
> also uses markOopDesc::unused_mark()
> 
>    // Unconditionally set box->_displaced_header = 
> markOopDesc::unused_mark().
>    // Without cast to int32_t movptr will destroy r10 which is typically 
> obj.
>    movptr(Address(boxReg, 0), 
> (int32_t)intptr_t(markOopDesc::unused_mark()));
> 
> (The !_LP64 version uses the literal "3" which is just 
> markOopDesc::unused_mark anyway.)
> 
> And then in the x86 fast_unlock they are testing the same bit as AArch64:
> 
>    testptr(tmpReg, markOopDesc::monitor_value);    // Inflated?

OK, the patch is good. Thanks.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Tue Jan  8 09:42:19 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 8 Jan 2019 10:42:19 +0100
Subject: RFR(S): 8214862: assert(proj != __null) at compile.cpp:3251
In-Reply-To: <1632cec4-c3a1-239a-9f95-3b991c68ff97@oracle.com>
References: <87d0qfhtyo.fsf@redhat.com>
 <3167fa2c-a9bd-ae8d-a084-7a09275b35e1@oracle.com> <874lbpish9.fsf@redhat.com>
 <19387ab7-2dbc-61ed-5722-ff5ecbcc3b51@oracle.com> <878t0thi36.fsf@redhat.com>
 <1632cec4-c3a1-239a-9f95-3b991c68ff97@oracle.com>
Message-ID: <1b9a2e27-b61c-97ec-fc9b-c7460237684a@oracle.com>

Hi Roland,

> http://cr.openjdk.java.net/~roland/8214862/webrev.00/

This looks reasonable to me as well. Ship it!

Best regards,
Tobias

From tobias.hartmann at oracle.com  Tue Jan  8 09:45:55 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 8 Jan 2019 10:45:55 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <13ef0777-a122-555d-4719-ff09c17e674e@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <13ef0777-a122-555d-4719-ff09c17e674e@oracle.com>
Message-ID: <f2e6266c-fb2e-9a8c-bb14-f8dac8eb0d84@oracle.com>

Hi Claes,

On 08.01.19 07:27, Vladimir Kozlov wrote:
> In short - I agree with changes and removal this archaic feature.
> I would suggest immediate removal (or shortest available).

+1

Best regards,
Tobias

From claes.redestad at oracle.com  Tue Jan  8 09:53:35 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 8 Jan 2019 10:53:35 +0100
Subject: RFR: 8216262: Remove develop flag DelayCompilationDuringStartup
In-Reply-To: <f2e6266c-fb2e-9a8c-bb14-f8dac8eb0d84@oracle.com>
References: <64522ad1-1473-8012-b63a-5309bc72c5cc@oracle.com>
 <13ef0777-a122-555d-4719-ff09c17e674e@oracle.com>
 <f2e6266c-fb2e-9a8c-bb14-f8dac8eb0d84@oracle.com>
Message-ID: <bd6740b9-163c-a19d-3e6d-1393685b9629@oracle.com>

Vladimir, Tobias, thanks for reviewing!

/Claes

On 2019-01-08 10:45, Tobias Hartmann wrote:
> Hi Claes,
> 
> On 08.01.19 07:27, Vladimir Kozlov wrote:
>> In short - I agree with changes and removal this archaic feature.
>> I would suggest immediate removal (or shortest available).
> 
> +1
> 
> Best regards,
> Tobias
> 

From tobias.hartmann at oracle.com  Tue Jan  8 09:52:15 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 8 Jan 2019 10:52:15 +0100
Subject: RFR: 8076988: reevaluate trivial method policy
In-Reply-To: <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>
References: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
 <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>
Message-ID: <eac8a360-df92-fed4-ade7-8454865231e0@oracle.com>

Hi Eric,

looks good to me too.

On 08.01.19 05:09, dean.long at oracle.com wrote:
> Eric, you should be able to revert 8145579 at the same time.

+1

Best regards,
Tobias

From tobias.hartmann at oracle.com  Tue Jan  8 10:00:52 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 8 Jan 2019 11:00:52 +0100
Subject: RFR (XS): 8216290: Backport of 8215888 to JDK11u (Register to
 register spill may use AVX 512 move instruction on unsupported platform)
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A4AC0F@FMSMSX126.amr.corp.intel.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A4AC0F@FMSMSX126.amr.corp.intel.com>
Message-ID: <b5945875-9c84-e59d-63c4-3c7ce396a900@oracle.com>

Hi Sandhya,

this looks good to me. Please request approval according to:
https://openjdk.java.net/projects/jdk-updates/approval.html

Best regards,
Tobias

On 07.01.19 21:44, Viswanathan, Sandhya wrote:
> This is a request to backport the 8215888 <https://bugs.openjdk.java.net/browse/JDK-8215888> fix to
> JDK11u. The fix has been in JDK 12 branch for a couple of days now and passed nightly testing.
> 
> ?
> 
> The backport bug request is at:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216290
> 
> ?
> 
> The backport webrev is at:
> 
> http://cr.openjdk.java.net/~sviswanathan/8216290/webrev.00/
> 
> ?
> 
> Best Regards,
> 
> Sandhya
> 
> ?
> 

From rwestrel at redhat.com  Tue Jan  8 12:27:27 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 08 Jan 2019 13:27:27 +0100
Subject: RFR(S): 8214862: assert(proj != __null) at compile.cpp:3251
In-Reply-To: <1b9a2e27-b61c-97ec-fc9b-c7460237684a@oracle.com>
References: <87d0qfhtyo.fsf@redhat.com>
 <3167fa2c-a9bd-ae8d-a084-7a09275b35e1@oracle.com> <874lbpish9.fsf@redhat.com>
 <19387ab7-2dbc-61ed-5722-ff5ecbcc3b51@oracle.com> <878t0thi36.fsf@redhat.com>
 <1632cec4-c3a1-239a-9f95-3b991c68ff97@oracle.com>
 <1b9a2e27-b61c-97ec-fc9b-c7460237684a@oracle.com>
Message-ID: <87k1jf1gi8.fsf@redhat.com>


Thanks for the reviews, Vladimir & Tobias.

Roland.

From claes.redestad at oracle.com  Tue Jan  8 13:01:06 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 8 Jan 2019 14:01:06 +0100
Subject: RFR: 8216359: Remove develop flags TraceCompilationPolicy and
 TimeCompilationPolicy
Message-ID: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>

Hi,

the develop flags Trace- and TimeCompilationPolicy are not implemented
for any of the current default compilation policy implementations and
should be removed. (They _are_ implemented for StackWalkCompPolicy
which I'm proposing to be deprecated).

(This also removes the declaration of _in_vm_startup that should have
been removed by JDK-8216262)

Webrev: http://cr.openjdk.java.net/~redestad/8216359/open.00/
Bug:    https://bugs.openjdk.java.net/browse/JDK-8216359

Thanks!

/Claes

From rwestrel at redhat.com  Tue Jan  8 13:12:29 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 08 Jan 2019 14:12:29 +0100
Subject: RFR(S): 8214862: assert(proj != __null) at compile.cpp:3251
In-Reply-To: <3167fa2c-a9bd-ae8d-a084-7a09275b35e1@oracle.com>
References: <87d0qfhtyo.fsf@redhat.com>
 <3167fa2c-a9bd-ae8d-a084-7a09275b35e1@oracle.com>
Message-ID: <87h8ej1ef6.fsf@redhat.com>


> I would suggest to create a small function (in same file) to call from Compile::Optimize() - we 
> don't do graph transformations in it but call other functions.
>
> You don't need to use 'C->' because it is already Compile's method.

FTR, I'll push a fix that includes the suggestions above:

http://cr.openjdk.java.net/~roland/8214862/webrev.01/

Roland.

From nils.eliasson at oracle.com  Tue Jan  8 13:50:19 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 8 Jan 2019 14:50:19 +0100
Subject: RFR: 8216359: Remove develop flags TraceCompilationPolicy and
 TimeCompilationPolicy
In-Reply-To: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>
References: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>
Message-ID: <9eb59b35-1f96-18a6-252c-9776e115ac8c@oracle.com>

Looks great!

// Nils

On 2019-01-08 14:01, Claes Redestad wrote:
> Hi,
>
> the develop flags Trace- and TimeCompilationPolicy are not implemented
> for any of the current default compilation policy implementations and
> should be removed. (They _are_ implemented for StackWalkCompPolicy
> which I'm proposing to be deprecated).
>
> (This also removes the declaration of _in_vm_startup that should have
> been removed by JDK-8216262)
>
> Webrev: http://cr.openjdk.java.net/~redestad/8216359/open.00/
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216359
>
> Thanks!
>
> /Claes

From tobias.hartmann at oracle.com  Tue Jan  8 14:03:20 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 8 Jan 2019 15:03:20 +0100
Subject: RFR: 8216359: Remove develop flags TraceCompilationPolicy and
 TimeCompilationPolicy
In-Reply-To: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>
References: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>
Message-ID: <061751cf-4ff7-34fd-89a9-b2d924ee7cf3@oracle.com>

Hi Claes,

looks good to me.

Best regards,
Tobias

On 08.01.19 14:01, Claes Redestad wrote:
> Hi,
> 
> the develop flags Trace- and TimeCompilationPolicy are not implemented
> for any of the current default compilation policy implementations and
> should be removed. (They _are_ implemented for StackWalkCompPolicy
> which I'm proposing to be deprecated).
> 
> (This also removes the declaration of _in_vm_startup that should have
> been removed by JDK-8216262)
> 
> Webrev: http://cr.openjdk.java.net/~redestad/8216359/open.00/
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216359
> 
> Thanks!
> 
> /Claes

From per.liden at oracle.com  Tue Jan  8 14:32:39 2019
From: per.liden at oracle.com (Per Liden)
Date: Tue, 8 Jan 2019 15:32:39 +0100
Subject: RFR: 8215708: ZGC: Add missing LoadBarrierNode::size_of()
Message-ID: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>

LoadBarrierNode should implement size_of(). Otherwise cloning of such 
nodes is broken since only part of the object will be copied. This 
caused incorrect load barriers to be used in random places. For example, 
we could generate a weak barrier instead of a strong barrier, because 
the _weak member was not properly initialized when cloned.

This patch also implements three other methods (cmp, adr_type and 
match_edge) with an immediate call to ShouldNotReachHere(). This is a 
pure safety net to catch any misuse of these. These should never be 
called, but if they are called today we might not notice and instead 
silently do the wrong thing.

Bug: https://bugs.openjdk.java.net/browse/JDK-8215708
Webrev: http://cr.openjdk.java.net/~pliden/8215708/webrev.0

Testing: tier{1,6,7}

/Per

From erik.osterlund at oracle.com  Tue Jan  8 14:43:46 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Tue, 8 Jan 2019 15:43:46 +0100
Subject: RFR: 8215708: ZGC: Add missing LoadBarrierNode::size_of()
In-Reply-To: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
References: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
Message-ID: <e1fb6626-92ff-b5f0-f4a0-7d273c04faa0@oracle.com>

Hi Per,

Looks good.

Thanks,
/Erik

On 2019-01-08 15:32, Per Liden wrote:
> LoadBarrierNode should implement size_of(). Otherwise cloning of such 
> nodes is broken since only part of the object will be copied. This 
> caused incorrect load barriers to be used in random places. For 
> example, we could generate a weak barrier instead of a strong barrier, 
> because the _weak member was not properly initialized when cloned.
>
> This patch also implements three other methods (cmp, adr_type and 
> match_edge) with an immediate call to ShouldNotReachHere(). This is a 
> pure safety net to catch any misuse of these. These should never be 
> called, but if they are called today we might not notice and instead 
> silently do the wrong thing.
>
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215708
> Webrev: http://cr.openjdk.java.net/~pliden/8215708/webrev.0
>
> Testing: tier{1,6,7}
>
> /Per


From per.liden at oracle.com  Tue Jan  8 15:04:32 2019
From: per.liden at oracle.com (Per Liden)
Date: Tue, 8 Jan 2019 16:04:32 +0100
Subject: RFR: 8215708: ZGC: Add missing LoadBarrierNode::size_of()
In-Reply-To: <e1fb6626-92ff-b5f0-f4a0-7d273c04faa0@oracle.com>
References: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
 <e1fb6626-92ff-b5f0-f4a0-7d273c04faa0@oracle.com>
Message-ID: <47472a6a-b852-afe7-a3e0-104fbbe449d8@oracle.com>

Thanks Erik!

/Per

On 1/8/19 3:43 PM, Erik ?sterlund wrote:
> Hi Per,
> 
> Looks good.
> 
> Thanks,
> /Erik
> 
> On 2019-01-08 15:32, Per Liden wrote:
>> LoadBarrierNode should implement size_of(). Otherwise cloning of such 
>> nodes is broken since only part of the object will be copied. This 
>> caused incorrect load barriers to be used in random places. For 
>> example, we could generate a weak barrier instead of a strong barrier, 
>> because the _weak member was not properly initialized when cloned.
>>
>> This patch also implements three other methods (cmp, adr_type and 
>> match_edge) with an immediate call to ShouldNotReachHere(). This is a 
>> pure safety net to catch any misuse of these. These should never be 
>> called, but if they are called today we might not notice and instead 
>> silently do the wrong thing.
>>
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215708
>> Webrev: http://cr.openjdk.java.net/~pliden/8215708/webrev.0
>>
>> Testing: tier{1,6,7}
>>
>> /Per
> 

From claes.redestad at oracle.com  Tue Jan  8 15:25:18 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 8 Jan 2019 16:25:18 +0100
Subject: RFR: 8216359: Remove develop flags TraceCompilationPolicy and
 TimeCompilationPolicy
In-Reply-To: <061751cf-4ff7-34fd-89a9-b2d924ee7cf3@oracle.com>
References: <d1efebcd-23a4-0b39-7e01-014063ddeb3f@oracle.com>
 <061751cf-4ff7-34fd-89a9-b2d924ee7cf3@oracle.com>
Message-ID: <59040f0d-d1d4-9dc9-0539-22da1fa34db1@oracle.com>

Nils, Tobias, thanks for reviewing!

/Claes

On 2019-01-08 15:03, Tobias Hartmann wrote:
> Hi Claes,
> 
> looks good to me.
> 
> Best regards,
> Tobias
> 
> On 08.01.19 14:01, Claes Redestad wrote:
>> Hi,
>>
>> the develop flags Trace- and TimeCompilationPolicy are not implemented
>> for any of the current default compilation policy implementations and
>> should be removed. (They _are_ implemented for StackWalkCompPolicy
>> which I'm proposing to be deprecated).
>>
>> (This also removes the declaration of _in_vm_startup that should have
>> been removed by JDK-8216262)
>>
>> Webrev: http://cr.openjdk.java.net/~redestad/8216359/open.00/
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8216359
>>
>> Thanks!
>>
>> /Claes

From eric.caspole at oracle.com  Tue Jan  8 15:22:05 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Tue, 8 Jan 2019 10:22:05 -0500
Subject: RFR: 8076988: reevaluate trivial method policy
In-Reply-To: <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>
References: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
 <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>
Message-ID: <5d86dac1-364d-ce68-feb6-f01f8cdfcc9d@oracle.com>

Hi Dean, I will make a new CR to revert 8145579 and do that as a 
separate change next, ok?
Eric


On 1/7/19 23:09, dean.long at oracle.com wrote:
> Eric, you should be able to revert 8145579 at the same time.
> 
> dl
> 
> On 1/7/19 2:47 PM, Eric Caspole wrote:
>> Hi everyone,
>> Could I get reviews or comments for a fix/simplification of the 
>> trivial method policy. I have an internal benchmark where a very hot 
>> "trivial" method gets compiled at level 1 and it leads to a ~9% 
>> regression compared to getting compiled with C2 level 4. Others have 
>> expressed thoughts that this policy might now not as useful as 
>> originally intended. I have run performance testing of throughput and 
>> startup time with no noticeable regressions.
>>
>> This webrev passed regular tier1 and tier 2 testing.
>> Thanks,
>> Eric
>>
>>
>> JBS:
>> https://bugs.openjdk.java.net/browse/JDK-8076988
>>
>> Webrev:
>> http://cr.openjdk.java.net/~ecaspole/JDK-8076988/01/webrev/
> 

From dean.long at oracle.com  Tue Jan  8 16:11:22 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Tue, 8 Jan 2019 08:11:22 -0800
Subject: RFR: 8076988: reevaluate trivial method policy
In-Reply-To: <5d86dac1-364d-ce68-feb6-f01f8cdfcc9d@oracle.com>
References: <f0fc6d61-c5a4-d293-6ab8-4fbddcb88965@oracle.com>
 <66aa281f-ae60-5d89-7666-d33cafbb9b6c@oracle.com>
 <5d86dac1-364d-ce68-feb6-f01f8cdfcc9d@oracle.com>
Message-ID: <d1bc0304-99a5-146e-1189-8f1462a39430@oracle.com>

OK.

dl

On 1/8/19 7:22 AM, Eric Caspole wrote:
> Hi Dean, I will make a new CR to revert 8145579 and do that as a 
> separate change next, ok?
> Eric
>
>
> On 1/7/19 23:09, dean.long at oracle.com wrote:
>> Eric, you should be able to revert 8145579 at the same time.
>>
>> dl
>>
>> On 1/7/19 2:47 PM, Eric Caspole wrote:
>>> Hi everyone,
>>> Could I get reviews or comments for a fix/simplification of the 
>>> trivial method policy. I have an internal benchmark where a very hot 
>>> "trivial" method gets compiled at level 1 and it leads to a ~9% 
>>> regression compared to getting compiled with C2 level 4. Others have 
>>> expressed thoughts that this policy might now not as useful as 
>>> originally intended. I have run performance testing of throughput 
>>> and startup time with no noticeable regressions.
>>>
>>> This webrev passed regular tier1 and tier 2 testing.
>>> Thanks,
>>> Eric
>>>
>>>
>>> JBS:
>>> https://bugs.openjdk.java.net/browse/JDK-8076988
>>>
>>> Webrev:
>>> http://cr.openjdk.java.net/~ecaspole/JDK-8076988/01/webrev/
>>


From nils.eliasson at oracle.com  Tue Jan  8 17:04:29 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 8 Jan 2019 18:04:29 +0100
Subject: RFR: 8215708: ZGC: Add missing LoadBarrierNode::size_of()
In-Reply-To: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
References: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
Message-ID: <030aede6-259b-242f-738a-5f39d0ea8144@oracle.com>

Looks good!

// Nils

On 2019-01-08 15:32, Per Liden wrote:
> LoadBarrierNode should implement size_of(). Otherwise cloning of such 
> nodes is broken since only part of the object will be copied. This 
> caused incorrect load barriers to be used in random places. For 
> example, we could generate a weak barrier instead of a strong barrier, 
> because the _weak member was not properly initialized when cloned.
>
> This patch also implements three other methods (cmp, adr_type and 
> match_edge) with an immediate call to ShouldNotReachHere(). This is a 
> pure safety net to catch any misuse of these. These should never be 
> called, but if they are called today we might not notice and instead 
> silently do the wrong thing.
>
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215708
> Webrev: http://cr.openjdk.java.net/~pliden/8215708/webrev.0
>
> Testing: tier{1,6,7}
>
> /Per

From nils.eliasson at oracle.com  Tue Jan  8 17:23:14 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 8 Jan 2019 18:23:14 +0100
Subject: RFR (S): 8216372: Put C2 load barrier stub routines in separate
 codeblobs
Message-ID: <d6edfa04-3a0a-e1e9-a3c4-c636a59995c4@oracle.com>

Hi,

Please review this small clean up of the load barrier stub routine 
generation. The main improvement is having separate blobs for strong and 
weak barriers. This gives us PrintAssembly output that is clearly 
annotated with the barrier type:

0x00007f8fb91b4b58: lea 0x10(%r9),%r11
 ??0x00007f8fb91b4b5c: callq 0x00007f8fb9009a60 ; {runtime_call 
zgc_load_barrier_weak_stubs}
 ??0x00007f8fb91b4b61: jmpq 0x00007f8fb91b4a23

Bug: https://bugs.openjdk.java.net/browse/JDK-8216372

Webrev: http://cr.openjdk.java.net/~neliasso/8216372/webrev.02/

Regards,

Nils

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190108/6c91eb7e/attachment-0001.html>

From erik.osterlund at oracle.com  Tue Jan  8 17:39:16 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Tue, 8 Jan 2019 18:39:16 +0100
Subject: RFR (S): 8216372: Put C2 load barrier stub routines in separate
 codeblobs
In-Reply-To: <d6edfa04-3a0a-e1e9-a3c4-c636a59995c4@oracle.com>
References: <d6edfa04-3a0a-e1e9-a3c4-c636a59995c4@oracle.com>
Message-ID: <3b8359a7-7038-92f7-7030-69f6fce69e2c@oracle.com>

Hi Nils,

Looks good.

Thanks,
/Erik

On 2019-01-08 18:23, Nils Eliasson wrote:
> Hi,
> 
> Please review this small clean up of the load barrier stub routine 
> generation. The main improvement is having separate blobs for strong and 
> weak barriers. This gives us PrintAssembly output that is clearly 
> annotated with the barrier type:
> 
> 0x00007f8fb91b4b58: lea 0x10(%r9),%r11
>  ??0x00007f8fb91b4b5c: callq 0x00007f8fb9009a60 ; {runtime_call 
> zgc_load_barrier_weak_stubs}
>  ??0x00007f8fb91b4b61: jmpq 0x00007f8fb91b4a23
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216372
> 
> Webrev: http://cr.openjdk.java.net/~neliasso/8216372/webrev.02/
> 
> Regards,
> 
> Nils
> 

From sandhya.viswanathan at intel.com  Tue Jan  8 18:42:45 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Tue, 8 Jan 2019 18:42:45 +0000
Subject: RFR (XS): 8216290: Backport of 8215888 to JDK11u (Register to
 register spill may use AVX 512 move instruction on unsupported platform)
In-Reply-To: <b5945875-9c84-e59d-63c4-3c7ce396a900@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A4AC0F@FMSMSX126.amr.corp.intel.com>
 <b5945875-9c84-e59d-63c4-3c7ce396a900@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A4B182@FMSMSX126.amr.corp.intel.com>

Hi Tobias, 

Thanks, I have updated the bug request as per the steps described in the link you gave. 

Best Regards,
Sandhya


-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
Sent: Tuesday, January 08, 2019 2:01 AM
To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; vladimir.kozlov at oracle.com
Subject: Re: RFR (XS): 8216290: Backport of 8215888 to JDK11u (Register to register spill may use AVX 512 move instruction on unsupported platform)

Hi Sandhya,

this looks good to me. Please request approval according to:
https://openjdk.java.net/projects/jdk-updates/approval.html

Best regards,
Tobias

On 07.01.19 21:44, Viswanathan, Sandhya wrote:
> This is a request to backport the 8215888 
> <https://bugs.openjdk.java.net/browse/JDK-8215888> fix to JDK11u. The fix has been in JDK 12 branch for a couple of days now and passed nightly testing.
> 
> ?
> 
> The backport bug request is at:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216290
> 
> ?
> 
> The backport webrev is at:
> 
> http://cr.openjdk.java.net/~sviswanathan/8216290/webrev.00/
> 
> ?
> 
> Best Regards,
> 
> Sandhya
> 
> ?
> 

From eric.caspole at oracle.com  Tue Jan  8 19:27:19 2019
From: eric.caspole at oracle.com (Eric Caspole)
Date: Tue, 8 Jan 2019 14:27:19 -0500
Subject: RFR (S) 8216375: Revert JDK-8145579 after JDK-8076988 is resolved
Message-ID: <916836c2-7a25-76ff-9fca-3ed0547a15c7@oracle.com>

Hi everybody,
Could I get reviews on this small change. As Dean suggested, now that 
JDK-8076988 to simplify the trivial method check is done, the change of 
JDK-8145579 is no longer needed, so this webrev reverts it.

This passed tier1 and tier2 testing.
Thanks,
Eric

JBS:
https://bugs.openjdk.java.net/browse/JDK-8216375

Webrev:
http://cr.openjdk.java.net/~ecaspole/JDK-8216375/01/webrev/

From per.liden at oracle.com  Tue Jan  8 20:32:36 2019
From: per.liden at oracle.com (Per Liden)
Date: Tue, 8 Jan 2019 21:32:36 +0100
Subject: RFR: 8215708: ZGC: Add missing LoadBarrierNode::size_of()
In-Reply-To: <030aede6-259b-242f-738a-5f39d0ea8144@oracle.com>
References: <ea0975d2-120d-b3e2-9899-38e56660955b@oracle.com>
 <030aede6-259b-242f-738a-5f39d0ea8144@oracle.com>
Message-ID: <7c2351e1-e89f-f3dd-8693-b960c2325eae@oracle.com>

Thanks Nils!

/Per

On 01/08/2019 06:04 PM, Nils Eliasson wrote:
> Looks good!
> 
> // Nils
> 
> On 2019-01-08 15:32, Per Liden wrote:
>> LoadBarrierNode should implement size_of(). Otherwise cloning of such 
>> nodes is broken since only part of the object will be copied. This 
>> caused incorrect load barriers to be used in random places. For 
>> example, we could generate a weak barrier instead of a strong barrier, 
>> because the _weak member was not properly initialized when cloned.
>>
>> This patch also implements three other methods (cmp, adr_type and 
>> match_edge) with an immediate call to ShouldNotReachHere(). This is a 
>> pure safety net to catch any misuse of these. These should never be 
>> called, but if they are called today we might not notice and instead 
>> silently do the wrong thing.
>>
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215708
>> Webrev: http://cr.openjdk.java.net/~pliden/8215708/webrev.0
>>
>> Testing: tier{1,6,7}
>>
>> /Per

From per.liden at oracle.com  Tue Jan  8 20:34:53 2019
From: per.liden at oracle.com (Per Liden)
Date: Tue, 8 Jan 2019 21:34:53 +0100
Subject: RFR (S): 8216372: Put C2 load barrier stub routines in separate
 codeblobs
In-Reply-To: <d6edfa04-3a0a-e1e9-a3c4-c636a59995c4@oracle.com>
References: <d6edfa04-3a0a-e1e9-a3c4-c636a59995c4@oracle.com>
Message-ID: <150d31c1-e59d-13e1-ecda-43d25e38dcce@oracle.com>

Looks good!

/Per

On 01/08/2019 06:23 PM, Nils Eliasson wrote:
> Hi,
> 
> Please review this small clean up of the load barrier stub routine 
> generation. The main improvement is having separate blobs for strong and 
> weak barriers. This gives us PrintAssembly output that is clearly 
> annotated with the barrier type:
> 
> 0x00007f8fb91b4b58: lea 0x10(%r9),%r11
>    0x00007f8fb91b4b5c: callq 0x00007f8fb9009a60 ; {runtime_call 
> zgc_load_barrier_weak_stubs}
>    0x00007f8fb91b4b61: jmpq 0x00007f8fb91b4a23
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216372
> 
> Webrev: http://cr.openjdk.java.net/~neliasso/8216372/webrev.02/
> 
> Regards,
> 
> Nils
> 

From dean.long at oracle.com  Wed Jan  9 02:00:02 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Tue, 8 Jan 2019 18:00:02 -0800
Subject: RFR (S) 8216375: Revert JDK-8145579 after JDK-8076988 is resolved
In-Reply-To: <916836c2-7a25-76ff-9fca-3ed0547a15c7@oracle.com>
References: <916836c2-7a25-76ff-9fca-3ed0547a15c7@oracle.com>
Message-ID: <27d62e29-aa4d-cdca-d0b8-9af1dd35bb9e@oracle.com>

Looks good to me.? Don't forget to update the copyright year.

dl

On 1/8/19 11:27 AM, Eric Caspole wrote:
> Hi everybody,
> Could I get reviews on this small change. As Dean suggested, now that 
> JDK-8076988 to simplify the trivial method check is done, the change 
> of JDK-8145579 is no longer needed, so this webrev reverts it.
>
> This passed tier1 and tier2 testing.
> Thanks,
> Eric
>
> JBS:
> https://bugs.openjdk.java.net/browse/JDK-8216375
>
> Webrev:
> http://cr.openjdk.java.net/~ecaspole/JDK-8216375/01/webrev/


From derekw at marvell.com  Wed Jan  9 02:19:19 2019
From: derekw at marvell.com (Derek White)
Date: Wed, 9 Jan 2019 02:19:19 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path	not called
Message-ID: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>

Hi Nick,

Very nice find!

My only comments are to fix up some comments (pre-existing), and some trivial cleanups of pre-existing code. These are judgement calls, and it would be good to get the approval of at least one Andrew.

Comments:
1) The TODO comment before aarch64_enc_fast_unlock() has been done since 2014, so it can go away.

2) In aarch64_enc_fast_lock() and aarch64_enc_fast_unlock(), there are three comment blocks referring to old code using cmpxchgptr. At this point in time I find the new code clearer, and these comments don't add much?

Cleanup suggestions (untested!):
3) In aarch64_enc_fast_lock():
    // we can use AArch64's bit test and branch here but
    // markoopDesc does not define a bit index just the bit value
    // so assert in case the bit pos changes
#   define __monitor_value_log2 1
    assert(markOopDesc::monitor_value == (1 << __monitor_value_log2), "incorrect bit position");
    __ tbnz(disp_hdr, __monitor_value_log2, object_has_monitor);
#   undef __monitor_value_log2

Can be replaced with:
     __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value), object_has_monitor);
It looks like this was fixed in several places a long time ago, but this one got missed.

4)  Slightly better comment for last instruction of fast_unlock (and explicitly use zr).
    __ stlr(zr, tmp); // set unowned

- Derek


--------------------- Patch on original code (not your patch, sorry!) -----------------------------
--- src/hotspot/cpu/aarch64/aarch64.ad
+++ src/hotspot/cpu/aarch64/aarch64.ad
@@ -3418,13 +3418,7 @@
     }
 
     // Handle existing monitor
-    // we can use AArch64's bit test and branch here but
-    // markoopDesc does not define a bit index just the bit value
-    // so assert in case the bit pos changes
-#   define __monitor_value_log2 1
-    assert(markOopDesc::monitor_value == (1 << __monitor_value_log2), "incorrect bit position");
-    __ tbnz(disp_hdr, __monitor_value_log2, object_has_monitor);
-#   undef __monitor_value_log2
+    __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value), object_has_monitor);
 
     // Set displaced_header to be (markOop of object | UNLOCK_VALUE).
     __ orr(disp_hdr, disp_hdr, markOopDesc::unlocked_value);
@@ -3455,14 +3449,6 @@
       __ b(retry_load);
     }
 
-    // Formerly:
-    // __ cmpxchgptr(/*oldv=*/disp_hdr,
-    //               /*newv=*/box,
-    //               /*addr=*/oop,
-    //               /*tmp=*/tmp,
-    //               cont,
-    //               /*fail*/NULL);
-
     assert(oopDesc::mark_offset_in_bytes() == 0, "offset of _mark is not 0");
 
     // If the compare-and-exchange succeeded, then we found an unlocked
@@ -3511,15 +3497,6 @@
       __ bind(fail);
     }
 
-    // Label next;
-    // __ cmpxchgptr(/*oldv=*/disp_hdr,
-    //               /*newv=*/rthread,
-    //               /*addr=*/tmp,
-    //               /*tmp=*/rscratch1,
-    //               /*succeed*/next,
-    //               /*fail*/NULL);
-    // __ bind(next);
-
     // store a non-null value into the box.
     __ str(box, Address(box, BasicLock::displaced_header_offset_in_bytes()));
 
@@ -3544,9 +3521,6 @@
 
   %}
 
-  // TODO
-  // reimplement this with custom cmpxchgptr code
-  // which avoids some of the unnecessary branching
   enc_class aarch64_enc_fast_unlock(iRegP object, iRegP box, iRegP tmp, iRegP tmp2) %{
     MacroAssembler _masm(&cbuf);
     Register oop = as_Register($object$$reg);
@@ -3597,12 +3571,6 @@
         __ b(retry_load);
       }
 
-    // __ cmpxchgptr(/*compare_value=*/box,
-    //               /*exchange_value=*/disp_hdr,
-    //               /*where=*/oop,
-    //               /*result=*/tmp,
-    //               cont,
-    //               /*cas_failed*/NULL);
     assert(oopDesc::mark_offset_in_bytes() == 0, "offset of _mark is not 0");
 
     __ bind(cas_failed);
@@ -3626,7 +3594,7 @@
     __ cbnz(rscratch1, cont);
     // need a release store here
     __ lea(tmp, Address(tmp, ObjectMonitor::owner_offset_in_bytes()));
-    __ stlr(rscratch1, tmp); // rscratch1 is zero
+    __ stlr(zr, tmp); // set unowned
 
     __ bind(cont);
     // flag == EQ indicates success


> -----Original Message-----
> From: aarch64-port-dev <aarch64-port-dev-bounces at openjdk.java.net> On
> Behalf Of Nick Gasson (Arm Technology China)
> Sent: Tuesday, January 08, 2019 3:04 AM
> To: hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-
> dev at openjdk.java.net>
> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
> Subject: [EXT] [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock
> fast path not called
> 
> ----------------------------------------------------------------------
> Hi,
> 
> While looking at the profiling output of some micro-benchmarks for locking
> on AArch64, I noticed that the monitor unlock fast-path in
> aarch64_enc_fast_unlock in aarch64.ad (under label `object_has_monitor') is
> almost never executed, even though the lock in the test is inflated.
> 
> In order to branch to this fast-path we check if bit #1 is set in the displaced
> header word on the stack:
> 
>    __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value),
> object_has_monitor);
> 
> But in the common case the value in the dhw is set by the monitor locking
> fast-path in aarch64_enc_fast_lock, where we use the pointer to the dhw as
> an arbitrary non-null value. But the lower three bits of this pointer will
> always be zero, and so won't trigger the unlock fast-path which is looking for
> bit #1 set, and we will fall through to call the runtime to unlock the monitor.
> 
>    // store a non-null value into the box.
>    __ str(box, Address(box, BasicLock::displaced_header_offset_in_bytes()));
> 
> It seems that the unlock fast-path will only be executed when the monitor
> was originally locked by the runtime (e.g. when the lock was first inflated),
> because ObjectSynchronizer::slow_enter will store
> markOopDesc::unused_mark into the dhw, and this value has bit #1 set.
> 
> Can someone help me review this change to aarch64_enc_fast_lock to use
> markOopDesc::unused_mark as the arbitrary non-null value rather than `box'
> to match ObjectSynchronizer::slow_enter?
> 
> Webrev: http://cr.openjdk.java.net/~njian/8216350/webrev.0/
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216350
> 
> Also removed an unnecessary double branch in the unlock code.
> 
> Ran jtreg + jcstress.
> 
> I also added a new micro-benchmark to
> test/micro/org/openjdk/bench/vm/lang/LockUnlock.java so you can see this
> behaviour:
> 
> Without patch:
> 
> Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
>    597.855 ?(99.9%) 73.183 ns/op [Average]
>    (min, avg, max) = (438.862, 597.855, 861.028), stdev = 97.697
>    CI (99.9%): [524.672, 671.038] (assumes normal distribution)
> 
> With patch:
> 
> Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
>    219.067 ?(99.9%) 21.146 ns/op [Average]
>    (min, avg, max) = (176.379, 219.067, 300.186), stdev = 28.229
>    CI (99.9%): [197.921, 240.212] (assumes normal distribution)
> 
> This is with -XX:+UseLSE, -UseLSE has a similar improvement.
> 
> Thanks,
> Nick

From Nick.Gasson at arm.com  Wed Jan  9 02:50:53 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Wed, 9 Jan 2019 02:50:53 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
Message-ID: <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>

Hi Derek

> My only comments are to fix up some comments (pre-existing), and some trivial cleanups of pre-existing code. These are judgement calls, and it would be good to get the approval of at least one Andrew.

I agree all of these are good, especially #3 which obscures the symmetry 
between the lock and unlock functions. But I think we ought to create a 
separate patch, to separate code cleanup with no functional change from 
this patch which is a bug fix / functional change?

Also two minor things:

* There is inconsistent (four space) indentation under "// Check if it 
is still a light weight lock ..." in aarch64_enc_fast_unlock.

* At the end of aarch64_enc_fast_lock there is a commented out block "// 
PPC port checks the following invariants": I guess we should either 
implement these if we think they're useful or delete this whole block. 
(FWIW x86 doesn't do any extra checking #ifdef ASSERT).

Finally we could also consider moving these two functions into 
macroAssembler_aarch64.cpp to match the other ports.

Thanks,
Nick

On 09/01/2019 10:19, Derek White wrote:
> Hi Nick,
> 
> Very nice find!
> 
> My only comments are to fix up some comments (pre-existing), and some trivial cleanups of pre-existing code. These are judgement calls, and it would be good to get the approval of at least one Andrew.
> 
> Comments:
> 1) The TODO comment before aarch64_enc_fast_unlock() has been done since 2014, so it can go away.
> 
> 2) In aarch64_enc_fast_lock() and aarch64_enc_fast_unlock(), there are three comment blocks referring to old code using cmpxchgptr. At this point in time I find the new code clearer, and these comments don't add much?
> 
> Cleanup suggestions (untested!):
> 3) In aarch64_enc_fast_lock():
>      // we can use AArch64's bit test and branch here but
>      // markoopDesc does not define a bit index just the bit value
>      // so assert in case the bit pos changes
> #   define __monitor_value_log2 1
>      assert(markOopDesc::monitor_value == (1 << __monitor_value_log2), "incorrect bit position");
>      __ tbnz(disp_hdr, __monitor_value_log2, object_has_monitor);
> #   undef __monitor_value_log2
> 
> Can be replaced with:
>       __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value), object_has_monitor);
> It looks like this was fixed in several places a long time ago, but this one got missed.
> 
> 4)  Slightly better comment for last instruction of fast_unlock (and explicitly use zr).
>      __ stlr(zr, tmp); // set unowned
> 
> - Derek
> 
> 
> --------------------- Patch on original code (not your patch, sorry!) -----------------------------
> --- src/hotspot/cpu/aarch64/aarch64.ad
> +++ src/hotspot/cpu/aarch64/aarch64.ad
> @@ -3418,13 +3418,7 @@
>       }
>   
>       // Handle existing monitor
> -    // we can use AArch64's bit test and branch here but
> -    // markoopDesc does not define a bit index just the bit value
> -    // so assert in case the bit pos changes
> -#   define __monitor_value_log2 1
> -    assert(markOopDesc::monitor_value == (1 << __monitor_value_log2), "incorrect bit position");
> -    __ tbnz(disp_hdr, __monitor_value_log2, object_has_monitor);
> -#   undef __monitor_value_log2
> +    __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value), object_has_monitor);
>   
>       // Set displaced_header to be (markOop of object | UNLOCK_VALUE).
>       __ orr(disp_hdr, disp_hdr, markOopDesc::unlocked_value);
> @@ -3455,14 +3449,6 @@
>         __ b(retry_load);
>       }
>   
> -    // Formerly:
> -    // __ cmpxchgptr(/*oldv=*/disp_hdr,
> -    //               /*newv=*/box,
> -    //               /*addr=*/oop,
> -    //               /*tmp=*/tmp,
> -    //               cont,
> -    //               /*fail*/NULL);
> -
>       assert(oopDesc::mark_offset_in_bytes() == 0, "offset of _mark is not 0");
>   
>       // If the compare-and-exchange succeeded, then we found an unlocked
> @@ -3511,15 +3497,6 @@
>         __ bind(fail);
>       }
>   
> -    // Label next;
> -    // __ cmpxchgptr(/*oldv=*/disp_hdr,
> -    //               /*newv=*/rthread,
> -    //               /*addr=*/tmp,
> -    //               /*tmp=*/rscratch1,
> -    //               /*succeed*/next,
> -    //               /*fail*/NULL);
> -    // __ bind(next);
> -
>       // store a non-null value into the box.
>       __ str(box, Address(box, BasicLock::displaced_header_offset_in_bytes()));
>   
> @@ -3544,9 +3521,6 @@
>   
>     %}
>   
> -  // TODO
> -  // reimplement this with custom cmpxchgptr code
> -  // which avoids some of the unnecessary branching
>     enc_class aarch64_enc_fast_unlock(iRegP object, iRegP box, iRegP tmp, iRegP tmp2) %{
>       MacroAssembler _masm(&cbuf);
>       Register oop = as_Register($object$$reg);
> @@ -3597,12 +3571,6 @@
>           __ b(retry_load);
>         }
>   
> -    // __ cmpxchgptr(/*compare_value=*/box,
> -    //               /*exchange_value=*/disp_hdr,
> -    //               /*where=*/oop,
> -    //               /*result=*/tmp,
> -    //               cont,
> -    //               /*cas_failed*/NULL);
>       assert(oopDesc::mark_offset_in_bytes() == 0, "offset of _mark is not 0");
>   
>       __ bind(cas_failed);
> @@ -3626,7 +3594,7 @@
>       __ cbnz(rscratch1, cont);
>       // need a release store here
>       __ lea(tmp, Address(tmp, ObjectMonitor::owner_offset_in_bytes()));
> -    __ stlr(rscratch1, tmp); // rscratch1 is zero
> +    __ stlr(zr, tmp); // set unowned
>   
>       __ bind(cont);
>       // flag == EQ indicates success
> 
> 
>> -----Original Message-----
>> From: aarch64-port-dev <aarch64-port-dev-bounces at openjdk.java.net> On
>> Behalf Of Nick Gasson (Arm Technology China)
>> Sent: Tuesday, January 08, 2019 3:04 AM
>> To: hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-
>> dev at openjdk.java.net>
>> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
>> Subject: [EXT] [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock
>> fast path not called
>>
>> ----------------------------------------------------------------------
>> Hi,
>>
>> While looking at the profiling output of some micro-benchmarks for locking
>> on AArch64, I noticed that the monitor unlock fast-path in
>> aarch64_enc_fast_unlock in aarch64.ad (under label `object_has_monitor') is
>> almost never executed, even though the lock in the test is inflated.
>>
>> In order to branch to this fast-path we check if bit #1 is set in the displaced
>> header word on the stack:
>>
>>     __ tbnz(disp_hdr, exact_log2(markOopDesc::monitor_value),
>> object_has_monitor);
>>
>> But in the common case the value in the dhw is set by the monitor locking
>> fast-path in aarch64_enc_fast_lock, where we use the pointer to the dhw as
>> an arbitrary non-null value. But the lower three bits of this pointer will
>> always be zero, and so won't trigger the unlock fast-path which is looking for
>> bit #1 set, and we will fall through to call the runtime to unlock the monitor.
>>
>>     // store a non-null value into the box.
>>     __ str(box, Address(box, BasicLock::displaced_header_offset_in_bytes()));
>>
>> It seems that the unlock fast-path will only be executed when the monitor
>> was originally locked by the runtime (e.g. when the lock was first inflated),
>> because ObjectSynchronizer::slow_enter will store
>> markOopDesc::unused_mark into the dhw, and this value has bit #1 set.
>>
>> Can someone help me review this change to aarch64_enc_fast_lock to use
>> markOopDesc::unused_mark as the arbitrary non-null value rather than `box'
>> to match ObjectSynchronizer::slow_enter?
>>
>> Webrev: http://cr.openjdk.java.net/~njian/8216350/webrev.0/
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8216350
>>
>> Also removed an unnecessary double branch in the unlock code.
>>
>> Ran jtreg + jcstress.
>>
>> I also added a new micro-benchmark to
>> test/micro/org/openjdk/bench/vm/lang/LockUnlock.java so you can see this
>> behaviour:
>>
>> Without patch:
>>
>> Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
>>     597.855 ?(99.9%) 73.183 ns/op [Average]
>>     (min, avg, max) = (438.862, 597.855, 861.028), stdev = 97.697
>>     CI (99.9%): [524.672, 671.038] (assumes normal distribution)
>>
>> With patch:
>>
>> Result "org.openjdk.bench.vm.lang.LockUnlock.testContendedLock":
>>     219.067 ?(99.9%) 21.146 ns/op [Average]
>>     (min, avg, max) = (176.379, 219.067, 300.186), stdev = 28.229
>>     CI (99.9%): [197.921, 240.212] (assumes normal distribution)
>>
>> This is with -XX:+UseLSE, -UseLSE has a similar improvement.
>>
>> Thanks,
>> Nick

From Pengfei.Li at arm.com  Wed Jan  9 06:50:35 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Wed, 9 Jan 2019 06:50:35 +0000
Subject: [aarch64-port-dev ] RFR(S): 8214922: Add vectorization support
 for fmin/fmax
In-Reply-To: <DB7PR08MB3115836C7236BFBDC988D729968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115055477478B62D825C36196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87d0pv2iow.fsf@redhat.com> <c836cf2e-20a9-0ec4-212d-72326fb144a6@redhat.com>
 <877eg32bzq.fsf@redhat.com> <b91e56a1-dd8f-d9f1-40e8-af5d8c3c0d9d@redhat.com>
 <871s6a3map.fsf@redhat.com>
 <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87va371n6b.fsf@redhat.com>
 <DB7PR08MB3115836C7236BFBDC988D729968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <DB7PR08MB3115E7B38864A31A1F60DE50968B0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Andrew,

Do you have further comments on my 2nd min/max vectorization patch?

> > > http://cr.openjdk.java.net/~pli/rfr/8214922/webrev.01/
> >

--
Thanks,
Pengfei


From aph at redhat.com  Wed Jan  9 09:23:32 2019
From: aph at redhat.com (Andrew Haley)
Date: Wed, 9 Jan 2019 09:23:32 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
Message-ID: <51023960-0e8f-56aa-20a4-279017251585@redhat.com>

On 1/9/19 2:50 AM, Nick Gasson (Arm Technology China) wrote:

> I agree all of these are good, especially #3 which obscures the
> symmetry between the lock and unlock functions. But I think we ought
> to create a separate patch, to separate code cleanup with no
> functional change from this patch which is a bug fix / functional
> change?

HotSpot policy is that we can do minor cleanups as we go along:
experience has shown that unless you do so, cruft tends to
accumulate. These cleanups are OK for this patch.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From felix.yang at huawei.com  Wed Jan  9 09:29:00 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Wed, 9 Jan 2019 09:29:00 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path	not called
In-Reply-To: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
References: <19afbcdf-6d69-88ca-1794-c03e8e81f171@arm.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F3E33A@dggeml527-mbx.china.huawei.com>

> Webrev: http://cr.openjdk.java.net/~njian/8216350/webrev.0/
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216350
> 
> Also removed an unnecessary double branch in the unlock code.
> 
> Ran jtreg + jcstress.
> 

Hi,

I think the Copyright year for this file also needs to be updated as you changed it : src/hotspot/cpu/aarch64/aarch64.ad 

Otherwise, LGTM(Not a Reviewer)

Thanks,
Felix

From tobias.hartmann at oracle.com  Wed Jan  9 09:33:08 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 9 Jan 2019 10:33:08 +0100
Subject: RFR (S) 8216375: Revert JDK-8145579 after JDK-8076988 is resolved
In-Reply-To: <916836c2-7a25-76ff-9fca-3ed0547a15c7@oracle.com>
References: <916836c2-7a25-76ff-9fca-3ed0547a15c7@oracle.com>
Message-ID: <4e6e41f0-7a8c-e42e-033c-4928f89ae79a@oracle.com>

Hi Eric,

looks good.

Best regards,
Tobias

On 08.01.19 20:27, Eric Caspole wrote:
> Hi everybody,
> Could I get reviews on this small change. As Dean suggested, now that JDK-8076988 to simplify the
> trivial method check is done, the change of JDK-8145579 is no longer needed, so this webrev reverts it.
> 
> This passed tier1 and tier2 testing.
> Thanks,
> Eric
> 
> JBS:
> https://bugs.openjdk.java.net/browse/JDK-8216375
> 
> Webrev:
> http://cr.openjdk.java.net/~ecaspole/JDK-8216375/01/webrev/

From Nick.Gasson at arm.com  Wed Jan  9 09:40:56 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Wed, 9 Jan 2019 09:40:56 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
Message-ID: <5edeae5f-972f-72a3-8589-b72180f67949@arm.com>

Hi Andrew,

On 09/01/2019 17:23, Andrew Haley wrote:
> HotSpot policy is that we can do minor cleanups as we go along:
> experience has shown that unless you do so, cruft tends to
> accumulate. These cleanups are OK for this patch.
> 

Sure. I'll test with the cleanups and send the updated webrev tomorrow.

Thanks,
Nick

From rwestrel at redhat.com  Wed Jan  9 09:59:51 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 09 Jan 2019 10:59:51 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
Message-ID: <87ef9m178o.fsf@redhat.com>


http://cr.openjdk.java.net/~roland/8216135/webrev.00/

Range check elimination is applied to a loop and then the loop is
unrolled. After the loop is unrolled, the range of values for the
induction variable conflicts with a range check CastII (the loop is over
unrolled and the main loop would never be executed), the CastII's value
becomes top, a data path dies but the corresponding control path is kept
alive. This results in a broken graph.

This scenario is supposed to be caught by the skeleton predicates added
by 8193130 but it's not for 2 reasons:

1- With 8203915 & 8205033, Tobias extended skeleton predicates to cover
  not only the first value of the induction variable of the first loop
  iteration but also the last value of an unrolled loop. But his changes
  only apply to loop predicates, not range check elimination.

2- With 8203915 & 8205033, Tobias used an Opaque1 node as a place holder
  so on each unrolling, he could update the skeleton predicate with the
  new stride. The problem is that the Opaque1 node blocks type
  propagation and the skeleton predicate only has a chance to remove a
  dead main loop after loop opts are over. In the case of this bug, the
  CastII becomes dead before loop opts are finished.

The problem with 2- is that if the Opaque1 node is not added, on the
next unrolling there's no way to find what predicate and what part of
the predicate to update. The fix I propose, is to keep 3 predicates
after the first unrolling:

1 for the first value of the first iteration
1 for the last value of the last iteration, without an Opaque1 node
1 with an Opaque1 node that can be used as a template

On the next unrolling pass, the 1st and 2nd predicates above could have
been optimized out. Rather than try to locate and update the 2nd
predicate, the 1st and 2nd predicates are removed if they are found and,
once the code finds the 3rd predicate, it clones it once to produce the
check on the first value again and a second time to produce an updated
check on the new last value.

Roland.

From rkennke at redhat.com  Wed Jan  9 12:13:22 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Wed, 9 Jan 2019 13:13:22 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
Message-ID: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>

While poking around x86_64.ad's cmovP instructions (because I needed it
for an experiment in Shenandoah), I noticed that 2 of them are
disabled/commented-out:  cmovP_mem and  cmovP_memU. This means that a
cmovp with a 2nd argument that is a LoadP will generate two instructions:

mov %r1, $mem
cmov %r2, %1

instead of just one:

cmov %r2, $mem

The comment there says that adlc doesn't compute the bottom-type
correctly, and that implicit null-checking is broken, but I couldn't
confirm either of those. I checked hg annotate, but the commented-out
block stems from revision #1 and cannot be traced to a bug or so.

I did notice a bug though: the two instructions would encode to cmov to
32bit register instead to 64bit register. I added the missing
REX_reg_reg_wide(dst, src) and now everything seems to work fine and
generated code looks better.

I cannot say if if this has performance implication. I suspect not. If
it has, it's probably miniscule improvement. I can't see how it could be
worse though.

http://cr.openjdk.java.net/~rkennke/JDK-8216392/webrev.00/

Testing: tier1 (hotspot/jdk/langtools) passes on linux-x86

WDYT?

Roman

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190109/a0f0ebac/signature.asc>

From nils.eliasson at oracle.com  Wed Jan  9 12:31:57 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Wed, 9 Jan 2019 13:31:57 +0100
Subject: [12] RFR(XS): 8215755: ZGC: split_barrier_thru_phi: check number of
 inputs of phi
Message-ID: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>

Hi,

This fix adds a check of number of inputs before a check of a specific 
input.

Bug: https://bugs.openjdk.java.net/browse/JDK-8215755

Webrev: http://cr.openjdk.java.net/~neliasso/8215755

Please review,

Nils


From per.liden at oracle.com  Wed Jan  9 13:40:13 2019
From: per.liden at oracle.com (Per Liden)
Date: Wed, 9 Jan 2019 14:40:13 +0100
Subject: [12] RFR(XS): 8215755: ZGC: split_barrier_thru_phi: check number
 of inputs of phi
In-Reply-To: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>
References: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>
Message-ID: <92147126-c7cf-bb23-d37d-063b2d2461aa@oracle.com>

Looks good!

/Per

On 2019-01-09 13:31, Nils Eliasson wrote:
> Hi,
> 
> This fix adds a check of number of inputs before a check of a specific 
> input.
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215755
> 
> Webrev: http://cr.openjdk.java.net/~neliasso/8215755
> 
> Please review,
> 
> Nils
> 
> 

From tobias.hartmann at oracle.com  Wed Jan  9 13:59:00 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 9 Jan 2019 14:59:00 +0100
Subject: [12] RFR(XS): 8215755: ZGC: split_barrier_thru_phi: check number
 of inputs of phi
In-Reply-To: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>
References: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>
Message-ID: <780f64cb-e0c9-516b-3d53-29b527718a97@oracle.com>

Hi Nils,

looks good and trivial.

Best regards,
Tobias

On 09.01.19 13:31, Nils Eliasson wrote:
> Hi,
> 
> This fix adds a check of number of inputs before a check of a specific input.
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215755
> 
> Webrev: http://cr.openjdk.java.net/~neliasso/8215755
> 
> Please review,
> 
> Nils
> 
> 

From nils.eliasson at oracle.com  Wed Jan  9 14:05:08 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Wed, 9 Jan 2019 15:05:08 +0100
Subject: [12] RFR(XS): 8215755: ZGC: split_barrier_thru_phi: check number
 of inputs of phi
In-Reply-To: <780f64cb-e0c9-516b-3d53-29b527718a97@oracle.com>
References: <c7489a50-7881-996f-d2cc-d66bc4962f65@oracle.com>
 <780f64cb-e0c9-516b-3d53-29b527718a97@oracle.com>
Message-ID: <7be32373-7275-6adc-b7f1-04e2c77f3341@oracle.com>

Thanks Per and Tobias!

// Nils

On 2019-01-09 14:59, Tobias Hartmann wrote:
> Hi Nils,
>
> looks good and trivial.
>
> Best regards,
> Tobias
>
> On 09.01.19 13:31, Nils Eliasson wrote:
>> Hi,
>>
>> This fix adds a check of number of inputs before a check of a specific input.
>>
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215755
>>
>> Webrev: http://cr.openjdk.java.net/~neliasso/8215755
>>
>> Please review,
>>
>> Nils
>>
>>

From claes.redestad at oracle.com  Wed Jan  9 14:20:25 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 9 Jan 2019 15:20:25 +0100
Subject: RFR (trivial): 8216423: Remove FillDelaySlots
Message-ID: <40c7616a-9daf-975f-2dfd-b0ab9802f950@oracle.com>

Hi,

remove unused and unimplemented flag FillDelaySlots (not to be
confused with LIRFillDelaySlots).

Bug:   https://bugs.openjdk.java.net/browse/JDK-8216423
Patch:

diff -r 48d09a9f4d2b src/hotspot/share/runtime/globals.hpp
--- a/src/hotspot/share/runtime/globals.hpp	Tue Jan 08 10:29:02 2019 -0500
+++ b/src/hotspot/share/runtime/globals.hpp	Wed Jan 09 15:09:26 2019 +0100
@@ -1330,9 +1330,6 @@
    develop(bool, TypeProfileCasts,  true, 
      \
            "treat casts like calls for purposes of type profiling") 
      \
 
      \
-  develop(bool, FillDelaySlots, true, 
     \
-          "Fill delay slots (on SPARC only)") 
     \
- 
     \
    develop(bool, TimeLivenessAnalysis, false, 
      \
            "Time computation of bytecode liveness analysis") 
      \
 
      \


Thanks!

/Claes

From tobias.hartmann at oracle.com  Wed Jan  9 14:17:58 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 9 Jan 2019 15:17:58 +0100
Subject: RFR (trivial): 8216423: Remove FillDelaySlots
In-Reply-To: <40c7616a-9daf-975f-2dfd-b0ab9802f950@oracle.com>
References: <40c7616a-9daf-975f-2dfd-b0ab9802f950@oracle.com>
Message-ID: <6ae8eeec-eceb-c2eb-42f7-d6d396362ec3@oracle.com>

Hi Claes,

looks good.

Best regards,
Tobias

On 09.01.19 15:20, Claes Redestad wrote:
> Hi,
> 
> remove unused and unimplemented flag FillDelaySlots (not to be
> confused with LIRFillDelaySlots).
> 
> Bug:?? https://bugs.openjdk.java.net/browse/JDK-8216423
> Patch:
> 
> diff -r 48d09a9f4d2b src/hotspot/share/runtime/globals.hpp
> --- a/src/hotspot/share/runtime/globals.hpp??? Tue Jan 08 10:29:02 2019 -0500
> +++ b/src/hotspot/share/runtime/globals.hpp??? Wed Jan 09 15:09:26 2019 +0100
> @@ -1330,9 +1330,6 @@
> ?? develop(bool, TypeProfileCasts,? true, ???? \
> ?????????? "treat casts like calls for purposes of type profiling") ???? \
> 
> ???? \
> -? develop(bool, FillDelaySlots, true, ??? \
> -????????? "Fill delay slots (on SPARC only)") ??? \
> - ??? \
> ?? develop(bool, TimeLivenessAnalysis, false, ???? \
> ?????????? "Time computation of bytecode liveness analysis") ???? \
> 
> ???? \
> 
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Wed Jan  9 14:41:00 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 9 Jan 2019 15:41:00 +0100
Subject: RFR (trivial): 8216423: Remove FillDelaySlots
In-Reply-To: <6ae8eeec-eceb-c2eb-42f7-d6d396362ec3@oracle.com>
References: <40c7616a-9daf-975f-2dfd-b0ab9802f950@oracle.com>
 <6ae8eeec-eceb-c2eb-42f7-d6d396362ec3@oracle.com>
Message-ID: <e2a8dbf2-8db6-494d-9c99-6fdb47611d4c@oracle.com>

Thanks, Tobias!

/Claes

On 2019-01-09 15:17, Tobias Hartmann wrote:
> Hi Claes,
> 
> looks good.
> 
> Best regards,
> Tobias
> 
> On 09.01.19 15:20, Claes Redestad wrote:
>> Hi,
>>
>> remove unused and unimplemented flag FillDelaySlots (not to be
>> confused with LIRFillDelaySlots).
>>
>> Bug:?? https://bugs.openjdk.java.net/browse/JDK-8216423
>> Patch:
>>
>> diff -r 48d09a9f4d2b src/hotspot/share/runtime/globals.hpp
>> --- a/src/hotspot/share/runtime/globals.hpp??? Tue Jan 08 10:29:02 2019 -0500
>> +++ b/src/hotspot/share/runtime/globals.hpp??? Wed Jan 09 15:09:26 2019 +0100
>> @@ -1330,9 +1330,6 @@
>>  ?? develop(bool, TypeProfileCasts,? true, ???? \
>>  ?????????? "treat casts like calls for purposes of type profiling") ???? \
>>
>>  ???? \
>> -? develop(bool, FillDelaySlots, true, ??? \
>> -????????? "Fill delay slots (on SPARC only)") ??? \
>> - ??? \
>>  ?? develop(bool, TimeLivenessAnalysis, false, ???? \
>>  ?????????? "Time computation of bytecode liveness analysis") ???? \
>>
>>  ???? \
>>
>>
>> Thanks!
>>
>> /Claes

From dmitrij.pochepko at bell-sw.com  Wed Jan  9 14:50:54 2019
From: dmitrij.pochepko at bell-sw.com (Dmitrij Pochepko)
Date: Wed, 9 Jan 2019 17:50:54 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
Message-ID: <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>

Hi all,


here is my version of this patch consisting of single "sub" instruction 
(I haven't changed test): 
http://cr.openjdk.java.net/~dpochepk/8215792/webrev.01/

cnt2 is a counter for characters yet to be checked. So, instead of 
checking all characters in source string for first character match 
(which was initial reason for this bug), now it check (str2len - str1len 
+ 1).


Actually I think this "sub" instruction was initially lost while working 
on this instrinsic and moving this instruction between this block 
(generate_string_indexof_linear) and caller code. Regular tests couldn't 
catch this problem.


I run some testing to ensure regular usecases are not affected and it 
seems fine. Affected testcase and your test pass as well.


btw: now this code is even faster, because less characters will be 
loaded and checked


Thanks,

Dmitrij

On 04/01/2019 3:52 PM, Dmitrij Pochepko wrote:
> Sure.
>
> I could miss something, so, need to try it. I'll send webrev with 
> patch once it's done.
>
>
> Thanks,
>
> Dmitrij
>
>
> On 04.01.2019 14:04, Pengfei Li (Arm Technology China) wrote:
>> Hi Dmitrij,
>>
>> Thanks a lot for your reply.
>>
>>> since cnt2 is used as counter, wouldn't it be easier and shorter 
>>> just to substract cnt1 from cnt2 at the beginning of this code. 
>>> Total (cnt2 - cnt1 +1) combinations must be checked. That is why 
>>> first sustraction is by (wordSize/str2_chr_size - 1).
>>> Then whole fix will be probably just 1 line at the beginning: 
>>> sub(cnt2, cnt2, cnt1);
>> I don't think the whole fix could be as easy as "sub(cnt2, cnt2, 
>> cnt1)" because cnt2 is the counter which counts number of bytes not 
>> processed. It could be different from the number of bytes after 
>> current first-character-match index.
>>
>> But this is just my thought. Perhaps I didn't understand your idea 
>> and code thoroughly. So could you post your shorter fix and let's 
>> test if it's right?
>>
>> -- 
>> Thanks,
>> Pengfei
>>
>

From claes.redestad at oracle.com  Wed Jan  9 15:10:40 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 9 Jan 2019 16:10:40 +0100
Subject: RFR: 8216424: Remove or clean up TimeLivenessAnalysis
Message-ID: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>

Hi,

implementation for the develop flag TimeLivenessAnalysis leaves a few
breadcrumbs in product builds (in particular TraceTime
constructors/destructors aren't being inlined, so the compiler doesn't
realize these objects aren't actually doing anything)

Bug: https://bugs.openjdk.java.net/browse/JDK-8216424

This should be either cleaned up:
http://cr.openjdk.java.net/~redestad/8216424/cleanup.00/

.. or the flag should be removed altogether:
http://cr.openjdk.java.net/~redestad/8216424/remove.00/

I favor removal since the statistics collected by this analysis does
not seem very useful and any real performance effect could/should be
estimated using real profiling tools on product builds, anyhow.

Thanks!

/Claes

From adinn at redhat.com  Wed Jan  9 15:55:59 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Wed, 9 Jan 2019 15:55:59 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
Message-ID: <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>

Hello Dmitrij,

On 09/01/2019 14:50, Dmitrij Pochepko wrote:
> here is my version of this patch consisting of single "sub" instruction
> (I haven't changed test):
> http://cr.openjdk.java.net/~dpochepk/8215792/webrev.01/
> 
> cnt2 is a counter for characters yet to be checked. So, instead of
> checking all characters in source string for first character match
> (which was initial reason for this bug), now it check (str2len - str1len
> + 1).

That looks like a simpler fix than Pengfei's although I think his is
also correct. However, when I say 'correct' note that I can only make
that judgement relative to this current bug. I have no confidence that
there are no other bugs in your code.

> Actually I think this "sub" instruction was initially lost while working
> on this instrinsic and moving this instruction between this block
> (generate_string_indexof_linear) and caller code. Regular tests couldn't
> catch this problem.

That's a somewhat contentious and, I would suggest, dubious statement.
If you design code based on some algorithm -- especially a complex one
like the one employed here -- then you need to put at least as much work
into designing tests that check for problems in the encoding of that
algorithm as you put into the code. 'Regular' is rather a weasel word to
use at this point when it is clear that the test provision was not adequate.

Having looked at your code I am at a loss to see how it is accurately
described by the piece of C code -- i.e. the original Boyer-Moore
algorithm -- that sits in macroAssembler_aarch64.cpp and purports to
explain it. As happened with the trig/log code, your code actually
follows an algorithm that is significantly more complex that that C
original. Also, once again, it employs various coding tricks that are
not explained at all. The latter can be understood with study but proper
commenting would make maintenance and bug-fixing much easier and
quicker. This is exactly the same problem and just as major a problem as
it was with the trig/log code for *all* the same reasons.

> I run some testing to ensure regular usecases are not affected and it
> seems fine. Affected testcase and your test pass as well.

'some testing'? I'd really like to have full details of those tests.
Ideally, they should be comprehensive. That really means they should
come with a test plan that identifies all the different possible paths
through the code and provides a measure of the coverage the tests
actually provide that is high enough to instil some confidence in the
testing. There are indeed quite a few such paths (not just in the stubs
but also the intrinsics that cover the small cases) so I would expect
the test plan and test suite to be fairly large. Do you have such a test
plan and suite?

Given your previous lack of success at testing your own code I'm not at
all happy to accept your say so that 'oh, the code is fine'. I'm
currently more inclined to ask you to revert your first patch and go
back to the original Boyer-Moore code we had before you injected this
bug (and who knows what others?).

> btw: now this code is even faster, because less characters will be
> loaded and checked
Well, of course, you could make it even faster by deleting half the
code. If you don't place too much priority on correctness you can
achieve incredible performance.

Unfortunately, speed has to be secondary to correctness. So, you need to
stop concentrating on shaving cycles and concentrate on writing
readable, maintainable code that clearly implements a well-defined
algorithm. Can you provide any credible assurance that this code is
worth keeping? If not then I'd personally recommend reversion of all
your changes. Of course, I'll see what Andrew Haley has to say before
pressing for that action.

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From adinn at redhat.com  Wed Jan  9 16:02:08 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Wed, 9 Jan 2019 16:02:08 +0000
Subject: [aarch64-port-dev ] RFR(S): 8214922: Add vectorization support
 for fmin/fmax
In-Reply-To: <DB7PR08MB3115E7B38864A31A1F60DE50968B0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115055477478B62D825C36196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87d0pv2iow.fsf@redhat.com> <c836cf2e-20a9-0ec4-212d-72326fb144a6@redhat.com>
 <877eg32bzq.fsf@redhat.com> <b91e56a1-dd8f-d9f1-40e8-af5d8c3c0d9d@redhat.com>
 <871s6a3map.fsf@redhat.com>
 <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87va371n6b.fsf@redhat.com>
 <DB7PR08MB3115836C7236BFBDC988D729968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <DB7PR08MB3115E7B38864A31A1F60DE50968B0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <40d1a9a7-47f3-4e13-032d-70932b03d215@redhat.com>

Hi Pengfei,

On 09/01/2019 06:50, Pengfei Li (Arm Technology China) wrote:
> Hi Andrew,
> 
> Do you have further comments on my 2nd min/max vectorization patch?
> 
>>>> http://cr.openjdk.java.net/~pli/rfr/8214922/webrev.01/
I am ok with this version of the patch. If the use of the max/min2F
rules doesn't cause any regressions on all the architectures you tested
then it is probably ok to push it.

However, that said, I'm not clear what you mean by one comment:

"BTW: I'm also struggling to find a simple JMH case which can trigger
reduction auto-vectorization."

Do you mean that you have not been able to exercise the reduction code
at all? Or is it just that you cannot get it to work in a JMH test?

Obviously, it would be better if we would provide a JMH test that does
work. I'll see if I can provide a test.

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From tobias.hartmann at oracle.com  Wed Jan  9 16:54:17 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 9 Jan 2019 17:54:17 +0100
Subject: RFR: 8216424: Remove or clean up TimeLivenessAnalysis
In-Reply-To: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>
References: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>
Message-ID: <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>

Hi Claes,

Both webrevs look good to me but I would prefer removal as well. I haven't ever seen anyone using
that flag but let's wait for more opinions.

Best regards,
Tobias

On 09.01.19 16:10, Claes Redestad wrote:
> Hi,
> 
> implementation for the develop flag TimeLivenessAnalysis leaves a few
> breadcrumbs in product builds (in particular TraceTime
> constructors/destructors aren't being inlined, so the compiler doesn't
> realize these objects aren't actually doing anything)
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8216424
> 
> This should be either cleaned up:
> http://cr.openjdk.java.net/~redestad/8216424/cleanup.00/
> 
> .. or the flag should be removed altogether:
> http://cr.openjdk.java.net/~redestad/8216424/remove.00/
> 
> I favor removal since the statistics collected by this analysis does
> not seem very useful and any real performance effect could/should be
> estimated using real profiling tools on product builds, anyhow.
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Wed Jan  9 17:33:17 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 9 Jan 2019 18:33:17 +0100
Subject: RFR: 8216424: Remove or clean up TimeLivenessAnalysis
In-Reply-To: <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>
References: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>
 <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>
Message-ID: <d10174eb-ff28-daf6-4ab5-758fdbb29efe@oracle.com>

Hi Tobias,

On 2019-01-09 17:54, Tobias Hartmann wrote:
> Hi Claes,
> 
> Both webrevs look good to me but I would prefer removal as well. I haven't ever seen anyone using
> that flag but let's wait for more opinions.

thanks, and your vote in favor of removal has been noted. I'll wait a
day or two for other opinions. :-)

/Claes

From igor.ignatyev at oracle.com  Wed Jan  9 21:48:15 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 9 Jan 2019 13:48:15 -0800
Subject: RFR(T) : 8216441 : problem list
 org.graalvm.compiler.hotspot.test.ExplicitExceptionTest
Message-ID: <D34FA1C4-DE4A-47B4-ADBD-4F5C8FC97DE8@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8216441/webrev.00/index.html
> 2 lines changed: 2 ins; 0 del; 0 mod; 

Hi all,

could you please review this tiny and trivial patch which problem list graal unit test ExplicitExceptionTest till 8213249 is fixed?

webrev: http://cr.openjdk.java.net/~iignatyev//8216441/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8216441

Thanks,
-- Igor

From david.holmes at oracle.com  Thu Jan 10 01:07:36 2019
From: david.holmes at oracle.com (David Holmes)
Date: Thu, 10 Jan 2019 11:07:36 +1000
Subject: RFR(T) : 8216441 : problem list
 org.graalvm.compiler.hotspot.test.ExplicitExceptionTest
In-Reply-To: <D34FA1C4-DE4A-47B4-ADBD-4F5C8FC97DE8@oracle.com>
References: <D34FA1C4-DE4A-47B4-ADBD-4F5C8FC97DE8@oracle.com>
Message-ID: <a5e53a0f-738c-81af-bacf-55369b32e0ac@oracle.com>

Hi Igor,

+ org.graalvm.compiler.hotspot.test.ExplicitExceptionTest          8216441

The bug id should be the bug that will fix the underlying problem 
(8213249), not the bug used to update the problem-list.

Thanks,
David

> Hi all,
> 
> could you please review this tiny and trivial patch which problem list graal unit test ExplicitExceptionTest till 8213249 is fixed?
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8216441/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8216441

From igor.ignatyev at oracle.com  Thu Jan 10 01:13:02 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 9 Jan 2019 17:13:02 -0800
Subject: RFR(T) : 8216441 : problem list
 org.graalvm.compiler.hotspot.test.ExplicitExceptionTest
In-Reply-To: <a5e53a0f-738c-81af-bacf-55369b32e0ac@oracle.com>
References: <D34FA1C4-DE4A-47B4-ADBD-4F5C8FC97DE8@oracle.com>
 <a5e53a0f-738c-81af-bacf-55369b32e0ac@oracle.com>
Message-ID: <648BF524-C3AE-4E6B-95F4-DA9DB8FFEB4A@oracle.com>

HI David,

thanks for spotting that, apparently I copy-pasted the wrong id. corrected and pushed.

-- Igor

> On Jan 9, 2019, at 5:07 PM, David Holmes <david.holmes at oracle.com> wrote:
> 
> Hi Igor,
> 
> + org.graalvm.compiler.hotspot.test.ExplicitExceptionTest          8216441
> 
> The bug id should be the bug that will fix the underlying problem (8213249), not the bug used to update the problem-list.
> 
> Thanks,
> David
> 
>> Hi all,
>> could you please review this tiny and trivial patch which problem list graal unit test ExplicitExceptionTest till 8213249 is fixed?
>> webrev: http://cr.openjdk.java.net/~iignatyev//8216441/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216441


From aph at redhat.com  Thu Jan 10 09:18:10 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 10 Jan 2019 09:18:10 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
Message-ID: <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>

On 1/9/19 12:13 PM, Roman Kennke wrote:
> I cannot say if if this has performance implication. I suspect not. If
> it has, it's probably miniscule improvement. I can't see how it could be
> worse though.

I can. x86 can have some very weird performance characteristics. It'd be
helpful to do some measurement.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Thu Jan 10 09:45:40 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 10:45:40 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <87ef9m178o.fsf@redhat.com>
References: <87ef9m178o.fsf@redhat.com>
Message-ID: <7a2054ff-8bc8-7a5f-3233-4e45a3a577f8@oracle.com>

Hi Roland,

Nice analysis! Still took me a while to review but the fix looks good (I've also executed some
extended testing and all passed). Let's hope this skeleton predicate stuff is finally stable.

A 2nd review would be good.

Best regards,
Tobias

From Pengfei.Li at arm.com  Thu Jan 10 10:53:48 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Thu, 10 Jan 2019 10:53:48 +0000
Subject: [aarch64-port-dev ] RFR(S): 8214922: Add vectorization support
 for fmin/fmax
In-Reply-To: <40d1a9a7-47f3-4e13-032d-70932b03d215@redhat.com>
References: <DB7PR08MB3115055477478B62D825C36196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87d0pv2iow.fsf@redhat.com> <c836cf2e-20a9-0ec4-212d-72326fb144a6@redhat.com>
 <877eg32bzq.fsf@redhat.com> <b91e56a1-dd8f-d9f1-40e8-af5d8c3c0d9d@redhat.com>
 <871s6a3map.fsf@redhat.com>
 <DB7PR08MB3115C5F108D17017B98A1FF796B40@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <87va371n6b.fsf@redhat.com>
 <DB7PR08MB3115836C7236BFBDC988D729968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <DB7PR08MB3115E7B38864A31A1F60DE50968B0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <40d1a9a7-47f3-4e13-032d-70932b03d215@redhat.com>
Message-ID: <DB7PR08MB3115418F2B1993557DF7EA6196840@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi adinn, roland,

Sorry that I uploaded a new webrev for this today because I found that I made a mistake hidden in vectornode.cpp.
http://cr.openjdk.java.net/~pli/rfr/8214922/webrev.02/

The reduction min/max ops do not correspond to the original ones correctly in below part of code.
+    case Op_MinF:
+      assert(bt == T_FLOAT, "must be");
+      vopc = Op_MinReductionV;
+      break;
+    case Op_MinD:
+      assert(bt == T_DOUBLE, "must be");
+      vopc = Op_MaxReductionV;
+      break;
+    case Op_MaxF:
+      assert(bt == T_FLOAT, "must be");
+      vopc = Op_MinReductionV;
+      break;
+    case Op_MaxD:
+      assert(bt == T_DOUBLE, "must be");
+      vopc = Op_MaxReductionV;
+      break;

I've fixed it in my 3rd webrev. So could you help review it again?

And for Andrew Dinn's question:
> Do you mean that you have not been able to exercise the reduction code at
> all? Or is it just that you cannot get it to work in a JMH test?
> 
> Obviously, it would be better if we would provide a JMH test that does work.
> I'll see if I can provide a test.

I mean that I tried it hard and finally find one that works. As Vladimir Ivanov said the simple reduction auto-vectorization is disabled in current JDK, so we have to construct that in a more complex code shape. Below code in my previous uploaded JMH case[1] could generate the min/max reduction instructions.

      for (int i = 0; i < LENGTH; i++) {
        min = Math.min(min, fa[i] + fb[i]);
      }

Part of disassembly outputted by JMH perfasm is like below.
0x0000ffff9cca1650: fminv    s18, v19.4s
0x0000ffff9cca1654: fmin s18, s18, s16
0x0000ffff9cca1658: fminv    s19, v20.4s
0x0000ffff9cca165c: fmin s19, s19, s18
0x0000ffff9cca1660: fminv    s16, v22.4s
0x0000ffff9cca1664: fmin s16, s16, s19
0x0000ffff9cca1668: fminv    s19, v21.4s
0x0000ffff9cca166c: fmin s19, s19, s16

[1] http://cr.openjdk.java.net/~pli/rfr/8214922/TestSIMDFpMinMax.java

--
Thanks,
Pengfei


From rwestrel at redhat.com  Thu Jan 10 11:27:25 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 12:27:25 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <7a2054ff-8bc8-7a5f-3233-4e45a3a577f8@oracle.com>
References: <87ef9m178o.fsf@redhat.com>
 <7a2054ff-8bc8-7a5f-3233-4e45a3a577f8@oracle.com>
Message-ID: <87tvigzraa.fsf@redhat.com>


Hi Tobias,

> Nice analysis! Still took me a while to review but the fix looks good (I've also executed some
> extended testing and all passed). Let's hope this skeleton predicate stuff is finally stable.

Thanks for the review.

FTR, I will also need to undo:

http://hg.openjdk.java.net/jdk/jdk12/rev/ea921dca7f33

when I push this.

Roland.

From tobias.hartmann at oracle.com  Thu Jan 10 11:30:10 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 12:30:10 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <87tvigzraa.fsf@redhat.com>
References: <87ef9m178o.fsf@redhat.com>
 <7a2054ff-8bc8-7a5f-3233-4e45a3a577f8@oracle.com> <87tvigzraa.fsf@redhat.com>
Message-ID: <8a5b0525-3218-b6bf-a7ad-7442d98c1a2e@oracle.com>


On 10.01.19 12:27, Roland Westrelin wrote:
> FTR, I will also need to undo:
> 
> http://hg.openjdk.java.net/jdk/jdk12/rev/ea921dca7f33
> 
> when I push this.

Yes, good catch.

Thanks,
Tobias

From tobias.hartmann at oracle.com  Thu Jan 10 11:47:42 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 12:47:42 +0100
Subject: [13] RFR(T): 8216480: Typo in
 test/hotspot/jtreg/compiler/graalunit/README.md
Message-ID: <d888f754-0b04-4b6e-dba6-6c5adea59c85@oracle.com>

Hi,

please review the following trivial patch that fixes a typo in
https://bugs.openjdk.java.net/browse/JDK-8216480
http://cr.openjdk.java.net/~thartmann/8216480/webrev.00/

Thanks,
Tobias

From rwestrel at redhat.com  Thu Jan 10 12:59:07 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 13:59:07 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
Message-ID: <87o98ozn1g.fsf@redhat.com>


This was observed to sometimes hurt performance:

http://cr.openjdk.java.net/~roland/8216482/webrev.00/

Roland.

From rkennke at redhat.com  Thu Jan 10 13:01:46 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Thu, 10 Jan 2019 14:01:46 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <87o98ozn1g.fsf@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
Message-ID: <bbabcc8f-c656-c810-e07f-ee4956f436d0@redhat.com>

Looks good to me. Also, trivial.

Thanks for spotting this!

Roman

> This was observed to sometimes hurt performance:
> 
> http://cr.openjdk.java.net/~roland/8216482/webrev.00/
> 
> Roland.
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190110/b8693450/signature.asc>

From shade at redhat.com  Thu Jan 10 13:19:07 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Thu, 10 Jan 2019 14:19:07 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <bbabcc8f-c656-c810-e07f-ee4956f436d0@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
 <bbabcc8f-c656-c810-e07f-ee4956f436d0@redhat.com>
Message-ID: <15790b53-877f-95f2-b94a-a5b17168f875@redhat.com>

Looks good. This can be pushed to jdk/jdk12, I think.

-Aleksey

On 1/10/19 2:01 PM, Roman Kennke wrote:
> Looks good to me. Also, trivial.
> 
> Thanks for spotting this!
> 
> Roman
> 
>> This was observed to sometimes hurt performance:
>>
>> http://cr.openjdk.java.net/~roland/8216482/webrev.00/
>>
>> Roland.
>>
> 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190110/e2bd9d2f/signature.asc>

From tobias.hartmann at oracle.com  Thu Jan 10 13:17:10 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 14:17:10 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <87o98ozn1g.fsf@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
Message-ID: <9027c1f5-f562-5abf-6495-645284122da6@oracle.com>

Hi Roland,

looks good.

Best regards,
Tobias

On 10.01.19 13:59, Roland Westrelin wrote:
> 
> This was observed to sometimes hurt performance:
> 
> http://cr.openjdk.java.net/~roland/8216482/webrev.00/
> 
> Roland.
> 

From rwestrel at redhat.com  Thu Jan 10 13:23:12 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 14:23:12 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <9027c1f5-f562-5abf-6495-645284122da6@oracle.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com>
Message-ID: <87lg3szlxb.fsf@redhat.com>


Thanks for the review, Tobias. Oracle doesn't build Shenandoah, right?
so I can push that straight to jdk 12, no need to go through the submit
repo?

Roland.

From rwestrel at redhat.com  Thu Jan 10 13:23:31 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 14:23:31 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <15790b53-877f-95f2-b94a-a5b17168f875@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
 <bbabcc8f-c656-c810-e07f-ee4956f436d0@redhat.com>
 <15790b53-877f-95f2-b94a-a5b17168f875@redhat.com>
Message-ID: <87imywzlws.fsf@redhat.com>


Thank for the reviews, Roman & Aleksey.

Roland.

From tobias.hartmann at oracle.com  Thu Jan 10 13:30:09 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 14:30:09 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <87lg3szlxb.fsf@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com> <87lg3szlxb.fsf@redhat.com>
Message-ID: <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com>

On 10.01.19 14:23, Roland Westrelin wrote:
> Thanks for the review, Tobias. Oracle doesn't build Shenandoah, right?
> so I can push that straight to jdk 12, no need to go through the submit
> repo?

Shenandoah is build by default, right? And there are some tests that set -XX:+UseShenandoahGC, not
sure though if they are executed with the submit repo.

Best regards,
Tobias

From rwestrel at redhat.com  Thu Jan 10 13:44:57 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 14:44:57 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com> <87lg3szlxb.fsf@redhat.com>
 <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com>
Message-ID: <87ftu0zkx2.fsf@redhat.com>


> Shenandoah is build by default, right? And there are some tests that set -XX:+UseShenandoahGC, not
> sure though if they are executed with the submit repo.

Right, but there is this:

https://bugs.openjdk.java.net/browse/JDK-8215030
"Disable shenandoah in Oracle builds"

Roland.

From tobias.hartmann at oracle.com  Thu Jan 10 13:48:56 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 10 Jan 2019 14:48:56 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <87ftu0zkx2.fsf@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com> <87lg3szlxb.fsf@redhat.com>
 <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com> <87ftu0zkx2.fsf@redhat.com>
Message-ID: <76c0f61e-9c2f-8d76-2beb-52ba927ed14c@oracle.com>

You are right, we don't build it in our CI. Feel free to push your fix then!

Best regards,
Tobias


On 10.01.19 14:44, Roland Westrelin wrote:
> 
>> Shenandoah is build by default, right? And there are some tests that set -XX:+UseShenandoahGC, not
>> sure though if they are executed with the submit repo.
> 
> Right, but there is this:
> 
> https://bugs.openjdk.java.net/browse/JDK-8215030
> "Disable shenandoah in Oracle builds"
> 
> Roland.
> 

From rwestrel at redhat.com  Thu Jan 10 13:54:41 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 10 Jan 2019 14:54:41 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <76c0f61e-9c2f-8d76-2beb-52ba927ed14c@oracle.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com> <87lg3szlxb.fsf@redhat.com>
 <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com> <87ftu0zkx2.fsf@redhat.com>
 <76c0f61e-9c2f-8d76-2beb-52ba927ed14c@oracle.com>
Message-ID: <87d0p4zkgu.fsf@redhat.com>


> You are right, we don't build it in our CI. Feel free to push your fix then!

Thanks for confirming.

Roland.

From dmitrij.pochepko at bell-sw.com  Thu Jan 10 15:10:55 2019
From: dmitrij.pochepko at bell-sw.com (Dmitrij Pochepko)
Date: Thu, 10 Jan 2019 18:10:55 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
Message-ID: <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>


Hi Andrew,

I?ll focus on addressing your technical questions about testing this 
patch and intrinsic first.

By 'Regular' in previous email I meant all JCK and current jtreg tests 
which were also run [1]. This was to highlight the difference with 
IndexOfTest and IndexOfSameTest tests [2] developed for this intrinsic 
which was run for the original webrev and this patch. These tests cover 
all combinations of strings and substring lengths up to a specified 
length (IndexOfTest uses unique characters as padding and 
IndexOfSameTest is using first character from pattern to have cases with 
partial match tested). These tests are parameterized and require 
source_size parameter as first argument. Calling it with 30 and 300 
results in testing all modified indexof algorithms by brute force:

A. pattern size = 1: covered if source_size >= 1
B. pattern size = 2: covered if source_size >= 2
C. pattern size = 3: covered if source_size >= 3
D. special case with pattern size = 4: algorithm wasn't changed within 
the original webrev
E. pattern_size in [8, 256) and pattern_size < source_size/4: 
Boyer-Moore implementation: covered if source_size > 32. This is the 
algorithm that has the comment you mentioned in your email.
F. "pattern_size in [5, 8)" or "pattern_size in [8, 15] and pattern_size 
 >= source_size/4": Simple linear search, which loads and compares 
char-by-char: covered if source_size in [5, 32]
G. This is the one added by me. "pattern_size in [16, 256) and 
pattern_size >= source_size/4" or pattern_size >= 256. Block linear 
search (loads data by 8 byte chunks in search of first symbol): covered 
if source_size >= 16.

Below is listing of all branches in algorithm G and coverage test cases 
with sample parameter values when the branches are taken (test name, 
test parameter, pattern_size and expected_index inputs used in test 
during iterations, which leads to each branch 
taken/not_taken(fallthrough) at least once. Suffix U/L/UL denotes 
different encoding cases, where U = UTF-16 source string, L = Latin1 
source string, UL = both cases).

line 4399: __ br(__ LE, L_SMALL);??????????????????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=298, expected_index=0. Not 
taken(UL): pattern_size=250, expected_index=0
line 4410: __ br(__ NE, L_HAS_ZERO);???????????????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=250, expected_index=0. Not 
taken(UL): pattern_size=250, expected_index=-1
line 4414: __ br(__ LT, L_POST_LOOP);??????????????? // IndexOfTest with 
parameter 300: taken(U):? pattern_size=295, expected_index=5. Not 
taken(U):? pattern_size=250, expected_index=8
 ???????????????????????????????????????????????????? // IndexOfTest 
with parameter 300: taken(L):? pattern_size=290, expected_index=8. Not 
taken(L):? pattern_size=250, expected_index=8
line 4421: __ br(__ NE, L_HAS_ZERO);???????????????? // IndexOfTest with 
parameter 300: taken(U):? pattern_size=250, expected_index=5. Not 
taken(U):? pattern_size=250, expected_index=8
 ???????????????????????????????????????????????????? // IndexOfTest 
with parameter 300: taken(L):? pattern_size=250, expected_index=8. Not 
taken(L):? pattern_size=250, expected_index=16
line 4426: __ br(__ GE, L_LOOP);???????????????????? // IndexOfTest with 
parameter 300: taken(U):? pattern_size=250, expected_index=20. Not 
taken(U):? pattern_size=290, expected_index=8
 ???????????????????????????????????????????????????? // IndexOfTest 
with parameter 300: taken(L):? pattern_size=250, expected_index=30. Not 
taken(L):? pattern_size=280, expected_index=16
line 4429: __ br(__ LE, NOMATCH);??????????????????? // IndexOfTest with 
parameter 300: taken(U):? pattern_size=293, expected_index=-1. Not 
taken(U):? pattern_size=290, expected_index=-1
 ???????????????????????????????????????????????????? // IndexOfTest 
with parameter 300: taken(L):? pattern_size=285, expected_index=-1. Not 
taken(L):? pattern_size=280, expected_index=-1
line 4455: __ br(__ EQ, NOMATCH);??????????????????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=298, expected_index=-1. Not 
taken(UL): pattern_size=298, expected_index=0
line 4459: __ br(__ LE, L_SMALL_CMP_LOOP_LAST_CMP2); // this branch is 
not reached with current performance heuristic for algorithm selection 
(see MacroAssembler_aarch64.cpp:4599). It was also tested with heuristic 
disabled to keep algorithm generic and allow changes to heuristics
line 4478: __ br(__ NE, L_SMALL_CMP_LOOP_NOMATCH);?? // IndexOfSameTest 
with parameter 300: taken(UL): pattern_size=298, expected_index=-1. Not 
taken(UL): pattern_size=298,expected_index=0
line 4486: __ br(__ GE, L_SMALL_CMP_LOOP_LAST_CMP);? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=298, expected_index=0. Not 
taken(UL): pattern_size=298, expected_index=-1
line 4488: __ br(__ EQ, L_SMALL_CMP_LOOP);?????????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=298, expected_index=0. Not 
taken(UL): pattern_size=298, expected_index=-1
line 4490: __ cbz(tmp2, NOMATCH);??????????????????? // IndexOfSameTest 
with parameter 300: taken UL: pattern_size=298, expected_index=-1. Not 
taken(UL): pattern_size=298, expected_index=0
line 4498: __ br(__ NE, L_SMALL_CMP_LOOP_NOMATCH);?? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=298, expected_index=-1. Not 
taken(UL): pattern_size=298, expected_index=0
line 4519: __ br(__ NE, L_SMALL_CMP_LOOP_NOMATCH);?? // this branch is 
not reached with current performance heuristic for algorithm selection 
(see MacroAssembler_aarch64.cpp:4599). It was also tested with heuristic 
disabled to keep algorithm generic and allow changes to heuristics
line 4533: __ br(__ GE, L_CMP_LOOP_LAST_CMP2);?????? // this branch is 
not taken with current performance heuristic for algorithm selection 
(see MacroAssembler_aarch64.cpp:4599). It was also tested with heuristic 
disabled to keep algorithm generic and allow changes to heuristics
line 4557: __ br(__ NE, L_CMP_LOOP_NOMATCH);???????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=250, expected_index=1. Not 
taken(UL): pattern_size=250, expected_index=0
line 4565: __ br(__ GE, L_CMP_LOOP_LAST_CMP);??????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=250, expected_index=0. Not 
taken(UL): pattern_size=250, expected_index=-1
line 4567: __ br(__ EQ, L_CMP_LOOP);???????????????? // IndexOfTest with 
parameter 300: taken(UL): pattern_size=250, expected_index=0. Not 
taken(UL): pattern_size=250, expected_index=-1
line 4570: __ cbz(tmp2, L_HAS_ZERO_LOOP_NOMATCH);??? // IndexOfSameTest 
with parameter 300: taken(UL): pattern_size=250, expected_index=20. Not 
taken(UL): pattern_size=250, expected_index=0
line 4577: __ br(__ NE, L_CMP_LOOP_NOMATCH);???????? // IndexOfSameTest 
with parameter 300: taken(UL): pattern_size=250, expected_index=-1. 
IndexOfTest with parameter 300: Not taken(UL): pattern_size=250, 
expected_index=0
line 4601: __ br(__ NE, L_CMP_LOOP_NOMATCH);???????? // this branch is 
not reached with current performance heuristic for algorithm selection 
(see MacroAssembler_aarch64.cpp:4599). It was also tested with heuristic 
disabled to keep algorithm generic and allow changes to heuristics

source_size = 0 is covered by a pre-condition and the intrinsic is not 
called.

I referenced this test in initial review request for this intrinsic. It 
takes a long time to run, so I did not include it in the webrev. I'm 
going to update the webrev to include a subset of this test as jtreg.

Even brute force tests with 100% code coverage don't guarantee 100% 
correctness. The search-garbage-after-string test case for "algorithm G" 
and StringBuilder::setLength usage is a good catch by Stefan and 
Pengfei. And recent webrev addresses this case. I also tested a case 
symmetric to Pengfei's case checking that no "garbage" is read before 
specified source string [4]. I also am going to include it in the webrev.

Indeed it is hard to review complex algorithms. The Boyer-Moore comments 
you referenced were updated as part of the original webrev to describe 
changes in algorithm E, which is in macroAssembler_aarch64.cpp. I once 
asked to validate the level of comments with you during pow function 
review [3]. If this is the level of comments you find reasonable, I?ll 
be happy to improve it here and elsewhere to this level.

Once again, this is to address your question around testing for this 
intrinsic and patch. We are working on testing and review complex 
intrinsics to handle the wider problem of ensuring better quality of 
AArch64 intrinsics. We?ll follow up in a different email on that.

-Dmitrij

[1] all JCK, hotspot jtreg and jdk tier1-tier3 jtreg tests, including 
http://hg.openjdk.java.net/jdk/jdk/file/tip/test/hotspot/jtreg/compiler/intrinsics 
and http://hg.openjdk.java.net/jdk/jdk/file/tip/test/jdk/java/lang/String/
[2] http://cr.openjdk.java.net/~dpochepk/8189103/IndexOfTest.java, 
http://cr.openjdk.java.net/~dpochepk/8189103/IndexOfSameTest.java
[3] 
https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2018-October/031092.html
[4] http://cr.openjdk.java.net/~dpochepk/8215792/IndexOfBeforeTest.java


On 09/01/2019 6:55 PM, Andrew Dinn wrote:
> Hello Dmitrij,
>
> On 09/01/2019 14:50, Dmitrij Pochepko wrote:
>> here is my version of this patch consisting of single "sub" instruction
>> (I haven't changed test):
>> http://cr.openjdk.java.net/~dpochepk/8215792/webrev.01/
>>
>> cnt2 is a counter for characters yet to be checked. So, instead of
>> checking all characters in source string for first character match
>> (which was initial reason for this bug), now it check (str2len - str1len
>> + 1).
> That looks like a simpler fix than Pengfei's although I think his is
> also correct. However, when I say 'correct' note that I can only make
> that judgement relative to this current bug. I have no confidence that
> there are no other bugs in your code.
>
>> Actually I think this "sub" instruction was initially lost while working
>> on this instrinsic and moving this instruction between this block
>> (generate_string_indexof_linear) and caller code. Regular tests couldn't
>> catch this problem.
> That's a somewhat contentious and, I would suggest, dubious statement.
> If you design code based on some algorithm -- especially a complex one
> like the one employed here -- then you need to put at least as much work
> into designing tests that check for problems in the encoding of that
> algorithm as you put into the code. 'Regular' is rather a weasel word to
> use at this point when it is clear that the test provision was not adequate.
>
> Having looked at your code I am at a loss to see how it is accurately
> described by the piece of C code -- i.e. the original Boyer-Moore
> algorithm -- that sits in macroAssembler_aarch64.cpp and purports to
> explain it. As happened with the trig/log code, your code actually
> follows an algorithm that is significantly more complex that that C
> original. Also, once again, it employs various coding tricks that are
> not explained at all. The latter can be understood with study but proper
> commenting would make maintenance and bug-fixing much easier and
> quicker. This is exactly the same problem and just as major a problem as
> it was with the trig/log code for *all* the same reasons.
>
>> I run some testing to ensure regular usecases are not affected and it
>> seems fine. Affected testcase and your test pass as well.
> 'some testing'? I'd really like to have full details of those tests.
> Ideally, they should be comprehensive. That really means they should
> come with a test plan that identifies all the different possible paths
> through the code and provides a measure of the coverage the tests
> actually provide that is high enough to instil some confidence in the
> testing. There are indeed quite a few such paths (not just in the stubs
> but also the intrinsics that cover the small cases) so I would expect
> the test plan and test suite to be fairly large. Do you have such a test
> plan and suite?
>
> Given your previous lack of success at testing your own code I'm not at
> all happy to accept your say so that 'oh, the code is fine'. I'm
> currently more inclined to ask you to revert your first patch and go
> back to the original Boyer-Moore code we had before you injected this
> bug (and who knows what others?).
>
>> btw: now this code is even faster, because less characters will be
>> loaded and checked
> Well, of course, you could make it even faster by deleting half the
> code. If you don't place too much priority on correctness you can
> achieve incredible performance.
>
> Unfortunately, speed has to be secondary to correctness. So, you need to
> stop concentrating on shaving cycles and concentrate on writing
> readable, maintainable code that clearly implements a well-defined
> algorithm. Can you provide any credible assurance that this code is
> worth keeping? If not then I'd personally recommend reversion of all
> your changes. Of course, I'll see what Andrew Haley has to say before
> pressing for that action.
>
> regards,
>
>
> Andrew Dinn
> -----------
> Senior Principal Software Engineer
> Red Hat UK Ltd
> Registered in England and Wales under Company Registration No. 03798903
> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From doug.simon at oracle.com  Thu Jan 10 15:09:36 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Thu, 10 Jan 2019 16:09:36 +0100
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails with
 AOTed java.base
Message-ID: <AC23DDA9-CFB7-414D-9386-82829B83B3DC@oracle.com>

Please review this fix supplied by Josef Haider for an incorrect compilation of String.split.

When the String.indexOf intrinsic on AMD64 reaches the end of a string, it tries to vectorize the last compare operations by reading past the bounds of the character/byte array. This is not safe if the out-of-bounds read would cross a page boundary, so in that case characters are compared one-by-one. This is done with a `cmpl`-instruction, which only works as long as the bytes/chars are not sign extended.

The fix is to simply `and` the characters we are searching for with `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.

http://cr.openjdk.java.net/~dnsimon/8215313
https://bugs.openjdk.java.net/browse/JDK-8215313

-Doug

From dean.long at oracle.com  Thu Jan 10 18:04:12 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Thu, 10 Jan 2019 10:04:12 -0800
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <AC23DDA9-CFB7-414D-9386-82829B83B3DC@oracle.com>
References: <AC23DDA9-CFB7-414D-9386-82829B83B3DC@oracle.com>
Message-ID: <aa338452-5cc0-bfe1-8137-ce7788d4989f@oracle.com>

Is it OK to modify the values of searchValue[i]?? If the search value is 
already sign-extended, how about sign-extending cmpResult instead of 
zero-extending searchValue?

dl

On 1/10/19 7:09 AM, Doug Simon wrote:
> Please review this fix supplied by Josef Haider for an incorrect compilation of String.split.
>
> When the String.indexOf intrinsic on AMD64 reaches the end of a string, it tries to vectorize the last compare operations by reading past the bounds of the character/byte array. This is not safe if the out-of-bounds read would cross a page boundary, so in that case characters are compared one-by-one. This is done with a `cmpl`-instruction, which only works as long as the bytes/chars are not sign extended.
>
> The fix is to simply `and` the characters we are searching for with `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.
>
> http://cr.openjdk.java.net/~dnsimon/8215313
> https://bugs.openjdk.java.net/browse/JDK-8215313
>
> -Doug


From dean.long at oracle.com  Thu Jan 10 18:53:51 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Thu, 10 Jan 2019 10:53:51 -0800
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <aa338452-5cc0-bfe1-8137-ce7788d4989f@oracle.com>
References: <AC23DDA9-CFB7-414D-9386-82829B83B3DC@oracle.com>
 <aa338452-5cc0-bfe1-8137-ce7788d4989f@oracle.com>
Message-ID: <e8a7a82c-46eb-a560-608d-561af23675b7@oracle.com>

Taking another look, it seems like cmpl could be replaced with the 
size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and 
findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and 
cmpq right now.

dl

On 1/10/19 10:04 AM, dean.long at oracle.com wrote:
> Is it OK to modify the values of searchValue[i]?? If the search value 
> is already sign-extended, how about sign-extending cmpResult instead 
> of zero-extending searchValue?
>
> dl
>
> On 1/10/19 7:09 AM, Doug Simon wrote:
>> Please review this fix supplied by Josef Haider for an incorrect 
>> compilation of String.split.
>>
>> When the String.indexOf intrinsic on AMD64 reaches the end of a 
>> string, it tries to vectorize the last compare operations by reading 
>> past the bounds of the character/byte array. This is not safe if the 
>> out-of-bounds read would cross a page boundary, so in that case 
>> characters are compared one-by-one. This is done with a 
>> `cmpl`-instruction, which only works as long as the bytes/chars are 
>> not sign extended.
>>
>> The fix is to simply `and` the characters we are searching for with 
>> `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.
>>
>> http://cr.openjdk.java.net/~dnsimon/8215313
>> https://bugs.openjdk.java.net/browse/JDK-8215313
>>
>> -Doug
>


From josef.haider at khg.jku.at  Thu Jan 10 21:52:53 2019
From: josef.haider at khg.jku.at (Josef Haider)
Date: Thu, 10 Jan 2019 22:52:53 +0100
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
Message-ID: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>

Agreed, cmpw/cmpb would make more sense here, i just wanted
to keep the changeset minimal, since the entire method may soon be
changed again, anyway.

- Josef


> Taking another look, it seems like cmpl could be replaced with the 
> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and 
> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and 
> cmpq right now.
>
> dl
>
> On 1/10/19 10:04 AM, dean.long at oracle.com <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev> wrote:
> >/Is it OK to modify the values of searchValue[i]?? If the search value />/is already sign-extended, how about sign-extending cmpResult instead />/of zero-extending searchValue? />//>/dl />//>/On 1/10/19 7:09 AM, Doug Simon wrote: />>/Please review this fix supplied by Josef Haider for an incorrect />>/compilation of String.split. />>//>>/When the String.indexOf intrinsic on AMD64 reaches the end of a />>/string, it tries to vectorize the last compare operations by reading />>/past the bounds of the character/byte array. This is not safe if the />>/out-of-bounds read would cross a page boundary, so in that case />>/characters are compared one-by-one. This is done with a />>/`cmpl`-instruction, which only works as long as the bytes/chars are />>/not sign extended. />>//>>/The fix is to simply `and` the characters we are searching for with />>/`0xff`/`0xffff` in order to eliminate any erroneous sign extensions. />>//>>/http://cr.openjdk.java.net/~dnsimon/8215313 />>/https://bugs.openjdk.java.net/browse/JDK-8215313 />>//>>/-Doug />//
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190110/0f5d683b/attachment.html>

From ekaterina.pavlova at oracle.com  Thu Jan 10 22:03:30 2019
From: ekaterina.pavlova at oracle.com (Ekaterina Pavlova)
Date: Thu, 10 Jan 2019 14:03:30 -0800
Subject: [13] RFR(T): 8216480: Typo in
 test/hotspot/jtreg/compiler/graalunit/README.md
In-Reply-To: <d888f754-0b04-4b6e-dba6-6c5adea59c85@oracle.com>
References: <d888f754-0b04-4b6e-dba6-6c5adea59c85@oracle.com>
Message-ID: <2ebd8dec-7f1f-7a13-72af-697e41bb63f7@oracle.com>

good, thanks for fixing this.

-katya

On 1/10/19 3:47 AM, Tobias Hartmann wrote:
> Hi,
> 
> please review the following trivial patch that fixes a typo in
> https://bugs.openjdk.java.net/browse/JDK-8216480
> http://cr.openjdk.java.net/~thartmann/8216480/webrev.00/
> 
> Thanks,
> Tobias
> 


From vladimir.x.ivanov at oracle.com  Fri Jan 11 02:01:59 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 10 Jan 2019 18:01:59 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes wrong
 post-dominating point
Message-ID: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8215757/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8215757

Crash happens during SplitIf transformation when 
PhaseIdealLoop::spinup() erroneously uses the Region being eliminated as 
a post-dominating merge point (prior_n).

Sequence of events during PhaseIdealLoop pass which leads to the crash 
(IR in question [1]):

   #0: RegionNode 1722 (R1722) starts with IDOM(R1722) = IfNode 1511 (I1511)


   #1: Loop strip mining takes place and inserts new loop limit check:

     Loop: N1572/N1601  limit_check sfpts={ 1595 }
     ...
     Counted    Loop: N1866/N1601  counted [2,int),+1 (-1 iters)
     ...
     Loop: N1865/N1864  limit_check
       Loop: N1866/N1601  limit_check counted [2,int),+1 (-1 iters) 
has_sfpt strip_mined


   #2: As part of loop limit check insertion, new IfNode is created (If 
1854) and linked to R1722 as an input which causes R1722 IDOM to be 
updated [2]. It changes R1722 IDOM (I1511 => R1784), since dom_lca() 
normalizes the result using find_non_split_ctrl().


   #3: SplitIf is performed on I1511 and Phi 1790 is being processed. It 
has 3 users (197, 198, 199) which are attached to R1710, R1716, and 
R1722 respectively. At this point:

        IDOM(R1710) = I1511
        IDOM(R1716) = I1511
        IDOM(R1722) = R1784 <==


   #4: PhaseIdealLoop::handle_use() works fine for 197 & 198:

         197 =idom=> R1710 =idom=> I796 ( == iff_dom)
         198 =idom=> R1716 =idom=> I796

       but fails on 199 when it tries to process R1784 (being 
eliminated) in nested PhaseIdealLoop::spinup() call:

         199 =idom=> R1722 =idom=> R1784 =idom=> I796


The root cause is that while PhaseIdealLoop::do_split_if() updates IDOM 
for If & its projections, it doesn't do that for the corresponding 
Region (R1784) until the splitting is finished [3].

Proposed fix is to take into account delayed IDOM update (region -> 
region_dom) and explicitly check for old Region in 
PhaseIdealLoop::spinup() treating it as iff_dom.

Testing: failing test (replay), hs-precheckin-comp, hs-tier1, hs-tier2 
(in progress)

Thanks!

Best regards,
Vladimir Ivanov

[1] http://cr.openjdk.java.net/~vlivanov/8215757/split_if_1511.png

[2] 
http://hg.openjdk.java.net/jdk/jdk/file/2e1fd6414c4b/src/hotspot/share/opto/loopPredicate.cpp#l161

[3] 
http://hg.openjdk.java.net/jdk/jdk/file/2e1fd6414c4b/src/hotspot/share/opto/split_if.cpp#l525

   void PhaseIdealLoop::do_split_if( Node *iff ) {
     ...
     // Lazy replace IDOM info with the region's dominator
     lazy_replace( iff, region_dom );
     ...
     // Now make the original merge point go dead, by handling all its uses.
     ...
     lazy_replace( region, region_dom );
   }

From Nick.Gasson at arm.com  Fri Jan 11 02:36:47 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Fri, 11 Jan 2019 02:36:47 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
Message-ID: <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>

Hi all,

On 09/01/2019 17:23, Andrew Haley wrote:
> 
> HotSpot policy is that we can do minor cleanups as we go along:
> experience has shown that unless you do so, cruft tends to
> accumulate. These cleanups are OK for this patch.
> 

Please see the updated webrev here:

http://cr.openjdk.java.net/~ngasson/8216350/webrev.1/

Includes cleanups according to Derek's comments and updated the 
copyright year (thanks Felix).

> 4)  Slightly better comment for last instruction of fast_unlock (and explicitly use zr).
>     __ stlr(zr, tmp); // set unowned

Note I needed to change the definition of load_store_exclusive to allow 
ZR here. I've checked that this is OK for the other instructions that 
use this.

Thanks,
Nick

From vivek.r.deshpande at intel.com  Fri Jan 11 06:58:05 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Fri, 11 Jan 2019 06:58:05 +0000
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>

Hi Tobias

I have webrev for the fixes for the problems with  the VNNI optimization.

This has 3 fixes.
1) Fix for the crash by matching the operand by swapping to right positions.
2) Cost based generation of vpdpwssd instruction.
3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
    for a[i] and a[i+1] accesses in same MulAddS2I node
Bug ID: https://bugs.openjdk.java.net/browse/JDK-8216050
Webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.00/

Could you please take a look and review it.

Regards,
Vivek
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190111/94b568a8/attachment.html>

From tobias.hartmann at oracle.com  Fri Jan 11 08:22:44 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 11 Jan 2019 09:22:44 +0100
Subject: [13] RFR(T): 8216480: Typo in
 test/hotspot/jtreg/compiler/graalunit/README.md
In-Reply-To: <2ebd8dec-7f1f-7a13-72af-697e41bb63f7@oracle.com>
References: <d888f754-0b04-4b6e-dba6-6c5adea59c85@oracle.com>
 <2ebd8dec-7f1f-7a13-72af-697e41bb63f7@oracle.com>
Message-ID: <46ebd33e-8e18-4b35-f1c7-fd8eac5d87e2@oracle.com>

Thanks Katya.

Best regards,
Tobias

On 10.01.19 23:03, Ekaterina Pavlova wrote:
> good, thanks for fixing this.
> 
> -katya
> 
> On 1/10/19 3:47 AM, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following trivial patch that fixes a typo in
>> https://bugs.openjdk.java.net/browse/JDK-8216480
>> http://cr.openjdk.java.net/~thartmann/8216480/webrev.00/
>>
>> Thanks,
>> Tobias
>>
> 

From doug.simon at oracle.com  Fri Jan 11 09:02:24 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Fri, 11 Jan 2019 10:02:24 +0100
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
Message-ID: <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>

Hi Josef,

> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at> wrote:
> 
> Agreed, cmpw/cmpb would make more sense here, i just wanted
> to keep the changeset minimal, since the entire method may soon be
> changed again, anyway. 
> 
Can you please say more about this? Would you recommend applying your current patch as is to fix the crash or will you have the changes you mention ready soon?

-Doug
>> Taking another look, it seems like cmpl could be replaced with the 
>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and 
>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and 
>> cmpq right now.
>> 
>> dl
>> 
>> On 1/10/19 10:04 AM, dean.long at oracle.com <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev> wrote:
>> > Is it OK to modify the values of searchValue[i]?  If the search value 
>> > is already sign-extended, how about sign-extending cmpResult instead 
>> > of zero-extending searchValue?
>> >
>> > dl
>> >
>> > On 1/10/19 7:09 AM, Doug Simon wrote:
>> >> Please review this fix supplied by Josef Haider for an incorrect 
>> >> compilation of String.split.
>> >>
>> >> When the String.indexOf intrinsic on AMD64 reaches the end of a 
>> >> string, it tries to vectorize the last compare operations by reading 
>> >> past the bounds of the character/byte array. This is not safe if the 
>> >> out-of-bounds read would cross a page boundary, so in that case 
>> >> characters are compared one-by-one. This is done with a 
>> >> `cmpl`-instruction, which only works as long as the bytes/chars are 
>> >> not sign extended.
>> >>
>> >> The fix is to simply `and` the characters we are searching for with 
>> >> `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.
>> >>
>> >> http://cr.openjdk.java.net/~dnsimon/8215313 <http://cr.openjdk.java.net/~dnsimon/8215313>
>> >> https://bugs.openjdk.java.net/browse/JDK-8215313 <https://bugs.openjdk.java.net/browse/JDK-8215313>
>> >>
>> >> -Doug
>> >
>> 
> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190111/bb5f1867/attachment.html>

From rwestrel at redhat.com  Fri Jan 11 09:16:53 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 11 Jan 2019 10:16:53 +0100
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
Message-ID: <877efbzh8a.fsf@redhat.com>


http://cr.openjdk.java.net/~roland/8216549/webrev.00/

test1(), test2() and test3() perform an unsafe access with a mismatched
access.

test1() compiles to an unschedulable graph and causes the compiler to
crash. The memory input of the load from a non escaping allocation
initially points to a membar but is set to bypass the membar while
control stays set to the membar. The load is not eliminated because it's
a mismatched memory access, an anti dependence is added between the
membar and the load and the graph is unschedulable.

test2() and test3() return wrong results: the access is mismatched and
misaligned, it's given its own alias by c2 but the MergeMem right after
the allocation only points to the allocation for actual fields of the
newly allocated object. So the load memory input is set to the memory
state on method entry and the load is optimized as zero.

I simply propose to make non escaping allocations with mismatched
accesses to be non scalar replaceable.

Roland.

From tobias.hartmann at oracle.com  Fri Jan 11 09:31:41 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 11 Jan 2019 10:31:41 +0100
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed in
 ExplicitExceptionTest
Message-ID: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>

Hi,

please review the following patch:
https://bugs.openjdk.java.net/browse/JDK-8213249
http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/

The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
null message.

Thanks,
Tobias

From rwestrel at redhat.com  Fri Jan 11 09:53:41 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 11 Jan 2019 10:53:41 +0100
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
Message-ID: <874lafzfiy.fsf@redhat.com>


Hi Vladimir,

>    #2: As part of loop limit check insertion, new IfNode is created (If 
> 1854) and linked to R1722 as an input which causes R1722 IDOM to be 
> updated [2]. It changes R1722 IDOM (I1511 => R1784), since dom_lca() 
> normalizes the result using find_non_split_ctrl().

Isn't that the root cause: the idom of R1722 is still I1511 and not
R1784?

Roland.

From erik.osterlund at oracle.com  Fri Jan 11 09:53:23 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Fri, 11 Jan 2019 10:53:23 +0100
Subject: RFR: 8216427: ciMethodData::load_extra_data() does not always unpack
 the last entry
Message-ID: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>

Hi,

When unpacking the extra data section of the MDOs, the source and 
destination might not have the same number of entries, because there can 
be safepoints between cloning the extra data section of the MDO and 
unpacking the source entries to the destination entries.

Therefore the unpacking loop loops through all the source entries and 
copies them to the destination. Except the last 
DataLayout::arg_info_data_tag entry, that never gets copied form the 
source to the destination. Therefore, if a safepoint occurred between 
cloning the extra data section and unpacking its entries in 
ciMethodData::load_extra_data(), the last entry could contain random 
bogus memory.

It seems like the reason the last entry is not copied is because the 
copying of an entry requires a length which is currently calculated by 
taking the difference between the current entry and the next entry in 
the loop. But as there is no notion of a next entry when you are at the 
last DataLayout::arg_info_data_tag entry (because it is always the last 
one when present), so you can't do that. Therefore, the solution of 
choice seems to have been simply not copying the last 
DataLayout::arg_info_data_tag entry, instead of calculating what the 
length of that entry would be.

This patch appropriately calculates the length of the entries instead 
(which is also defined for DataLayout::arg_info_data_tag) in the copying 
loop, allowing the last DataLayout::arg_info_data_tag entry to be copied 
as well.

Webrev:
http://cr.openjdk.java.net/~eosterlund/8216427/webrev.00/

Bug:
https://bugs.openjdk.java.net/browse/JDK-8216427

Tested through hs-tier1-3.

Thanks,
/Erik

From tobias.hartmann at oracle.com  Fri Jan 11 12:48:32 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 11 Jan 2019 13:48:32 +0100
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
Message-ID: <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>

Hi Vivek,

On 11.01.19 07:58, Deshpande, Vivek R wrote:
> 1) Fix for the crash by matching the operand by swapping to right positions. 

Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.

> 2) Cost based generation of vpdpwssd instruction. 

Other instructions added by JDK-8214751 still miss a cost definition, for example:
http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20

> 3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have
> different control RangeCheck nodes?
> ????for a[i] and a[i+1] accesses in same MulAddS2I node

This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.

Thanks,
Tobias

From martin.doerr at sap.com  Fri Jan 11 12:55:22 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 11 Jan 2019 12:55:22 +0000
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
Message-ID: <88842ba1a169406d9628ab06665bd787@sap.com>

Hi,

I'd like to contribute a small JIT improvement for JVMTI to avoid calling raw_liveness_at_bci when its result is not needed.

Bug with description:
https://bugs.openjdk.java.net/browse/JDK-8216556

Webrev:
http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/

Please review.

Best regards,
Martin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190111/26f9464c/attachment.html>

From rwestrel at redhat.com  Fri Jan 11 13:35:56 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 11 Jan 2019 14:35:56 +0100
Subject: RFR(T): 8216482: Shenandoah: typo in
 ShenandoahBarrierSetC2::clone_barrier_at_expansion() causes failed
 compilations
In-Reply-To: <87d0p4zkgu.fsf@redhat.com>
References: <87o98ozn1g.fsf@redhat.com>
 <9027c1f5-f562-5abf-6495-645284122da6@oracle.com> <87lg3szlxb.fsf@redhat.com>
 <51c87854-36c0-903e-c647-0c612ac42c5d@oracle.com> <87ftu0zkx2.fsf@redhat.com>
 <76c0f61e-9c2f-8d76-2beb-52ba927ed14c@oracle.com> <87d0p4zkgu.fsf@redhat.com>
Message-ID: <87tvifxqo3.fsf@redhat.com>


FTR, I pushed this one by mistake to jdk/jdk instead of jdk12. I read
in:

https://mail.openjdk.java.net/pipermail/hotspot-gc-dev/2019-January/024470.html

that it's ok to push a change to 12 after jdk/jdk so I will do that.

Roland.

From rwestrel at redhat.com  Fri Jan 11 13:51:29 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 11 Jan 2019 14:51:29 +0100
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <877efbzh8a.fsf@redhat.com>
References: <877efbzh8a.fsf@redhat.com>
Message-ID: <87r2djxpy6.fsf@redhat.com>


Also: I targeted this to 13 but I don't really have a strong opinion
whether it should go in 12 or 13.

Roland.

From vladimir.x.ivanov at oracle.com  Fri Jan 11 18:23:40 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 11 Jan 2019 10:23:40 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <874lafzfiy.fsf@redhat.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com>
Message-ID: <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>


>>     #2: As part of loop limit check insertion, new IfNode is created (If
>> 1854) and linked to R1722 as an input which causes R1722 IDOM to be
>> updated [2]. It changes R1722 IDOM (I1511 => R1784), since dom_lca()
>> normalizes the result using find_non_split_ctrl().
> 
> Isn't that the root cause: the idom of R1722 is still I1511 and not
> R1784?

If it were the case, then PhaseIdealLoop::handle_use()/spinup() would 
reliably crash on all users of Phi 1790. There are 2 other Regions 
(R1710 and R1716) which keep their IDOM (I1511) intact and the 
transformation works fine for them.

R1722 is changed during strip mining transformation and its IDOM is 
recomputed (I1511 => R1784). Then PhaseIdealLoop::handle_use()/spinup() 
crashes trying to process R1722 user (CallStaticJava 199) and the 
problem is caused by IDOM(R1722) which is still R1784 and not I798 
(region_dom/iff_dom) as for the other Regions (that's the effect of 
"lazy_replace(iff, region_dom)" in PhaseIdealLoop::do_split_if()).

Best regards,
Vladimir Ivanov

From ekaterina.pavlova at oracle.com  Fri Jan 11 18:42:43 2019
From: ekaterina.pavlova at oracle.com (Ekaterina Pavlova)
Date: Fri, 11 Jan 2019 10:42:43 -0800
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed
 in ExplicitExceptionTest
In-Reply-To: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
References: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
Message-ID: <d39c334e-0f6e-c9c8-a851-c045cc84c624@oracle.com>

The changes look good.

thanks.
-katya

On 1/11/19 1:31 AM, Tobias Hartmann wrote:
> Hi,
> 
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8213249
> http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/
> 
> The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
> deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
> code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
> null message.
> 
> Thanks,
> Tobias
> 


From igor.ignatyev at oracle.com  Fri Jan 11 18:47:14 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Fri, 11 Jan 2019 10:47:14 -0800
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed
 in ExplicitExceptionTest
In-Reply-To: <d39c334e-0f6e-c9c8-a851-c045cc84c624@oracle.com>
References: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
 <d39c334e-0f6e-c9c8-a851-c045cc84c624@oracle.com>
Message-ID: <0BE98960-D1DE-4DF9-A15A-BB19C47BA28D@oracle.com>

Hi Tobias,

the fix looks good to me.

Thanks,
-- Igor

> On Jan 11, 2019, at 10:42 AM, Ekaterina Pavlova <ekaterina.pavlova at oracle.com> wrote:
> 
> The changes look good.
> 
> thanks.
> -katya
> 
> On 1/11/19 1:31 AM, Tobias Hartmann wrote:
>> Hi,
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8213249
>> http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/
>> The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
>> deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
>> code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
>> null message.
>> Thanks,
>> Tobias
> 


From dean.long at oracle.com  Fri Jan 11 18:48:55 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Fri, 11 Jan 2019 10:48:55 -0800
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed
 in ExplicitExceptionTest
In-Reply-To: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
References: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
Message-ID: <ed64b39c-ad4b-de21-9a0c-cf63ba662c36@oracle.com>

The fix seems reasonable.? It's a little strange that the test needs to 
know about a C2 flag, but these
tests are already strange because they care about exception messages 
exactly matching between
compilers.

dl

On 1/11/19 1:31 AM, Tobias Hartmann wrote:
> Hi,
>
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8213249
> http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/
>
> The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
> deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
> code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
> null message.
>
> Thanks,
> Tobias


From vladimir.x.ivanov at oracle.com  Fri Jan 11 19:23:59 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 11 Jan 2019 11:23:59 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
Message-ID: <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>


On 11/01/2019 10:23, Vladimir Ivanov wrote:
> 
>>> ??? #2: As part of loop limit check insertion, new IfNode is created (If
>>> 1854) and linked to R1722 as an input which causes R1722 IDOM to be
>>> updated [2]. It changes R1722 IDOM (I1511 => R1784), since dom_lca()
>>> normalizes the result using find_non_split_ctrl().
>>
>> Isn't that the root cause: the idom of R1722 is still I1511 and not
>> R1784?
> 
> If it were the case, then PhaseIdealLoop::handle_use()/spinup() would 
> reliably crash on all users of Phi 1790. There are 2 other Regions 
> (R1710 and R1716) which keep their IDOM (I1511) intact and the 
> transformation works fine for them.
> 
> R1722 is changed during strip mining transformation and its IDOM is 
> recomputed (I1511 => R1784). 

To elaborate a bit more on that: the only reason IDOM changes is due to 
the way it is computed:
     // rgn = R1722, new_iff = I1854
     Node* ridom = idom(rgn); // ridom = I1522 = IDOM(R1722)
     Node* nrdom = dom_lca(ridom, new_iff); // nrdom = R1784
     set_idom(rgn, nrdom, dom_depth(rgn));

     Node *dom_lca( Node *n1, Node *n2 ) const {
       return find_non_split_ctrl(dom_lca_internal(n1, n2));
     }

     dom_lca_internal(I1522, I1854) = I1522
     find_non_split_ctrl(I1522) = R1784

If IDOM info is recomputed from scratch, IDOM(R1722) remains I1511.

So, eager IDOM normalization (during initial construcion) doesn't help: 
it would lead to consistently hitting the problem in 
PhaseIdealLoop::handle_use()/spinup() when processing dependent RegionNodes.

Best regards,
Vladimir Ivanov

From vivek.r.deshpande at intel.com  Fri Jan 11 19:38:16 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Fri, 11 Jan 2019 19:38:16 +0000
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>

Hi Tobias

Thanks for reviewing the patch.
I have made the changes according to your suggestion.
In this webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
I have fix for the crash reported in the 8216050.

The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.

I have updated the bug also with the link to webrev.

I have created a different bug JDK-8216580 for  
 3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
     for a[i] and a[i+1] accesses in same MulAddS2I node

Thank you.
Regards,
Vivek

-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
Sent: Friday, January 11, 2019 4:49 AM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

Hi Vivek,

On 11.01.19 07:58, Deshpande, Vivek R wrote:
> 1) Fix for the crash by matching the operand by swapping to right positions. 

Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.

> 2) Cost based generation of vpdpwssd instruction. 

Other instructions added by JDK-8214751 still miss a cost definition, for example:
http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20

> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
> be isomorphic when they have different control RangeCheck nodes
> ????for a[i] and a[i+1] accesses in same MulAddS2I node

This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.

Thanks,
Tobias

From dean.long at oracle.com  Fri Jan 11 19:46:23 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Fri, 11 Jan 2019 11:46:23 -0800
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
In-Reply-To: <88842ba1a169406d9628ab06665bd787@sap.com>
References: <88842ba1a169406d9628ab06665bd787@sap.com>
Message-ID: <f26c0fdd-70c7-704a-6627-2848f181ddf5@oracle.com>

Hi Martin.? Looks good to me.

dl

On 1/11/19 4:55 AM, Doerr, Martin wrote:
>
> Hi,
>
> I?d like to contribute a small JIT improvement for JVMTI to avoid 
> calling raw_liveness_at_bci when its result is not needed.
>
> Bug with description:
>
> https://bugs.openjdk.java.net/browse/JDK-8216556
>
> Webrev:
>
> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/
>
> Please review.
>
> Best regards,
>
> Martin
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190111/b2923955/attachment.html>

From vladimir.x.ivanov at oracle.com  Fri Jan 11 19:49:29 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 11 Jan 2019 11:49:29 -0800
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <877efbzh8a.fsf@redhat.com>
References: <877efbzh8a.fsf@redhat.com>
Message-ID: <7962eba3-28c3-44d4-f88b-58ea9640f25e@oracle.com>

Looks good.

Best regards,
Vladimir Ivanov

On 11/01/2019 01:16, Roland Westrelin wrote:
> 
> http://cr.openjdk.java.net/~roland/8216549/webrev.00/
> 
> test1(), test2() and test3() perform an unsafe access with a mismatched
> access.
> 
> test1() compiles to an unschedulable graph and causes the compiler to
> crash. The memory input of the load from a non escaping allocation
> initially points to a membar but is set to bypass the membar while
> control stays set to the membar. The load is not eliminated because it's
> a mismatched memory access, an anti dependence is added between the
> membar and the load and the graph is unschedulable.
> 
> test2() and test3() return wrong results: the access is mismatched and
> misaligned, it's given its own alias by c2 but the MergeMem right after
> the allocation only points to the allocation for actual fields of the
> newly allocated object. So the load memory input is set to the memory
> state on method entry and the load is optimized as zero.
> 
> I simply propose to make non escaping allocations with mismatched
> accesses to be non scalar replaceable.
> 
> Roland.
> 

From vivek.r.deshpande at intel.com  Sat Jan 12 00:03:49 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Sat, 12 Jan 2019 00:03:49 +0000
Subject: RFR(XS):8216580:X86: Fix generation of VNNI vector code by
 allowing adjacent LoadS nodes to be isomorphic
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A6DA@ORSMSX106.amr.corp.intel.com>

Hi Tobias

The webrev for the bug JDK-821650 is here:
http://cr.openjdk.java.net/~vdeshpande/8216580/webrev.00/
This fixes generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes for a[i] and a[i+1] accesses in same MulAddS2I node.
Could you please review it.

Regards,
Vivek

-----Original Message-----
From: Deshpande, Vivek R 
Sent: Friday, January 11, 2019 11:38 AM
To: 'Tobias Hartmann' <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
Subject: RE: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

Hi Tobias

Thanks for reviewing the patch.
I have made the changes according to your suggestion.
In this webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
I have fix for the crash reported in the 8216050.

The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.

I have updated the bug also with the link to webrev.

I have created a different bug JDK-8216580 for
 3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
     for a[i] and a[i+1] accesses in same MulAddS2I node

Thank you.
Regards,
Vivek

-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
Sent: Friday, January 11, 2019 4:49 AM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

Hi Vivek,

On 11.01.19 07:58, Deshpande, Vivek R wrote:
> 1) Fix for the crash by matching the operand by swapping to right positions. 

Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.

> 2) Cost based generation of vpdpwssd instruction. 

Other instructions added by JDK-8214751 still miss a cost definition, for example:
http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20

> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
> be isomorphic when they have different control RangeCheck nodes
> ????for a[i] and a[i+1] accesses in same MulAddS2I node

This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.

Thanks,
Tobias

From doug.simon at oracle.com  Sat Jan 12 12:57:19 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Sat, 12 Jan 2019 13:57:19 +0100
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
 <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
Message-ID: <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>


> On 11 Jan 2019, at 10:02, Doug Simon <doug.simon at oracle.com> wrote:
> 
> Hi Josef,
> 
>> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at <mailto:josef.haider at khg.jku.at>> wrote:
>> 
>> Agreed, cmpw/cmpb would make more sense here, i just wanted
>> to keep the changeset minimal, since the entire method may soon be
>> changed again, anyway. 
>> 
> Can you please say more about this? Would you recommend applying your current patch as is to fix the crash or will you have the changes you mention ready soon?

Josef has updated his fix to use cmpw/cmpb:

http://cr.openjdk.java.net/~dnsimon/8215313/

Previous webrev is now at http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342

Dean, can you please re-review.

-Doug

> 
> -Doug
>>> Taking another look, it seems like cmpl could be replaced with the 
>>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and 
>>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and 
>>> cmpq right now.
>>> 
>>> dl
>>> 
>>> On 1/10/19 10:04 AM, dean.long at oracle.com <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev> wrote:
>>> > Is it OK to modify the values of searchValue[i]?  If the search value 
>>> > is already sign-extended, how about sign-extending cmpResult instead 
>>> > of zero-extending searchValue?
>>> >
>>> > dl
>>> >
>>> > On 1/10/19 7:09 AM, Doug Simon wrote:
>>> >> Please review this fix supplied by Josef Haider for an incorrect 
>>> >> compilation of String.split.
>>> >>
>>> >> When the String.indexOf intrinsic on AMD64 reaches the end of a 
>>> >> string, it tries to vectorize the last compare operations by reading 
>>> >> past the bounds of the character/byte array. This is not safe if the 
>>> >> out-of-bounds read would cross a page boundary, so in that case 
>>> >> characters are compared one-by-one. This is done with a 
>>> >> `cmpl`-instruction, which only works as long as the bytes/chars are 
>>> >> not sign extended.
>>> >>
>>> >> The fix is to simply `and` the characters we are searching for with 
>>> >> `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.
>>> >>
>>> >> http://cr.openjdk.java.net/~dnsimon/8215313 <http://cr.openjdk.java.net/~dnsimon/8215313>
>>> >> https://bugs.openjdk.java.net/browse/JDK-8215313 <https://bugs.openjdk.java.net/browse/JDK-8215313>
>>> >>
>>> >> -Doug
>>> >
>>> 
>> 
>> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190112/8379b5be/attachment.html>

From vladimir.kozlov at oracle.com  Sat Jan 12 22:40:27 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sat, 12 Jan 2019 14:40:27 -0800
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed
 in ExplicitExceptionTest
In-Reply-To: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
References: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
Message-ID: <256a33be-08f5-9670-edf9-ff640f19a54c@oracle.com>

Good.

Thanks,
Vladimir

On 1/11/19 1:31 AM, Tobias Hartmann wrote:
> Hi,
> 
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8213249
> http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/
> 
> The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
> deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
> code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
> null message.
> 
> Thanks,
> Tobias
> 

From vladimir.kozlov at oracle.com  Sun Jan 13 02:20:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sat, 12 Jan 2019 18:20:19 -0800
Subject: RFR: 8216424: Remove or clean up TimeLivenessAnalysis
In-Reply-To: <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>
References: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>
 <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>
Message-ID: <52e3cb2a-37d4-6e84-f33c-8d9bf572de7e@oracle.com>

Agree with removal. I never used it.

Thanks,
Vladimir

On 1/9/19 8:54 AM, Tobias Hartmann wrote:
> Hi Claes,
> 
> Both webrevs look good to me but I would prefer removal as well. I haven't ever seen anyone using
> that flag but let's wait for more opinions.
> 
> Best regards,
> Tobias
> 
> On 09.01.19 16:10, Claes Redestad wrote:
>> Hi,
>>
>> implementation for the develop flag TimeLivenessAnalysis leaves a few
>> breadcrumbs in product builds (in particular TraceTime
>> constructors/destructors aren't being inlined, so the compiler doesn't
>> realize these objects aren't actually doing anything)
>>
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8216424
>>
>> This should be either cleaned up:
>> http://cr.openjdk.java.net/~redestad/8216424/cleanup.00/
>>
>> .. or the flag should be removed altogether:
>> http://cr.openjdk.java.net/~redestad/8216424/remove.00/
>>
>> I favor removal since the statistics collected by this analysis does
>> not seem very useful and any real performance effect could/should be
>> estimated using real profiling tools on product builds, anyhow.
>>
>> Thanks!
>>
>> /Claes

From vladimir.kozlov at oracle.com  Sun Jan 13 02:36:48 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sat, 12 Jan 2019 18:36:48 -0800
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
In-Reply-To: <f26c0fdd-70c7-704a-6627-2848f181ddf5@oracle.com>
References: <88842ba1a169406d9628ab06665bd787@sap.com>
 <f26c0fdd-70c7-704a-6627-2848f181ddf5@oracle.com>
Message-ID: <36d50534-5f4d-d443-bf3c-4286d977faa5@oracle.com>

+1

Thanks,
Vladimir

On 1/11/19 11:46 AM, dean.long at oracle.com wrote:
> Hi Martin.? Looks good to me.
> 
> dl
> 
> On 1/11/19 4:55 AM, Doerr, Martin wrote:
>>
>> Hi,
>>
>> I?d like to contribute a small JIT improvement for JVMTI to avoid calling raw_liveness_at_bci when its result is not 
>> needed.
>>
>> Bug with description:
>>
>> https://bugs.openjdk.java.net/browse/JDK-8216556
>>
>> Webrev:
>>
>> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/
>>
>> Please review.
>>
>> Best regards,
>>
>> Martin
>>
> 

From vladimir.kozlov at oracle.com  Sun Jan 13 02:46:57 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sat, 12 Jan 2019 18:46:57 -0800
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
Message-ID: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>

http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8216151

Have to update default.policy after changes in jdk.internal.vm.compiler.management files done by JDK-8199755: "Update 
Graal".

Ran CheckAccessClassInPackagePermissions.java test.

-- 
Thanks,
Vladimir

From claes.redestad at oracle.com  Sun Jan 13 11:59:46 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Sun, 13 Jan 2019 12:59:46 +0100
Subject: RFR: 8216424: Remove or clean up TimeLivenessAnalysis
In-Reply-To: <52e3cb2a-37d4-6e84-f33c-8d9bf572de7e@oracle.com>
References: <9ab53da1-d6d8-d5f8-10b6-da960444aa6c@oracle.com>
 <c713a6f7-8a3c-709d-6770-cae518b2ec06@oracle.com>
 <52e3cb2a-37d4-6e84-f33c-8d9bf572de7e@oracle.com>
Message-ID: <c325aac7-5d44-8c6f-323a-12c542bcd362@oracle.com>

Thanks, Vladimir,

I'll go ahead and remove it, then.

/Claes

On 2019-01-13 03:20, Vladimir Kozlov wrote:
> Agree with removal. I never used it.
> 
> Thanks,
> Vladimir
> 
> On 1/9/19 8:54 AM, Tobias Hartmann wrote:
>> Hi Claes,
>>
>> Both webrevs look good to me but I would prefer removal as well. I 
>> haven't ever seen anyone using
>> that flag but let's wait for more opinions.
>>
>> Best regards,
>> Tobias
>>
>> On 09.01.19 16:10, Claes Redestad wrote:
>>> Hi,
>>>
>>> implementation for the develop flag TimeLivenessAnalysis leaves a few
>>> breadcrumbs in product builds (in particular TraceTime
>>> constructors/destructors aren't being inlined, so the compiler doesn't
>>> realize these objects aren't actually doing anything)
>>>
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8216424
>>>
>>> This should be either cleaned up:
>>> http://cr.openjdk.java.net/~redestad/8216424/cleanup.00/
>>>
>>> .. or the flag should be removed altogether:
>>> http://cr.openjdk.java.net/~redestad/8216424/remove.00/
>>>
>>> I favor removal since the statistics collected by this analysis does
>>> not seem very useful and any real performance effect could/should be
>>> estimated using real profiling tools on product builds, anyhow.
>>>
>>> Thanks!
>>>
>>> /Claes

From bsrbnd at gmail.com  Sun Jan 13 17:10:07 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Sun, 13 Jan 2019 18:10:07 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
Message-ID: <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>

On Thu, 10 Jan 2019 at 10:19, Andrew Haley <aph at redhat.com> wrote:
>
> On 1/9/19 12:13 PM, Roman Kennke wrote:
> > I cannot say if if this has performance implication. I suspect not. If
> > it has, it's probably miniscule improvement. I can't see how it could be
> > worse though.
>
> I can. x86 can have some very weird performance characteristics. It'd be
> helpful to do some measurement.

I'm not sure we are really able to conclude anything from performance
measurement on highly implementation-dependent instructions unless we
make an average on a significant number of different x86_64 processors
which might well change with future generations...

Shouldn't we follow a more pragmatic direction considering that less
instructions/registers and a better/smaller encoding is generally
preferable, as Roman suggested, which is the purpose of complex
instruction sets?

Bernard

From vladimir.kozlov at oracle.com  Sun Jan 13 21:34:20 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sun, 13 Jan 2019 13:34:20 -0800
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <87ef9m178o.fsf@redhat.com>
References: <87ef9m178o.fsf@redhat.com>
Message-ID: <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com>

Looks reasonable.

Did you test with switched off UseLoopPredicate?

Thanks,
Vladimir

On 1/9/19 1:59 AM, Roland Westrelin wrote:
> 
> http://cr.openjdk.java.net/~roland/8216135/webrev.00/
> 
> Range check elimination is applied to a loop and then the loop is
> unrolled. After the loop is unrolled, the range of values for the
> induction variable conflicts with a range check CastII (the loop is over
> unrolled and the main loop would never be executed), the CastII's value
> becomes top, a data path dies but the corresponding control path is kept
> alive. This results in a broken graph.
> 
> This scenario is supposed to be caught by the skeleton predicates added
> by 8193130 but it's not for 2 reasons:
> 
> 1- With 8203915 & 8205033, Tobias extended skeleton predicates to cover
>    not only the first value of the induction variable of the first loop
>    iteration but also the last value of an unrolled loop. But his changes
>    only apply to loop predicates, not range check elimination.
> 
> 2- With 8203915 & 8205033, Tobias used an Opaque1 node as a place holder
>    so on each unrolling, he could update the skeleton predicate with the
>    new stride. The problem is that the Opaque1 node blocks type
>    propagation and the skeleton predicate only has a chance to remove a
>    dead main loop after loop opts are over. In the case of this bug, the
>    CastII becomes dead before loop opts are finished.
> 
> The problem with 2- is that if the Opaque1 node is not added, on the
> next unrolling there's no way to find what predicate and what part of
> the predicate to update. The fix I propose, is to keep 3 predicates
> after the first unrolling:
> 
> 1 for the first value of the first iteration
> 1 for the last value of the last iteration, without an Opaque1 node
> 1 with an Opaque1 node that can be used as a template
> 
> On the next unrolling pass, the 1st and 2nd predicates above could have
> been optimized out. Rather than try to locate and update the 2nd
> predicate, the 1st and 2nd predicates are removed if they are found and,
> once the code finds the 3rd predicate, it clones it once to produce the
> check on the first value again and a second time to produce an updated
> check on the new last value.
> 
> Roland.
> 

From vladimir.kozlov at oracle.com  Sun Jan 13 21:57:54 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sun, 13 Jan 2019 13:57:54 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
Message-ID: <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>

On 1/11/19 11:23 AM, Vladimir Ivanov wrote:
> 
> 
> On 11/01/2019 10:23, Vladimir Ivanov wrote:
>>
>>>> ??? #2: As part of loop limit check insertion, new IfNode is created (If
>>>> 1854) and linked to R1722 as an input which causes R1722 IDOM to be
>>>> updated [2]. It changes R1722 IDOM (I1511 => R1784), since dom_lca()
>>>> normalizes the result using find_non_split_ctrl().
>>>
>>> Isn't that the root cause: the idom of R1722 is still I1511 and not
>>> R1784?
>>
>> If it were the case, then PhaseIdealLoop::handle_use()/spinup() would reliably crash on all users of Phi 1790. There 
>> are 2 other Regions (R1710 and R1716) which keep their IDOM (I1511) intact and the transformation works fine for them.
>>
>> R1722 is changed during strip mining transformation and its IDOM is recomputed (I1511 => R1784). 
> 
> To elaborate a bit more on that: the only reason IDOM changes is due to the way it is computed:
>  ??? // rgn = R1722, new_iff = I1854
>  ??? Node* ridom = idom(rgn); // ridom = I1522 = IDOM(R1722)

Is it typo? Should it be I1511? I don't see I1522 in graph's picture.

>  ??? Node* nrdom = dom_lca(ridom, new_iff); // nrdom = R1784
>  ??? set_idom(rgn, nrdom, dom_depth(rgn));
> 
>  ??? Node *dom_lca( Node *n1, Node *n2 ) const {
>  ????? return find_non_split_ctrl(dom_lca_internal(n1, n2));
>  ??? }
> 
>  ??? dom_lca_internal(I1522, I1854) = I1522

I assume it is 1511.

>  ??? find_non_split_ctrl(I1522) = R1784
> 
> If IDOM info is recomputed from scratch, IDOM(R1722) remains I1511.

Can you explain more this point? Why result is different if it is from scratch?

Thanks,
Vladimir K

> 
> So, eager IDOM normalization (during initial construcion) doesn't help: it would lead to consistently hitting the 
> problem in PhaseIdealLoop::handle_use()/spinup() when processing dependent RegionNodes.
> 
> Best regards,
> Vladimir Ivanov

From vladimir.kozlov at oracle.com  Sun Jan 13 22:05:04 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sun, 13 Jan 2019 14:05:04 -0800
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <87r2djxpy6.fsf@redhat.com>
References: <877efbzh8a.fsf@redhat.com> <87r2djxpy6.fsf@redhat.com>
Message-ID: <579a07cb-7617-6c39-38ba-595e369a41b2@oracle.com>

I would suggest to push it into JDK 12. It is P3 and nasty one.

Thanks,
Vladimir

On 1/11/19 5:51 AM, Roland Westrelin wrote:
> 
> Also: I targeted this to 13 but I don't really have a strong opinion
> whether it should go in 12 or 13.
> 
> Roland.
> 

From vladimir.kozlov at oracle.com  Sun Jan 13 22:05:46 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sun, 13 Jan 2019 14:05:46 -0800
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <7962eba3-28c3-44d4-f88b-58ea9640f25e@oracle.com>
References: <877efbzh8a.fsf@redhat.com>
 <7962eba3-28c3-44d4-f88b-58ea9640f25e@oracle.com>
Message-ID: <04834367-91bc-e60d-f510-736a8ec2fedd@oracle.com>

+1

Thanks,
Vladimir K.

On 1/11/19 11:49 AM, Vladimir Ivanov wrote:
> Looks good.
> 
> Best regards,
> Vladimir Ivanov
> 
> On 11/01/2019 01:16, Roland Westrelin wrote:
>>
>> http://cr.openjdk.java.net/~roland/8216549/webrev.00/
>>
>> test1(), test2() and test3() perform an unsafe access with a mismatched
>> access.
>>
>> test1() compiles to an unschedulable graph and causes the compiler to
>> crash. The memory input of the load from a non escaping allocation
>> initially points to a membar but is set to bypass the membar while
>> control stays set to the membar. The load is not eliminated because it's
>> a mismatched memory access, an anti dependence is added between the
>> membar and the load and the graph is unschedulable.
>>
>> test2() and test3() return wrong results: the access is mismatched and
>> misaligned, it's given its own alias by c2 but the MergeMem right after
>> the allocation only points to the allocation for actual fields of the
>> newly allocated object. So the load memory input is set to the memory
>> state on method entry and the load is optimized as zero.
>>
>> I simply propose to make non escaping allocations with mismatched
>> accesses to be non scalar replaceable.
>>
>> Roland.
>>

From dean.long at oracle.com  Sun Jan 13 22:07:14 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Sun, 13 Jan 2019 14:07:14 -0800
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
 <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
 <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
Message-ID: <be3eae6d-b4c1-45dd-6e16-488b8d202580@oracle.com>

Looks good.? Please update copyright with 2019 end year.

dl

On 1/12/19 4:57 AM, Doug Simon wrote:
>
>
>> On 11 Jan 2019, at 10:02, Doug Simon <doug.simon at oracle.com 
>> <mailto:doug.simon at oracle.com>> wrote:
>>
>> Hi Josef,
>>
>>> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at 
>>> <mailto:josef.haider at khg.jku.at>> wrote:
>>>
>>> Agreed, cmpw/cmpb would make more sense here, i just wanted
>>> to keep the changeset minimal, since the entire method may soon be
>>> changed again, anyway.
>>>
>> Can you please say more about this? Would you recommend applying your 
>> current patch as is to fix the crash or will you have the changes you 
>> mention ready soon?
>
> Josef has updated his fix to use cmpw/cmpb:
>
> http://cr.openjdk.java.net/~dnsimon/8215313/
>
> Previous webrev is now at 
> http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342
>
> Dean, can you please re-review.
>
> -Doug
>
>>
>> -Doug
>>>> Taking another look, it seems like cmpl could be replaced with the
>>>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and
>>>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and
>>>> cmpq right now.
>>>>
>>>> dl
>>>>
>>>> On 1/10/19 10:04 AM,dean.long at oracle.com  <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev>  wrote:
>>>> >/Is it OK to modify the values of searchValue[i]?? If the search value />/is already sign-extended, how about sign-extending cmpResult instead />/of zero-extending searchValue? />//>/dl />//>/On 1/10/19 7:09 AM, Doug Simon wrote: />>/Please review this fix supplied by Josef Haider for an incorrect />>/compilation of String.split. />>//>>/When the String.indexOf intrinsic on AMD64 reaches the end of a />>/string, it tries to vectorize the last compare operations by reading />>/past the bounds of the character/byte array. This is not safe if the />>/out-of-bounds read would cross a page boundary, so in that case />>/characters are compared one-by-one. This is done with a />>/`cmpl`-instruction, which only works as long as the bytes/chars are />>/not sign extended. />>//>>/The fix is to simply `and` the characters we are searching for with />>/`0xff`/`0xffff` in order to eliminate any erroneous sign extensions. />>//>>/http://cr.openjdk.java.net/~dnsimon/8215313 />>/https://bugs.openjdk.java.net/browse/JDK-8215313 />>//>>/-Doug />//
>>>>
>>>
>>>
>>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190113/df3d7da4/attachment-0001.html>

From vladimir.kozlov at oracle.com  Sun Jan 13 22:11:20 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sun, 13 Jan 2019 14:11:20 -0800
Subject: RFR: 8216427: ciMethodData::load_extra_data() does not always
 unpack the last entry
In-Reply-To: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
References: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
Message-ID: <a583290c-5393-7594-ab8c-21fba0a99574@oracle.com>

Looks good.

Please run hs-precheckin-comp too.

Thanks,
Vladimir

On 1/11/19 1:53 AM, Erik ?sterlund wrote:
> Hi,
> 
> When unpacking the extra data section of the MDOs, the source and destination might not have the same number of entries, 
> because there can be safepoints between cloning the extra data section of the MDO and unpacking the source entries to 
> the destination entries.
> 
> Therefore the unpacking loop loops through all the source entries and copies them to the destination. Except the last 
> DataLayout::arg_info_data_tag entry, that never gets copied form the source to the destination. Therefore, if a 
> safepoint occurred between cloning the extra data section and unpacking its entries in ciMethodData::load_extra_data(), 
> the last entry could contain random bogus memory.
> 
> It seems like the reason the last entry is not copied is because the copying of an entry requires a length which is 
> currently calculated by taking the difference between the current entry and the next entry in the loop. But as there is 
> no notion of a next entry when you are at the last DataLayout::arg_info_data_tag entry (because it is always the last 
> one when present), so you can't do that. Therefore, the solution of choice seems to have been simply not copying the 
> last DataLayout::arg_info_data_tag entry, instead of calculating what the length of that entry would be.
> 
> This patch appropriately calculates the length of the entries instead (which is also defined for 
> DataLayout::arg_info_data_tag) in the copying loop, allowing the last DataLayout::arg_info_data_tag entry to be copied 
> as well.
> 
> Webrev:
> http://cr.openjdk.java.net/~eosterlund/8216427/webrev.00/
> 
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8216427
> 
> Tested through hs-tier1-3.
> 
> Thanks,
> /Erik

From erik.osterlund at oracle.com  Sun Jan 13 22:24:24 2019
From: erik.osterlund at oracle.com (Erik Osterlund)
Date: Sun, 13 Jan 2019 14:24:24 -0800 (PST)
Subject: RFR: 8216427: ciMethodData::load_extra_data() does not always
 unpack the last entry
In-Reply-To: <a583290c-5393-7594-ab8c-21fba0a99574@oracle.com>
References: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
 <a583290c-5393-7594-ab8c-21fba0a99574@oracle.com>
Message-ID: <9605F7DB-5227-45B0-9F7D-FEA3FE050760@oracle.com>

Hi Vladimir,

Thanks for the review.

/Erik

> On 13 Jan 2019, at 23:11, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> Looks good.
> 
> Please run hs-precheckin-comp too.
> 
> Thanks,
> Vladimir
> 
>> On 1/11/19 1:53 AM, Erik ?sterlund wrote:
>> Hi,
>> When unpacking the extra data section of the MDOs, the source and destination might not have the same number of entries, because there can be safepoints between cloning the extra data section of the MDO and unpacking the source entries to the destination entries.
>> Therefore the unpacking loop loops through all the source entries and copies them to the destination. Except the last DataLayout::arg_info_data_tag entry, that never gets copied form the source to the destination. Therefore, if a safepoint occurred between cloning the extra data section and unpacking its entries in ciMethodData::load_extra_data(), the last entry could contain random bogus memory.
>> It seems like the reason the last entry is not copied is because the copying of an entry requires a length which is currently calculated by taking the difference between the current entry and the next entry in the loop. But as there is no notion of a next entry when you are at the last DataLayout::arg_info_data_tag entry (because it is always the last one when present), so you can't do that. Therefore, the solution of choice seems to have been simply not copying the last DataLayout::arg_info_data_tag entry, instead of calculating what the length of that entry would be.
>> This patch appropriately calculates the length of the entries instead (which is also defined for DataLayout::arg_info_data_tag) in the copying loop, allowing the last DataLayout::arg_info_data_tag entry to be copied as well.
>> Webrev:
>> http://cr.openjdk.java.net/~eosterlund/8216427/webrev.00/
>> Bug:
>> https://bugs.openjdk.java.net/browse/JDK-8216427
>> Tested through hs-tier1-3.
>> Thanks,
>> /Erik


From martin.doerr at sap.com  Mon Jan 14 08:30:33 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Mon, 14 Jan 2019 08:30:33 +0000
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
In-Reply-To: <3a600790198e4bbbb6f253daf0af8ff0@sap.com>
References: <88842ba1a169406d9628ab06665bd787@sap.com>
 <9c7afb40-cc2b-9ae8-fb70-4ac3bacb72da@oracle.com>
 <3a600790198e4bbbb6f253daf0af8ff0@sap.com>
Message-ID: <d6a8ad3b4cb94e18ad2de7f05fd1c1dd@sap.com>

Hi Claes,

excellent proposal. Thanks. I had not noticed that it currently is in a cpp file.

New webrev:
http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.01/

What I still don't really like is that we're passing MethodLivenessResult objects on stack via 3 compilation units.
But I don't know if it's worth refactoring the code.

Best regards,
Martin


-----Original Message-----
From: Claes Redestad <claes.redestad at oracle.com> 
Sent: Freitag, 11. Januar 2019 16:45
To: Doerr, Martin <martin.doerr at sap.com>
Subject: Re: RFR(S): 8216556: Unnecessary liveness computation with JVMTI

Hi,

  just a random thought, but if you're optimizing this and got some
measure where it matters(?), maybe you should also try inlining
ciEnv::should_retain_local_variables(), i.e., move definition to
ciEnv.hpp. If it doesn't bloat static binary size it seems like it won't
hurt, at least.

/Claes

On 2019-01-11 13:55, Doerr, Martin wrote:
> Hi,
> 
> I'd like to contribute a small JIT improvement for JVMTI to avoid 
> calling raw_liveness_at_bci when its result is not needed.
> 
> Bug with description:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216556
> 
> Webrev:
> 
> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/
> 
> Please review.
> 
> Best regards,
> 
> Martin
> 

From rwestrel at redhat.com  Mon Jan 14 08:32:00 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Mon, 14 Jan 2019 09:32:00 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com>
References: <87ef9m178o.fsf@redhat.com>
 <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com>
Message-ID: <87k1j7y70f.fsf@redhat.com>


> Looks reasonable.

Thanks for the review.

> Did you test with switched off UseLoopPredicate?

I didn't and sure, that makes sense. I'll run that on my system
tonight. Or maybe Tobias can run the same testing he ran with
UseLoopPredicate off?

Roland.

From tobias.hartmann at oracle.com  Mon Jan 14 08:40:42 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 09:40:42 +0100
Subject: [13] RFR(S): 8213249: compiler/graalunit/HotspotTest.java failed
 in ExplicitExceptionTest
In-Reply-To: <256a33be-08f5-9670-edf9-ff640f19a54c@oracle.com>
References: <a71b4e27-0b3d-7f85-6f55-448d7ea2790b@oracle.com>
 <256a33be-08f5-9670-edf9-ff640f19a54c@oracle.com>
Message-ID: <cc17aea5-b139-90a6-9637-2dea269faffc@oracle.com>

Katya, Igor, Dean, Vladimir, thanks for the reviews!

Best regards,
Tobias

On 12.01.19 23:40, Vladimir Kozlov wrote:
> Good.
> 
> Thanks,
> Vladimir
> 
> On 1/11/19 1:31 AM, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8213249
>> http://cr.openjdk.java.net/~thartmann/8213249/webrev.01/
>>
>> The problem is C2's -XX:OmitStackTraceInFastThrow which is enabled by default. Instead of
>> deoptimizing at frequent throws (such as the ArrayIndexOutOfBoundsException in this case), C2 emits
>> code to throw a pre-allocated exception object (see code in GraphKit::builtin_throw()) which has a
>> null message.
>>
>> Thanks,
>> Tobias
>>

From tobias.hartmann at oracle.com  Mon Jan 14 08:42:31 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 09:42:31 +0100
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <877efbzh8a.fsf@redhat.com>
References: <877efbzh8a.fsf@redhat.com>
Message-ID: <d4bc4a04-fd70-576a-c937-d00c3702931b@oracle.com>

Hi Roland,

looks good to me as well. I'm fine with pushing to JDK 12.

Best regards,
Tobias

On 11.01.19 10:16, Roland Westrelin wrote:
> 
> http://cr.openjdk.java.net/~roland/8216549/webrev.00/
> 
> test1(), test2() and test3() perform an unsafe access with a mismatched
> access.
> 
> test1() compiles to an unschedulable graph and causes the compiler to
> crash. The memory input of the load from a non escaping allocation
> initially points to a membar but is set to bypass the membar while
> control stays set to the membar. The load is not eliminated because it's
> a mismatched memory access, an anti dependence is added between the
> membar and the load and the graph is unschedulable.
> 
> test2() and test3() return wrong results: the access is mismatched and
> misaligned, it's given its own alias by c2 but the MergeMem right after
> the allocation only points to the allocation for actual fields of the
> newly allocated object. So the load memory input is set to the memory
> state on method entry and the load is optimized as zero.
> 
> I simply propose to make non escaping allocations with mismatched
> accesses to be non scalar replaceable.
> 
> Roland.
> 

From tobias.hartmann at oracle.com  Mon Jan 14 08:43:52 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 09:43:52 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <87k1j7y70f.fsf@redhat.com>
References: <87ef9m178o.fsf@redhat.com>
 <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com> <87k1j7y70f.fsf@redhat.com>
Message-ID: <3a6364a3-0483-f454-87e9-7894cf8e8055@oracle.com>

Hi Roland,

On 14.01.19 09:32, Roland Westrelin wrote:
> I didn't and sure, that makes sense. I'll run that on my system
> tonight. Or maybe Tobias can run the same testing he ran with
> UseLoopPredicate off?

Yes, I'll do that.

Best regards,
Tobias

From doug.simon at oracle.com  Mon Jan 14 09:16:31 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Mon, 14 Jan 2019 10:16:31 +0100
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <be3eae6d-b4c1-45dd-6e16-488b8d202580@oracle.com>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
 <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
 <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
 <be3eae6d-b4c1-45dd-6e16-488b8d202580@oracle.com>
Message-ID: <B8D1BF63-F3A2-459E-8F55-51F626730505@oracle.com>


> On 13 Jan 2019, at 23:07, dean.long at oracle.com wrote:
> 
> Looks good. 

Thanks for the review.

> Please update copyright with 2019 end year.

Done.

-Doug

> 
> On 1/12/19 4:57 AM, Doug Simon wrote:
>> 
>> 
>>> On 11 Jan 2019, at 10:02, Doug Simon <doug.simon at oracle.com <mailto:doug.simon at oracle.com>> wrote:
>>> 
>>> Hi Josef,
>>> 
>>>> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at <mailto:josef.haider at khg.jku.at>> wrote:
>>>> 
>>>> Agreed, cmpw/cmpb would make more sense here, i just wanted
>>>> to keep the changeset minimal, since the entire method may soon be
>>>> changed again, anyway. 
>>>> 
>>> Can you please say more about this? Would you recommend applying your current patch as is to fix the crash or will you have the changes you mention ready soon?
>> 
>> Josef has updated his fix to use cmpw/cmpb:
>> 
>> http://cr.openjdk.java.net/~dnsimon/8215313/ <http://cr.openjdk.java.net/~dnsimon/8215313/>
>> 
>> Previous webrev is now at http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342 <http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342>
>> 
>> Dean, can you please re-review.
>> 
>> -Doug
>> 
>>> 
>>> -Doug
>>>>> Taking another look, it seems like cmpl could be replaced with the 
>>>>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and 
>>>>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and 
>>>>> cmpq right now.
>>>>> 
>>>>> dl
>>>>> 
>>>>> On 1/10/19 10:04 AM, dean.long at oracle.com <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev> wrote:
>>>>> > Is it OK to modify the values of searchValue[i]?  If the search value 
>>>>> > is already sign-extended, how about sign-extending cmpResult instead 
>>>>> > of zero-extending searchValue?
>>>>> >
>>>>> > dl
>>>>> >
>>>>> > On 1/10/19 7:09 AM, Doug Simon wrote:
>>>>> >> Please review this fix supplied by Josef Haider for an incorrect 
>>>>> >> compilation of String.split.
>>>>> >>
>>>>> >> When the String.indexOf intrinsic on AMD64 reaches the end of a 
>>>>> >> string, it tries to vectorize the last compare operations by reading 
>>>>> >> past the bounds of the character/byte array. This is not safe if the 
>>>>> >> out-of-bounds read would cross a page boundary, so in that case 
>>>>> >> characters are compared one-by-one. This is done with a 
>>>>> >> `cmpl`-instruction, which only works as long as the bytes/chars are 
>>>>> >> not sign extended.
>>>>> >>
>>>>> >> The fix is to simply `and` the characters we are searching for with 
>>>>> >> `0xff`/`0xffff` in order to eliminate any erroneous sign extensions.
>>>>> >>
>>>>> >> http://cr.openjdk.java.net/~dnsimon/8215313 <http://cr.openjdk.java.net/~dnsimon/8215313>
>>>>> >> https://bugs.openjdk.java.net/browse/JDK-8215313 <https://bugs.openjdk.java.net/browse/JDK-8215313>
>>>>> >>
>>>>> >> -Doug
>>>>> >
>>>>> 
>>>> 
>>>> 
>>> 
>> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190114/54b8c598/attachment.html>

From adinn at redhat.com  Mon Jan 14 09:55:59 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Mon, 14 Jan 2019 09:55:59 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
Message-ID: <bd9409e3-728e-9a62-fe18-add9bacdd4aa@redhat.com>

On 09/01/2019 12:13, Roman Kennke wrote:
> While poking around x86_64.ad's cmovP instructions (because I needed it
> for an experiment in Shenandoah), I noticed that 2 of them are
> disabled/commented-out:  cmovP_mem and  cmovP_memU. This means that a
> cmovp with a 2nd argument that is a LoadP will generate two instructions:
> 
> mov %r1, $mem
> cmov %r2, %1
> 
> instead of just one:
> 
> cmov %r2, $mem
> 
> The comment there says that adlc doesn't compute the bottom-type
> correctly, and that implicit null-checking is broken, but I couldn't
> confirm either of those. I checked hg annotate, but the commented-out
> block stems from revision #1 and cannot be traced to a bug or so.

I'm not an expert on aldc but I suspect that the first comment cannot
simply be ignored -- even if it appears to work in the cases you have tried.

adlc needs to know bottom types both for memory nodes and for machine
nodes which coalesce memory ops via rule reductions. This is necessary
in order to ensure that ops which affect the same memory slices are
scheduled in the correct order.

Code in files output_h.cpp ad output_c.cpp generates implementations of
a virtual method that retrieves the bottom type for such nodes. CMoveP
instructions are handled as a special case (in output_h.cpp) by
computing the meet of the bottom types of the first and second ins for
the associated node.

That's ok when the ins correspond to standard form inputs. However, I'm
not sure it will correctly handle a rule containing a rule with a memory
form input. Memory inputs are a fiction which corresponds to more than
one in node. I think this may end up computing the bottom type using the
bottom type of the base address without taking into account any offset.
That might well cause nasty errors in the computation of some types.

Perhaps someone from the compiler team can comment on this?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From tobias.hartmann at oracle.com  Mon Jan 14 10:04:45 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 11:04:45 +0100
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
Message-ID: <cfb4f0ab-3751-6aac-475a-23e5464a8a62@oracle.com>

Hi Vladimir,

looks good to me.

Best regards,
Tobias


On 13.01.19 03:46, Vladimir Kozlov wrote:
> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8216151
> 
> Have to update default.policy after changes in jdk.internal.vm.compiler.management files done by
> JDK-8199755: "Update Graal".
> 
> Ran CheckAccessClassInPackagePermissions.java test.
> 

From tobias.hartmann at oracle.com  Mon Jan 14 10:17:51 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 11:17:51 +0100
Subject: RFR: 8216427: ciMethodData::load_extra_data() does not always
 unpack the last entry
In-Reply-To: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
References: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
Message-ID: <b845f86f-10b0-f704-8017-5f44d24f70e6@oracle.com>

Hi Erik,

looks good to me too.

Best regards,
Tobias

On 11.01.19 10:53, Erik ?sterlund wrote:
> Hi,
> 
> When unpacking the extra data section of the MDOs, the source and destination might not have the
> same number of entries, because there can be safepoints between cloning the extra data section of
> the MDO and unpacking the source entries to the destination entries.
> 
> Therefore the unpacking loop loops through all the source entries and copies them to the
> destination. Except the last DataLayout::arg_info_data_tag entry, that never gets copied form the
> source to the destination. Therefore, if a safepoint occurred between cloning the extra data section
> and unpacking its entries in ciMethodData::load_extra_data(), the last entry could contain random
> bogus memory.
> 
> It seems like the reason the last entry is not copied is because the copying of an entry requires a
> length which is currently calculated by taking the difference between the current entry and the next
> entry in the loop. But as there is no notion of a next entry when you are at the last
> DataLayout::arg_info_data_tag entry (because it is always the last one when present), so you can't
> do that. Therefore, the solution of choice seems to have been simply not copying the last
> DataLayout::arg_info_data_tag entry, instead of calculating what the length of that entry would be.
> 
> This patch appropriately calculates the length of the entries instead (which is also defined for
> DataLayout::arg_info_data_tag) in the copying loop, allowing the last DataLayout::arg_info_data_tag
> entry to be copied as well.
> 
> Webrev:
> http://cr.openjdk.java.net/~eosterlund/8216427/webrev.00/
> 
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8216427
> 
> Tested through hs-tier1-3.
> 
> Thanks,
> /Erik

From Alan.Bateman at oracle.com  Mon Jan 14 10:27:23 2019
From: Alan.Bateman at oracle.com (Alan Bateman)
Date: Mon, 14 Jan 2019 10:27:23 +0000
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
Message-ID: <d2c592ab-93c6-200d-bb80-094874528be4@oracle.com>

On 13/01/2019 02:46, Vladimir Kozlov wrote:
> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8216151
>
> Have to update default.policy after changes in 
> jdk.internal.vm.compiler.management files done by JDK-8199755: "Update 
> Graal".
>
> Ran CheckAccessClassInPackagePermissions.java test.
>
cc'ing security-dev as that is where is the security policy file is 
maintained.

One thing is double check is that code in 
jdk.internal.vm.compiler.management really needs to access members of 
classes in the listed packages. I ask because the module definition 
doesn't export some of these packages to 
jdk.internal.vm.compiler.management so they aren't accessible even when 
not running with a security manager.

-Alan

From tobias.hartmann at oracle.com  Mon Jan 14 10:40:13 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 11:40:13 +0100
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
Message-ID: <87483987-e5dd-a42b-ff94-da645e041019@oracle.com>

Hi Vivek,

thanks for making these changes. Looks good to me!

A second review would be good.

Best regards,
Tobias

On 11.01.19 20:38, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for  
>  3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>      for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions. 
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction. 
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>> be isomorphic when they have different control RangeCheck nodes
>> ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From rkennke at redhat.com  Mon Jan 14 10:57:10 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Mon, 14 Jan 2019 11:57:10 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <bd9409e3-728e-9a62-fe18-add9bacdd4aa@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <bd9409e3-728e-9a62-fe18-add9bacdd4aa@redhat.com>
Message-ID: <176827a6-32e5-8b9a-3441-3dcca9bc2759@redhat.com>

Hi Andrew,

>> While poking around x86_64.ad's cmovP instructions (because I needed it
>> for an experiment in Shenandoah), I noticed that 2 of them are
>> disabled/commented-out:  cmovP_mem and  cmovP_memU. This means that a
>> cmovp with a 2nd argument that is a LoadP will generate two instructions:
>>
>> mov %r1, $mem
>> cmov %r2, %1
>>
>> instead of just one:
>>
>> cmov %r2, $mem
>>
>> The comment there says that adlc doesn't compute the bottom-type
>> correctly, and that implicit null-checking is broken, but I couldn't
>> confirm either of those. I checked hg annotate, but the commented-out
>> block stems from revision #1 and cannot be traced to a bug or so.
> 
> I'm not an expert on aldc but I suspect that the first comment cannot
> simply be ignored -- even if it appears to work in the cases you have tried.
> 
> adlc needs to know bottom types both for memory nodes and for machine
> nodes which coalesce memory ops via rule reductions. This is necessary
> in order to ensure that ops which affect the same memory slices are
> scheduled in the correct order.
> 
> Code in files output_h.cpp ad output_c.cpp generates implementations of
> a virtual method that retrieves the bottom type for such nodes. CMoveP
> instructions are handled as a special case (in output_h.cpp) by
> computing the meet of the bottom types of the first and second ins for
> the associated node.
> 
> That's ok when the ins correspond to standard form inputs. However, I'm
> not sure it will correctly handle a rule containing a rule with a memory
> form input. Memory inputs are a fiction which corresponds to more than
> one in node. I think this may end up computing the bottom type using the
> bottom type of the base address without taking into account any offset.
> That might well cause nasty errors in the computation of some types.
> 
> Perhaps someone from the compiler team can comment on this?

Yeah, I agree, but I couldn't tell or figure out where and how exactly
it's wrong. And since the comment is from changeset #1, and no bug
referenced, etc, it's hard to find out. And it might also be possible
that it was due to the buggy impl that it was (using 32bit reg instead
of 64). Would be good if somebody from compiler team in Oracle could
comment on it.

Roman


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190114/606459c2/signature.asc>

From erik.osterlund at oracle.com  Mon Jan 14 11:26:23 2019
From: erik.osterlund at oracle.com (Erik Osterlund)
Date: Mon, 14 Jan 2019 12:26:23 +0100
Subject: RFR: 8216427: ciMethodData::load_extra_data() does not always
 unpack the last entry
In-Reply-To: <b845f86f-10b0-f704-8017-5f44d24f70e6@oracle.com>
References: <983e6611-417e-9c7a-0643-389fffa0e2cf@oracle.com>
 <b845f86f-10b0-f704-8017-5f44d24f70e6@oracle.com>
Message-ID: <A66FC786-66C8-4697-87AD-8461684B9EB4@oracle.com>

Hi Tobias,

Thanks for the review.

/Erik

> On 14 Jan 2019, at 11:17, Tobias Hartmann <tobias.hartmann at oracle.com> wrote:
> 
> Hi Erik,
> 
> looks good to me too.
> 
> Best regards,
> Tobias
> 
>> On 11.01.19 10:53, Erik ?sterlund wrote:
>> Hi,
>> 
>> When unpacking the extra data section of the MDOs, the source and destination might not have the
>> same number of entries, because there can be safepoints between cloning the extra data section of
>> the MDO and unpacking the source entries to the destination entries.
>> 
>> Therefore the unpacking loop loops through all the source entries and copies them to the
>> destination. Except the last DataLayout::arg_info_data_tag entry, that never gets copied form the
>> source to the destination. Therefore, if a safepoint occurred between cloning the extra data section
>> and unpacking its entries in ciMethodData::load_extra_data(), the last entry could contain random
>> bogus memory.
>> 
>> It seems like the reason the last entry is not copied is because the copying of an entry requires a
>> length which is currently calculated by taking the difference between the current entry and the next
>> entry in the loop. But as there is no notion of a next entry when you are at the last
>> DataLayout::arg_info_data_tag entry (because it is always the last one when present), so you can't
>> do that. Therefore, the solution of choice seems to have been simply not copying the last
>> DataLayout::arg_info_data_tag entry, instead of calculating what the length of that entry would be.
>> 
>> This patch appropriately calculates the length of the entries instead (which is also defined for
>> DataLayout::arg_info_data_tag) in the copying loop, allowing the last DataLayout::arg_info_data_tag
>> entry to be copied as well.
>> 
>> Webrev:
>> http://cr.openjdk.java.net/~eosterlund/8216427/webrev.00/
>> 
>> Bug:
>> https://bugs.openjdk.java.net/browse/JDK-8216427
>> 
>> Tested through hs-tier1-3.
>> 
>> Thanks,
>> /Erik


From tobias.hartmann at oracle.com  Mon Jan 14 11:35:54 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 12:35:54 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <3a6364a3-0483-f454-87e9-7894cf8e8055@oracle.com>
References: <87ef9m178o.fsf@redhat.com>
 <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com> <87k1j7y70f.fsf@redhat.com>
 <3a6364a3-0483-f454-87e9-7894cf8e8055@oracle.com>
Message-ID: <0989b7d2-00f5-20d9-6696-9f88029a47d2@oracle.com>

Hi Roland,

all tests passed.

Best regards,
Tobias

On 14.01.19 09:43, Tobias Hartmann wrote:
> Hi Roland,
> 
> On 14.01.19 09:32, Roland Westrelin wrote:
>> I didn't and sure, that makes sense. I'll run that on my system
>> tonight. Or maybe Tobias can run the same testing he ran with
>> UseLoopPredicate off?
> 
> Yes, I'll do that.
> 
> Best regards,
> Tobias
> 

From rwestrel at redhat.com  Mon Jan 14 12:51:32 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Mon, 14 Jan 2019 13:51:32 +0100
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <d4bc4a04-fd70-576a-c937-d00c3702931b@oracle.com>
References: <877efbzh8a.fsf@redhat.com>
 <d4bc4a04-fd70-576a-c937-d00c3702931b@oracle.com>
Message-ID: <87ef9fxuzv.fsf@redhat.com>


Thanks for the reviews, Vladimir I, Vladimir K and Tobias. I will push
it to 12.

Roland.

From rwestrel at redhat.com  Mon Jan 14 14:10:16 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Mon, 14 Jan 2019 15:10:16 +0100
Subject: RFR(M): 8216135: C2 assert(!had_error) failed: bad dominance
In-Reply-To: <0989b7d2-00f5-20d9-6696-9f88029a47d2@oracle.com>
References: <87ef9m178o.fsf@redhat.com>
 <690e8952-5996-6a69-949d-9f196e2b84d8@oracle.com> <87k1j7y70f.fsf@redhat.com>
 <3a6364a3-0483-f454-87e9-7894cf8e8055@oracle.com>
 <0989b7d2-00f5-20d9-6696-9f88029a47d2@oracle.com>
Message-ID: <87a7k3xrcn.fsf@redhat.com>


Thanks for testing it.

Roland.

From erik.osterlund at oracle.com  Mon Jan 14 15:17:31 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Mon, 14 Jan 2019 16:17:31 +0100
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic copy
Message-ID: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>

Hi,

The ciMethodData::load_data() member function copies a raw MDO to the 
compiler mirror of said MDO. However, the copy is performed using a 
non-atomic copy function, despite being updated concurrently. This could 
potentially cause word tearing when reading metadata pointers, causing 
the VM to crash... in theory.

While this is not a problem when unpacking the extra data section, 
because it is done under a lock, the same can not be said about the rest 
of the MDO. So it should either be protected by a lock, or use an atomic 
copy function instead.

This patch adds an extra seat belt by performing atomic heap word copy 
instead.

Webrev:
http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/

Bug:
https://bugs.openjdk.java.net/browse/JDK-8216987

Thanks,
/Erik

From martin.doerr at sap.com  Mon Jan 14 15:30:09 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Mon, 14 Jan 2019 15:30:09 +0000
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
Message-ID: <eaf6400dd48e412fa3a0e6ed90f71e07@sap.com>

Hi Erik,

this looks good.

Best regards,
Martin


-----Original Message-----
From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Erik ?sterlund
Sent: Montag, 14. Januar 2019 16:18
To: hotspot compiler <hotspot-compiler-dev at openjdk.java.net>
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic copy

Hi,

The ciMethodData::load_data() member function copies a raw MDO to the 
compiler mirror of said MDO. However, the copy is performed using a 
non-atomic copy function, despite being updated concurrently. This could 
potentially cause word tearing when reading metadata pointers, causing 
the VM to crash... in theory.

While this is not a problem when unpacking the extra data section, 
because it is done under a lock, the same can not be said about the rest 
of the MDO. So it should either be protected by a lock, or use an atomic 
copy function instead.

This patch adds an extra seat belt by performing atomic heap word copy 
instead.

Webrev:
http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/

Bug:
https://bugs.openjdk.java.net/browse/JDK-8216987

Thanks,
/Erik

From erik.osterlund at oracle.com  Mon Jan 14 15:32:05 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Mon, 14 Jan 2019 16:32:05 +0100
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <eaf6400dd48e412fa3a0e6ed90f71e07@sap.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
 <eaf6400dd48e412fa3a0e6ed90f71e07@sap.com>
Message-ID: <e4eb0e40-bb6d-46da-848f-6edc34901024@oracle.com>

Hi Martin,

Thanks for the review.

/Erik

On 2019-01-14 16:30, Doerr, Martin wrote:
> Hi Erik,
>
> this looks good.
>
> Best regards,
> Martin
>
>
> -----Original Message-----
> From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Erik ?sterlund
> Sent: Montag, 14. Januar 2019 16:18
> To: hotspot compiler <hotspot-compiler-dev at openjdk.java.net>
> Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic copy
>
> Hi,
>
> The ciMethodData::load_data() member function copies a raw MDO to the
> compiler mirror of said MDO. However, the copy is performed using a
> non-atomic copy function, despite being updated concurrently. This could
> potentially cause word tearing when reading metadata pointers, causing
> the VM to crash... in theory.
>
> While this is not a problem when unpacking the extra data section,
> because it is done under a lock, the same can not be said about the rest
> of the MDO. So it should either be protected by a lock, or use an atomic
> copy function instead.
>
> This patch adds an extra seat belt by performing atomic heap word copy
> instead.
>
> Webrev:
> http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/
>
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8216987
>
> Thanks,
> /Erik


From tobias.hartmann at oracle.com  Mon Jan 14 16:12:18 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 17:12:18 +0100
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
Message-ID: <124bdce0-dce0-0ab2-6b2c-514d336c54a7@oracle.com>

Hi Erik,

looks good.

Best regards,
Tobias

On 14.01.19 16:17, Erik ?sterlund wrote:
> Hi,
> 
> The ciMethodData::load_data() member function copies a raw MDO to the compiler mirror of said MDO.
> However, the copy is performed using a non-atomic copy function, despite being updated concurrently.
> This could potentially cause word tearing when reading metadata pointers, causing the VM to crash...
> in theory.
> 
> While this is not a problem when unpacking the extra data section, because it is done under a lock,
> the same can not be said about the rest of the MDO. So it should either be protected by a lock, or
> use an atomic copy function instead.
> 
> This patch adds an extra seat belt by performing atomic heap word copy instead.
> 
> Webrev:
> http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/
> 
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8216987
> 
> Thanks,
> /Erik

From erik.osterlund at oracle.com  Mon Jan 14 16:21:20 2019
From: erik.osterlund at oracle.com (=?UTF-8?Q?Erik_=c3=96sterlund?=)
Date: Mon, 14 Jan 2019 17:21:20 +0100
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <124bdce0-dce0-0ab2-6b2c-514d336c54a7@oracle.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
 <124bdce0-dce0-0ab2-6b2c-514d336c54a7@oracle.com>
Message-ID: <0337f39d-6bf4-0c08-91bc-e0a040ea6646@oracle.com>

Hi Tobias,

Thanks for the review.

/Erik

On 2019-01-14 17:12, Tobias Hartmann wrote:
> Hi Erik,
>
> looks good.
>
> Best regards,
> Tobias
>
> On 14.01.19 16:17, Erik ?sterlund wrote:
>> Hi,
>>
>> The ciMethodData::load_data() member function copies a raw MDO to the compiler mirror of said MDO.
>> However, the copy is performed using a non-atomic copy function, despite being updated concurrently.
>> This could potentially cause word tearing when reading metadata pointers, causing the VM to crash...
>> in theory.
>>
>> While this is not a problem when unpacking the extra data section, because it is done under a lock,
>> the same can not be said about the rest of the MDO. So it should either be protected by a lock, or
>> use an atomic copy function instead.
>>
>> This patch adds an extra seat belt by performing atomic heap word copy instead.
>>
>> Webrev:
>> http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/
>>
>> Bug:
>> https://bugs.openjdk.java.net/browse/JDK-8216987
>>
>> Thanks,
>> /Erik


From patric.hedlin at oracle.com  Mon Jan 14 16:47:35 2019
From: patric.hedlin at oracle.com (Patric Hedlin)
Date: Mon, 14 Jan 2019 17:47:35 +0100
Subject: RFR(S): 8210392: assert(Compile::current()->live_nodes() <
 Compile::current()->max_node_limit()) failed: Live Node limit exceeded limit
In-Reply-To: <2f4e12e4-459d-b96b-6cf2-50d6dba098d9@oracle.com>
References: <28011331-bd43-2c32-dba4-e41879ffe28a@oracle.com>
 <99f3f410-7200-5fb1-fccd-c39e35c20288@oracle.com>
 <2f4e12e4-459d-b96b-6cf2-50d6dba098d9@oracle.com>
Message-ID: <0aece297-3929-7db5-7054-190163fe65fd@oracle.com>

Thanks for reviewing Tobias,

On 12/18/18 1:37 PM, Tobias Hartmann wrote:
> Hi Patric,
>
> were you able to reproduce this with a test (I see that one is attached to the bug)? If so, please
> add it to the webrev. Please also remove the extra newlines (for example, in line 1146).
>
> The comment in line 1027 says "Use same limit as split_if_with_blocks_post". I think this is
> outdated right?

Updated webrev with test-case.

Fixed #?%#.

Best regards,
Patric

> Best regards,
> Tobias
>
> On 18.12.18 12:48, Patric Hedlin wrote:
>> Dear all,
>>
>> I would like to ask for help to review the following change/update:
>>
>> Issue:? https://bugs.openjdk.java.net/browse/JDK-8210392
>>
>> Webrev: http://cr.openjdk.java.net/~phedlin/tr8210392/
>>
>>
>> 8210392: assert(Compile::current()->live_nodes() < Compile::current()->max_node_limit()) failed:
>> Live Node limit exceeded limit
>>
>>  ??? Avoid excessive split-if through a crude throttling approach.
>>
>>
>> Testing: hs-tier1-4, hs-precheckin-comp
>>
>>
>> Best regards,
>> Patric

From tobias.hartmann at oracle.com  Mon Jan 14 16:52:39 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 14 Jan 2019 17:52:39 +0100
Subject: RFR(S): 8210392: assert(Compile::current()->live_nodes() <
 Compile::current()->max_node_limit()) failed: Live Node limit exceeded limit
In-Reply-To: <0aece297-3929-7db5-7054-190163fe65fd@oracle.com>
References: <28011331-bd43-2c32-dba4-e41879ffe28a@oracle.com>
 <99f3f410-7200-5fb1-fccd-c39e35c20288@oracle.com>
 <2f4e12e4-459d-b96b-6cf2-50d6dba098d9@oracle.com>
 <0aece297-3929-7db5-7054-190163fe65fd@oracle.com>
Message-ID: <e153b5d2-777b-c04b-ad36-9d9a1032d01d@oracle.com>

Hi Patric,

thanks for adding the test. This looks good to me.

Best regards,
Tobias


On 14.01.19 17:47, Patric Hedlin wrote:
> Thanks for reviewing Tobias,
> 
> On 12/18/18 1:37 PM, Tobias Hartmann wrote:
>> Hi Patric,
>>
>> were you able to reproduce this with a test (I see that one is attached to the bug)? If so, please
>> add it to the webrev. Please also remove the extra newlines (for example, in line 1146).
>>
>> The comment in line 1027 says "Use same limit as split_if_with_blocks_post". I think this is
>> outdated right?
> 
> Updated webrev with test-case.
> 
> Fixed #?%#.
> 
> Best regards,
> Patric
> 
>> Best regards,
>> Tobias
>>
>> On 18.12.18 12:48, Patric Hedlin wrote:
>>> Dear all,
>>>
>>> I would like to ask for help to review the following change/update:
>>>
>>> Issue:? https://bugs.openjdk.java.net/browse/JDK-8210392
>>>
>>> Webrev: http://cr.openjdk.java.net/~phedlin/tr8210392/
>>>
>>>
>>> 8210392: assert(Compile::current()->live_nodes() < Compile::current()->max_node_limit()) failed:
>>> Live Node limit exceeded limit
>>>
>>> ???? Avoid excessive split-if through a crude throttling approach.
>>>
>>>
>>> Testing: hs-tier1-4, hs-precheckin-comp
>>>
>>>
>>> Best regards,
>>> Patric

From vladimir.kozlov at oracle.com  Mon Jan 14 17:03:16 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 09:03:16 -0800
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
Message-ID: <428e592e-fe7c-80e5-21a0-122774baccf7@oracle.com>

Good.

Thanks,
Vladimir

On 1/14/19 7:17 AM, Erik ?sterlund wrote:
> Hi,
> 
> The ciMethodData::load_data() member function copies a raw MDO to the compiler mirror of said MDO. However, the copy is 
> performed using a non-atomic copy function, despite being updated concurrently. This could potentially cause word 
> tearing when reading metadata pointers, causing the VM to crash... in theory.
> 
> While this is not a problem when unpacking the extra data section, because it is done under a lock, the same can not be 
> said about the rest of the MDO. So it should either be protected by a lock, or use an atomic copy function instead.
> 
> This patch adds an extra seat belt by performing atomic heap word copy instead.
> 
> Webrev:
> http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/
> 
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8216987
> 
> Thanks,
> /Erik

From erik.osterlund at oracle.com  Mon Jan 14 17:04:24 2019
From: erik.osterlund at oracle.com (Erik Osterlund)
Date: Mon, 14 Jan 2019 18:04:24 +0100
Subject: 8216987: ciMethodData::load_data() unpacks MDOs with non-atomic
 copy
In-Reply-To: <428e592e-fe7c-80e5-21a0-122774baccf7@oracle.com>
References: <4ed95ecc-91ec-07e8-4adc-1a48be644f1c@oracle.com>
 <428e592e-fe7c-80e5-21a0-122774baccf7@oracle.com>
Message-ID: <E6BDB133-0977-4284-AB0C-22FC03A06377@oracle.com>

Hi Vladimir,

Thanks for the review.

/Erik

> On 14 Jan 2019, at 18:03, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> Good.
> 
> Thanks,
> Vladimir
> 
>> On 1/14/19 7:17 AM, Erik ?sterlund wrote:
>> Hi,
>> The ciMethodData::load_data() member function copies a raw MDO to the compiler mirror of said MDO. However, the copy is performed using a non-atomic copy function, despite being updated concurrently. This could potentially cause word tearing when reading metadata pointers, causing the VM to crash... in theory.
>> While this is not a problem when unpacking the extra data section, because it is done under a lock, the same can not be said about the rest of the MDO. So it should either be protected by a lock, or use an atomic copy function instead.
>> This patch adds an extra seat belt by performing atomic heap word copy instead.
>> Webrev:
>> http://cr.openjdk.java.net/~eosterlund/8216987/webrev.00/
>> Bug:
>> https://bugs.openjdk.java.net/browse/JDK-8216987
>> Thanks,
>> /Erik


From vladimir.kozlov at oracle.com  Mon Jan 14 17:06:42 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 09:06:42 -0800
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <cfb4f0ab-3751-6aac-475a-23e5464a8a62@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
 <cfb4f0ab-3751-6aac-475a-23e5464a8a62@oracle.com>
Message-ID: <138390b2-7542-619c-eef1-b775e5bb2064@oracle.com>

Thank you, Tobias

Vladimir

On 1/14/19 2:04 AM, Tobias Hartmann wrote:
> Hi Vladimir,
> 
> looks good to me.
> 
> Best regards,
> Tobias
> 
> 
> On 13.01.19 03:46, Vladimir Kozlov wrote:
>> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
>> https://bugs.openjdk.java.net/browse/JDK-8216151
>>
>> Have to update default.policy after changes in jdk.internal.vm.compiler.management files done by
>> JDK-8199755: "Update Graal".
>>
>> Ran CheckAccessClassInPackagePermissions.java test.
>>

From vladimir.kozlov at oracle.com  Mon Jan 14 17:39:10 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 09:39:10 -0800
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <d2c592ab-93c6-200d-bb80-094874528be4@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
 <d2c592ab-93c6-200d-bb80-094874528be4@oracle.com>
Message-ID: <e158a7b4-dd42-8d20-b5f1-aa7a3d4c014e@oracle.com>

Thank you, Alan

On 1/14/19 2:27 AM, Alan Bateman wrote:
> On 13/01/2019 02:46, Vladimir Kozlov wrote:
>> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
>> https://bugs.openjdk.java.net/browse/JDK-8216151
>>
>> Have to update default.policy after changes in jdk.internal.vm.compiler.management files done by JDK-8199755: "Update 
>> Graal".
>>
>> Ran CheckAccessClassInPackagePermissions.java test.
>>
> cc'ing security-dev as that is where is the security policy file is maintained.
> 
> One thing is double check is that code in jdk.internal.vm.compiler.management really needs to access members of classes 
> in the listed packages. I ask because the module definition doesn't export some of these packages to 
> jdk.internal.vm.compiler.management so they aren't accessible even when not running with a security manager.

I verified that all listed packages are used by compiler.management and I listed only needed in default.policy. I used 
CheckAccessClassInPackagePermissions.java test to find which permissions are needed.

Thanks,
Vladimir

> 
> -Alan

From mandy.chung at oracle.com  Mon Jan 14 18:29:39 2019
From: mandy.chung at oracle.com (Mandy Chung)
Date: Mon, 14 Jan 2019 10:29:39 -0800
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <e158a7b4-dd42-8d20-b5f1-aa7a3d4c014e@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
 <d2c592ab-93c6-200d-bb80-094874528be4@oracle.com>
 <e158a7b4-dd42-8d20-b5f1-aa7a3d4c014e@oracle.com>
Message-ID: <f7d1d0e6-1765-9d40-2551-1a02715902b0@oracle.com>


On 1/14/19 9:39 AM, Vladimir Kozlov wrote:
> Thank you, Alan
>
> On 1/14/19 2:27 AM, Alan Bateman wrote:
>> On 13/01/2019 02:46, Vladimir Kozlov wrote:
>>> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
>>> https://bugs.openjdk.java.net/browse/JDK-8216151
>>>
>>> Have to update default.policy after changes in 
>>> jdk.internal.vm.compiler.management files done by JDK-8199755: 
>>> "Update Graal".
>>>
>>> Ran CheckAccessClassInPackagePermissions.java test.
>>>
>> cc'ing security-dev as that is where is the security policy file is 
>> maintained.
>>
>> One thing is double check is that code in 
>> jdk.internal.vm.compiler.management really needs to access members of 
>> classes in the listed packages. I ask because the module definition 
>> doesn't export some of these packages to 
>> jdk.internal.vm.compiler.management so they aren't accessible even 
>> when not running with a security manager.
>
> I verified that all listed packages are used by compiler.management 
> and I listed only needed in default.policy. I used 
> CheckAccessClassInPackagePermissions.java test to find which 
> permissions are needed.
>

I reviewed the change and the list matches the list of qualified exports 
from jdk.internal.vm.compiler to jdk.internal.vm.compiler.management.

The security team has been looking into removing the private VM call out 
to ClassLoader::checkPackageAccess.? When that's removed, we would not 
need to maintain these accessClassInPackage permission to access any new 
qualified exports.

Mandy
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190114/7619503c/attachment.html>

From vladimir.kozlov at oracle.com  Mon Jan 14 18:31:07 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 10:31:07 -0800
Subject: [12] RFR(S) 8216151: [Graal] Module
 jdk.internal.vm.compiler.management has not been granted
 accessClassInPackage.org.graalvm.compiler.debug
In-Reply-To: <f7d1d0e6-1765-9d40-2551-1a02715902b0@oracle.com>
References: <077fcff1-28fc-acbf-6a8b-c299978ae0a2@oracle.com>
 <d2c592ab-93c6-200d-bb80-094874528be4@oracle.com>
 <e158a7b4-dd42-8d20-b5f1-aa7a3d4c014e@oracle.com>
 <f7d1d0e6-1765-9d40-2551-1a02715902b0@oracle.com>
Message-ID: <2ad1f183-f3fd-eb0e-f56b-64c5746c6c08@oracle.com>

Thank you Mandy for review.

Vladimir

On 1/14/19 10:29 AM, Mandy Chung wrote:
> 
> 
> On 1/14/19 9:39 AM, Vladimir Kozlov wrote:
>> Thank you, Alan
>>
>> On 1/14/19 2:27 AM, Alan Bateman wrote:
>>> On 13/01/2019 02:46, Vladimir Kozlov wrote:
>>>> http://cr.openjdk.java.net/~kvn/8216151/webrev.00/
>>>> https://bugs.openjdk.java.net/browse/JDK-8216151
>>>>
>>>> Have to update default.policy after changes in jdk.internal.vm.compiler.management files done by JDK-8199755: 
>>>> "Update Graal".
>>>>
>>>> Ran CheckAccessClassInPackagePermissions.java test.
>>>>
>>> cc'ing security-dev as that is where is the security policy file is maintained.
>>>
>>> One thing is double check is that code in jdk.internal.vm.compiler.management really needs to access members of 
>>> classes in the listed packages. I ask because the module definition doesn't export some of these packages to 
>>> jdk.internal.vm.compiler.management so they aren't accessible even when not running with a security manager.
>>
>> I verified that all listed packages are used by compiler.management and I listed only needed in default.policy. I used 
>> CheckAccessClassInPackagePermissions.java test to find which permissions are needed.
>>
> 
> I reviewed the change and the list matches the list of qualified exports from jdk.internal.vm.compiler to 
> jdk.internal.vm.compiler.management.
> 
> The security team has been looking into removing the private VM call out to ClassLoader::checkPackageAccess.? When 
> that's removed, we would not need to maintain these accessClassInPackage permission to access any new qualified exports.
> 
> Mandy

From vivek.r.deshpande at intel.com  Mon Jan 14 19:40:10 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Mon, 14 Jan 2019 19:40:10 +0000
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <87483987-e5dd-a42b-ff94-da645e041019@oracle.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
 <87483987-e5dd-a42b-ff94-da645e041019@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14C5B9@ORSMSX106.amr.corp.intel.com>

Thanks Tobias for reviewing it.

Regards,
Vivek

-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
Sent: Monday, January 14, 2019 2:40 AM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

Hi Vivek,

thanks for making these changes. Looks good to me!

A second review would be good.

Best regards,
Tobias

On 11.01.19 20:38, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: 
> http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for
>  3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>      for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; 
> hotspot-compiler-dev at openjdk.java.net compiler 
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya 
> <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails 
> with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions. 
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction. 
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>> be isomorphic when they have different control RangeCheck nodes
>> ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From tom.rodriguez at oracle.com  Mon Jan 14 20:14:11 2019
From: tom.rodriguez at oracle.com (Tom Rodriguez)
Date: Mon, 14 Jan 2019 12:14:11 -0800
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
 <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
 <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
Message-ID: <8fcf202b-7115-7dac-87b1-d2be550ad8c1@oracle.com>

Looks good.

tom

Doug Simon wrote on 1/12/19 4:57 AM:
> 
> 
>> On 11 Jan 2019, at 10:02, Doug Simon <doug.simon at oracle.com 
>> <mailto:doug.simon at oracle.com>> wrote:
>>
>> Hi Josef,
>>
>>> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at 
>>> <mailto:josef.haider at khg.jku.at>> wrote:
>>>
>>> Agreed, cmpw/cmpb would make more sense here, i just wanted
>>> to keep the changeset minimal, since the entire method may soon be
>>> changed again, anyway.
>>>
>> Can you please say more about this? Would you recommend applying your 
>> current patch as is to fix the crash or will you have the changes you 
>> mention ready soon?
> 
> Josef has updated his fix to use cmpw/cmpb:
> 
> http://cr.openjdk.java.net/~dnsimon/8215313/
> 
> Previous webrev is now at 
> http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342
> 
> Dean, can you please re-review.
> 
> -Doug
> 
>>
>> -Doug
>>>> Taking another look, it seems like cmpl could be replaced with the
>>>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and
>>>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and
>>>> cmpq right now.
>>>>
>>>> dl
>>>>
>>>> On 1/10/19 10:04 AM,dean.long at oracle.com 
>>>> <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev>  wrote:
>>>> >/Is it OK to modify the values of searchValue[i]?? If the search value />/is already sign-extended, how about sign-extending cmpResult instead />/of zero-extending searchValue? />//>/dl />//>/On 1/10/19 7:09 AM, Doug Simon wrote: />>/Please review this fix supplied by Josef Haider for an incorrect />>/compilation of String.split. />>//>>/When the String.indexOf intrinsic on AMD64 reaches the end of a />>/string, it tries to vectorize the last compare operations by reading />>/past the bounds of the character/byte array. This is not safe if the />>/out-of-bounds read would cross a page boundary, so in that case />>/characters are compared one-by-one. This is done with a />>/`cmpl`-instruction, which only works as long as the bytes/chars are />>/not sign extended. />>//>>/The fix is to simply `and` the characters we are searching for with />>/`0xff`/`0xffff` in order to eliminate any erroneous sign extensions. />>//>>/http://cr.openjdk.java.net/~dnsimon/8215313 />>/https://bugs.openjdk.java.net/browse/JDK-8215313 />>//>>/-Doug />//
>>>>
>>>
>>>
>>
> 

From vladimir.kozlov at oracle.com  Mon Jan 14 20:25:55 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 12:25:55 -0800
Subject: [12] RFR(S): 8215313: [AOT] java/lang/String/Split.java fails
 with AOTed java.base
In-Reply-To: <8fcf202b-7115-7dac-87b1-d2be550ad8c1@oracle.com>
References: <dab7abd3-1abd-1652-a328-6a81960f934e@khg.jku.at>
 <88068DB1-E235-448C-A7D8-B1208143CCCF@oracle.com>
 <E101F893-ED66-418B-9CA8-64FBD0334EA3@oracle.com>
 <8fcf202b-7115-7dac-87b1-d2be550ad8c1@oracle.com>
Message-ID: <112920cf-46d5-0082-e2ca-79e21fdb7ce6@oracle.com>

+1

Thanks,
Vladimir

On 1/14/19 12:14 PM, Tom Rodriguez wrote:
> Looks good.
> 
> tom
> 
> Doug Simon wrote on 1/12/19 4:57 AM:
>>
>>
>>> On 11 Jan 2019, at 10:02, Doug Simon <doug.simon at oracle.com <mailto:doug.simon at oracle.com>> wrote:
>>>
>>> Hi Josef,
>>>
>>>> On 10 Jan 2019, at 22:52, Josef Haider <josef.haider at khg.jku.at <mailto:josef.haider at khg.jku.at>> wrote:
>>>>
>>>> Agreed, cmpw/cmpb would make more sense here, i just wanted
>>>> to keep the changeset minimal, since the entire method may soon be
>>>> changed again, anyway.
>>>>
>>> Can you please say more about this? Would you recommend applying your current patch as is to fix the crash or will 
>>> you have the changes you mention ready soon?
>>
>> Josef has updated his fix to use cmpw/cmpb:
>>
>> http://cr.openjdk.java.net/~dnsimon/8215313/
>>
>> Previous webrev is now at http://cr.openjdk.java.net/~dnsimon/8215313.old/20190112_1342
>>
>> Dean, can you please re-review.
>>
>> -Doug
>>
>>>
>>> -Doug
>>>>> Taking another look, it seems like cmpl could be replaced with the
>>>>> size-appropriate cmpb, cmpw, or cmpl based on byteMode(kind) and
>>>>> findTwoCharPrefix, but I guess AMD64Assembler only supports cmpl and
>>>>> cmpq right now.
>>>>>
>>>>> dl
>>>>>
>>>>> On 1/10/19 10:04 AM,dean.long at oracle.com <https://mail.openjdk.java.net/mailman/listinfo/hotspot-compiler-dev>  
>>>>> wrote:
>>>>> >/Is it OK to modify the values of searchValue[i]?? If the search value />/is already sign-extended, how about 
>>>>> sign-extending cmpResult instead />/of zero-extending searchValue? />//>/dl />//>/On 1/10/19 7:09 AM, Doug Simon 
>>>>> wrote: />>/Please review this fix supplied by Josef Haider for an incorrect />>/compilation of String.split. 
>>>>> />>//>>/When the String.indexOf intrinsic on AMD64 reaches the end of a />>/string, it tries to vectorize the last 
>>>>> compare operations by reading />>/past the bounds of the character/byte array. This is not safe if the 
>>>>> />>/out-of-bounds read would cross a page boundary, so in that case />>/characters are compared one-by-one. This is 
>>>>> done with a />>/`cmpl`-instruction, which only works as long as the bytes/chars are />>/not sign extended. 
>>>>> />>//>>/The fix is to simply `and` the characters we are searching for with />>/`0xff`/`0xffff` in order to 
>>>>> eliminate any erroneous sign extensions. />>//>>/http://cr.openjdk.java.net/~dnsimon/8215313 
>>>>> />>/https://bugs.openjdk.java.net/browse/JDK-8215313 />>//>>/-Doug />//
>>>>>
>>>>
>>>>
>>>
>>

From vladimir.x.ivanov at oracle.com  Mon Jan 14 21:55:37 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Mon, 14 Jan 2019 13:55:37 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
Message-ID: <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>


>>> If it were the case, then PhaseIdealLoop::handle_use()/spinup() would 
>>> reliably crash on all users of Phi 1790. There are 2 other Regions 
>>> (R1710 and R1716) which keep their IDOM (I1511) intact and the 
>>> transformation works fine for them.
>>>
>>> R1722 is changed during strip mining transformation and its IDOM is 
>>> recomputed (I1511 => R1784). 
>>
>> To elaborate a bit more on that: the only reason IDOM changes is due 
>> to the way it is computed:
>> ???? // rgn = R1722, new_iff = I1854
>> ???? Node* ridom = idom(rgn); // ridom = I1522 = IDOM(R1722)
> 
> Is it typo? Should it be I1511? I don't see I1522 in graph's picture.

Yes, it should be I1511. Sorry for the confusion.

>> ???? Node* nrdom = dom_lca(ridom, new_iff); // nrdom = R1784
>> ???? set_idom(rgn, nrdom, dom_depth(rgn));
>>
>> ???? Node *dom_lca( Node *n1, Node *n2 ) const {
>> ?????? return find_non_split_ctrl(dom_lca_internal(n1, n2));
>> ???? }
>>
>> ???? dom_lca_internal(I1522, I1854) = I1522
> 
> I assume it is 1511.
> 
>> ???? find_non_split_ctrl(I1522) = R1784
>>
>> If IDOM info is recomputed from scratch, IDOM(R1722) remains I1511.
> 
> Can you explain more this point? Why result is different if it is from 
> scratch?

PhaseIdealLoop::Dominators() doesn't adjust IDOM for Regions. So, 
initial IDOM values are and that's the same dom_lca_internal() computes 
for them:
   IDOM(R1710) = IDOM(R1716) = IDOM(R1722) = I1511

Then IdealLoopTree::counted_loop() strip mines some of the loops and it 
causes a change in R1722 which causes recomputation of IDOM using 
dom_lca() which does normalize the IDOM.

If IDOM is rebuilt from scratch at this point, initial IDOM will stay 
the same (because no strip mining takes place):
   IDOM(R1710) = IDOM(R1716) = IDOM(R1722) = I1511


And that's the other way to fix the crash: initiate new PhaseIdealLoop 
iteration right away if any strip mined loops are introduced.

But it looks more like a workaround and I decided to go with the fix in 
PhaseIdealLoop::spinup() because I don't see a reason why IDOM 
recomputation can't be triggered from other places.

Best regards,
Vladimir Ivanov

From vladimir.kozlov at oracle.com  Mon Jan 14 22:36:48 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 14:36:48 -0800
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
Message-ID: <14067057-ec56-8c07-8f79-d1a29c7e20b7@oracle.com>

Hi Vivek,

I do not understand changes in superword.cpp.

muladds2i will never be packed in follow_def_uses() since you return 'false' for muladds2i in all cases when u1 != u2 
(even when i1 == i2). Is it intentional?

Thanks,
Vladimir

On 1/11/19 11:38 AM, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for
>   3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>       for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions.
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction.
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to
>> be isomorphic when they have different control RangeCheck nodes
>>  ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From vladimir.kozlov at oracle.com  Mon Jan 14 23:25:40 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 15:25:40 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
Message-ID: <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>

On 1/14/19 1:55 PM, Vladimir Ivanov wrote:
> 
>>>> R1722 is changed during strip mining transformation and its IDOM is recomputed (I1511 => R1784). 
>>>
>>> If IDOM info is recomputed from scratch, IDOM(R1722) remains I1511.
>>
>> Can you explain more this point? Why result is different if it is from scratch?
> 
> PhaseIdealLoop::Dominators() doesn't adjust IDOM for Regions. So, initial IDOM values are and that's the same 
> dom_lca_internal() computes for them:
>  ? IDOM(R1710) = IDOM(R1716) = IDOM(R1722) = I1511
> 
> Then IdealLoopTree::counted_loop() strip mines some of the loops and it causes a change in R1722 which causes 
> recomputation of IDOM using dom_lca() which does normalize the IDOM.
> 
> If IDOM is rebuilt from scratch at this point, initial IDOM will stay the same (because no strip mining takes place):
>  ? IDOM(R1710) = IDOM(R1716) = IDOM(R1722) = I1511
> 
> 
> And that's the other way to fix the crash: initiate new PhaseIdealLoop iteration right away if any strip mined loops are 
> introduced.

Got it. So the issue is that strip mining invalidated IDOM information generated at the beginning of 
PhaseIdealLoop::build_and_optimize().

> 
> But it looks more like a workaround and I decided to go with the fix in PhaseIdealLoop::spinup() because I don't see a 
> reason why IDOM recomputation can't be triggered from other places.

I am not sure your changes help to all cases.  It may indeed helps to split_if optimization but dominator information is 
used before it too. I see Shenandoah's optimize_loops() uses information before split_if.

Can we correctly recalculate IDOM after counted_loop() if strip mining loop was inserted? My be we can simplify strip 
mining code if we know that IDOM will be recalculated.

Would be nice to hear Roland's opinion too.


On other hand I think your point fix is good for JDK 12. May be do what I suggest in JDK 13 later if it is too complex.

Thanks,
Vladimir

> 
> Best regards,
> Vladimir Ivanov

From vladimir.x.ivanov at oracle.com  Mon Jan 14 23:49:15 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Mon, 14 Jan 2019 15:49:15 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>
Message-ID: <5eb3f544-7080-b6cc-6ebe-d9c87db19717@oracle.com>


>>
>> But it looks more like a workaround and I decided to go with the fix 
>> in PhaseIdealLoop::spinup() because I don't see a reason why IDOM 
>> recomputation can't be triggered from other places.
> 
> I am not sure your changes help to all cases.? It may indeed helps to 
> split_if optimization but dominator information is used before it too. I 
> see Shenandoah's optimize_loops() uses information before split_if.

I try to address only PhaseIdealLoop::spinup() case. There may be other 
bugs lurking in other places.

> Can we correctly recalculate IDOM after counted_loop() if strip mining 
> loop was inserted? My be we can simplify strip mining code if we know 
> that IDOM will be recalculated.

The simplest fix I can come up with (and most reliable IMO w.r.t. other 
possible bugs which aren't uncovered yet) is to set C->major_progress() 
if strip mining happened and return early to initiate the next round of 
PhaseIdealLoop and recompute IDOM info. In that case, transformations 
will see only IDOM computed by Dominators(), but it means repeated IDOM 
& loop info computations when strip mining happens.

> Would be nice to hear Roland's opinion too.

Yes, same here.

As for me:

  * I find it ugly that Dominators() and dom_lca() aren't consistent;

  * I'm in favor of normalized info (dom_lca() variant) to be computed 
from the very beginning;

  * I still believe PhaseIdealLoop::spinup() has a bug which should be 
fixed (irrespective of whether IDOM is normalized or not);

Best regards,
Vladimir Ivanov

From vladimir.kozlov at oracle.com  Tue Jan 15 00:08:37 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 16:08:37 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <5eb3f544-7080-b6cc-6ebe-d9c87db19717@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>
 <5eb3f544-7080-b6cc-6ebe-d9c87db19717@oracle.com>
Message-ID: <4748e941-763b-a38e-8686-26a0e59da581@oracle.com>

On 1/14/19 3:49 PM, Vladimir Ivanov wrote:
> 
>>>
>>> But it looks more like a workaround and I decided to go with the fix in PhaseIdealLoop::spinup() because I don't see 
>>> a reason why IDOM recomputation can't be triggered from other places.
>>
>> I am not sure your changes help to all cases.? It may indeed helps to split_if optimization but dominator information 
>> is used before it too. I see Shenandoah's optimize_loops() uses information before split_if.
> 
> I try to address only PhaseIdealLoop::spinup() case. There may be other bugs lurking in other places.
> 
>> Can we correctly recalculate IDOM after counted_loop() if strip mining loop was inserted? My be we can simplify strip 
>> mining code if we know that IDOM will be recalculated.
> 
> The simplest fix I can come up with (and most reliable IMO w.r.t. other possible bugs which aren't uncovered yet) is to 
> set C->major_progress() if strip mining happened and return early to initiate the next round of PhaseIdealLoop and 
> recompute IDOM info. In that case, transformations will see only IDOM computed by Dominators(), but it means repeated 
> IDOM & loop info computations when strip mining happens.

Yes. It is safest/conservative solution. The only issue is that we have several targeted individual calls to 
PhaseIdealLoop before we go into optimize_loops() which calls PhaseIdealLoop in loop. So initial PhaseIdealLoop call 
sequence could be altered if we bailout too soon.

> 
>> Would be nice to hear Roland's opinion too.
> 
> Yes, same here.
> 
> As for me:
> 
>  ?* I find it ugly that Dominators() and dom_lca() aren't consistent;

Agree. Should be fixed (but not urgent for 12).

> 
>  ?* I'm in favor of normalized info (dom_lca() variant) to be computed from the very beginning;

File RFE.

> 
>  ?* I still believe PhaseIdealLoop::spinup() has a bug which should be fixed (irrespective of whether IDOM is normalized 
> or not);

I agree with that too.

Again, I agree with your fix for jdk 12. Lets clean up this mess after that.

Thanks,
Vladimir

> 
> Best regards,
> Vladimir Ivanov

From vivek.r.deshpande at intel.com  Tue Jan 15 00:17:05 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Tue, 15 Jan 2019 00:17:05 +0000
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <14067057-ec56-8c07-8f79-d1a29c7e20b7@oracle.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
 <14067057-ec56-8c07-8f79-d1a29c7e20b7@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14CD5E@ORSMSX106.amr.corp.intel.com>

Hi Vladimir

Thanks for looking at the patch. 
The MulAddS2I node gets packed in follow_use_defs() with this approach in which we just perform swaps in follow_def_uses and return false.
This way MulAddS2I nodes gets the right alignment of multiple of 4 from its outs.
If we return true after the swaps in follow_def_uses(), it gets alignment  as multiple of 2(from LoadS) for packing, instead of multiple of 4.

Regards,
Vivek

-----Original Message-----
From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com] 
Sent: Monday, January 14, 2019 2:37 PM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; Tobias Hartmann <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Raj, Guru <guru.raj at intel.com>
Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

Hi Vivek,

I do not understand changes in superword.cpp.

muladds2i will never be packed in follow_def_uses() since you return 'false' for muladds2i in all cases when u1 != u2 (even when i1 == i2). Is it intentional?

Thanks,
Vladimir

On 1/11/19 11:38 AM, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: 
> http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for
>   3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>       for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; 
> hotspot-compiler-dev at openjdk.java.net compiler 
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya 
> <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails 
> with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions.
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction.
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>> be isomorphic when they have different control RangeCheck nodes
>>  ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From vladimir.kozlov at oracle.com  Tue Jan 15 00:26:15 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 14 Jan 2019 16:26:15 -0800
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14CD5E@ORSMSX106.amr.corp.intel.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
 <14067057-ec56-8c07-8f79-d1a29c7e20b7@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14CD5E@ORSMSX106.amr.corp.intel.com>
Message-ID: <2a9920a1-0ec3-87cf-2c71-7bdcfcb796be@oracle.com>

On 1/14/19 4:17 PM, Deshpande, Vivek R wrote:
> Hi Vladimir
> 
> Thanks for looking at the patch.
> The MulAddS2I node gets packed in follow_use_defs() with this approach in which we just perform swaps in follow_def_uses and return false.

Got it. I confused follow_use_defs() with follow_def_uses().

Changes are good.

Vladimir

> This way MulAddS2I nodes gets the right alignment of multiple of 4 from its outs.
> If we return true after the swaps in follow_def_uses(), it gets alignment  as multiple of 2(from LoadS) for packing, instead of multiple of 4.
> 
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
> Sent: Monday, January 14, 2019 2:37 PM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; Tobias Hartmann <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> I do not understand changes in superword.cpp.
> 
> muladds2i will never be packed in follow_def_uses() since you return 'false' for muladds2i in all cases when u1 != u2 (even when i1 == i2). Is it intentional?
> 
> Thanks,
> Vladimir
> 
> On 1/11/19 11:38 AM, Deshpande, Vivek R wrote:
>> Hi Tobias
>>
>> Thanks for reviewing the patch.
>> I have made the changes according to your suggestion.
>> In this webrev:
>> http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
>> I have fix for the crash reported in the 8216050.
>>
>> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
>> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
>>
>> I have updated the bug also with the link to webrev.
>>
>> I have created a different bug JDK-8216580 for
>>    3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>>        for a[i] and a[i+1] accesses in same MulAddS2I node
>>
>> Thank you.
>> Regards,
>> Vivek
>>
>> -----Original Message-----
>> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
>> Sent: Friday, January 11, 2019 4:49 AM
>> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>;
>> hotspot-compiler-dev at openjdk.java.net compiler
>> <hotspot-compiler-dev at openjdk.java.net>
>> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya
>> <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
>> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails
>> with assert(0 <= i && i < _len) failed: illegal index
>>
>> Hi Vivek,
>>
>> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>>> 1) Fix for the crash by matching the operand by swapping to right positions.
>>
>> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
>>
>>> 2) Cost based generation of vpdpwssd instruction.
>>
>> Other instructions added by JDK-8214751 still miss a cost definition, for example:
>> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
>>
>>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to
>>> be isomorphic when they have different control RangeCheck nodes
>>>   ????for a[i] and a[i+1] accesses in same MulAddS2I node
>>
>> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
>>
>> Thanks,
>> Tobias
>>

From tom.rodriguez at oracle.com  Tue Jan 15 07:09:12 2019
From: tom.rodriguez at oracle.com (Tom Rodriguez)
Date: Mon, 14 Jan 2019 23:09:12 -0800
Subject: [12] RFR(XS) 8215748: Application fails when executed with Graal
Message-ID: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>

http://cr.openjdk.java.net/~never/8215748/webrev
https://bugs.openjdk.java.net/browse/JDK-8215748

If an interface method attempts to invoke an array clone method, JVMCI 
doesn't let you resolve the invoke properly which can result in 
performance problems or unexpected NullPointerExceptions.  clone is 
publicly visible on arrays but is protected in Object.  HotSpot doesn't 
have an actual Method* for the array clone operations, it just reuses 
Object.clone.  This is accomplished with some trickery in the 
linkResolver.cpp that adjusts the visibility during resolution if an 
array class is involved.  JVMCI only deals with concrete methods so when 
a call site is resolved you get back the real Object.clone.  If you try 
to use resolveMethod on it then it will resolve it relative to Object 
instead of using the array type.  This works ok when the accessing class 
is an class but for interface types it fails.  In benign cases Graal 
just ends up falling back to a regular call which is slower than normal. 
  In this case we were attempting to resolve an invoke for a profiled 
call site and got back null which shouldn't happen.  The fix is the use 
the array class as the method type in this particular case which mirrors 
the logic in the linkResolver.cpp that adjusts the visibility check. 
Tested with Spark and the new unit test.  mach5 testing is ongoing.

From rwestrel at redhat.com  Tue Jan 15 09:03:21 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 15 Jan 2019 10:03:21 +0100
Subject: RFR(S): 8217042: Shenandoah: write barrier on backedge of strip mined
 loop causes c2 crash at expansion time
Message-ID: <874laaxpgm.fsf@redhat.com>


http://cr.openjdk.java.net/~roland/8217042/webrev.00/

If a write barrier is in the body of the outer strip mined loop,
expanding it causes loop strip mining verification code to fail. This is
worked around by turning the strip mined loop nest into a regular
counted loop nest so verification code doesn't trigger. The logic that
takes care of that breaks when the write barrier is on the backedge of
the strip mined loop because it is applied after the barrier is
expanded. The fix I propose is to move that logic before barrier
expansion.

Roland.

From rwestrel at redhat.com  Tue Jan 15 09:19:20 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 15 Jan 2019 10:19:20 +0100
Subject: RFR(S): 8217043: Shenandoah: SIGSEGV in Type::meet_helper() at
 barrier expansion time
Message-ID: <87y37mwa5j.fsf@redhat.com>


http://cr.openjdk.java.net/~roland/8217043/webrev.00/

The ShenandoahBarrierNode::needs_barrier_impl() encounters a
CallLeafNode (from a write barrier) and tries to get the type of n which
is a tuple, not a pointer and this causes a null pointer
dereference. The write barrier runtime call should anyway prevent an
optimization of the barrier and to be on the safe side, any call should.

Roland.

From shade at redhat.com  Tue Jan 15 09:26:44 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Tue, 15 Jan 2019 10:26:44 +0100
Subject: RFR(S): 8217042: Shenandoah: write barrier on backedge of strip
 mined loop causes c2 crash at expansion time
In-Reply-To: <874laaxpgm.fsf@redhat.com>
References: <874laaxpgm.fsf@redhat.com>
Message-ID: <0b1a05ce-5cd6-4791-fad4-d70c7ac9be24@redhat.com>

On 1/15/19 10:03 AM, Roland Westrelin wrote:
> http://cr.openjdk.java.net/~roland/8217042/webrev.00/

Cannot comment on the patch itself, it looks fine to my untrained eye.

You might want to fix indents in two places before pushing, these should be inside the if-s?

Here:

2666     if (loop->_head->is_OuterStripMinedLoop()) {
2667     // Expanding a barrier here will break loop strip mining
2668     // verification. Transform the loop so the loop nest doesn't
2669     // appear as strip mined.

and here:

2680     if (loop->_head->is_OuterStripMinedLoop()) {
2681     // Expanding a barrier here will break loop strip mining
2682     // verification. Transform the loop so the loop nest doesn't
2683     // appear as strip mined.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190115/c5871ab6/signature-0001.asc>

From shade at redhat.com  Tue Jan 15 10:01:40 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Tue, 15 Jan 2019 11:01:40 +0100
Subject: RFR(S): 8217043: Shenandoah: SIGSEGV in Type::meet_helper() at
 barrier expansion time
In-Reply-To: <87y37mwa5j.fsf@redhat.com>
References: <87y37mwa5j.fsf@redhat.com>
Message-ID: <c07786f7-c810-8491-1555-674cf95bd512@redhat.com>

On 1/15/19 10:19 AM, Roland Westrelin wrote:
> http://cr.openjdk.java.net/~roland/8217043/webrev.00/
> 
> The ShenandoahBarrierNode::needs_barrier_impl() encounters a
> CallLeafNode (from a write barrier) and tries to get the type of n which
> is a tuple, not a pointer and this causes a null pointer
> dereference. The write barrier runtime call should anyway prevent an
> optimization of the barrier and to be on the safe side, any call should.

Looks good to me.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190115/3859d718/signature.asc>

From aph at redhat.com  Tue Jan 15 10:14:54 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 15 Jan 2019 10:14:54 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
Message-ID: <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>

On 1/13/19 5:10 PM, B. Blaser wrote:
> On Thu, 10 Jan 2019 at 10:19, Andrew Haley <aph at redhat.com> wrote:
>>
>> On 1/9/19 12:13 PM, Roman Kennke wrote:
>>> I cannot say if if this has performance implication. I suspect not. If
>>> it has, it's probably miniscule improvement. I can't see how it could be
>>> worse though.
>>
>> I can. x86 can have some very weird performance characteristics. It'd be
>> helpful to do some measurement.
> 
> I'm not sure we are really able to conclude anything from performance
> measurement on highly implementation-dependent instructions unless we
> make an average on a significant number of different x86_64 processors
> which might well change with future generations...
> 
> Shouldn't we follow a more pragmatic direction considering that less
> instructions/registers and a better/smaller encoding is generally
> preferable, as Roman suggested, which is the purpose of complex
> instruction sets?

I'm not sure that CISC has a purpose, as such.

See the analysis of GCC performance in
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=56309 :


Quick summary: Conditional moves on Intel Core/Xeon and AMD Bulldozer
architectures should probably be avoided "as a rule."

History: Conditional moves were beneficial for the Intel Pentium 4, and also
(but less-so) for AMD Athlon/Phenom chips.  In the AMD Athlon/Phenom case the
performance of cmov vs cmp+branch is determined more by the alignment of the
target of the branch, than by the prediction rate of the branch.  The
instruction decoders would incur penalties on certain types of unaligned branch
targets (when taken), or when decoding sequences of instructions that contained
multiple branches within a 16byte "fetch" window (taken or not).  cmov was
sometimes handy for avoiding those.

With regard to more current Intel Core and AMD Bulldozer/Bobcat architecture:

I have found that use of conditional moves (cmov) is only beneficial if the
branch that the move is replacing is badly mis-predicted.  In my tests, the
cmov only became clearly "optimal" when the branch was predicted correctly less
than 92% of the time, which is abysmal by modern branch predictor standards and
rarely occurs in practice.  Above 97% prediction rates, cmov is typically
slower than cmp+branch. Inside loops that contain branches with prediction
rates approaching 100% (as is the case presented by the OP), cmov becomes a
severe performance bottleneck.  This holds true for both Core and Bulldozer.
Bulldozer has less efficient branching than the i7, but is also severely
bottlenecked by its limited fetch/decode.  Cmov requires executing more total
instructions, and that makes Bulldozer very unhappy.

Note that my tests involved relatively simple loops that did not suffer from
the added register pressure that cmov introduces.  In practice, the prognosis
for cmov being "optimal" is even worse than what I've observed in a controlled
environment.  Furthermore, to my knowledge the status of cmov vs. branch
performance on x86 will not be changing anytime soon.  cmov will continue to be
a liability well into the next couple architecture releases from Intel and AMD.
 Piledriver will have added fetch/decode resources but should also have a
smaller mispredict penalty, so its doubtful cmov will gain much advantages
there either.

Therefore I would recommend setting -fno-tree-loop-if-convert for all -march
matching Intel Core and AMD Bulldozer/Bobcat families.


-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From rkennke at redhat.com  Tue Jan 15 10:17:19 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Tue, 15 Jan 2019 11:17:19 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
Message-ID: <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>

>>>> I cannot say if if this has performance implication. I suspect not. If
>>>> it has, it's probably miniscule improvement. I can't see how it could be
>>>> worse though.
>>>
>>> I can. x86 can have some very weird performance characteristics. It'd be
>>> helpful to do some measurement.
>>
>> I'm not sure we are really able to conclude anything from performance
>> measurement on highly implementation-dependent instructions unless we
>> make an average on a significant number of different x86_64 processors
>> which might well change with future generations...
>>
>> Shouldn't we follow a more pragmatic direction considering that less
>> instructions/registers and a better/smaller encoding is generally
>> preferable, as Roman suggested, which is the purpose of complex
>> instruction sets?
> 
> I'm not sure that CISC has a purpose, as such.
> 
> See the analysis of GCC performance in
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=56309 :
> 
> 
> Quick summary: Conditional moves on Intel Core/Xeon and AMD Bulldozer
> architectures should probably be avoided "as a rule."
> 
> History: Conditional moves were beneficial for the Intel Pentium 4, and also
> (but less-so) for AMD Athlon/Phenom chips.  In the AMD Athlon/Phenom case the
> performance of cmov vs cmp+branch is determined more by the alignment of the
> target of the branch, than by the prediction rate of the branch.  The
> instruction decoders would incur penalties on certain types of unaligned branch
> targets (when taken), or when decoding sequences of instructions that contained
> multiple branches within a 16byte "fetch" window (taken or not).  cmov was
> sometimes handy for avoiding those.
> 
> With regard to more current Intel Core and AMD Bulldozer/Bobcat architecture:
> 
> I have found that use of conditional moves (cmov) is only beneficial if the
> branch that the move is replacing is badly mis-predicted.  In my tests, the
> cmov only became clearly "optimal" when the branch was predicted correctly less
> than 92% of the time, which is abysmal by modern branch predictor standards and
> rarely occurs in practice.  Above 97% prediction rates, cmov is typically
> slower than cmp+branch. Inside loops that contain branches with prediction
> rates approaching 100% (as is the case presented by the OP), cmov becomes a
> severe performance bottleneck.  This holds true for both Core and Bulldozer.
> Bulldozer has less efficient branching than the i7, but is also severely
> bottlenecked by its limited fetch/decode.  Cmov requires executing more total
> instructions, and that makes Bulldozer very unhappy.
> 
> Note that my tests involved relatively simple loops that did not suffer from
> the added register pressure that cmov introduces.  In practice, the prognosis
> for cmov being "optimal" is even worse than what I've observed in a controlled
> environment.  Furthermore, to my knowledge the status of cmov vs. branch
> performance on x86 will not be changing anytime soon.  cmov will continue to be
> a liability well into the next couple architecture releases from Intel and AMD.
>  Piledriver will have added fetch/decode resources but should also have a
> smaller mispredict penalty, so its doubtful cmov will gain much advantages
> there either.
> 
> Therefore I would recommend setting -fno-tree-loop-if-convert for all -march
> matching Intel Core and AMD Bulldozer/Bobcat families.
> 

I agree with that. However, note that this is not about using cmov vs.
branches. This is about generating a load followed by a cmov on the
resulting register vs generating a cmov that also does the load and
avoids the register. It's pretty much the same data-dependency-wise,
except that it avoids using the extra register and encodes smaller.

Roman


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190115/bc7b439a/signature.asc>

From tobias.hartmann at oracle.com  Tue Jan 15 10:32:15 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 15 Jan 2019 11:32:15 +0100
Subject: [12] RFR(XS) 8215748: Application fails when executed with Graal
In-Reply-To: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
References: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
Message-ID: <a3fbae06-4428-7580-c649-410265bd1f5a@oracle.com>

Hi Tom,

this looks good to me. You might want to reference the related code in
LinkResolver::check_method_accessability in your comment(no new webrev required).

Best regards,
Tobias

On 15.01.19 08:09, Tom Rodriguez wrote:
> http://cr.openjdk.java.net/~never/8215748/webrev
> https://bugs.openjdk.java.net/browse/JDK-8215748
> 
> If an interface method attempts to invoke an array clone method, JVMCI doesn't let you resolve the
> invoke properly which can result in performance problems or unexpected NullPointerExceptions.? clone
> is publicly visible on arrays but is protected in Object.? HotSpot doesn't have an actual Method*
> for the array clone operations, it just reuses Object.clone.? This is accomplished with some
> trickery in the linkResolver.cpp that adjusts the visibility during resolution if an array class is
> involved.? JVMCI only deals with concrete methods so when a call site is resolved you get back the
> real Object.clone.? If you try to use resolveMethod on it then it will resolve it relative to Object
> instead of using the array type.? This works ok when the accessing class is an class but for
> interface types it fails.? In benign cases Graal just ends up falling back to a regular call which
> is slower than normal. ?In this case we were attempting to resolve an invoke for a profiled call
> site and got back null which shouldn't happen.? The fix is the use the array class as the method
> type in this particular case which mirrors the logic in the linkResolver.cpp that adjusts the
> visibility check. Tested with Spark and the new unit test.? mach5 testing is ongoing.

From aph at redhat.com  Tue Jan 15 10:44:33 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 15 Jan 2019 10:44:33 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
Message-ID: <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>

On 1/15/19 10:17 AM, Roman Kennke wrote:
> I agree with that. However, note that this is not about using cmov vs.
> branches. This is about generating a load followed by a cmov on the
> resulting register vs generating a cmov that also does the load and
> avoids the register. It's pretty much the same data-dependency-wise,
> except that it avoids using the extra register and encodes smaller.

Sure, I get that. But, for the reasons given, CMOV is a rather dusty
corner of the ISA. Intel themselves recommend not using it unless you
know that the branch is always unpredictable. They say "Use the SETCC
and CMOV instructions to eliminate unpredictable conditional branches
where possible. Do not do this for predictable branches." It really
couldn't be clearer.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Tue Jan 15 10:56:38 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 15 Jan 2019 11:56:38 +0100
Subject: RFR(XS):8216580:X86: Fix generation of VNNI vector code by
 allowing adjacent LoadS nodes to be isomorphic
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A6DA@ORSMSX106.amr.corp.intel.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A6DA@ORSMSX106.amr.corp.intel.com>
Message-ID: <abb3e9b2-15c5-20a5-46c8-e6cc01ba4a62@oracle.com>

Hi Vivek,

please add parentheses around the == comparison in lines 1225,1226.

Otherwise this looks reasonable to me but I'm not too familiar with that code.

Best regards,
Tobias

On 12.01.19 01:03, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> The webrev for the bug JDK-821650 is here:
> http://cr.openjdk.java.net/~vdeshpande/8216580/webrev.00/
> This fixes generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes for a[i] and a[i+1] accesses in same MulAddS2I node.
> Could you please review it.
> 
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Deshpande, Vivek R 
> Sent: Friday, January 11, 2019 11:38 AM
> To: 'Tobias Hartmann' <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: RE: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for
>  3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>      for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions. 
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction. 
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>> be isomorphic when they have different control RangeCheck nodes
>> ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From martin.doerr at sap.com  Tue Jan 15 11:05:44 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Tue, 15 Jan 2019 11:05:44 +0000
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
In-Reply-To: <d6a8ad3b4cb94e18ad2de7f05fd1c1dd@sap.com>
References: <88842ba1a169406d9628ab06665bd787@sap.com>
 <9c7afb40-cc2b-9ae8-fb70-4ac3bacb72da@oracle.com>
 <3a600790198e4bbbb6f253daf0af8ff0@sap.com>
 <d6a8ad3b4cb94e18ad2de7f05fd1c1dd@sap.com>
Message-ID: <01121e2319ea44bf8aee088ffb32a617@sap.com>

Hi Vladimir, Dean and Claes,

thank you for reviewing.
I assume the version which moves the implementation of should_retain_local_variables() to the hpp file (as suggested by Claes) is fine.
I'll push this version if there are no objections.

Best regards,
Martin


-----Original Message-----
From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Doerr, Martin
Sent: Montag, 14. Januar 2019 09:31
To: Claes Redestad <claes.redestad at oracle.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: [CAUTION] RE: RFR(S): 8216556: Unnecessary liveness computation with JVMTI

Hi Claes,

excellent proposal. Thanks. I had not noticed that it currently is in a cpp file.

New webrev:
http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.01/

What I still don't really like is that we're passing MethodLivenessResult objects on stack via 3 compilation units.
But I don't know if it's worth refactoring the code.

Best regards,
Martin


-----Original Message-----
From: Claes Redestad <claes.redestad at oracle.com> 
Sent: Freitag, 11. Januar 2019 16:45
To: Doerr, Martin <martin.doerr at sap.com>
Subject: Re: RFR(S): 8216556: Unnecessary liveness computation with JVMTI

Hi,

  just a random thought, but if you're optimizing this and got some
measure where it matters(?), maybe you should also try inlining
ciEnv::should_retain_local_variables(), i.e., move definition to
ciEnv.hpp. If it doesn't bloat static binary size it seems like it won't
hurt, at least.

/Claes

On 2019-01-11 13:55, Doerr, Martin wrote:
> Hi,
> 
> I'd like to contribute a small JIT improvement for JVMTI to avoid 
> calling raw_liveness_at_bci when its result is not needed.
> 
> Bug with description:
> 
> https://bugs.openjdk.java.net/browse/JDK-8216556
> 
> Webrev:
> 
> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/
> 
> Please review.
> 
> Best regards,
> 
> Martin
> 

From rkennke at redhat.com  Tue Jan 15 11:16:42 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Tue, 15 Jan 2019 12:16:42 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
Message-ID: <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>

>> I agree with that. However, note that this is not about using cmov vs.
>> branches. This is about generating a load followed by a cmov on the
>> resulting register vs generating a cmov that also does the load and
>> avoids the register. It's pretty much the same data-dependency-wise,
>> except that it avoids using the extra register and encodes smaller.
> 
> Sure, I get that. But, for the reasons given, CMOV is a rather dusty
> corner of the ISA. Intel themselves recommend not using it unless you
> know that the branch is always unpredictable. They say "Use the SETCC
> and CMOV instructions to eliminate unpredictable conditional branches
> where possible. Do not do this for predictable branches." It really
> couldn't be clearer.

Well yeah, but again, this patch isn't about generating cmov or not, it
only changes that a cmov preceded by a load (mov) is generated as single
instruction rather than two instructions for object loads, pretty much
as it's done for all the other types. However, it's not very important
to me, and probably anybody else, otherwise this wouldn't have been
commented-out. I'd withdraw the patch unless somebody steps up and
really wants it.

Roman


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190115/f3dff3a1/signature.asc>

From lutz.schmidt at sap.com  Tue Jan 15 11:26:55 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Tue, 15 Jan 2019 11:26:55 +0000
Subject: RFR (M): 8216314: SIGILL in CodeHeapState::print_names()
Message-ID: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>

Dear all,

may I please request reviews for this fix, hardening CodeHeap Analytics to not fail when used in high-load (stress) scenarios. There was quite a bit of preliminary discussion, all documented in the "Comments" section of the bug.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8216314 
Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8216314.01/ 

Thank you!
Lutz
 

From tobias.hartmann at oracle.com  Tue Jan 15 11:45:51 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 15 Jan 2019 12:45:51 +0100
Subject: RFR (M): 8216314: SIGILL in CodeHeapState::print_names()
In-Reply-To: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>
References: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>
Message-ID: <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>

Hi Lutz,

thanks for the discussions and making these changes. The fix looks good to me.

Minor style issue (no new webrev required) in codeHeapState.cpp:1289/1290/1305/1305/1306: Please add
a newline after '{' (and before '}') or at least a whitespace.

Best regards,
Tobias

On 15.01.19 12:26, Schmidt, Lutz wrote:
> Dear all,
> 
> may I please request reviews for this fix, hardening CodeHeap Analytics to not fail when used in high-load (stress) scenarios. There was quite a bit of preliminary discussion, all documented in the "Comments" section of the bug.
> 
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8216314 
> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8216314.01/ 
> 
> Thank you!
> Lutz
>  
> 

From lutz.schmidt at sap.com  Tue Jan 15 12:57:25 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Tue, 15 Jan 2019 12:57:25 +0000
Subject: RFR (M): 8216314: SIGILL in CodeHeapState::print_names()
In-Reply-To: <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>
References: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>
 <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>
Message-ID: <E44FB0B4-6F83-4511-8C40-2A3411B6A182@sap.com>

Thanks for the review, Tobias!

The discussions were very helpful to zero in on a good solution.
The single-line if statement are now three-liners.

Regards,
Lutz

?On 15.01.19, 12:45, "Tobias Hartmann" <tobias.hartmann at oracle.com> wrote:

    Hi Lutz,
    
    thanks for the discussions and making these changes. The fix looks good to me.
    
    Minor style issue (no new webrev required) in codeHeapState.cpp:1289/1290/1305/1305/1306: Please add
    a newline after '{' (and before '}') or at least a whitespace.
    
    Best regards,
    Tobias
    
    On 15.01.19 12:26, Schmidt, Lutz wrote:
    > Dear all,
    > 
    > may I please request reviews for this fix, hardening CodeHeap Analytics to not fail when used in high-load (stress) scenarios. There was quite a bit of preliminary discussion, all documented in the "Comments" section of the bug.
    > 
    > Bug:    https://bugs.openjdk.java.net/browse/JDK-8216314 
    > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8216314.01/ 
    > 
    > Thank you!
    > Lutz
    >  
    > 
    

From rwestrel at redhat.com  Tue Jan 15 13:38:11 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 15 Jan 2019 14:38:11 +0100
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com>
Message-ID: <87va2qvy64.fsf@redhat.com>


Hi Vladimir K & Vladimir I,

>> And that's the other way to fix the crash: initiate new PhaseIdealLoop iteration right away if any strip mined loops are 
>> introduced.
>
> Got it. So the issue is that strip mining invalidated IDOM information generated at the beginning of 
> PhaseIdealLoop::build_and_optimize().

I don't think that's accurate. The idom is changed when a loop limit
check is inserted (so that's unrelated to strip mining AFAICT). As
Vladimir said, when the loop limit check is inserted, the idom of the
region is fixed by:

    Node* nrdom = dom_lca(ridom, new_iff);
    set_idom(rgn, nrdom, dom_depth(rgn));

which does:

  Node *dom_lca( Node *n1, Node *n2 ) const {
    return find_non_split_ctrl(dom_lca_internal(n1, n2));
  }

and because of the find_non_split_ctrl(), the idom is set to a region
rather than an if.

That's broken and I'm confused as to why a straightforward change of the
logic above:

diff --git a/src/hotspot/share/opto/loopPredicate.cpp b/src/hotspot/share/opto/loopPredicate.cpp
--- a/src/hotspot/share/opto/loopPredicate.cpp
+++ b/src/hotspot/share/opto/loopPredicate.cpp
@@ -160,7 +160,7 @@
   // When called from beautify_loops() idom is not constructed yet.
   if (_idom != NULL) {
     Node* ridom = idom(rgn);
-    Node* nrdom = dom_lca(ridom, new_iff);
+    Node* nrdom = dom_lca_internal(ridom, new_iff);
     set_idom(rgn, nrdom, dom_depth(rgn));
   }
 
is not good enough.

Roland.

From nils.eliasson at oracle.com  Tue Jan 15 13:38:11 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 15 Jan 2019 14:38:11 +0100
Subject: RFR(S): 8210392: assert(Compile::current()->live_nodes() <
 Compile::current()->max_node_limit()) failed: Live Node limit exceeded limit
In-Reply-To: <e153b5d2-777b-c04b-ad36-9d9a1032d01d@oracle.com>
References: <28011331-bd43-2c32-dba4-e41879ffe28a@oracle.com>
 <99f3f410-7200-5fb1-fccd-c39e35c20288@oracle.com>
 <2f4e12e4-459d-b96b-6cf2-50d6dba098d9@oracle.com>
 <0aece297-3929-7db5-7054-190163fe65fd@oracle.com>
 <e153b5d2-777b-c04b-ad36-9d9a1032d01d@oracle.com>
Message-ID: <7dd1865d-d29a-26cb-46cf-a818c0b0f305@oracle.com>

+1

Looks good!

// Nils

On 2019-01-14 17:52, Tobias Hartmann wrote:
> Hi Patric,
>
> thanks for adding the test. This looks good to me.
>
> Best regards,
> Tobias
>
>
> On 14.01.19 17:47, Patric Hedlin wrote:
>> Thanks for reviewing Tobias,
>>
>> On 12/18/18 1:37 PM, Tobias Hartmann wrote:
>>> Hi Patric,
>>>
>>> were you able to reproduce this with a test (I see that one is attached to the bug)? If so, please
>>> add it to the webrev. Please also remove the extra newlines (for example, in line 1146).
>>>
>>> The comment in line 1027 says "Use same limit as split_if_with_blocks_post". I think this is
>>> outdated right?
>> Updated webrev with test-case.
>>
>> Fixed #?%#.
>>
>> Best regards,
>> Patric
>>
>>> Best regards,
>>> Tobias
>>>
>>> On 18.12.18 12:48, Patric Hedlin wrote:
>>>> Dear all,
>>>>
>>>> I would like to ask for help to review the following change/update:
>>>>
>>>> Issue:? https://bugs.openjdk.java.net/browse/JDK-8210392
>>>>
>>>> Webrev: http://cr.openjdk.java.net/~phedlin/tr8210392/
>>>>
>>>>
>>>> 8210392: assert(Compile::current()->live_nodes() < Compile::current()->max_node_limit()) failed:
>>>> Live Node limit exceeded limit
>>>>
>>>>  ???? Avoid excessive split-if through a crude throttling approach.
>>>>
>>>>
>>>> Testing: hs-tier1-4, hs-precheckin-comp
>>>>
>>>>
>>>> Best regards,
>>>> Patric

From magnus.ihse.bursie at oracle.com  Tue Jan 15 14:05:23 2019
From: magnus.ihse.bursie at oracle.com (Magnus Ihse Bursie)
Date: Tue, 15 Jan 2019 15:05:23 +0100
Subject: RFR(M): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
Message-ID: <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>

On 2018-12-25 16:19, Jakub Van?k wrote:
> Hi,
>
> please review this webrev. It is a successor of the softfloat-3 [patch]
> thread (first email
> http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
> )
>
> Changes since the last patch (v6):
>
> - renamed --with-softloat* to --with-sflt* (it is more compact and it
>    corresponds to the old --with-sflt-lib=... option)
>
> - license is now obtained via --with-sflt-license switch (so it is not
>    included in OpenJDK source tree)
>
> - updated documentation (slight rewording, added the license option)
>
> - checks for default --with/--without behavior are in place again
>    (I forgot them when I changed the way the library is detected)
>
> - added a simple testcase - I found a disrepancy between softfloat and
>    system function behavior. When a float with bits 0x003FFFFF is
>    added to 0x00000001, the correct result is 0x00400000, but the
>    default software floating point implementation returns 0x00000000.
>    However I'm not sure where to put this test - now it is in
>    test/hotspot/jtreg/compiler/floatingpoint.
>
> - comments in code refer to CR 6757269 and newly JDK-8215902 too.
>
> I have created a repository with SoftFloat-3e with build configuration
> specifically for OpenJDK on armel:
> https://github.com/ev3dev-lang-java/softfloat-openjdk
>
> I can add a link to it to the documentation.
>
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
> Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/

Hi Jakub,

In general this looks good.

Some comments:

I agree with Erik that you can add a link to your github project; 
compiling SoftFloat is outside the scope of the OpenJDK build 
instructions, but it can sure be helpful to lower the bar for users 
wanting to do that. Just one question: any particular reason you didn't 
create your github repo by forking the official 
https://github.com/ucb-bar/berkeley-softfloat-3? That way, it would have 
been easy for users to see that you were not adding any malicious or 
suspicious code to the original SoftFloat distribution.

On the other hand, I think the link to 
http://mail.openjdk.java.net/pipermail/aarch32-port-dev/2016-November/000611.html 
is unnecessary and just creates clutter in the documentation. Please 
remove it.

/Magnus
> CI build: https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
>
> Cheers,
>
> Jakub
>


From linuxtardis at gmail.com  Tue Jan 15 16:31:52 2019
From: linuxtardis at gmail.com (Jakub =?UTF-8?Q?Van=C4=9Bk?=)
Date: Tue, 15 Jan 2019 17:31:52 +0100
Subject: RFR(M)(round 2): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
 <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>
Message-ID: <de5bb24804a0c5b66f0412382f338e415de6b1ed.camel@gmail.com>

Hi Magnus and Erik,

I have added the link to the repository to README and I have removed
the link to the mailing list thread. I have also recreated the GitHub
repository. Now it is a fork of the mentioned repository with two extra
commits containing README and the build scripts.

New webrev URL: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.04/
Bug: https://bugs.openjdk.java.net/browse/JDK-8215902

Regards,

Jakub

On 2019-01-15 at 15:05 +0100, Magnus Ihse Bursie wrote:
> On 2018-12-25 16:19, Jakub Van?k wrote:
> > Hi,
> > 
> > please review this webrev. It is a successor of the softfloat-3
> > [patch]
> > thread (first email
> > 
http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
> > )
> > 
> > Changes since the last patch (v6):
> > 
> > - renamed --with-softloat* to --with-sflt* (it is more compact and
> > it
> >    corresponds to the old --with-sflt-lib=... option)
> > 
> > - license is now obtained via --with-sflt-license switch (so it is
> > not
> >    included in OpenJDK source tree)
> > 
> > - updated documentation (slight rewording, added the license
> > option)
> > 
> > - checks for default --with/--without behavior are in place again
> >    (I forgot them when I changed the way the library is detected)
> > 
> > - added a simple testcase - I found a disrepancy between softfloat
> > and
> >    system function behavior. When a float with bits 0x003FFFFF is
> >    added to 0x00000001, the correct result is 0x00400000, but the
> >    default software floating point implementation returns
> > 0x00000000.
> >    However I'm not sure where to put this test - now it is in
> >    test/hotspot/jtreg/compiler/floatingpoint.
> > 
> > - comments in code refer to CR 6757269 and newly JDK-8215902 too.
> > 
> > I have created a repository with SoftFloat-3e with build
> > configuration
> > specifically for OpenJDK on armel:
> > https://github.com/ev3dev-lang-java/softfloat-openjdk
> > 
> > I can add a link to it to the documentation.
> > 
> > Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
> > Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/
> 
> Hi Jakub,
> 
> In general this looks good.
> 
> Some comments:
> 
> I agree with Erik that you can add a link to your github project; 
> compiling SoftFloat is outside the scope of the OpenJDK build 
> instructions, but it can sure be helpful to lower the bar for users 
> wanting to do that. Just one question: any particular reason you
> didn't 
> create your github repo by forking the official 
> https://github.com/ucb-bar/berkeley-softfloat-3? That way, it would
> have 
> been easy for users to see that you were not adding any malicious or 
> suspicious code to the original SoftFloat distribution.
> 
> On the other hand, I think the link to 
> 
http://mail.openjdk.java.net/pipermail/aarch32-port-dev/2016-November/000611.html
>  
> is unnecessary and just creates clutter in the documentation. Please 
> remove it.
> 
> /Magnus
> > CI build: 
> > https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
> > 
> > Cheers,
> > 
> > Jakub
> > 
> 
> 


From john.r.rose at oracle.com  Tue Jan 15 16:43:56 2019
From: john.r.rose at oracle.com (John Rose)
Date: Tue, 15 Jan 2019 08:43:56 -0800
Subject: RFR(XS): 8216549: Mismatched unsafe access to non escaping object
 fails
In-Reply-To: <877efbzh8a.fsf@redhat.com>
References: <877efbzh8a.fsf@redhat.com>
Message-ID: <35639127-2BC2-4005-BDDE-8324C366F0E3@oracle.com>

On Jan 11, 2019, at 1:16 AM, Roland Westrelin <rwestrel at redhat.com> wrote:
> 
> I simply propose to make non escaping allocations with mismatched
> accesses to be non scalar replaceable.

That's a good fix for now.

At some point, we may want to make the JIT more lenient in Valhalla,
at least for value type buffers[1].  The reason is that there are legitimate
reasons to process a small value type of multiple small fields in terms of
a larger primitive type.  The reason I'm thinking of is vectorizing operations
like comparison and hash-code on the small value type.  When we get
Java-level support for vectors (Panama Vector API) some value type
operations can be handled in an operation or two on a single vector.

Example:  For `__ByValue class IntPair { int x, y; }`, the comparison operator
can perhaps be optimized (by a platform-specific binder) as a `long`
comparison, or an MMX comparison.

It's not a concern yet, but here's a little bookmark FTR, showing Mandy's
work on buffers?

? John

[1]: http://cr.openjdk.java.net/~mchung/valhalla/webrevs/unsafe/private-buffer.00/ <http://cr.openjdk.java.net/~mchung/valhalla/webrevs/unsafe/private-buffer.00/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190115/b7843ba7/attachment-0001.html>

From igor.veresov at oracle.com  Tue Jan 15 16:59:07 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Tue, 15 Jan 2019 08:59:07 -0800
Subject: [12] RFR(S) 8196568: [Graal] LongMulOverflowTest.java fails with
 "runTestOverflow() did not overflow"
Message-ID: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>

Exact math instrinsics in Graal are failing TCK. Since we?re out of time for 12, in order to make Graal compliant, I?d like to turn off emission of the floating exact math nodes. This fix is only for 12, and is not going upstream. For 13 I?ll work on a proper fix.

Webrev: http://cr.openjdk.java.net/~iveresov/8196568/webrev.00/
JBS: https://bugs.openjdk.java.net/browse/JDK-8196568


Thanks,
Igor

From vladimir.kozlov at oracle.com  Tue Jan 15 17:07:34 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 15 Jan 2019 09:07:34 -0800
Subject: [12] RFR(S) 8196568: [Graal] LongMulOverflowTest.java fails with
 "runTestOverflow() did not overflow"
In-Reply-To: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
References: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
Message-ID: <4b81189c-4e40-7b9f-60f2-2b9b0f731230@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/15/19 8:59 AM, Igor Veresov wrote:
> Exact math instrinsics in Graal are failing TCK. Since we?re out of time for 12, in order to make Graal compliant, I?d like to turn off emission of the floating exact math nodes. This fix is only for 12, and is not going upstream. For 13 I?ll work on a proper fix.
> 
> Webrev: http://cr.openjdk.java.net/~iveresov/8196568/webrev.00/
> JBS: https://bugs.openjdk.java.net/browse/JDK-8196568
> 
> 
> Thanks,
> Igor
> 

From dean.long at oracle.com  Tue Jan 15 17:10:04 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Tue, 15 Jan 2019 09:10:04 -0800
Subject: [12] RFR(S) 8196568: [Graal] LongMulOverflowTest.java fails with
 "runTestOverflow() did not overflow"
In-Reply-To: <4b81189c-4e40-7b9f-60f2-2b9b0f731230@oracle.com>
References: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
 <4b81189c-4e40-7b9f-60f2-2b9b0f731230@oracle.com>
Message-ID: <7a228f09-a32d-3c46-efb5-f0d3ad144281@oracle.com>

+1

dl

On 1/15/19 9:07 AM, Vladimir Kozlov wrote:
> Looks good.
>
> Thanks,
> Vladimir
>
> On 1/15/19 8:59 AM, Igor Veresov wrote:
>> Exact math instrinsics in Graal are failing TCK. Since we?re out of 
>> time for 12, in order to make Graal compliant, I?d like to turn off 
>> emission of the floating exact math nodes. This fix is only for 12, 
>> and is not going upstream. For 13 I?ll work on a proper fix.
>>
>> Webrev: http://cr.openjdk.java.net/~iveresov/8196568/webrev.00/
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8196568
>>
>>
>> Thanks,
>> Igor
>>


From tom.rodriguez at oracle.com  Tue Jan 15 17:34:43 2019
From: tom.rodriguez at oracle.com (Tom Rodriguez)
Date: Tue, 15 Jan 2019 09:34:43 -0800
Subject: [12] RFR(S) 8196568: [Graal] LongMulOverflowTest.java fails with
 "runTestOverflow() did not overflow"
In-Reply-To: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
References: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
Message-ID: <b599d877-fedb-28ec-d9dd-1a7cf195dd00@oracle.com>

Looks good.

tom

Igor Veresov wrote on 1/15/19 8:59 AM:
> Exact math instrinsics in Graal are failing TCK. Since we?re out of time for 12, in order to make Graal compliant, I?d like to turn off emission of the floating exact math nodes. This fix is only for 12, and is not going upstream. For 13 I?ll work on a proper fix.
> 
> Webrev: http://cr.openjdk.java.net/~iveresov/8196568/webrev.00/
> JBS: https://bugs.openjdk.java.net/browse/JDK-8196568
> 
> 
> Thanks,
> Igor
> 

From vladimir.x.ivanov at oracle.com  Tue Jan 15 17:46:02 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 15 Jan 2019 09:46:02 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <87va2qvy64.fsf@redhat.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com> <87va2qvy64.fsf@redhat.com>
Message-ID: <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>


On 15/01/2019 05:38, Roland Westrelin wrote:
> 
> Hi Vladimir K & Vladimir I,
> 
>>> And that's the other way to fix the crash: initiate new PhaseIdealLoop iteration right away if any strip mined loops are
>>> introduced.
>>
>> Got it. So the issue is that strip mining invalidated IDOM information generated at the beginning of
>> PhaseIdealLoop::build_and_optimize().
> 
> I don't think that's accurate. The idom is changed when a loop limit
> check is inserted (so that's unrelated to strip mining AFAICT). As
> Vladimir said, when the loop limit check is inserted, the idom of the
> region is fixed by:
> 
>      Node* nrdom = dom_lca(ridom, new_iff);
>      set_idom(rgn, nrdom, dom_depth(rgn));
> 
> which does:
> 
>    Node *dom_lca( Node *n1, Node *n2 ) const {
>      return find_non_split_ctrl(dom_lca_internal(n1, n2));
>    }
> 
> and because of the find_non_split_ctrl(), the idom is set to a region
> rather than an if.
> 
> That's broken and I'm confused as to why a straightforward change of the
> logic above:
> 
> diff --git a/src/hotspot/share/opto/loopPredicate.cpp b/src/hotspot/share/opto/loopPredicate.cpp
> --- a/src/hotspot/share/opto/loopPredicate.cpp
> +++ b/src/hotspot/share/opto/loopPredicate.cpp
> @@ -160,7 +160,7 @@
>     // When called from beautify_loops() idom is not constructed yet.
>     if (_idom != NULL) {
>       Node* ridom = idom(rgn);
> -    Node* nrdom = dom_lca(ridom, new_iff);
> +    Node* nrdom = dom_lca_internal(ridom, new_iff);
>       set_idom(rgn, nrdom, dom_depth(rgn));
>     }
>   
> is not good enough.

Fair point. So you're saying that dom_lca()/find_non_split_ctrl() should 
never be used to set IDOM, right? And all the places which require 
non-split point for an IDOM should explicitly normalize it?

I checked the codebase and all places, where 
dom_lca()/find_non_split_ctrl() are used, IDOM is left intact except 
(PhaseIdealLoop::create_new_if_for_predicate).

So, I'm fine with the fix you propose (though I'm still not happy about 
the distinction between IDOM & dom_lca()/find_non_split_ctrl()).

Best regards,
Vladimir Ivanov

From vladimir.kozlov at oracle.com  Tue Jan 15 17:56:09 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 15 Jan 2019 09:56:09 -0800
Subject: RFR (M): 8216314: SIGILL in CodeHeapState::print_names()
In-Reply-To: <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>
References: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>
 <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>
Message-ID: <3719e6a2-54a5-283d-ee2b-fedb0c0110a2@oracle.com>

+1. Looks good.

Thanks,
Vladimir

On 1/15/19 3:45 AM, Tobias Hartmann wrote:
> Hi Lutz,
> 
> thanks for the discussions and making these changes. The fix looks good to me.
> 
> Minor style issue (no new webrev required) in codeHeapState.cpp:1289/1290/1305/1305/1306: Please add
> a newline after '{' (and before '}') or at least a whitespace.
> 
> Best regards,
> Tobias
> 
> On 15.01.19 12:26, Schmidt, Lutz wrote:
>> Dear all,
>>
>> may I please request reviews for this fix, hardening CodeHeap Analytics to not fail when used in high-load (stress) scenarios. There was quite a bit of preliminary discussion, all documented in the "Comments" section of the bug.
>>
>> Bug:    https://bugs.openjdk.java.net/browse/JDK-8216314
>> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8216314.01/
>>
>> Thank you!
>> Lutz
>>   
>>

From vladimir.kozlov at oracle.com  Tue Jan 15 18:31:05 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 15 Jan 2019 10:31:05 -0800
Subject: [12] RFR(XS) 8215748: Application fails when executed with Graal
In-Reply-To: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
References: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
Message-ID: <6b7700a3-c85a-1b9d-d314-4cd57c58c74e@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/14/19 11:09 PM, Tom Rodriguez wrote:
> http://cr.openjdk.java.net/~never/8215748/webrev
> https://bugs.openjdk.java.net/browse/JDK-8215748
> 
> If an interface method attempts to invoke an array clone method, JVMCI doesn't let you resolve the 
> invoke properly which can result in performance problems or unexpected NullPointerExceptions.? clone 
> is publicly visible on arrays but is protected in Object.? HotSpot doesn't have an actual Method* 
> for the array clone operations, it just reuses Object.clone.? This is accomplished with some 
> trickery in the linkResolver.cpp that adjusts the visibility during resolution if an array class is 
> involved.? JVMCI only deals with concrete methods so when a call site is resolved you get back the 
> real Object.clone.? If you try to use resolveMethod on it then it will resolve it relative to Object 
> instead of using the array type.? This works ok when the accessing class is an class but for 
> interface types it fails.? In benign cases Graal just ends up falling back to a regular call which 
> is slower than normal. ?In this case we were attempting to resolve an invoke for a profiled call 
> site and got back null which shouldn't happen.? The fix is the use the array class as the method 
> type in this particular case which mirrors the logic in the linkResolver.cpp that adjusts the 
> visibility check. Tested with Spark and the new unit test.? mach5 testing is ongoing.

From igor.veresov at oracle.com  Tue Jan 15 18:37:53 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Tue, 15 Jan 2019 10:37:53 -0800
Subject: [12] RFR(S) 8196568: [Graal] LongMulOverflowTest.java fails with
 "runTestOverflow() did not overflow"
In-Reply-To: <b599d877-fedb-28ec-d9dd-1a7cf195dd00@oracle.com>
References: <4FBF36AA-ACF6-4FFA-899B-D9A1E91EA828@oracle.com>
 <b599d877-fedb-28ec-d9dd-1a7cf195dd00@oracle.com>
Message-ID: <C6BAEED1-5A2D-4A05-9455-4442CFF245D8@oracle.com>

Vladimir, Dean, and Tom,

Thanks for the reviews!

Igor

> On Jan 15, 2019, at 9:34 AM, Tom Rodriguez <tom.rodriguez at oracle.com> wrote:
> 
> Looks good.
> 
> tom
> 
> Igor Veresov wrote on 1/15/19 8:59 AM:
>> Exact math instrinsics in Graal are failing TCK. Since we?re out of time for 12, in order to make Graal compliant, I?d like to turn off emission of the floating exact math nodes. This fix is only for 12, and is not going upstream. For 13 I?ll work on a proper fix.
>> Webrev: http://cr.openjdk.java.net/~iveresov/8196568/webrev.00/
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8196568
>> Thanks,
>> Igor


From igor.veresov at oracle.com  Tue Jan 15 18:43:27 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Tue, 15 Jan 2019 10:43:27 -0800
Subject: [12] RFR(XS) 8215748: Application fails when executed with Graal
In-Reply-To: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
References: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
Message-ID: <97D08F9B-06A9-40FE-8FCB-48D3F36C12DE@oracle.com>

Looks good.

Igor

> On Jan 14, 2019, at 11:09 PM, Tom Rodriguez <tom.rodriguez at oracle.com> wrote:
> 
> http://cr.openjdk.java.net/~never/8215748/webrev
> https://bugs.openjdk.java.net/browse/JDK-8215748
> 
> If an interface method attempts to invoke an array clone method, JVMCI doesn't let you resolve the invoke properly which can result in performance problems or unexpected NullPointerExceptions.  clone is publicly visible on arrays but is protected in Object.  HotSpot doesn't have an actual Method* for the array clone operations, it just reuses Object.clone.  This is accomplished with some trickery in the linkResolver.cpp that adjusts the visibility during resolution if an array class is involved.  JVMCI only deals with concrete methods so when a call site is resolved you get back the real Object.clone.  If you try to use resolveMethod on it then it will resolve it relative to Object instead of using the array type.  This works ok when the accessing class is an class but for interface types it fails.  In benign cases Graal just ends up falling back to a regular call which is slower than normal.  In this case we were attempting to resolve an invoke for a profiled call site and got back null which shouldn't happen.  The fix is the use the array class as the method type in this particular case which mirrors the logic in the linkResolver.cpp that adjusts the visibility check. Tested with Spark and the new unit test.  mach5 testing is ongoing.


From vivek.r.deshpande at intel.com  Tue Jan 15 19:27:48 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Tue, 15 Jan 2019 19:27:48 +0000
Subject: RFR(S):8216050:X86: Fix for Superword optimization fails with
 assert(0 <= i && i < _len) failed: illegal index
In-Reply-To: <2a9920a1-0ec3-87cf-2c71-7bdcfcb796be@oracle.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A148B9C@ORSMSX106.amr.corp.intel.com>
 <045def80-a536-ae87-7384-09b30e5a8d78@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A45C@ORSMSX106.amr.corp.intel.com>
 <14067057-ec56-8c07-8f79-d1a29c7e20b7@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A14CD5E@ORSMSX106.amr.corp.intel.com>
 <2a9920a1-0ec3-87cf-2c71-7bdcfcb796be@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14DF41@ORSMSX106.amr.corp.intel.com>

Thanks Vladimir and Tobias for the review.
I have pushed the change.

Regards,
Vivek

-----Original Message-----
From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com] 
Sent: Monday, January 14, 2019 4:26 PM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; Tobias Hartmann <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Raj, Guru <guru.raj at intel.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails with assert(0 <= i && i < _len) failed: illegal index

On 1/14/19 4:17 PM, Deshpande, Vivek R wrote:
> Hi Vladimir
> 
> Thanks for looking at the patch.
> The MulAddS2I node gets packed in follow_use_defs() with this approach in which we just perform swaps in follow_def_uses and return false.

Got it. I confused follow_use_defs() with follow_def_uses().

Changes are good.

Vladimir

> This way MulAddS2I nodes gets the right alignment of multiple of 4 from its outs.
> If we return true after the swaps in follow_def_uses(), it gets alignment  as multiple of 2(from LoadS) for packing, instead of multiple of 4.
> 
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
> Sent: Monday, January 14, 2019 2:37 PM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; Tobias Hartmann 
> <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net 
> compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails 
> with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> I do not understand changes in superword.cpp.
> 
> muladds2i will never be packed in follow_def_uses() since you return 'false' for muladds2i in all cases when u1 != u2 (even when i1 == i2). Is it intentional?
> 
> Thanks,
> Vladimir
> 
> On 1/11/19 11:38 AM, Deshpande, Vivek R wrote:
>> Hi Tobias
>>
>> Thanks for reviewing the patch.
>> I have made the changes according to your suggestion.
>> In this webrev:
>> http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
>> I have fix for the crash reported in the 8216050.
>>
>> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
>> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
>>
>> I have updated the bug also with the link to webrev.
>>
>> I have created a different bug JDK-8216580 for
>>    3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>>        for a[i] and a[i+1] accesses in same MulAddS2I node
>>
>> Thank you.
>> Regards,
>> Vivek
>>
>> -----Original Message-----
>> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
>> Sent: Friday, January 11, 2019 4:49 AM
>> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; 
>> hotspot-compiler-dev at openjdk.java.net compiler 
>> <hotspot-compiler-dev at openjdk.java.net>
>> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, 
>> Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru 
>> <guru.raj at intel.com>
>> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails 
>> with assert(0 <= i && i < _len) failed: illegal index
>>
>> Hi Vivek,
>>
>> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>>> 1) Fix for the crash by matching the operand by swapping to right positions.
>>
>> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
>>
>>> 2) Cost based generation of vpdpwssd instruction.
>>
>> Other instructions added by JDK-8214751 still miss a cost definition, for example:
>> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
>>
>>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>>> be isomorphic when they have different control RangeCheck nodes
>>>   ????for a[i] and a[i+1] accesses in same MulAddS2I node
>>
>> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
>>
>> Thanks,
>> Tobias
>>

From dean.long at oracle.com  Tue Jan 15 20:07:51 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Tue, 15 Jan 2019 12:07:51 -0800
Subject: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
In-Reply-To: <01121e2319ea44bf8aee088ffb32a617@sap.com>
References: <88842ba1a169406d9628ab06665bd787@sap.com>
 <9c7afb40-cc2b-9ae8-fb70-4ac3bacb72da@oracle.com>
 <3a600790198e4bbbb6f253daf0af8ff0@sap.com>
 <d6a8ad3b4cb94e18ad2de7f05fd1c1dd@sap.com>
 <01121e2319ea44bf8aee088ffb32a617@sap.com>
Message-ID: <3e7b089a-82dc-1123-7b91-7af1a741033d@oracle.com>

+1

dl

On 1/15/19 3:05 AM, Doerr, Martin wrote:
> Hi Vladimir, Dean and Claes,
>
> thank you for reviewing.
> I assume the version which moves the implementation of should_retain_local_variables() to the hpp file (as suggested by Claes) is fine.
> I'll push this version if there are no objections.
>
> Best regards,
> Martin
>
>
> -----Original Message-----
> From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Doerr, Martin
> Sent: Montag, 14. Januar 2019 09:31
> To: Claes Redestad <claes.redestad at oracle.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: [CAUTION] RE: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
>
> Hi Claes,
>
> excellent proposal. Thanks. I had not noticed that it currently is in a cpp file.
>
> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.01/
>
> What I still don't really like is that we're passing MethodLivenessResult objects on stack via 3 compilation units.
> But I don't know if it's worth refactoring the code.
>
> Best regards,
> Martin
>
>
> -----Original Message-----
> From: Claes Redestad <claes.redestad at oracle.com>
> Sent: Freitag, 11. Januar 2019 16:45
> To: Doerr, Martin <martin.doerr at sap.com>
> Subject: Re: RFR(S): 8216556: Unnecessary liveness computation with JVMTI
>
> Hi,
>
>    just a random thought, but if you're optimizing this and got some
> measure where it matters(?), maybe you should also try inlining
> ciEnv::should_retain_local_variables(), i.e., move definition to
> ciEnv.hpp. If it doesn't bloat static binary size it seems like it won't
> hurt, at least.
>
> /Claes
>
> On 2019-01-11 13:55, Doerr, Martin wrote:
>> Hi,
>>
>> I'd like to contribute a small JIT improvement for JVMTI to avoid
>> calling raw_liveness_at_bci when its result is not needed.
>>
>> Bug with description:
>>
>> https://bugs.openjdk.java.net/browse/JDK-8216556
>>
>> Webrev:
>>
>> http://cr.openjdk.java.net/~mdoerr/8216556_JVMTI_liveness/webrev.00/
>>
>> Please review.
>>
>> Best regards,
>>
>> Martin
>>


From vladimir.x.ivanov at oracle.com  Wed Jan 16 01:21:04 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 15 Jan 2019 17:21:04 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com> <87va2qvy64.fsf@redhat.com>
 <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>
Message-ID: <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com>

Updated webrev (with Roland's proposal):
   http://cr.openjdk.java.net/~vlivanov/8215757/webrev.01/

Testing: failing test (replay), hs-precheckin-comp, hs-tier1, hs-tier2 
(in progress)

Best regards,
Vladimir Ivanov

On 15/01/2019 09:46, Vladimir Ivanov wrote:
> 
> 
> On 15/01/2019 05:38, Roland Westrelin wrote:
>>
>> Hi Vladimir K & Vladimir I,
>>
>>>> And that's the other way to fix the crash: initiate new 
>>>> PhaseIdealLoop iteration right away if any strip mined loops are
>>>> introduced.
>>>
>>> Got it. So the issue is that strip mining invalidated IDOM 
>>> information generated at the beginning of
>>> PhaseIdealLoop::build_and_optimize().
>>
>> I don't think that's accurate. The idom is changed when a loop limit
>> check is inserted (so that's unrelated to strip mining AFAICT). As
>> Vladimir said, when the loop limit check is inserted, the idom of the
>> region is fixed by:
>>
>> ???? Node* nrdom = dom_lca(ridom, new_iff);
>> ???? set_idom(rgn, nrdom, dom_depth(rgn));
>>
>> which does:
>>
>> ?? Node *dom_lca( Node *n1, Node *n2 ) const {
>> ???? return find_non_split_ctrl(dom_lca_internal(n1, n2));
>> ?? }
>>
>> and because of the find_non_split_ctrl(), the idom is set to a region
>> rather than an if.
>>
>> That's broken and I'm confused as to why a straightforward change of the
>> logic above:
>>
>> diff --git a/src/hotspot/share/opto/loopPredicate.cpp 
>> b/src/hotspot/share/opto/loopPredicate.cpp
>> --- a/src/hotspot/share/opto/loopPredicate.cpp
>> +++ b/src/hotspot/share/opto/loopPredicate.cpp
>> @@ -160,7 +160,7 @@
>> ??? // When called from beautify_loops() idom is not constructed yet.
>> ??? if (_idom != NULL) {
>> ????? Node* ridom = idom(rgn);
>> -??? Node* nrdom = dom_lca(ridom, new_iff);
>> +??? Node* nrdom = dom_lca_internal(ridom, new_iff);
>> ????? set_idom(rgn, nrdom, dom_depth(rgn));
>> ??? }
>> is not good enough.
> 
> Fair point. So you're saying that dom_lca()/find_non_split_ctrl() should 
> never be used to set IDOM, right? And all the places which require 
> non-split point for an IDOM should explicitly normalize it?
> 
> I checked the codebase and all places, where 
> dom_lca()/find_non_split_ctrl() are used, IDOM is left intact except 
> (PhaseIdealLoop::create_new_if_for_predicate).
> 
> So, I'm fine with the fix you propose (though I'm still not happy about 
> the distinction between IDOM & dom_lca()/find_non_split_ctrl()).
> 
> Best regards,
> Vladimir Ivanov

From vladimir.kozlov at oracle.com  Wed Jan 16 01:56:48 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 15 Jan 2019 17:56:48 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com> <87va2qvy64.fsf@redhat.com>
 <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>
 <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com>
Message-ID: <a3789117-e5e1-8d79-d7c4-3fb0d5e73570@oracle.com>

Looks good.

I tried to look on history of this code and it was from the first day of loop predicates implementation:

http://hg.openjdk.java.net/jdk9/hs/hotspot/rev/b2b6a9bf6238#l5.184

Thanks,
Vladimir

On 1/15/19 5:21 PM, Vladimir Ivanov wrote:
> Updated webrev (with Roland's proposal):
>  ? http://cr.openjdk.java.net/~vlivanov/8215757/webrev.01/
> 
> Testing: failing test (replay), hs-precheckin-comp, hs-tier1, hs-tier2 (in progress)
> 
> Best regards,
> Vladimir Ivanov
> 
> On 15/01/2019 09:46, Vladimir Ivanov wrote:
>>
>>
>> On 15/01/2019 05:38, Roland Westrelin wrote:
>>>
>>> Hi Vladimir K & Vladimir I,
>>>
>>>>> And that's the other way to fix the crash: initiate new PhaseIdealLoop iteration right away if 
>>>>> any strip mined loops are
>>>>> introduced.
>>>>
>>>> Got it. So the issue is that strip mining invalidated IDOM information generated at the 
>>>> beginning of
>>>> PhaseIdealLoop::build_and_optimize().
>>>
>>> I don't think that's accurate. The idom is changed when a loop limit
>>> check is inserted (so that's unrelated to strip mining AFAICT). As
>>> Vladimir said, when the loop limit check is inserted, the idom of the
>>> region is fixed by:
>>>
>>> ???? Node* nrdom = dom_lca(ridom, new_iff);
>>> ???? set_idom(rgn, nrdom, dom_depth(rgn));
>>>
>>> which does:
>>>
>>> ?? Node *dom_lca( Node *n1, Node *n2 ) const {
>>> ???? return find_non_split_ctrl(dom_lca_internal(n1, n2));
>>> ?? }
>>>
>>> and because of the find_non_split_ctrl(), the idom is set to a region
>>> rather than an if.
>>>
>>> That's broken and I'm confused as to why a straightforward change of the
>>> logic above:
>>>
>>> diff --git a/src/hotspot/share/opto/loopPredicate.cpp b/src/hotspot/share/opto/loopPredicate.cpp
>>> --- a/src/hotspot/share/opto/loopPredicate.cpp
>>> +++ b/src/hotspot/share/opto/loopPredicate.cpp
>>> @@ -160,7 +160,7 @@
>>> ??? // When called from beautify_loops() idom is not constructed yet.
>>> ??? if (_idom != NULL) {
>>> ????? Node* ridom = idom(rgn);
>>> -??? Node* nrdom = dom_lca(ridom, new_iff);
>>> +??? Node* nrdom = dom_lca_internal(ridom, new_iff);
>>> ????? set_idom(rgn, nrdom, dom_depth(rgn));
>>> ??? }
>>> is not good enough.
>>
>> Fair point. So you're saying that dom_lca()/find_non_split_ctrl() should never be used to set 
>> IDOM, right? And all the places which require non-split point for an IDOM should explicitly 
>> normalize it?
>>
>> I checked the codebase and all places, where dom_lca()/find_non_split_ctrl() are used, IDOM is 
>> left intact except (PhaseIdealLoop::create_new_if_for_predicate).
>>
>> So, I'm fine with the fix you propose (though I'm still not happy about the distinction between 
>> IDOM & dom_lca()/find_non_split_ctrl()).
>>
>> Best regards,
>> Vladimir Ivanov

From tom.rodriguez at oracle.com  Wed Jan 16 07:00:35 2019
From: tom.rodriguez at oracle.com (Tom Rodriguez)
Date: Tue, 15 Jan 2019 23:00:35 -0800
Subject: [12] RFR(XS) 8215748: Application fails when executed with Graal
In-Reply-To: <a3fbae06-4428-7580-c649-410265bd1f5a@oracle.com>
References: <63a38944-2d0a-ef61-bea5-e709b4623692@oracle.com>
 <a3fbae06-4428-7580-c649-410265bd1f5a@oracle.com>
Message-ID: <9ad421e8-3075-16ff-6aaf-80517a24c4d1@oracle.com>


Tobias Hartmann wrote on 1/15/19 2:32 AM:
> Hi Tom,
> 
> this looks good to me. You might want to reference the related code in
> LinkResolver::check_method_accessability in your comment(no new webrev required).

Good suggestion.  I added a mention of that at the end of the comment. 
Thanks!

tom

> 
> Best regards,
> Tobias
> 
> On 15.01.19 08:09, Tom Rodriguez wrote:
>> http://cr.openjdk.java.net/~never/8215748/webrev
>> https://bugs.openjdk.java.net/browse/JDK-8215748
>>
>> If an interface method attempts to invoke an array clone method, JVMCI doesn't let you resolve the
>> invoke properly which can result in performance problems or unexpected NullPointerExceptions.? clone
>> is publicly visible on arrays but is protected in Object.? HotSpot doesn't have an actual Method*
>> for the array clone operations, it just reuses Object.clone.? This is accomplished with some
>> trickery in the linkResolver.cpp that adjusts the visibility during resolution if an array class is
>> involved.? JVMCI only deals with concrete methods so when a call site is resolved you get back the
>> real Object.clone.? If you try to use resolveMethod on it then it will resolve it relative to Object
>> instead of using the array type.? This works ok when the accessing class is an class but for
>> interface types it fails.? In benign cases Graal just ends up falling back to a regular call which
>> is slower than normal. ?In this case we were attempting to resolve an invoke for a profiled call
>> site and got back null which shouldn't happen.? The fix is the use the array class as the method
>> type in this particular case which mirrors the logic in the linkResolver.cpp that adjusts the
>> visibility check. Tested with Spark and the new unit test.? mach5 testing is ongoing.

From lutz.schmidt at sap.com  Wed Jan 16 08:30:29 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Wed, 16 Jan 2019 08:30:29 +0000
Subject: RFR (M): 8216314: SIGILL in CodeHeapState::print_names()
In-Reply-To: <3719e6a2-54a5-283d-ee2b-fedb0c0110a2@oracle.com>
References: <98FFB93F-674D-4994-953F-B35572E316A2@sap.com>
 <1c39e69b-2830-0ced-bbb2-8b5003972695@oracle.com>
 <3719e6a2-54a5-283d-ee2b-fedb0c0110a2@oracle.com>
Message-ID: <4E570A00-3567-4333-A268-3C0100DF0417@sap.com>

Thank you, Vladimir!
I'll go ahead and push.
Regards,
Lutz

?On 15.01.19, 18:56, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:

    +1. Looks good.
    
    Thanks,
    Vladimir
    
    On 1/15/19 3:45 AM, Tobias Hartmann wrote:
    > Hi Lutz,
    > 
    > thanks for the discussions and making these changes. The fix looks good to me.
    > 
    > Minor style issue (no new webrev required) in codeHeapState.cpp:1289/1290/1305/1305/1306: Please add
    > a newline after '{' (and before '}') or at least a whitespace.
    > 
    > Best regards,
    > Tobias
    > 
    > On 15.01.19 12:26, Schmidt, Lutz wrote:
    >> Dear all,
    >>
    >> may I please request reviews for this fix, hardening CodeHeap Analytics to not fail when used in high-load (stress) scenarios. There was quite a bit of preliminary discussion, all documented in the "Comments" section of the bug.
    >>
    >> Bug:    https://bugs.openjdk.java.net/browse/JDK-8216314
    >> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8216314.01/
    >>
    >> Thank you!
    >> Lutz
    >>   
    >>
    

From rwestrel at redhat.com  Wed Jan 16 08:43:28 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 16 Jan 2019 09:43:28 +0100
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com> <87va2qvy64.fsf@redhat.com>
 <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>
 <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com>
Message-ID: <87pnsxvvpr.fsf@redhat.com>


>    http://cr.openjdk.java.net/~vlivanov/8215757/webrev.01/

Looks good to me.

Roland.

From rkennke at redhat.com  Wed Jan 16 09:42:24 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Wed, 16 Jan 2019 10:42:24 +0100
Subject: RFR(S): 8217042: Shenandoah: write barrier on backedge of strip
 mined loop causes c2 crash at expansion time
In-Reply-To: <874laaxpgm.fsf@redhat.com>
References: <874laaxpgm.fsf@redhat.com>
Message-ID: <fdc70707-c365-9ac8-24dd-4ec03be877dd@redhat.com>

Looks good to me. Fix the comments as Aleksey already noted.

Thanks,
Roman

> http://cr.openjdk.java.net/~roland/8217042/webrev.00/
> 
> If a write barrier is in the body of the outer strip mined loop,
> expanding it causes loop strip mining verification code to fail. This is
> worked around by turning the strip mined loop nest into a regular
> counted loop nest so verification code doesn't trigger. The logic that
> takes care of that breaks when the write barrier is on the backedge of
> the strip mined loop because it is applied after the barrier is
> expanded. The fix I propose is to move that logic before barrier
> expansion.
> 
> Roland.
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190116/36a938fe/signature-0001.asc>

From rkennke at redhat.com  Wed Jan 16 09:43:05 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Wed, 16 Jan 2019 10:43:05 +0100
Subject: RFR(S): 8217043: Shenandoah: SIGSEGV in Type::meet_helper() at
 barrier expansion time
In-Reply-To: <87y37mwa5j.fsf@redhat.com>
References: <87y37mwa5j.fsf@redhat.com>
Message-ID: <98bfac69-a4ed-efa3-e00d-be267b15b932@redhat.com>

Ok.

Thanks,
Roman

> http://cr.openjdk.java.net/~roland/8217043/webrev.00/
> 
> The ShenandoahBarrierNode::needs_barrier_impl() encounters a
> CallLeafNode (from a write barrier) and tries to get the type of n which
> is a tuple, not a pointer and this causes a null pointer
> dereference. The write barrier runtime call should anyway prevent an
> optimization of the barrier and to be on the safe side, any call should.
> 
> Roland.
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190116/1f142fb6/signature.asc>

From jatin.bhateja at intel.com  Wed Jan 16 09:53:04 2019
From: jatin.bhateja at intel.com (Bhateja, Jatin)
Date: Wed, 16 Jan 2019 09:53:04 +0000
Subject: [aarch64-port-dev ] RFR(M): 8212043: Add floating-point
 Math.min/max intrinsics
In-Reply-To: <CAMi_1tV6=A9F2mkBGYSQQTssNkkJS+1E5Qkhvs=yv0vL=Rtjkw@mail.gmail.com>
References: <DB7PR08MB31155E7EBF83657CB1C17F9996F90@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <fa0af30b-5512-97a2-555d-7885b4ce6a6d@redhat.com>
 <DB7PR08MB3115E97D35A9812A164F1E0596A80@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <5bf1c593-2e96-8a10-88c6-98afdd9a04f2@redhat.com>
 <DB7PR08MB31158A711E73D7D37BB3F62F96A50@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <0c7de175-17d8-f3f5-a47b-2b9b3f45af71@redhat.com>
 <AM6PR08MB311141B58BB955B94A8E143C96A70@AM6PR08MB3111.eurprd08.prod.outlook.com>
 <d42679f1-8696-3011-b23f-0f8f4d962f1c@redhat.com>
 <DB7PR08MB31156B42C681921D8E75B6AC96A00@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e5e8574e-60ce-8837-54a3-eb97dc121306@redhat.com>
 <1e7af2c4-8610-2ee9-9955-298ffb715fa7@redhat.com>
 <DB7PR08MB3115D088A4A8B4B34EF723A696BD0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <06048878-effe-7d24-bb87-b140e662aeb8@redhat.com>
 <7c97719b-e83a-ba40-43a3-8cec8273df1c@redhat.com>
 <3df16666-a10b-41bb-7439-b967e1d76735@redhat.com>
 <4a10fa17-197b-2da9-7890-9544a407832f@redhat.com>
 <c2b74b56-8da5-6da1-8680-a65f749469fe@redhat.com>
 <DB7PR08MB3115666749428943CBB7B1A196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <CAMi_1tV6=A9F2mkBGYSQQTssNkkJS+1E5Qkhvs=yv0vL=Rtjkw@mail.gmail.com>
Message-ID: <A66BBE673E08E1428E3A918AE4D5B32CED1492@BGSMSX106.gar.corp.intel.com>

Hi Pengfei,

Your final patch (http://cr.openjdk.java.net/~pli/rfr/8212043/webrev.04/)
to support floating point scalar max/min intrinsic also included following test case which is not up streamed to jdk repository.

test/hotspot/jtreg/compiler/intrinsics/math/TestFpMinMaxIntrinsics.java

Can you kindly add this test case, I?m working on supporting these new intrinsics for X86 platform and will like to use the test case you created.

Thanks and Regards,
Jatin Bhateja

From: Pengfei Li (Arm Technology China) <Pengfei.Li at arm.com<mailto:Pengfei.Li at arm.com>>
---------- Forwarded message ---------
Date: Wed, Dec 19, 2018 at 6:38 PM
Subject: RE: [aarch64-port-dev ] RFR(M): 8212043: Add floating-point Math.min/max intrinsics
To: Andrew Dinn <adinn at redhat.com<mailto:adinn at redhat.com>>, Andrew Haley <aph at redhat.com<mailto:aph at redhat.com>>, hotspot-compiler-dev at openjdk.java.net<mailto:hotspot-compiler-dev at openjdk.java.net> <hotspot-compiler-dev at openjdk.java.net<mailto:hotspot-compiler-dev at openjdk.java.net>>, aarch64-port-dev at openjdk.java.net<mailto:aarch64-port-dev at openjdk.java.net> <aarch64-port-dev at openjdk.java.net<mailto:aarch64-port-dev at openjdk.java.net>>
Cc: nd <nd at arm.com<mailto:nd at arm.com>>


Hi

> Pengfei, I am sure you will be pleased to know this has finally been pushed to
> the dev repo.

Thanks a lot Andrew Dinn and Andrew Haley! This could not happen without your help.

And for the next step, the follow-up patch for vectorization is almost ready. I will post it in another new thread soon later.

--
Thanks,
Pengfei

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190116/bd902219/attachment.html>

From adinn at redhat.com  Wed Jan 16 11:23:10 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Wed, 16 Jan 2019 11:23:10 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <ba59cf3a-1b07-1530-79d1-d7f22a1cb900@redhat.com>
 <600e8c67-80a4-fef5-b441-72c51c6ccddb@oracle.com>
 <ad813d34-3ebd-6f66-332a-06b9446367c0@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
Message-ID: <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>

Hi Alan/Brian,

I have finally been able to shelve other commitments and return to this
JEP (apologies for the hiatus).

  https://openjdk.java.net/jeps/8207851

The JEP has been reviewed positively by Stuart Marks (core libs) and
Vladimir Kozlov (intrinsics). It has also been warmly welcomed by
several potential users in Red Hat and Intel (including, respectively,
Jonathan Halliday and Sandya Viswanathan both in cc).

I believe I have addressed all outstanding comments on the JEP per se,
including those made by Alan. Is it now possible for one of you to
endorse the JEP so it can be submitted?

I am aware that I still need to address a few details in the draft
implementation that are not present in the latest webrev. I believe
there are two changes requested by Vladimir:

  1. correct the type of cache writeback memory nodes to generic memory
  2. use the JVM to inject a flag setting which enables/disables mapping
of persistent buffers

and also one change requested by Alan:

  make method MappedByteBuffer.isPersistent private rather than public

Is there any other impediment to submitting the JEP and proceeding to
code review?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From tobias.hartmann at oracle.com  Wed Jan 16 11:25:16 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 16 Jan 2019 12:25:16 +0100
Subject: RFR(S): 8217043: Shenandoah: SIGSEGV in Type::meet_helper() at
 barrier expansion time
In-Reply-To: <87y37mwa5j.fsf@redhat.com>
References: <87y37mwa5j.fsf@redhat.com>
Message-ID: <a74f1828-a1d9-37e8-016f-0020941aebd3@oracle.com>

Hi Roland,

looks good to me.

Best regards,
Tobias

On 15.01.19 10:19, Roland Westrelin wrote:
> 
> http://cr.openjdk.java.net/~roland/8217043/webrev.00/
> 
> The ShenandoahBarrierNode::needs_barrier_impl() encounters a
> CallLeafNode (from a write barrier) and tries to get the type of n which
> is a tuple, not a pointer and this causes a null pointer
> dereference. The write barrier runtime call should anyway prevent an
> optimization of the barrier and to be on the safe side, any call should.
> 
> Roland.
> 

From tobias.hartmann at oracle.com  Wed Jan 16 11:26:05 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 16 Jan 2019 12:26:05 +0100
Subject: RFR(S): 8217042: Shenandoah: write barrier on backedge of strip
 mined loop causes c2 crash at expansion time
In-Reply-To: <874laaxpgm.fsf@redhat.com>
References: <874laaxpgm.fsf@redhat.com>
Message-ID: <f37db87b-810e-481f-fa47-fffd5db2338c@oracle.com>

Hi Roland,

looks reasonable to me.

Best regards,
Tobias

On 15.01.19 10:03, Roland Westrelin wrote:
> 
> http://cr.openjdk.java.net/~roland/8217042/webrev.00/
> 
> If a write barrier is in the body of the outer strip mined loop,
> expanding it causes loop strip mining verification code to fail. This is
> worked around by turning the strip mined loop nest into a regular
> counted loop nest so verification code doesn't trigger. The logic that
> takes care of that breaks when the write barrier is on the backedge of
> the strip mined loop because it is applied after the barrier is
> expanded. The fix I propose is to move that logic before barrier
> expansion.
> 
> Roland.
> 

From rwestrel at redhat.com  Wed Jan 16 12:34:57 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 16 Jan 2019 13:34:57 +0100
Subject: RFR(S): 8217043: Shenandoah: SIGSEGV in Type::meet_helper() at
 barrier expansion time
In-Reply-To: <a74f1828-a1d9-37e8-016f-0020941aebd3@oracle.com>
References: <87y37mwa5j.fsf@redhat.com>
 <a74f1828-a1d9-37e8-016f-0020941aebd3@oracle.com>
Message-ID: <87imyowzke.fsf@redhat.com>


Thanks for the reviews Aleksey, Roman & Tobias.

Roland.

From rwestrel at redhat.com  Wed Jan 16 12:35:58 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 16 Jan 2019 13:35:58 +0100
Subject: RFR(S): 8217042: Shenandoah: write barrier on backedge of strip
 mined loop causes c2 crash at expansion time
In-Reply-To: <f37db87b-810e-481f-fa47-fffd5db2338c@oracle.com>
References: <874laaxpgm.fsf@redhat.com>
 <f37db87b-810e-481f-fa47-fffd5db2338c@oracle.com>
Message-ID: <87fttswzip.fsf@redhat.com>


Thanks for the reviews Roman & Tobias, and for the comments, Aleksey.

Roland.

From lutz.schmidt at sap.com  Wed Jan 16 14:52:38 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Wed, 16 Jan 2019 14:52:38 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
Message-ID: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>

Dear all, 

may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/ 

Thank you!
Lutz
 

From vladimir.x.ivanov at oracle.com  Wed Jan 16 17:48:59 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Wed, 16 Jan 2019 09:48:59 -0800
Subject: [12] RFR (S): 8215757: C2: PhaseIdealLoop::spinup() computes
 wrong post-dominating point
In-Reply-To: <87pnsxvvpr.fsf@redhat.com>
References: <13b5bc19-75a3-2692-92a1-8ac731ebf671@oracle.com>
 <874lafzfiy.fsf@redhat.com> <09abefa3-37b4-5657-22f9-e06f144a1867@oracle.com>
 <21fc0b3a-87d5-0c31-0d63-75eca3c05e5b@oracle.com>
 <14b4d2c8-0cf4-4c49-309b-1838da8536a0@oracle.com>
 <6ebc3b71-bc58-0a14-20c9-aaac3a705f91@oracle.com>
 <5e4f7d3b-6d7d-3017-2926-13e932820205@oracle.com> <87va2qvy64.fsf@redhat.com>
 <cdc71b53-e0f6-71ac-f039-3d8408efc7e2@oracle.com>
 <07fe96e2-72fd-48c4-32e8-af43840aef4c@oracle.com> <87pnsxvvpr.fsf@redhat.com>
Message-ID: <fc92e093-4f70-14d5-9923-3d9f9f555fac@oracle.com>

Thanks, Vladimir & Roland.

Best regards,
Vladimir Ivanov

On 16/01/2019 00:43, Roland Westrelin wrote:
> 
>>     http://cr.openjdk.java.net/~vlivanov/8215757/webrev.01/
> 
> Looks good to me.
> 
> Roland.
> 

From vladimir.kozlov at oracle.com  Wed Jan 16 18:10:21 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 16 Jan 2019 10:10:21 -0800
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
Message-ID: <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>

Hi Lutz,

I see that you have only one usage in all cases for:
BUFFEREDSTREAM_FLUSH_IF("", 512)

Can you simple declare simplified macro for this?

Otherwise looks good.

Thanks,
Vladimir

On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
> Dear all,
> 
> may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
> 
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
> 
> Thank you!
> Lutz
>   
> 

From derekw at marvell.com  Wed Jan 16 19:44:33 2019
From: derekw at marvell.com (Derek White)
Date: Wed, 16 Jan 2019 19:44:33 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
 <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>
Message-ID: <MN2PR18MB2733C3A6FA4BC2512D3E3927D2820@MN2PR18MB2733.namprd18.prod.outlook.com>

Hi Nick,

Looks good to me!

 - Derek

> -----Original Message-----
> From: Nick Gasson (Arm Technology China) <Nick.Gasson at arm.com>
> Sent: Thursday, January 10, 2019 9:37 PM
> To: Andrew Haley <aph at redhat.com>; Derek White
> <derekw at marvell.com>; hotspot-compiler-dev at openjdk.java.net compiler
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
> Subject: [EXT] Re: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor
> unlock fast path not called
> 
> External Email
> 
> ----------------------------------------------------------------------
> Hi all,
> 
> On 09/01/2019 17:23, Andrew Haley wrote:
> >
> > HotSpot policy is that we can do minor cleanups as we go along:
> > experience has shown that unless you do so, cruft tends to accumulate.
> > These cleanups are OK for this patch.
> >
> 
> Please see the updated webrev here:
> 
> http://cr.openjdk.java.net/~ngasson/8216350/webrev.1/
> 
> Includes cleanups according to Derek's comments and updated the copyright
> year (thanks Felix).
> 
> > 4)  Slightly better comment for last instruction of fast_unlock (and
> explicitly use zr).
> >     __ stlr(zr, tmp); // set unowned
> 
> Note I needed to change the definition of load_store_exclusive to allow ZR
> here. I've checked that this is OK for the other instructions that use this.
> 
> Thanks,
> Nick

From lutz.schmidt at sap.com  Wed Jan 16 20:37:23 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Wed, 16 Jan 2019 20:37:23 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
Message-ID: <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>

Hi Vladimir, 

thanks a lot for looking at this so quickly. 

Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512" originated from the thought "its large enough for a well-behaved line and small enough to save some flushes".

I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I wasn't sure if that could be categorized as over-engineered. 

Your thoughts?

Thanks,
Lutz

?On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov" <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:

    Hi Lutz,
    
    I see that you have only one usage in all cases for:
    BUFFEREDSTREAM_FLUSH_IF("", 512)
    
    Can you simple declare simplified macro for this?
    
    Otherwise looks good.
    
    Thanks,
    Vladimir
    
    On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
    > Dear all,
    > 
    > may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
    > 
    > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
    > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
    > 
    > Thank you!
    > Lutz
    >   
    > 
    

From bsrbnd at gmail.com  Wed Jan 16 20:46:32 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Wed, 16 Jan 2019 21:46:32 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
Message-ID: <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>

On Tue, 15 Jan 2019 at 12:16, Roman Kennke <rkennke at redhat.com> wrote:
>
> >> I agree with that. However, note that this is not about using cmov vs.
> >> branches. This is about generating a load followed by a cmov on the
> >> resulting register vs generating a cmov that also does the load and
> >> avoids the register. It's pretty much the same data-dependency-wise,
> >> except that it avoids using the extra register and encodes smaller.
> >
> > Sure, I get that. But, for the reasons given, CMOV is a rather dusty
> > corner of the ISA. Intel themselves recommend not using it unless you
> > know that the branch is always unpredictable. They say "Use the SETCC
> > and CMOV instructions to eliminate unpredictable conditional branches
> > where possible. Do not do this for predictable branches." It really
> > couldn't be clearer.
>
> Well yeah, but again, this patch isn't about generating cmov or not, it
> only changes that a cmov preceded by a load (mov) is generated as single
> instruction rather than two instructions for object loads, pretty much
> as it's done for all the other types. However, it's not very important
> to me, and probably anybody else, otherwise this wouldn't have been
> commented-out. I'd withdraw the patch unless somebody steps up and
> really wants it.

To answer Andrew Haley, one of the major difference between CISC and
RISC is specifically the load/store architecture of the latter which
is part of most instructions of the former; I don't see many good
reasons to generate RISC-like load/store code using only a subset of
instructions and to juggle with registers. Note also that if a
'mov+cmov' would now appear to be faster than a sole 'cmov' on some
processors, there is a high probability to see the opposite behavior
in future generations.

Of course, if you use another idiom like 'cmp+branch' which isn't the
purpose of this fix, you might have benefits for predictable branches
or not for unpredictable branches.

At my mind, It'd be unfortunate to withdraw this patch as the current
policy seems to go in Roman's direction, see 'cmovL_mem' which uses an
adequate prefix:

http://hg.openjdk.java.net/jdk/jdk/file/d3aa93570779/src/hotspot/cpu/x86/x86_64.ad#l7083

I'm not skilled enough in the pointer type area to answer Andrew
Dinn's question but maybe adding predicates to validate them if
Roman's prefix correction isn't sufficient would be a possible
solution?

In any case, explanations about the initial 'cmovP_mem' intention and
the reason for disabling it would be helpful.

Bernard

From vladimir.kozlov at oracle.com  Wed Jan 16 21:53:48 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 16 Jan 2019 13:53:48 -0800
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
Message-ID: <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>

On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
> Hi Vladimir,
> 
> thanks a lot for looking at this so quickly.
> 
> Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512" originated from the thought "its large enough for a well-behaved line and small enough to save some flushes".
> 
> I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I wasn't sure if that could be categorized as over-engineered.

Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.

Vladimir

> 
> Your thoughts?
> 
> Thanks,
> Lutz
> 
> ?On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov" <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
> 
>      Hi Lutz,
>      
>      I see that you have only one usage in all cases for:
>      BUFFEREDSTREAM_FLUSH_IF("", 512)
>      
>      Can you simple declare simplified macro for this?
>      
>      Otherwise looks good.
>      
>      Thanks,
>      Vladimir
>      
>      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
>      > Dear all,
>      >
>      > may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
>      >
>      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
>      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
>      >
>      > Thank you!
>      > Lutz
>      >
>      >
>      
> 

From gromero at linux.vnet.ibm.com  Wed Jan 16 21:57:37 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Wed, 16 Jan 2019 19:57:37 -0200
Subject: [12] RFR(S) 8215317: [GRAAL] unit test CheckGraalIntrinsics
 failed after 8213754
In-Reply-To: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
References: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
Message-ID: <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>

Hi Vladimir,

I would like to request the approval to backport the change:

8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
https://bugs.openjdk.java.net/browse/JDK-8213754

to jdk11u, but if it gets integrated before 8215687 it will break Graal
test HotspotTest.java/CheckGraalIntrinsics.java again, as expected.

Are you fine if I request the approval to backport first this change, i.e.
8215687?

Actually I'll have to tweak a bit and s/isJDK12OrHigher/isJDK11OrHigher/,
right?

Thank you.

Best regards,
Gustavo

On 12/13/2018 02:10 AM, Vladimir Kozlov wrote:
> https://bugs.openjdk.java.net/browse/JDK-8215317
> 
> JDK-8213754 added new intrinsics which cause Graal's unit test failure.
> 
> CheckGraalIntrinsics test is adjusted for new intrinsics:
> 
> src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java
> @@ -376,6 +376,14 @@
>                               "jdk/jfr/internal/JVM.getEventWriter()Ljava/lang/Object;");
>           }
> 
> +        if (isJDK12OrHigher()) {
> +            add(toBeInvestigated,
> +                            "java/lang/CharacterDataLatin1.isDigit(I)Z",
> +                            "java/lang/CharacterDataLatin1.isLowerCase(I)Z",
> +                            "java/lang/CharacterDataLatin1.isUpperCase(I)Z",
> +                            "java/lang/CharacterDataLatin1.isWhitespace(I)Z");
> +        }
> +
>           if (!config.inlineNotify()) {
>               add(ignore, "java/lang/Object.notify()V");
>           }
> 
> Tested tier1 and tier3-graal (where test is run).
> 
> I also pushed changes into Lab's Graal repo so this test will be updated during next sync.
> But I want to push fix into JDK because JDK 12 will be forked very soon.
> 


From xxinliu at amazon.com  Wed Jan 16 22:04:27 2019
From: xxinliu at amazon.com (Liu, Xin)
Date: Wed, 16 Jan 2019 22:04:27 +0000
Subject: Why does call_site_target keep changing for a Nashorn method?
Message-ID: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>

In one of our applications, C1/C2 keeps compiling a Javascript method generated by Nashorn but the code fails a dependency check right before installing in the code cache. This is with JDK tip.

It can?t pass ?Dependencies::check_call_site_target_value?.
[C2 Parsing]
<bc code='182' bci='1'/>
<dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
<call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
<inline_success reason='accessor'/>
<parse method='1141' uses='21249.000000' stamp='1112.538'>
<bc code='180' bci='1'/>
<unknown id='1556'/>
<unknown id='1866'/>
<dependency type='call_site_target_value' x0='1556' x='1866'/>
<parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
</parse>

[Validating compilation dependencies]
<dependency type='call_site_target_value' x0='1132' x='1143'/>
<dependency type='call_site_target_value' x0='1334' x='1337'/>
<dependency type='call_site_target_value' x0='1424' x='1425'/>
<dependency type='call_site_target_value' x0='1437' x='1438'/>
<dependency type='call_site_target_value' x0='1454' x='1455'/>
<dependency type='call_site_target_value' x0='1465' x='1466'/>
<dependency type='call_site_target_value' x0='1482' x='1483'/>
<dependency type='call_site_target_value' x0='1498' x='1499'/>
<dependency type='call_site_target_value' x0='1509' x='1510'/>
<dependency type='call_site_target_value' x0='1526' x='1576'/>
<dependency type='call_site_target_value' x0='1528' x='1667'/>
<dependency type='call_site_target_value' x0='1536' x='1692'/>
<dependency type='call_site_target_value' x0='1537' x='1707'/>
<dependency type='call_site_target_value' x0='1538' x='1730'/>
<dependency type='call_site_target_value' x0='1539' x='1746'/>
<dependency type='call_site_target_value' x0='1540' x='1787'/>
<dependency type='call_site_target_value' x0='1550' x='1804'/>
<dependency type='call_site_target_value' x0='1553' x='1820'/>
<dependency type='call_site_target_value' x0='1554' x='1836'/>
<dependency type='call_site_target_value' x0='1555' x='1849'/>
<dependency type='call_site_target_value' x0='1556' x='1866'/>
<dependency_failed type='call_site_target_value' x0='1556' x='1866' witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite' stamp='1113.578'/>

It?s related to the GWT methodHandle.  The 2 mismatched methodhandles are very similar except for argL3, which is an int[2].
Even though arg0-2 are not identical objects, their contents are same.

(gdb) call java_lang_invoke_CallSite::target(call_site)->print()
java.lang.invoke.BoundMethodHandle$Species_LLLL
{0x00000000f586ca98} - klass: 'java/lang/invoke/BoundMethodHandle$Species_LLLL'
- ---- fields (total size 6 words):
- 'customizationCount' 'B' @12  0
- private final 'type' 'Ljava/lang/invoke/MethodType;' @16  a 'java/lang/invoke/MethodType'{0x00000000e21e2878} = (Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object; (e21e2878)
- final 'form' 'Ljava/lang/invoke/LambdaForm;' @20  a 'java/lang/invoke/LambdaForm'{0x00000000e1e4a670} => a 'java/lang/invoke/MemberName'{0x00000000e1e4a938} = {method} {0x00007fffa512cb68} 'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;' in 'java/lang/invoke/LambdaForm$MH' (e1e4a670)
- 'asTypeCache' 'Ljava/lang/invoke/MethodHandle;' @24  NULL (0)
- final 'argL0' 'Ljava/lang/Object;' @28  a 'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8} (f586c9e8)
- final 'argL1' 'Ljava/lang/Object;' @32  a 'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28} (f586ca28)
- final 'argL2' 'Ljava/lang/Object;' @36  a 'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60} (f586ca60)
- final 'argL3' 'Ljava/lang/Object;' @40  [I{0x00000000f586ca10} (f586ca10)

(gdb) call method_handle->print()
java.lang.invoke.BoundMethodHandle$Species_LLLL
{0x00000000f6b18500} - klass: 'java/lang/invoke/BoundMethodHandle$Species_LLLL'
- ---- fields (total size 6 words):
- 'customizationCount' 'B' @12  0
- private final 'type' 'Ljava/lang/invoke/MethodType;' @16  a 'java/lang/invoke/MethodType'{0x00000000e21e2878} = (Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object; (e21e2878)
- final 'form' 'Ljava/lang/invoke/LambdaForm;' @20  a 'java/lang/invoke/LambdaForm'{0x00000000e1e4a670} => a 'java/lang/invoke/MemberName'{0x00000000e1e4a938} = {method} {0x00007fffa512cb68} 'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;' in 'java/lang/invoke/LambdaForm$MH' (e1e4a670)
- 'asTypeCache' 'Ljava/lang/invoke/MethodHandle;' @24  NULL (0)
- final 'argL0' 'Ljava/lang/Object;' @28  a 'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450} (f6b18450)
- final 'argL1' 'Ljava/lang/Object;' @32  a 'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490} (f6b18490)
- final 'argL2' 'Ljava/lang/Object;' @36  a 'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8} (f6b184c8)
- final 'argL3' 'Ljava/lang/Object;' @40  [I{0x00000000f6b18478} (f6b18478)

My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.


// Intrinsified by C2. Counters are used during parsing to calculate branch frequencies.
@LambdaForm.Hidden
@jdk.internal.HotSpotIntrinsicCandidate
static
boolean profileBoolean(boolean result, int[] counters) {
    // Profile is int[2] where [0] and [1] correspond to false and true occurrences respectively.
    int idx = result ? 1 : 0;
    try {
        counters[idx] = Math.addExact(counters[idx], 1);
    } catch (ArithmeticException e) {
        // Avoid continuous overflow by halving the problematic count.
        counters[idx] = counters[idx] / 2;
    }
    return result;
}


I am still struggling to understand the source code in java.lang.invoke.*.  Could anybody enlighten me why the target of the callsite changes every time here?  it is relative to this profiling thing?
In validation log, it has validated the dep ?dependency type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t pass it after then? My guess is one MH object has been changed by another Java thread.
One interesting fact that compiler thread can?t pass 22th dep.  My tuition is it goes over an unknown threshold.

The 2nd question is about ciEnv:: validate_compile_task_dependencies.  Why does failure of call_site_target_value_changed not count as a deopt?
The flag  _inc_decompile_count_on_failure =false stops MDO to mark this method ?not_compileable?.  C2 doesn?t set the flag, so C2 ends up compiling it over and over, which makes C2 a cpu hog. Here?s the code in validate_compile_task_dependencies

  bool counter_changed = system_dictionary_modification_counter_changed();
  Dependencies::DepType result = dependencies()->validate_dependencies(_task, counter_changed);
  if (result != Dependencies::end_marker) {
    if (result == Dependencies::call_site_target_value) {
      _inc_decompile_count_on_failure = false;
      record_failure("call site target change");

Maybe the right thing to do is to count this as a deopt and change the deopt limit computation to take into account the size of the method in nodes, just as done for abandoning compilation if the graph is too big.

Thanks,
--lx

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190116/328346a5/attachment-0001.html>

From sandhya.viswanathan at intel.com  Thu Jan 17 00:00:31 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Thu, 17 Jan 2019 00:00:31 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <ba59cf3a-1b07-1530-79d1-d7f22a1cb900@redhat.com>
 <600e8c67-80a4-fef5-b441-72c51c6ccddb@oracle.com>
 <ad813d34-3ebd-6f66-332a-06b9446367c0@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A50345@FMSMSX126.amr.corp.intel.com>

It will be wonderful to have persistent MappedByteBuffer feature proposed by Andrew Dinn in JDK 13. To us it looks to be a seamless extension to the existing API, provides a very good building block for persistent memory support in Java in the current Java paradigm and is directly applicable to a class of workloads. Many Big Data frameworks like Apache HBASE use FileChannel map and MappedByteBuffer as the underlying mechanism and so can use the proposed feature to utilize non-volatile memory. 

We have also reviewed and provided initial feedback to Andrew on the implementation. 

Best Regards,
Sandhya
 

-----Original Message-----
From: Andrew Dinn [mailto:adinn at redhat.com] 
Sent: Wednesday, January 16, 2019 3:23 AM
To: Alan Bateman <Alan.Bateman at oracle.com>; Brian Goetz <brian.goetz at oracle.com>
Cc: core-libs-dev at openjdk.java.net; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; Jonathan Halliday <jonathan.halliday at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Subject: Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

Hi Alan/Brian,

I have finally been able to shelve other commitments and return to this JEP (apologies for the hiatus).

  https://openjdk.java.net/jeps/8207851

The JEP has been reviewed positively by Stuart Marks (core libs) and Vladimir Kozlov (intrinsics). It has also been warmly welcomed by several potential users in Red Hat and Intel (including, respectively, Jonathan Halliday and Sandya Viswanathan both in cc).

I believe I have addressed all outstanding comments on the JEP per se, including those made by Alan. Is it now possible for one of you to endorse the JEP so it can be submitted?

I am aware that I still need to address a few details in the draft implementation that are not present in the latest webrev. I believe there are two changes requested by Vladimir:

  1. correct the type of cache writeback memory nodes to generic memory
  2. use the JVM to inject a flag setting which enables/disables mapping of persistent buffers

and also one change requested by Alan:

  make method MappedByteBuffer.isPersistent private rather than public

Is there any other impediment to submitting the JEP and proceeding to code review?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From vladimir.kozlov at oracle.com  Thu Jan 17 01:40:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 16 Jan 2019 17:40:19 -0800
Subject: [12] RFR(S) 8215317: [GRAAL] unit test CheckGraalIntrinsics
 failed after 8213754
In-Reply-To: <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>
References: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
 <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>
Message-ID: <7de3a2e1-5dc2-2d23-aec9-92085d7d7cff@oracle.com>

Hi Gustavo,

You should combine both changes and request them at the same time. Changeset will have to list both 
changes. Originally your changes should include CheckGraalIntrinsics.java fix.

Note, for 8213754 corresponding Graal test fix is 8215317 (and not 8215687, that one is for 8212043 
Math.min/max).

Yes, Graal fix in jdk11u is different - new intrinsics should be listed under existing condition 
isJDK11OrHigher():

http://hg.openjdk.java.net/jdk-updates/jdk11u/file/5fc74655f16d/src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java#l371

Regards,
Vladimir

On 1/16/19 1:57 PM, Gustavo Romero wrote:
> Hi Vladimir,
> 
> I would like to request the approval to backport the change:
> 
> 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
> https://bugs.openjdk.java.net/browse/JDK-8213754
> 
> to jdk11u, but if it gets integrated before 8215687 it will break Graal
> test HotspotTest.java/CheckGraalIntrinsics.java again, as expected.
> 
> Are you fine if I request the approval to backport first this change, i.e.
> 8215687?
> 
> Actually I'll have to tweak a bit and s/isJDK12OrHigher/isJDK11OrHigher/,
> right?
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
> On 12/13/2018 02:10 AM, Vladimir Kozlov wrote:
>> https://bugs.openjdk.java.net/browse/JDK-8215317
>>
>> JDK-8213754 added new intrinsics which cause Graal's unit test failure.
>>
>> CheckGraalIntrinsics test is adjusted for new intrinsics:
>>
>> src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java 
>>
>> @@ -376,6 +376,14 @@
>> ????????????????????????????? "jdk/jfr/internal/JVM.getEventWriter()Ljava/lang/Object;");
>> ????????? }
>>
>> +??????? if (isJDK12OrHigher()) {
>> +??????????? add(toBeInvestigated,
>> +??????????????????????????? "java/lang/CharacterDataLatin1.isDigit(I)Z",
>> +??????????????????????????? "java/lang/CharacterDataLatin1.isLowerCase(I)Z",
>> +??????????????????????????? "java/lang/CharacterDataLatin1.isUpperCase(I)Z",
>> +??????????????????????????? "java/lang/CharacterDataLatin1.isWhitespace(I)Z");
>> +??????? }
>> +
>> ????????? if (!config.inlineNotify()) {
>> ????????????? add(ignore, "java/lang/Object.notify()V");
>> ????????? }
>>
>> Tested tier1 and tier3-graal (where test is run).
>>
>> I also pushed changes into Lab's Graal repo so this test will be updated during next sync.
>> But I want to push fix into JDK because JDK 12 will be forked very soon.
>>
> 

From Pengfei.Li at arm.com  Thu Jan 17 02:06:59 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Thu, 17 Jan 2019 02:06:59 +0000
Subject: [aarch64-port-dev ] RFR(M): 8212043: Add floating-point
 Math.min/max intrinsics
In-Reply-To: <A66BBE673E08E1428E3A918AE4D5B32CED1492@BGSMSX106.gar.corp.intel.com>
References: <DB7PR08MB31155E7EBF83657CB1C17F9996F90@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <fa0af30b-5512-97a2-555d-7885b4ce6a6d@redhat.com>
 <DB7PR08MB3115E97D35A9812A164F1E0596A80@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <5bf1c593-2e96-8a10-88c6-98afdd9a04f2@redhat.com>
 <DB7PR08MB31158A711E73D7D37BB3F62F96A50@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <0c7de175-17d8-f3f5-a47b-2b9b3f45af71@redhat.com>
 <AM6PR08MB311141B58BB955B94A8E143C96A70@AM6PR08MB3111.eurprd08.prod.outlook.com>
 <d42679f1-8696-3011-b23f-0f8f4d962f1c@redhat.com>
 <DB7PR08MB31156B42C681921D8E75B6AC96A00@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e5e8574e-60ce-8837-54a3-eb97dc121306@redhat.com>
 <1e7af2c4-8610-2ee9-9955-298ffb715fa7@redhat.com>
 <DB7PR08MB3115D088A4A8B4B34EF723A696BD0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <06048878-effe-7d24-bb87-b140e662aeb8@redhat.com>
 <7c97719b-e83a-ba40-43a3-8cec8273df1c@redhat.com>
 <3df16666-a10b-41bb-7439-b967e1d76735@redhat.com>
 <4a10fa17-197b-2da9-7890-9544a407832f@redhat.com>
 <c2b74b56-8da5-6da1-8680-a65f749469fe@redhat.com>
 <DB7PR08MB3115666749428943CBB7B1A196BE0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <CAMi_1tV6=A9F2mkBGYSQQTssNkkJS+1E5Qkhvs=yv0vL=Rtjkw@mail.gmail.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED1492@BGSMSX106.gar.corp.intel.com>
Message-ID: <DB7PR08MB3115294A670B040968E3D44296830@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Jatin,

> test/hotspot/jtreg/compiler/intrinsics/math/TestFpMinMaxIntrinsics.java
> 
> Can you kindly add this test case, I?m working on supporting these new intrinsics for X86 platform and will like to use the test case you created. 

Thanks for pointing out. That newly created test file is really missing when pushed. But I'm NOT a committer so I can't do it either.

Perhaps you could just use the file to test and upstream it together with your code change. I think each file I uploaded to my cr.openjdk.java.net is authorized to use.

--
Thanks,
Pengfei


From mikael.vidstedt at oracle.com  Thu Jan 17 02:22:43 2019
From: mikael.vidstedt at oracle.com (Mikael Vidstedt)
Date: Wed, 16 Jan 2019 18:22:43 -0800
Subject: RFR (XS): 8217266: Remove dead LIR_List::compare_to and
 LIR_Code::lir_compare_to
Message-ID: <39E5F5F4-598E-46C8-8BAD-C95D16439EDA@oracle.com>


Please review this small change which removes some long since dead code:

bug: https://bugs.openjdk.java.net/browse/JDK-8217266
webrev: http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/ <http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/>


I went back to see when the method was last actively used, but it?s at least 10 years ago (it was already dead in hsx14 from 2009), so suffice to say it?s been a while.

Running tier1 now for good luck.

Cheers,
Mikael

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190116/9ced9756/attachment-0001.html>

From Nick.Gasson at arm.com  Thu Jan 17 06:51:10 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Thu, 17 Jan 2019 06:51:10 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <MN2PR18MB2733C3A6FA4BC2512D3E3927D2820@MN2PR18MB2733.namprd18.prod.outlook.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
 <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>
 <MN2PR18MB2733C3A6FA4BC2512D3E3927D2820@MN2PR18MB2733.namprd18.prod.outlook.com>
Message-ID: <2c32a14b-151e-5d05-5de0-07a984727f20@arm.com>

Thanks Derek! Is there anyone who can help me push this? (BTW in the 
last webrev I removed the Contributed-by line and added my hg username, 
hope this is correct...)

Nick

On 17/01/2019 03:44, Derek White wrote:
> Hi Nick,
> 
> Looks good to me!
> 
>   - Derek
> 
>> -----Original Message-----
>> From: Nick Gasson (Arm Technology China) <Nick.Gasson at arm.com>
>> Sent: Thursday, January 10, 2019 9:37 PM
>> To: Andrew Haley <aph at redhat.com>; Derek White
>> <derekw at marvell.com>; hotspot-compiler-dev at openjdk.java.net compiler
>> <hotspot-compiler-dev at openjdk.java.net>
>> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
>> Subject: [EXT] Re: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor
>> unlock fast path not called
>>
>> External Email
>>
>> ----------------------------------------------------------------------
>> Hi all,
>>
>> On 09/01/2019 17:23, Andrew Haley wrote:
>>>
>>> HotSpot policy is that we can do minor cleanups as we go along:
>>> experience has shown that unless you do so, cruft tends to accumulate.
>>> These cleanups are OK for this patch.
>>>
>>
>> Please see the updated webrev here:
>>
>> http://cr.openjdk.java.net/~ngasson/8216350/webrev.1/
>>
>> Includes cleanups according to Derek's comments and updated the copyright
>> year (thanks Felix).
>>
>>> 4)  Slightly better comment for last instruction of fast_unlock (and
>> explicitly use zr).
>>>      __ stlr(zr, tmp); // set unowned
>>
>> Note I needed to change the definition of load_store_exclusive to allow ZR
>> here. I've checked that this is OK for the other instructions that use this.
>>
>> Thanks,
>> Nick

From rwestrel at redhat.com  Thu Jan 17 08:33:08 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 17 Jan 2019 09:33:08 +0100
Subject: RFR (XS): 8217266: Remove dead LIR_List::compare_to and
 LIR_Code::lir_compare_to
In-Reply-To: <39E5F5F4-598E-46C8-8BAD-C95D16439EDA@oracle.com>
References: <39E5F5F4-598E-46C8-8BAD-C95D16439EDA@oracle.com>
Message-ID: <877ef3wunv.fsf@redhat.com>


> webrev: http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/ <http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/>

That looks good to me.

Roland.

From aph at redhat.com  Thu Jan 17 09:16:57 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 17 Jan 2019 09:16:57 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
Message-ID: <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>

On 1/16/19 8:46 PM, B. Blaser wrote:

> To answer Andrew Haley, one of the major difference between CISC and
> RISC is specifically the load/store architecture of the latter which
> is part of most instructions of the former; I don't see many good
> reasons to generate RISC-like load/store code using only a subset of
> instructions and to juggle with registers.

Well, yes, but the question remains: does this change actually help
anything. And if it does, by how much? All we have now is

> I cannot say if if this has performance implication. I suspect
> not. If it has, it's probably miniscule improvement. I can't see how
> it could be worse though.

We can measure, and we should.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From aph at redhat.com  Thu Jan 17 09:36:01 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 17 Jan 2019 09:36:01 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <2c32a14b-151e-5d05-5de0-07a984727f20@arm.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
 <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>
 <MN2PR18MB2733C3A6FA4BC2512D3E3927D2820@MN2PR18MB2733.namprd18.prod.outlook.com>
 <2c32a14b-151e-5d05-5de0-07a984727f20@arm.com>
Message-ID: <4f470d09-bc37-55b6-f42d-373934d5aba4@redhat.com>

On 1/17/19 6:51 AM, Nick Gasson (Arm Technology China) wrote:
> Thanks Derek! Is there anyone who can help me push this? (BTW in the 
> last webrev I removed the Contributed-by line and added my hg username, 
> hope this is correct...)

We need more committers. These people have contributed to the AArch64 port:

adinn aph avoitylov bulasevich coleenp dchuyko dholmes dlong dpochepk
dsamersoff egahlin eosterlund erikj fyang gdub goetz gziemski hseigel
ihse iveresov jcbeyler jcm jwilhelm kbarrett kvn lana lfoltan lucy
mbaesken mdoerr mikael njian pli pliden prr rehn rkennke roland
rraghavan shade smonteith stefank stuefe thartmann tschatzl vlivanov
yzhang zyao

I see that we have quite a few authors without committer access. njian
has 16 committed patches by now, and should surely be a committer.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From felix.yang at huawei.com  Thu Jan 17 12:21:32 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Thu, 17 Jan 2019 12:21:32 +0000
Subject: [aarch64-port-dev ] RFR: 8216350: AArch64: monitor unlock fast
 path not called
In-Reply-To: <2c32a14b-151e-5d05-5de0-07a984727f20@arm.com>
References: <MN2PR18MB2733F0D50D0E61B4EFE1D178D28B0@MN2PR18MB2733.namprd18.prod.outlook.com>
 <680089e7-ec26-a4cd-6143-4d36182e971a@arm.com>
 <51023960-0e8f-56aa-20a4-279017251585@redhat.com>
 <a0b8de53-c4ca-98da-7d82-bea0e563fb31@arm.com>
 <MN2PR18MB2733C3A6FA4BC2512D3E3927D2820@MN2PR18MB2733.namprd18.prod.outlook.com>
 <2c32a14b-151e-5d05-5de0-07a984727f20@arm.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F763@dggeml527-mbx.china.huawei.com>

Hi,

    As this patch changes one testcase, to be conservative, I submitted the patch to the submit repo last week: http://hg.openjdk.java.net/jdk/submit/rev/7dfc2583c8b9
    The Email I got shows that it passed all the oracle internal tests.  Will push the patch. 

Thanks,
Felix

> 
> Thanks Derek! Is there anyone who can help me push this? (BTW in the
> last webrev I removed the Contributed-by line and added my hg username,
> hope this is correct...)
> 

From Alan.Bateman at oracle.com  Thu Jan 17 12:53:54 2019
From: Alan.Bateman at oracle.com (Alan Bateman)
Date: Thu, 17 Jan 2019 12:53:54 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <600e8c67-80a4-fef5-b441-72c51c6ccddb@oracle.com>
 <ad813d34-3ebd-6f66-332a-06b9446367c0@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
Message-ID: <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>

On 16/01/2019 11:23, Andrew Dinn wrote:
> Hi Alan/Brian,
>
> I have finally been able to shelve other commitments and return to this
> JEP (apologies for the hiatus).
>
>    https://openjdk.java.net/jeps/8207851
>
> The JEP has been reviewed positively by Stuart Marks (core libs) and
> Vladimir Kozlov (intrinsics). It has also been warmly welcomed by
> several potential users in Red Hat and Intel (including, respectively,
> Jonathan Halliday and Sandya Viswanathan both in cc).
I think the proposal is good as a short term/tactical solution, 
especially as you were able to reduce the API surface down to new 
FileChannel map modes. I think it can be looked at again once Project 
Panama is further along and there is some notion of "memory region" that 
is backed by NVM.

I skimmed through the current draft. In the most recent discussion then 
I think we had converged on "SYNC" rather than "PERSISTENT", the 
reasoning being that there is persistence already with regular file 
mapped files, also it aligns with the MAP_SYNC flag to mmap. I don't 
recall if the discussion on isPersistent concluded, that was more of a 
naming issue and whether you include an isXXX method or not is not 
critical to the proposal. The overload of the force method to specify a 
range is a good addition, irrespective of the JEP.

One thing to clarify is the heading "Proposed Restricted Public JDK API 
Changes". The proposal (and the early webrevs) exposed writebackMemory 
in the internal Unsafe, not sun.misc.Unsafe, which I think is right. 
This makes it a JDK internal API so it doesn't need to be in JEP.

Did you get any feedback on the Testing section? Given that the feature 
needs special hardware then it will need commitment to test is on a 
regular basis. It's a similar issue to the draft "JEP 337: RDMA Network 
Sockets" where special hardware is needed to full test the feature. In 
the case of JEP 337 then some testing with emulation is possible.

Vladimir and I have reviewed the JEP, it will need an area lead to 
endorse, I think it can be Brian or Mikael in this case.

-Alan


From martin.doerr at sap.com  Thu Jan 17 13:18:13 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Thu, 17 Jan 2019 13:18:13 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
 <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
Message-ID: <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi,

the rebased webrev.01 applies on jdk/jdk, now (after JDK-8216376). So the issue Gustavo had observed does not longer exist.
http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/

I have updated copyrights and retested it.

Best regards,
Martin


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Montag, 7. Januar 2019 14:52
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin,

On 01/07/2019 11:49 AM, Doerr, Martin wrote:
> I want to check all places where we use "mr(R1_SP, R21_sender_SP)". There may be more issues with that. I'll probably handle that in a separate change and push this CRC change afterwards.

I see. Thanks for letting me know.

Best regards,
Gustavo

> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 4. Januar 2019 19:55
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/04/2019 02:13 PM, Doerr, Martin wrote:
>> Hi Gustavo,
>>
>> when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).
> 
> Got it. Thanks a lot for the explanations.
> 
> I think it doesn't currently matter in practice, but I'm wondering if to be
> consistent we should cut back the stack back earlier also in
> TemplateInterpreterGenerator::generate_CRC32_update_entry()?
> 
> diff -r a35f8c35d8c9 src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 10:09:00 2019 +0100
> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 13:44:37 2019 -0500
> @@ -1840,11 +1840,12 @@
>    #endif
>        __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64 bit to have a clean register.
>    
> +    // Restore caller sp for c2i case and return.
> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
> +
>        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>        __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
>    
> -    // Restore caller sp for c2i case and return.
> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>        __ blr();
>    
>        // Generate a vanilla native entry as the slow path.
> 
> Currently there is no issue probably because generated code is simpler and does
> no spills.
> 
> Best regards,
> Gustavo
> 
>> When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).
>>
>> "mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.
>>
>> Best regards,
>> Martin
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Freitag, 4. Januar 2019 14:44
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
>>> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.
>>
>> Glad to help! Thanks for the additional information, I was not aware that the
>> selection of different frame headers could be done at compile time. One last
>> question only for my education: what exactly advanced (incremented) R1_SP so it
>> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
>> which function exactly or "who" is the caller exactly here?
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>>> New webrev:
>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
>>>
>>> Best regards,
>>> Martin
>>>
>>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Donnerstag, 3. Januar 2019 19:36
>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>
>>> Hi Martin,
>>>
>>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>>>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
>>>
>>> Thanks for providing a fix so I can try it.
>>> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
>>> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
>>> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
>>>
>>> Just as reference, I can reproduce it on the release build with the following trivial code:
>>>
>>> import java.util.zip.CRC32C;
>>>
>>> class CRC32C_v1 {
>>>       public static void main(String[] arg) {
>>>         byte[] b = new byte[1024];
>>>       
>>>         CRC32C crc32c = new CRC32C();
>>>         crc32c.update(b, 0, b.length);
>>>
>>>         System.out.println(crc32c.getValue());
>>>       }
>>> }
>>>
>>> Thanks for fixing the typos.
>>>
>>>
>>> Best regards,
>>> Gustavo
>>>      
>>>> Best regards,
>>>> Martin
>>>>
>>>>
>>>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>>>> @@ -1924,6 +1924,9 @@
>>>>            __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>          }
>>>>
>>>> +    // Restore caller sp for c2i case.
>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>> +
>>>>          StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>>>
>>>>          if (!VM_Version::has_vpmsumb()) {
>>>> @@ -1933,8 +1936,6 @@
>>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>>>          }
>>>>
>>>> -    // Restore caller sp for c2i case and return.
>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>          __ blr();
>>>>
>>>>          // Generate a vanilla native entry as the slow path.
>>>> @@ -2014,6 +2015,9 @@
>>>>            __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>          }
>>>>
>>>> +    // Restore caller sp for c2i case.
>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>> +
>>>>          StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>>>
>>>>          if (!VM_Version::has_vpmsumb()) {
>>>> @@ -2023,8 +2027,6 @@
>>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>>>          }
>>>>
>>>> -    // Restore caller sp for c2i case and return.
>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>          __ blr();
>>>>
>>>>          BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>>>
>>>>
>>>> -----Original Message-----
>>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>>> Sent: Donnerstag, 3. Januar 2019 17:13
>>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>>
>>>> Hi Martin,
>>>>
>>>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>>>
>>>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>>>
>>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>>>
>>>> This is all for the CRC32 class.
>>>>
>>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>>>
>>>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>>>
>>>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>>>
>>>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>>>
>>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>>>> for Barrett but it should be changed in
>>>>
>>>> +  // Point to Barret constants
>>>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>>>> +
>>>>
>>>> ?
>>>>
>>>> s/not/note/ in:
>>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>>>
>>>> d/lives/ in:
>>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>>>
>>>> Best regards,
>>>> Gustavo
>>>>
>>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>>>> Hi,
>>>>>
>>>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>>>
>>>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>>>
>>>>> Bug:
>>>>>
>>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>>>
>>>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>>>
>>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>>>
>>>>> Please review.
>>>>>
>>>>> Best regards,
>>>>>
>>>>> Martin
>>>>>
>>>>
>>>
>>
> 


From gromero at linux.vnet.ibm.com  Thu Jan 17 14:07:58 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 17 Jan 2019 12:07:58 -0200
Subject: [12] RFR(S) 8215317: [GRAAL] unit test CheckGraalIntrinsics
 failed after 8213754
In-Reply-To: <7de3a2e1-5dc2-2d23-aec9-92085d7d7cff@oracle.com>
References: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
 <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>
 <7de3a2e1-5dc2-2d23-aec9-92085d7d7cff@oracle.com>
Message-ID: <0af27551-f6f6-28b9-b4b6-cae6427fefad@linux.vnet.ibm.com>

Hi Vladimir,

On 01/16/2019 11:40 PM, Vladimir Kozlov wrote:
> You should combine both changes and request them at the same time. Changeset will have to list both changes. Originally your changes should include CheckGraalIntrinsics.java fix.

Thanks a lot for advising. Just one question: by "Changeset will have to
list both changes" you mean the commit title (or body) must say explicitly
the change is a combination of 8213754 + 821531?
  

> Note, for 8213754 corresponding Graal test fix is 8215317 (and not 8215687, that one is for 8212043 Math.min/max).

Thanks for the detailed note. I commented on the right thread but pasted
the wrong bug!

Thank you and best regards,
Gustavo

> Yes, Graal fix in jdk11u is different - new intrinsics should be listed under existing condition isJDK11OrHigher():
> 
> http://hg.openjdk.java.net/jdk-updates/jdk11u/file/5fc74655f16d/src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java#l371
> 
> Regards,
> Vladimir
> 
> On 1/16/19 1:57 PM, Gustavo Romero wrote:
>> Hi Vladimir,
>>
>> I would like to request the approval to backport the change:
>>
>> 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
>> https://bugs.openjdk.java.net/browse/JDK-8213754
>>
>> to jdk11u, but if it gets integrated before 8215687 it will break Graal
>> test HotspotTest.java/CheckGraalIntrinsics.java again, as expected.
>>
>> Are you fine if I request the approval to backport first this change, i.e.
>> 8215687?
>>
>> Actually I'll have to tweak a bit and s/isJDK12OrHigher/isJDK11OrHigher/,
>> right?
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>> On 12/13/2018 02:10 AM, Vladimir Kozlov wrote:
>>> https://bugs.openjdk.java.net/browse/JDK-8215317
>>>
>>> JDK-8213754 added new intrinsics which cause Graal's unit test failure.
>>>
>>> CheckGraalIntrinsics test is adjusted for new intrinsics:
>>>
>>> src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java
>>> @@ -376,6 +376,14 @@
>>>                               "jdk/jfr/internal/JVM.getEventWriter()Ljava/lang/Object;");
>>>           }
>>>
>>> +        if (isJDK12OrHigher()) {
>>> +            add(toBeInvestigated,
>>> +                            "java/lang/CharacterDataLatin1.isDigit(I)Z",
>>> +                            "java/lang/CharacterDataLatin1.isLowerCase(I)Z",
>>> +                            "java/lang/CharacterDataLatin1.isUpperCase(I)Z",
>>> +                            "java/lang/CharacterDataLatin1.isWhitespace(I)Z");
>>> +        }
>>> +
>>>           if (!config.inlineNotify()) {
>>>               add(ignore, "java/lang/Object.notify()V");
>>>           }
>>>
>>> Tested tier1 and tier3-graal (where test is run).
>>>
>>> I also pushed changes into Lab's Graal repo so this test will be updated during next sync.
>>> But I want to push fix into JDK because JDK 12 will be forked very soon.
>>>
>>
> 


From adinn at redhat.com  Thu Jan 17 14:27:44 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Thu, 17 Jan 2019 14:27:44 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <ad813d34-3ebd-6f66-332a-06b9446367c0@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
Message-ID: <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>

Hi Alan,

Thanks for your response.

On 17/01/2019 12:53, Alan Bateman wrote:
> I skimmed through the current draft. In the most recent discussion then
> I think we had converged on "SYNC" rather than "PERSISTENT", the
> reasoning being that there is persistence already with regular file
> mapped files, also it aligns with the MAP_SYNC flag to mmap. I don't
> recall if the discussion on isPersistent concluded, that was more of a
> naming issue and whether you include an isXXX method or not is not
> critical to the proposal. The overload of the force method to specify a
> range is a good addition, irrespective of the JEP.

Ok, thanks. At least sync is now being used consistently in the public
API. I will look at renaming internal vars/methods to use sync when I
publish the next webrev.

> One thing to clarify is the heading "Proposed Restricted Public JDK API
> Changes". The proposal (and the early webrevs) exposed writebackMemory
> in the internal Unsafe, not sun.misc.Unsafe, which I think is right.
> This makes it a JDK internal API so it doesn't need to be in JEP.


I am happy to remove it from the JEP if needed. Does it do any harm to
leave it?

> Did you get any feedback on the Testing section? Given that the feature
> needs special hardware then it will need commitment to test is on a
> regular basis. It's a similar issue to the draft "JEP 337: RDMA Network
> Sockets" where special hardware is needed to full test the feature. In
> the case of JEP 337 then some testing with emulation is possible.

I believe I received no specific feedback on that topic.

Some of the other Red Hat dev teams (i.e. not OpenJDK) and also dev
staff at Intel are very keen to base some of their future work on this
feature. So, it will certainly get tested /after/ JDK release :-)

Red Hat does have the Intel hardware needed to test this feature but, so
far, nothing that can be used to test on AArch64. Our OpenJDK team can
access this kit for one-off testing but it is not currently available
for continuous integration testing.

I will propose to my manager that we acquire the relevant kit and ensure
that all JDKs which implement this JEP are tested prior to release. We
should also be able to test AArch64 using volatile memory to simulate a
non-volatile memory device up to the point where the requisite
AArch64-based NVM hardware becomes available. I am fairly confident this
plan will be agreeable to the overlords whom I humbly serve.

Perhaps Intel also could provide help with testing? [Sadhya, is this an
option?]

My bigger concern was that crash recovery tests may never be 100%
reliable. A 100% guarantee requires the ability to engineer a machine
crash at a precisely defined critical point of execution and some of the
relevant critical locations will be embedded in the middle of JITted
code making it hard to provoke the crash. So, the situations where a
crash /can/ be engineered may not fully reflect those that occur in live
deployments. That said, a dash of artificiality in test code is,
perhaps, not so worthy of remark . . .

> Vladimir and I have reviewed the JEP, it will need an area lead to
> endorse, I think it can be Brian or Mikael in this case.
Ok, thanks for the above answers. Looking forward to hearing further
from Brian and/or Mikael (Vidstedt, I assume? :-).

regards,


Andrew Dinn
-----------


From lutz.schmidt at sap.com  Thu Jan 17 15:47:00 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Thu, 17 Jan 2019 15:47:00 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
Message-ID: <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>

Hi Vladimir & all,
there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
What's new (in addition to some comments) is the macro 

  // Flush the buffer contents if the remaining capacity is less
  // than the calculated threshold (256 bytes + capacity/16)
  // That should suffice for all reasonably sized output lines.
  #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)                \
      BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))

It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences. 
Regards,
Lutz

?On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:

    On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
    > Hi Vladimir,
    > 
    > thanks a lot for looking at this so quickly.
    > 
    > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512" originated from the thought "its large enough for a well-behaved line and small enough to save some flushes".
    > 
    > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I wasn't sure if that could be categorized as over-engineered.
    
    Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
    
    Vladimir
    
    > 
    > Your thoughts?
    > 
    > Thanks,
    > Lutz
    > 
    > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov" <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
    > 
    >      Hi Lutz,
    >      
    >      I see that you have only one usage in all cases for:
    >      BUFFEREDSTREAM_FLUSH_IF("", 512)
    >      
    >      Can you simple declare simplified macro for this?
    >      
    >      Otherwise looks good.
    >      
    >      Thanks,
    >      Vladimir
    >      
    >      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
    >      > Dear all,
    >      >
    >      > may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
    >      >
    >      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
    >      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
    >      >
    >      > Thank you!
    >      > Lutz
    >      >
    >      >
    >      
    > 
    

From gromero at linux.vnet.ibm.com  Thu Jan 17 16:18:39 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 17 Jan 2019 14:18:39 -0200
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
 <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
 <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <efcb97e0-8920-50f7-80d7-e72d7403a4fe@linux.vnet.ibm.com>

Hi Martin,

On 01/17/2019 11:18 AM, Doerr, Martin wrote:
> Hi,
> 
> the rebased webrev.01 applies on jdk/jdk, now (after JDK-8216376). So the issue Gustavo had observed does not longer exist.
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> 
> I have updated copyrights and retested it.

I tested it when JDK-8216376 was submitted for review, but I retested
this rebase from webrev.01 again on top of the most recent changes
again (just in case) and all looks fine to me.

Thank you.

Best regards,
Gustavo

> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Montag, 7. Januar 2019 14:52
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/07/2019 11:49 AM, Doerr, Martin wrote:
>> I want to check all places where we use "mr(R1_SP, R21_sender_SP)". There may be more issues with that. I'll probably handle that in a separate change and push this CRC change afterwards.
> 
> I see. Thanks for letting me know.
> 
> Best regards,
> Gustavo
> 
>> Best regards,
>> Martin
>>
>>
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Freitag, 4. Januar 2019 19:55
>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>
>> Hi Martin,
>>
>> On 01/04/2019 02:13 PM, Doerr, Martin wrote:
>>> Hi Gustavo,
>>>
>>> when called from the interpreter (the scenario you observed), R21 is set before resizing the frame to avoid wasted stack space (InterpreterMacroAssembler::call_from_interpreter).
>>
>> Got it. Thanks a lot for the explanations.
>>
>> I think it doesn't currently matter in practice, but I'm wondering if to be
>> consistent we should cut back the stack back earlier also in
>> TemplateInterpreterGenerator::generate_CRC32_update_entry()?
>>
>> diff -r a35f8c35d8c9 src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 10:09:00 2019 +0100
>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan 04 13:44:37 2019 -0500
>> @@ -1840,11 +1840,12 @@
>>     #endif
>>         __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64 bit to have a clean register.
>>     
>> +    // Restore caller sp for c2i case and return.
>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>> +
>>         StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>         __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
>>     
>> -    // Restore caller sp for c2i case and return.
>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>         __ blr();
>>     
>>         // Generate a vanilla native entry as the slow path.
>>
>> Currently there is no issue probably because generated code is simpler and does
>> no spills.
>>
>> Best regards,
>> Gustavo
>>
>>> When called from compiled methods, R21 is set by a c2i adapter which extends the compiled frame by space for arguments (gen_c2i_adapter).
>>>
>>> "mr(R1_SP, R21_sender_SP)" is more error-prone than "resize_frame_absolute" so I think the latter would be better (though it takes more registers and instructions), but I don't want to replace that as part of this CRC change.
>>>
>>> Best regards,
>>> Martin
>>>
>>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Freitag, 4. Januar 2019 14:44
>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>
>>> Hi Martin,
>>>
>>> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
>>>> thank you very much for confirming. This makes sense. We use different frame headers depending on whether the frame is the top Java frame or not (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a shortcut for leaf calls which relies on having an unmodified stack until this point. So the patch fixes the issue.
>>>
>>> Glad to help! Thanks for the additional information, I was not aware that the
>>> selection of different frame headers could be done at compile time. One last
>>> question only for my education: what exactly advanced (incremented) R1_SP so it
>>> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame for
>>> which function exactly or "who" is the caller exactly here?
>>>
>>> Thank you.
>>>
>>> Best regards,
>>> Gustavo
>>>
>>>> New webrev:
>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
>>>>
>>>> Best regards,
>>>> Martin
>>>>
>>>>
>>>> -----Original Message-----
>>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>>> Sent: Donnerstag, 3. Januar 2019 19:36
>>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>>
>>>> Hi Martin,
>>>>
>>>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
>>>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on our machine (with fastdbg build).
>>>>> I guess that the frameless spills mess up the stack. Can you check if the patch below helps?
>>>>
>>>> Thanks for providing a fix so I can try it.
>>>> Yes, I confirm the patch below indeed fixes the sigsegv crash when CRC32C update() method is used.
>>>> I also confirm that I don't observe the crash on the fastdebug build, only on the release build.
>>>> It also only affects the Interpreter mode, so passing -Xcomp avoids the crash on the release build.
>>>>
>>>> Just as reference, I can reproduce it on the release build with the following trivial code:
>>>>
>>>> import java.util.zip.CRC32C;
>>>>
>>>> class CRC32C_v1 {
>>>>        public static void main(String[] arg) {
>>>>          byte[] b = new byte[1024];
>>>>        
>>>>          CRC32C crc32c = new CRC32C();
>>>>          crc32c.update(b, 0, b.length);
>>>>
>>>>          System.out.println(crc32c.getValue());
>>>>        }
>>>> }
>>>>
>>>> Thanks for fixing the typos.
>>>>
>>>>
>>>> Best regards,
>>>> Gustavo
>>>>       
>>>>> Best regards,
>>>>> Martin
>>>>>
>>>>>
>>>>> diff -r a33f49d5998c src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
>>>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 17:30:03 2019 +0100
>>>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu Jan 03 18:33:16 2019 +0100
>>>>> @@ -1924,6 +1924,9 @@
>>>>>             __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>>           }
>>>>>
>>>>> +    // Restore caller sp for c2i case.
>>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>> +
>>>>>           StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
>>>>>
>>>>>           if (!VM_Version::has_vpmsumb()) {
>>>>> @@ -1933,8 +1936,6 @@
>>>>>             __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, true);
>>>>>           }
>>>>>
>>>>> -    // Restore caller sp for c2i case and return.
>>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>>           __ blr();
>>>>>
>>>>>           // Generate a vanilla native entry as the slow path.
>>>>> @@ -2014,6 +2015,9 @@
>>>>>             __ addi(data, data, arrayOopDesc::base_offset_in_bytes(T_BYTE));
>>>>>           }
>>>>>
>>>>> +    // Restore caller sp for c2i case.
>>>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>> +
>>>>>           StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm, table);
>>>>>
>>>>>           if (!VM_Version::has_vpmsumb()) {
>>>>> @@ -2023,8 +2027,6 @@
>>>>>             __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3, tc0, tc1, tc2, false);
>>>>>           }
>>>>>
>>>>> -    // Restore caller sp for c2i case and return.
>>>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller started.
>>>>>           __ blr();
>>>>>
>>>>>           BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
>>>>>
>>>>>
>>>>> -----Original Message-----
>>>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>>>> Sent: Donnerstag, 3. Januar 2019 17:13
>>>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>>>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays
>>>>>
>>>>> Hi Martin,
>>>>>
>>>>> oh that's nice. You removed the 512-byte block constraint and also wired it up to the Interpreter :)
>>>>>
>>>>> For the worst case, unaligned 512 byte array, I see the gap to aligned 512 byte array reduced by about ~5.7x.
>>>>>
>>>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
>>>>>
>>>>> This is all for the CRC32 class.
>>>>>
>>>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
>>>>>
>>>>> I've upload a full log into http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
>>>>>
>>>>> I'm leaving for the lunch and I'll take a closer look when back. But probably you will figure it out before I sit to appreciate the meal :)
>>>>>
>>>>> Finally, since the change does some cleanup, I wonder if it would be worth fixing the following typos:
>>>>>
>>>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the code as a short version
>>>>> for Barrett but it should be changed in
>>>>>
>>>>> +  // Point to Barret constants
>>>>> +  add_const_optimized(cur_const, constants, outer_consts_size + inner_consts_size);
>>>>> +
>>>>>
>>>>> ?
>>>>>
>>>>> s/not/note/ in:
>>>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table address(es):
>>>>>
>>>>> d/lives/ in:
>>>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc lives lives in VCRC, now
>>>>>
>>>>> Best regards,
>>>>> Gustavo
>>>>>
>>>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
>>>>>> Hi,
>>>>>>
>>>>>> the JVM on PPC64 currently misses usage of the fast vector implementation in the interpreter code.
>>>>>>
>>>>>> In addition, performance is not good for short arrays (unaligned 512 byte arrays or shorter arrays) because the current vector implementation needs at least 512 bytes.
>>>>>>
>>>>>> Bug:
>>>>>>
>>>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
>>>>>>
>>>>>> I have addressed these 2 issues + some cleanup with the following webrev:
>>>>>>
>>>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
>>>>>>
>>>>>> Please review.
>>>>>>
>>>>>> Best regards,
>>>>>>
>>>>>> Martin
>>>>>>
>>>>>
>>>>
>>>
>>
> 


From vladimir.kozlov at oracle.com  Thu Jan 17 18:12:42 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 17 Jan 2019 10:12:42 -0800
Subject: RFR (XS): 8217266: Remove dead LIR_List::compare_to and
 LIR_Code::lir_compare_to
In-Reply-To: <877ef3wunv.fsf@redhat.com>
References: <39E5F5F4-598E-46C8-8BAD-C95D16439EDA@oracle.com>
 <877ef3wunv.fsf@redhat.com>
Message-ID: <c92240e6-569d-d655-787c-6979b4e3698d@oracle.com>

+1

Vladimir

On 1/17/19 12:33 AM, Roland Westrelin wrote:
> 
>> webrev: http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/ <http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/>
> 
> That looks good to me.
> 
> Roland.
> 

From vladimir.kozlov at oracle.com  Thu Jan 17 18:37:13 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 17 Jan 2019 10:37:13 -0800
Subject: [12] RFR(S) 8215317: [GRAAL] unit test CheckGraalIntrinsics
 failed after 8213754
In-Reply-To: <0af27551-f6f6-28b9-b4b6-cae6427fefad@linux.vnet.ibm.com>
References: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
 <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>
 <7de3a2e1-5dc2-2d23-aec9-92085d7d7cff@oracle.com>
 <0af27551-f6f6-28b9-b4b6-cae6427fefad@linux.vnet.ibm.com>
Message-ID: <a8d73bdb-b589-fe80-902e-9e21c504d2de@oracle.com>

On 1/17/19 6:07 AM, Gustavo Romero wrote:
> Hi Vladimir,
> 
> On 01/16/2019 11:40 PM, Vladimir Kozlov wrote:
>> You should combine both changes and request them at the same time. Changeset will have to list 
>> both changes. Originally your changes should include CheckGraalIntrinsics.java fix.
> 
> Thanks a lot for advising. Just one question: by "Changeset will have to
> list both changes" you mean the commit title (or body) must say explicitly
> the change is a combination of 8213754 + 821531?

Separate lines per bug subject. Here is example:

http://hg.openjdk.java.net/jdk/jdk/rev/c139884bd80e

8213348: jdk.internal.vm.compiler.management service providers missing in module descriptor
8211781: re-building fails after changing Graal sources
Reviewed-by: erikj, mchung

Vladimir

> 
> 
>> Note, for 8213754 corresponding Graal test fix is 8215317 (and not 8215687, that one is for 
>> 8212043 Math.min/max).
> 
> Thanks for the detailed note. I commented on the right thread but pasted
> the wrong bug!
> 
> Thank you and best regards,
> Gustavo
> 
>> Yes, Graal fix in jdk11u is different - new intrinsics should be listed under existing condition 
>> isJDK11OrHigher():
>>
>> http://hg.openjdk.java.net/jdk-updates/jdk11u/file/5fc74655f16d/src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java#l371 
>>
>>
>> Regards,
>> Vladimir
>>
>> On 1/16/19 1:57 PM, Gustavo Romero wrote:
>>> Hi Vladimir,
>>>
>>> I would like to request the approval to backport the change:
>>>
>>> 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
>>> https://bugs.openjdk.java.net/browse/JDK-8213754
>>>
>>> to jdk11u, but if it gets integrated before 8215687 it will break Graal
>>> test HotspotTest.java/CheckGraalIntrinsics.java again, as expected.
>>>
>>> Are you fine if I request the approval to backport first this change, i.e.
>>> 8215687?
>>>
>>> Actually I'll have to tweak a bit and s/isJDK12OrHigher/isJDK11OrHigher/,
>>> right?
>>>
>>> Thank you.
>>>
>>> Best regards,
>>> Gustavo
>>>
>>> On 12/13/2018 02:10 AM, Vladimir Kozlov wrote:
>>>> https://bugs.openjdk.java.net/browse/JDK-8215317
>>>>
>>>> JDK-8213754 added new intrinsics which cause Graal's unit test failure.
>>>>
>>>> CheckGraalIntrinsics test is adjusted for new intrinsics:
>>>>
>>>> src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java 
>>>>
>>>> @@ -376,6 +376,14 @@
>>>> ????????????????????????????? "jdk/jfr/internal/JVM.getEventWriter()Ljava/lang/Object;");
>>>> ????????? }
>>>>
>>>> +??????? if (isJDK12OrHigher()) {
>>>> +??????????? add(toBeInvestigated,
>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isDigit(I)Z",
>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isLowerCase(I)Z",
>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isUpperCase(I)Z",
>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isWhitespace(I)Z");
>>>> +??????? }
>>>> +
>>>> ????????? if (!config.inlineNotify()) {
>>>> ????????????? add(ignore, "java/lang/Object.notify()V");
>>>> ????????? }
>>>>
>>>> Tested tier1 and tier3-graal (where test is run).
>>>>
>>>> I also pushed changes into Lab's Graal repo so this test will be updated during next sync.
>>>> But I want to push fix into JDK because JDK 12 will be forked very soon.
>>>>
>>>
>>
> 

From vladimir.kozlov at oracle.com  Thu Jan 17 18:39:34 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 17 Jan 2019 10:39:34 -0800
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
 <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
Message-ID: <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>

Looks good

Thanks,
Vladimir

On 1/17/19 7:47 AM, Schmidt, Lutz wrote:
> Hi Vladimir & all,
> there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
> What's new (in addition to some comments) is the macro
> 
>    // Flush the buffer contents if the remaining capacity is less
>    // than the calculated threshold (256 bytes + capacity/16)
>    // That should suffice for all reasonably sized output lines.
>    #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)                \
>        BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))
> 
> It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences.
> Regards,
> Lutz
> 
> ?On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:
> 
>      On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
>      > Hi Vladimir,
>      >
>      > thanks a lot for looking at this so quickly.
>      >
>      > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512" originated from the thought "its large enough for a well-behaved line and small enough to save some flushes".
>      >
>      > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I wasn't sure if that could be categorized as over-engineered.
>      
>      Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
>      
>      Vladimir
>      
>      >
>      > Your thoughts?
>      >
>      > Thanks,
>      > Lutz
>      >
>      > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov" <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
>      >
>      >      Hi Lutz,
>      >
>      >      I see that you have only one usage in all cases for:
>      >      BUFFEREDSTREAM_FLUSH_IF("", 512)
>      >
>      >      Can you simple declare simplified macro for this?
>      >
>      >      Otherwise looks good.
>      >
>      >      Thanks,
>      >      Vladimir
>      >
>      >      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
>      >      > Dear all,
>      >      >
>      >      > may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
>      >      >
>      >      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
>      >      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
>      >      >
>      >      > Thank you!
>      >      > Lutz
>      >      >
>      >      >
>      >
>      >
>      
> 

From gromero at linux.vnet.ibm.com  Thu Jan 17 18:44:01 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 17 Jan 2019 16:44:01 -0200
Subject: [12] RFR(S) 8215317: [GRAAL] unit test CheckGraalIntrinsics
 failed after 8213754
In-Reply-To: <a8d73bdb-b589-fe80-902e-9e21c504d2de@oracle.com>
References: <5df6d8d0-4fba-4515-ffee-870cf8cff9d3@oracle.com>
 <23eb2db4-fc81-e5e4-4c8f-67a617426353@linux.vnet.ibm.com>
 <7de3a2e1-5dc2-2d23-aec9-92085d7d7cff@oracle.com>
 <0af27551-f6f6-28b9-b4b6-cae6427fefad@linux.vnet.ibm.com>
 <a8d73bdb-b589-fe80-902e-9e21c504d2de@oracle.com>
Message-ID: <a5199bbb-6b6e-f2ca-cffc-2c12ae45852c@linux.vnet.ibm.com>

On 01/17/2019 04:37 PM, Vladimir Kozlov wrote:
> On 1/17/19 6:07 AM, Gustavo Romero wrote:
>> Hi Vladimir,
>>
>> On 01/16/2019 11:40 PM, Vladimir Kozlov wrote:
>>> You should combine both changes and request them at the same time. Changeset will have to list both changes. Originally your changes should include CheckGraalIntrinsics.java fix.
>>
>> Thanks a lot for advising. Just one question: by "Changeset will have to
>> list both changes" you mean the commit title (or body) must say explicitly
>> the change is a combination of 8213754 + 821531?
> 
> Separate lines per bug subject. Here is example:
> 
> http://hg.openjdk.java.net/jdk/jdk/rev/c139884bd80e
> 
> 8213348: jdk.internal.vm.compiler.management service providers missing in module descriptor
> 8211781: re-building fails after changing Graal sources
> Reviewed-by: erikj, mchung

Got it. I'll send the change to jdk-updates for review before
tagging the bugs with "jdk11u-fix-request" so.

Thanks, Vladimir.

Regards,
Gustavo
  
> Vladimir
> 
>>
>>
>>> Note, for 8213754 corresponding Graal test fix is 8215317 (and not 8215687, that one is for 8212043 Math.min/max).
>>
>> Thanks for the detailed note. I commented on the right thread but pasted
>> the wrong bug!
>>
>> Thank you and best regards,
>> Gustavo
>>
>>> Yes, Graal fix in jdk11u is different - new intrinsics should be listed under existing condition isJDK11OrHigher():
>>>
>>> http://hg.openjdk.java.net/jdk-updates/jdk11u/file/5fc74655f16d/src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java#l371
>>>
>>> Regards,
>>> Vladimir
>>>
>>> On 1/16/19 1:57 PM, Gustavo Romero wrote:
>>>> Hi Vladimir,
>>>>
>>>> I would like to request the approval to backport the change:
>>>>
>>>> 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
>>>> https://bugs.openjdk.java.net/browse/JDK-8213754
>>>>
>>>> to jdk11u, but if it gets integrated before 8215687 it will break Graal
>>>> test HotspotTest.java/CheckGraalIntrinsics.java again, as expected.
>>>>
>>>> Are you fine if I request the approval to backport first this change, i.e.
>>>> 8215687?
>>>>
>>>> Actually I'll have to tweak a bit and s/isJDK12OrHigher/isJDK11OrHigher/,
>>>> right?
>>>>
>>>> Thank you.
>>>>
>>>> Best regards,
>>>> Gustavo
>>>>
>>>> On 12/13/2018 02:10 AM, Vladimir Kozlov wrote:
>>>>> https://bugs.openjdk.java.net/browse/JDK-8215317
>>>>>
>>>>> JDK-8213754 added new intrinsics which cause Graal's unit test failure.
>>>>>
>>>>> CheckGraalIntrinsics test is adjusted for new intrinsics:
>>>>>
>>>>> src/jdk.internal.vm.compiler/share/classes/org.graalvm.compiler.hotspot.test/src/org/graalvm/compiler/hotspot/test/CheckGraalIntrinsics.java
>>>>> @@ -376,6 +376,14 @@
>>>>> ????????????????????????????? "jdk/jfr/internal/JVM.getEventWriter()Ljava/lang/Object;");
>>>>> ????????? }
>>>>>
>>>>> +??????? if (isJDK12OrHigher()) {
>>>>> +??????????? add(toBeInvestigated,
>>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isDigit(I)Z",
>>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isLowerCase(I)Z",
>>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isUpperCase(I)Z",
>>>>> +??????????????????????????? "java/lang/CharacterDataLatin1.isWhitespace(I)Z");
>>>>> +??????? }
>>>>> +
>>>>> ????????? if (!config.inlineNotify()) {
>>>>> ????????????? add(ignore, "java/lang/Object.notify()V");
>>>>> ????????? }
>>>>>
>>>>> Tested tier1 and tier3-graal (where test is run).
>>>>>
>>>>> I also pushed changes into Lab's Graal repo so this test will be updated during next sync.
>>>>> But I want to push fix into JDK because JDK 12 will be forked very soon.
>>>>>
>>>>
>>>
>>
> 


From lutz.schmidt at sap.com  Thu Jan 17 19:45:13 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Thu, 17 Jan 2019 19:45:13 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
 <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
 <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
Message-ID: <FDEAF982-5BE7-4036-9838-2154C98F85A2@sap.com>

Thank you, Vladimir!
Have a great day!
Lutz

?On 17.01.19, 19:39, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:

    Looks good
    
    Thanks,
    Vladimir
    
    On 1/17/19 7:47 AM, Schmidt, Lutz wrote:
    > Hi Vladimir & all,
    > there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
    > What's new (in addition to some comments) is the macro
    > 
    >    // Flush the buffer contents if the remaining capacity is less
    >    // than the calculated threshold (256 bytes + capacity/16)
    >    // That should suffice for all reasonably sized output lines.
    >    #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)                \
    >        BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))
    > 
    > It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences.
    > Regards,
    > Lutz
    > 
    > On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:
    > 
    >      On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
    >      > Hi Vladimir,
    >      >
    >      > thanks a lot for looking at this so quickly.
    >      >
    >      > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512" originated from the thought "its large enough for a well-behaved line and small enough to save some flushes".
    >      >
    >      > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I wasn't sure if that could be categorized as over-engineered.
    >      
    >      Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
    >      
    >      Vladimir
    >      
    >      >
    >      > Your thoughts?
    >      >
    >      > Thanks,
    >      > Lutz
    >      >
    >      > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov" <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
    >      >
    >      >      Hi Lutz,
    >      >
    >      >      I see that you have only one usage in all cases for:
    >      >      BUFFEREDSTREAM_FLUSH_IF("", 512)
    >      >
    >      >      Can you simple declare simplified macro for this?
    >      >
    >      >      Otherwise looks good.
    >      >
    >      >      Thanks,
    >      >      Vladimir
    >      >
    >      >      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
    >      >      > Dear all,
    >      >      >
    >      >      > may I please have reviews for this (semantically) small change. Its purpose is to reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
    >      >      >
    >      >      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
    >      >      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
    >      >      >
    >      >      > Thank you!
    >      >      > Lutz
    >      >      >
    >      >      >
    >      >
    >      >
    >      
    > 
    

From bsrbnd at gmail.com  Thu Jan 17 19:51:06 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Thu, 17 Jan 2019 20:51:06 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
 <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
Message-ID: <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>

On Thu, 17 Jan 2019 at 10:17, Andrew Haley <aph at redhat.com> wrote:
>
> On 1/16/19 8:46 PM, B. Blaser wrote:
>
> > To answer Andrew Haley, one of the major difference between CISC and
> > RISC is specifically the load/store architecture of the latter which
> > is part of most instructions of the former; I don't see many good
> > reasons to generate RISC-like load/store code using only a subset of
> > instructions and to juggle with registers.
>
> Well, yes, but the question remains: does this change actually help
> anything. And if it does, by how much?

Here it is on intel xeon with 5*10e9 iterations:
* mov+cmov = 10.94s
* cmov = 10.15s

Thoughts?

Thanks,
Bernard

$ cat cmov.c
// $ gcc -S cmov.c
// $ cat cmov.s
// $ gcc cmov.s
// $ time ./a.out

#include<time.h>
#include<stdio.h>

void main() {
    struct timespec start, stop;
    clock_gettime(CLOCK_THREAD_CPUTIME_ID, &start);

    for (long i=0; i<5000000000L; i++) {
        asm ("clc");

        asm ("movq -8(%rbp), %rbx");
        asm ("cmovncq %rbx, %rax");

//        asm ("cmovncq -8(%rbp), %rax");
    }

    clock_gettime(CLOCK_THREAD_CPUTIME_ID, &stop);

    long t = ((long)stop.tv_sec) * 1000000000L + stop.tv_nsec;
    t -= ((long)start.tv_sec) * 1000000000L + start.tv_nsec;

    printf("nsec: %ld\n", t);
}
$ gcc -S cmov.c
$ gcc cmov.s
$ time ./a.out
nsec: 10942890857

real    0m10.951s
user    0m10.941s
sys    0m0.003s
$ cat cmov.c
[...]
    for (long i=0; i<5000000000L; i++) {
        asm ("clc");

//        asm ("movq -8(%rbp), %rbx");
//        asm ("cmovncq %rbx, %rax");

        asm ("cmovncq -8(%rbp), %rax");
    }
[...]
$ gcc -S cmov.c
$ gcc cmov.s
$ time ./a.out
nsec: 10149026430

real    0m10.157s
user    0m10.150s
sys    0m0.001s

From mikael.vidstedt at oracle.com  Thu Jan 17 21:54:11 2019
From: mikael.vidstedt at oracle.com (Mikael Vidstedt)
Date: Thu, 17 Jan 2019 13:54:11 -0800
Subject: RFR (XS): 8217266: Remove dead LIR_List::compare_to and
 LIR_Code::lir_compare_to
In-Reply-To: <c92240e6-569d-d655-787c-6979b4e3698d@oracle.com>
References: <39E5F5F4-598E-46C8-8BAD-C95D16439EDA@oracle.com>
 <877ef3wunv.fsf@redhat.com> <c92240e6-569d-d655-787c-6979b4e3698d@oracle.com>
Message-ID: <A8548856-537E-4777-BF55-FBDA06907039@oracle.com>


Roland/Vladimir, thanks for the reviews. Change pushed.

Cheers,
Mikael

> On Jan 17, 2019, at 10:12 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> +1
> 
> Vladimir
> 
> On 1/17/19 12:33 AM, Roland Westrelin wrote:
>>> webrev: http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/ <http://cr.openjdk.java.net/~mikael/webrevs/8217266/webrev.00/open/webrev/>
>> That looks good to me.
>> Roland.


From vladimir.x.ivanov at oracle.com  Thu Jan 17 22:35:15 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 17 Jan 2019 14:35:15 -0800
Subject: Why does call_site_target keep changing for a Nashorn method?
In-Reply-To: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
References: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
Message-ID: <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>

C1/C2 optimistically inline through CallSite instances even if those are 
mutable (MutableCallSite/VolatileCallSite). It requires a nmethod 
dependency and once CallSite target changes, all dependent nmethods 
should be invalidated. If such change happens during compilation, 
nmethod installation fails.

That's exactly what you observe: the dependency is recorded during 
inlining, but failed verification during installation.

Regarding the observed behavior, it is well-known [1] [2] and was a 
deliberate choice. As JDK-7087838 [1] states:

"The consensus among language runtime implementors is that they want 
control over switch points (and thus call sites) and so it's their 
responsibility to handle extensive invalidation of such."

So, such pathological behavior is treated as a bug in user code (Nashorn 
in this particular case).

There's an RFE filed [3] to consider alternative options for unstable 
calls.

Best regards,
Vladimir Ivanov

[1] https://bugs.openjdk.java.net/browse/JDK-7087838
[2] https://bugs.openjdk.java.net/browse/JDK-7177745
[3] https://bugs.openjdk.java.net/browse/JDK-8147550

On 16/01/2019 14:04, Liu, Xin wrote:
> In one of our applications, C1/C2 keeps compiling a Javascript method 
> generated by Nashorn but the code fails a dependency check right before 
> installing in the code cache. This is with JDK tip.
> 
> It can?t pass ?Dependencies::check_call_site_target_value?.
> 
> [C2 Parsing]
> 
> <bc code='182' bci='1'/>
> 
> <dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
> 
> <call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
> 
> <inline_success reason='accessor'/>
> 
> <parse method='1141' uses='21249.000000' stamp='1112.538'>
> 
> <bc code='180' bci='1'/>
> 
> <unknown id='1556'/>
> 
> <unknown id='1866'/>
> 
> <dependency type='call_site_target_value' x0='1556' x='1866'/>
> 
> <parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
> 
> </parse>
> 
> [Validating compilation dependencies]
> 
> <dependency type='call_site_target_value' x0='1132' x='1143'/>
> 
> <dependency type='call_site_target_value' x0='1334' x='1337'/>
> 
> <dependency type='call_site_target_value' x0='1424' x='1425'/>
> 
> <dependency type='call_site_target_value' x0='1437' x='1438'/>
> 
> <dependency type='call_site_target_value' x0='1454' x='1455'/>
> 
> <dependency type='call_site_target_value' x0='1465' x='1466'/>
> 
> <dependency type='call_site_target_value' x0='1482' x='1483'/>
> 
> <dependency type='call_site_target_value' x0='1498' x='1499'/>
> 
> <dependency type='call_site_target_value' x0='1509' x='1510'/>
> 
> <dependency type='call_site_target_value' x0='1526' x='1576'/>
> 
> <dependency type='call_site_target_value' x0='1528' x='1667'/>
> 
> <dependency type='call_site_target_value' x0='1536' x='1692'/>
> 
> <dependency type='call_site_target_value' x0='1537' x='1707'/>
> 
> <dependency type='call_site_target_value' x0='1538' x='1730'/>
> 
> <dependency type='call_site_target_value' x0='1539' x='1746'/>
> 
> <dependency type='call_site_target_value' x0='1540' x='1787'/>
> 
> <dependency type='call_site_target_value' x0='1550' x='1804'/>
> 
> <dependency type='call_site_target_value' x0='1553' x='1820'/>
> 
> <dependency type='call_site_target_value' x0='1554' x='1836'/>
> 
> <dependency type='call_site_target_value' x0='1555' x='1849'/>
> 
> <dependency type='call_site_target_value' x0='1556' x='1866'/>
> 
> <dependency_failed type='call_site_target_value' x0='1556' x='1866' 
> witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite' 
> stamp='1113.578'/>
> 
> It?s related to the GWT methodHandle. ?The 2 mismatched methodhandles 
> are very similar except for argL3, which is an int[2].
> 
> Even though arg0-2 are not identical objects, their contents are same.
> 
> (gdb)call java_lang_invoke_CallSite::target(call_site)->print()
> 
> java.lang.invoke.BoundMethodHandle$Species_LLLL
> 
> {0x00000000f586ca98}- 
> klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
> 
> - ---- fields(total size 6 words):
> 
> -'customizationCount''B'@12 0
> 
> - private final'type''Ljava/lang/invoke/MethodType;'@16 
> a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
> 
> - final'form''Ljava/lang/invoke/LambdaForm;'@20 
> a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
> 
> -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
> 
> - final'argL0''Ljava/lang/Object;'@28 
> a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8}(f586c9e8)
> 
> - final'argL1''Ljava/lang/Object;'@32 
> a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28}(f586ca28)
> 
> - final'argL2''Ljava/lang/Object;'@36 
> a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60}(f586ca60)
> 
> - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f586ca10}(f586ca10)
> 
> (gdb)call method_handle->print()
> 
> java.lang.invoke.BoundMethodHandle$Species_LLLL
> 
> {0x00000000f6b18500}- 
> klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
> 
> - ---- fields(total size 6 words):
> 
> -'customizationCount''B'@12 0
> 
> - private final'type''Ljava/lang/invoke/MethodType;'@16 
> a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
> 
> - final'form''Ljava/lang/invoke/LambdaForm;'@20 
> a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
> 
> -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
> 
> - final'argL0''Ljava/lang/Object;'@28 
> a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450}(f6b18450)
> 
> - final'argL1''Ljava/lang/Object;'@32 
> a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490}(f6b18490)
> 
> - final'argL2''Ljava/lang/Object;'@36 
> a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8}(f6b184c8)
> 
> - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f6b18478}(f6b18478)
> 
> My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.
> 
> // Intrinsified by C2. Counters are used during parsing to calculate 
> branch frequencies.
> @LambdaForm.Hidden
> @jdk.internal.HotSpotIntrinsicCandidate
> static
> boolean profileBoolean(boolean result, int[] counters) {
> // Profile is int[2] where [0] and [1] correspond to false and true 
> occurrences respectively.
> int idx = result ? 1 : 0;
>  ??? try {
>  ??????? counters[idx] = Math./addExact/(counters[idx], 1);
> } catch (ArithmeticException e) {
> // Avoid continuous overflow by halving the problematic count.
> counters[idx] = counters[idx] / 2;
> }
> return result;
> }
> 
> I am still struggling to understand the source code in 
> java.lang.invoke.*. ?Could anybody enlighten me why the target of the 
> callsite changes every time here? ?it is relative to this profiling thing?
> 
> In validation log, it has validated the dep ?dependency 
> type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t 
> pass it after then? My guess is one MH object has been changed by 
> another Java thread.
> 
> One interesting fact that compiler thread can?t pass 22^th dep.? My 
> tuition is it goes over an unknown threshold.
> 
> The 2nd question is about ciEnv:: validate_compile_task_dependencies. 
>  ?Why does failure of call_site_target_value_changed not count as a deopt?
> 
> The flag??_inc_decompile_count_on_failure =false stops MDO to mark this 
> method ?not_compileable?. ?C2 doesn?t set the flag, so C2 ends up 
> compiling it over and over, which makes C2 a cpu hog. Here?s the code in 
> validate_compile_task_dependencies
> 
>  ? bool counter_changed = system_dictionary_modification_counter_changed();
> 
>  ? Dependencies::DepType result = 
> dependencies()->validate_dependencies(_task, counter_changed);
> 
>  ? if (result != Dependencies::end_marker) {
> 
>  ??? if (result == Dependencies::call_site_target_value) {
> 
>  ????? _inc_decompile_count_on_failure = false;
> 
>  ????? record_failure("call site target change");
> 
> Maybe the right thing to do is to count this as a deopt and change the 
> deopt limit computation to take into account the size of the method in 
> nodes, just as done for abandoning compilation if the graph is too big.
> 
> Thanks,
> 
> --lx
> 

From sandhya.viswanathan at intel.com  Fri Jan 18 01:13:35 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Fri, 18 Jan 2019 01:13:35 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <ad813d34-3ebd-6f66-332a-06b9446367c0@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A50E3B@FMSMSX126.amr.corp.intel.com>

Hi Andrew,

>>> Perhaps Intel also could provide help with testing? [Sadhya, is this an option?]

Yes, we can help with testing this feature as needed.

Best Regards,
Sandhya

-----Original Message-----
From: Andrew Dinn [mailto:adinn at redhat.com] 
Sent: Thursday, January 17, 2019 6:28 AM
To: Alan Bateman <Alan.Bateman at oracle.com>; Brian Goetz <brian.goetz at oracle.com>
Cc: core-libs-dev at openjdk.java.net; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; Jonathan Halliday <jonathan.halliday at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Mikael Vidstedt <mikael.vidstedt at oracle.com>
Subject: Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

Hi Alan,

Thanks for your response.

On 17/01/2019 12:53, Alan Bateman wrote:
> I skimmed through the current draft. In the most recent discussion 
> then I think we had converged on "SYNC" rather than "PERSISTENT", the 
> reasoning being that there is persistence already with regular file 
> mapped files, also it aligns with the MAP_SYNC flag to mmap. I don't 
> recall if the discussion on isPersistent concluded, that was more of a 
> naming issue and whether you include an isXXX method or not is not 
> critical to the proposal. The overload of the force method to specify 
> a range is a good addition, irrespective of the JEP.

Ok, thanks. At least sync is now being used consistently in the public API. I will look at renaming internal vars/methods to use sync when I publish the next webrev.

> One thing to clarify is the heading "Proposed Restricted Public JDK 
> API Changes". The proposal (and the early webrevs) exposed 
> writebackMemory in the internal Unsafe, not sun.misc.Unsafe, which I think is right.
> This makes it a JDK internal API so it doesn't need to be in JEP.


I am happy to remove it from the JEP if needed. Does it do any harm to leave it?

> Did you get any feedback on the Testing section? Given that the 
> feature needs special hardware then it will need commitment to test is 
> on a regular basis. It's a similar issue to the draft "JEP 337: RDMA 
> Network Sockets" where special hardware is needed to full test the 
> feature. In the case of JEP 337 then some testing with emulation is possible.

I believe I received no specific feedback on that topic.

Some of the other Red Hat dev teams (i.e. not OpenJDK) and also dev staff at Intel are very keen to base some of their future work on this feature. So, it will certainly get tested /after/ JDK release :-)

Red Hat does have the Intel hardware needed to test this feature but, so far, nothing that can be used to test on AArch64. Our OpenJDK team can access this kit for one-off testing but it is not currently available for continuous integration testing.

I will propose to my manager that we acquire the relevant kit and ensure that all JDKs which implement this JEP are tested prior to release. We should also be able to test AArch64 using volatile memory to simulate a non-volatile memory device up to the point where the requisite AArch64-based NVM hardware becomes available. I am fairly confident this plan will be agreeable to the overlords whom I humbly serve.

Perhaps Intel also could provide help with testing? [Sadhya, is this an option?]

My bigger concern was that crash recovery tests may never be 100% reliable. A 100% guarantee requires the ability to engineer a machine crash at a precisely defined critical point of execution and some of the relevant critical locations will be embedded in the middle of JITted code making it hard to provoke the crash. So, the situations where a crash /can/ be engineered may not fully reflect those that occur in live deployments. That said, a dash of artificiality in test code is, perhaps, not so worthy of remark . . .

> Vladimir and I have reviewed the JEP, it will need an area lead to 
> endorse, I think it can be Brian or Mikael in this case.
Ok, thanks for the above answers. Looking forward to hearing further from Brian and/or Mikael (Vidstedt, I assume? :-).

regards,


Andrew Dinn
-----------


From felix.yang at huawei.com  Fri Jan 18 05:36:11 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Fri, 18 Jan 2019 05:36:11 +0000
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation in
 ConvI2LNode::Ideal
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>

Hi,

    Can someone help review this change to the C2 compiler? 

    Bug: https://bugs.openjdk.java.net/browse/JDK-8217359
    Webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.00/

    The bug triggers when C2 compiler does the following transformation in function ConvI2LNode::Ideal: 
    // Convert ConvI2L(AddI(x, y)) to AddL(ConvI2L(x), ConvI2L(y))
    ......
    395     Node* cx = phase->C->constrained_convI2L(phase, x, TypeInt::make(rxlo, rxhi, widen), NULL);
    396     Node* cy = phase->C->constrained_convI2L(phase, y, TypeInt::make(rylo, ryhi, widen), NULL);
    ....

    Here is the process of how it triggers:

// =========================================================
// Before line 395, x is an AddINode (id: 202). y is also an AddINode (id: 553) and x is a subtree of y.
// The ideal graph looks like:
//
//       ...  ...  ...  ...
//         \   |  /     |
//          86_Phi   33_ConI
//             |     /
//         \   |    /
//          202_AddI
//
//            ...  ... ... ...
//             |     \  |  /
//         27_ConI  202_AddI --------- (node x)
//             |    /
//        \    |   /
//         549_SubI
//             |    ...
//         \   |    /
//          553_AddI ---------- (node y)
//
// ==========================================================
// After line 395, x is converted to cx and cx is an AddLNode (id: 1274).
// At this point, everything looks fine.
//         ...     ...  ...
//           \     /    /
//      1271_ConvI2L  1273_ConL
//              |     /
//         \    |    /
//          1274_AddL
//
// ==========================================================
// In line 396, y will be converted to cy.  In this progress, y
// and its subnode will all be converted recursively.  This is
// a rather long progress.  The convertion of y is like this:
//
// Node 27_ConI will be converted to node 1278_ConL.
//
// Since x(202) is the input edge of node 549, it will be
// converted again.  And the result cx_2 is node 1282_AddL.
// The structure of cx_2 is the same as cx.  After GVN(hash_find_insert()),
// 1282_AddL is replaced with 1274_AddL.
//
// Then 549_SubI will be converted to 1283_SubL and the ideal graph looks like: 
//                    ...    ...  ...
//                      \    /    /
//                1271_ConvI2L  1273_ConL
//            ...         |     /
//             |     \    |    /
//         1278_ConL  1274_AddL
//             |     /
//        \    |    /
//         1283_SubL
//
// After that, C2 will do the following transformation to node 1283_SubL: 
//      x - (y + cons) ==> (x - y) - cons
//
// When this is done, node 1283_SubL is converted to node 1286_AddL: 
//                        ...      ...   ...
//                         |        |    /
//                   1278_ConL  1271_ConI2L
//                         |    /     ...
//                   \     |   /      /
//                    1284_SubL   1285_ConL
//                         |     /
//                    \    |    /
//                     1286_AddL
//
// Then in function subsume_node(), 1283_SubL is replaced with 1286_AddL. 
// During this progress, following operations will be carried out:
//  | In function subsume_node(), 1283_SubL will be regarded as a
//  | dead_node since it is replaced by 1286_AddL. The same inspection
//  | of dead node will be carried out to the subnodes of 1283
//  | recursively.  And remove_dead_node() function will be called
//  | by subsume_node() to replace the input edges of dead node to NULL.
//  | 1274_AddL is node cx. At this moment, 1274_AddL has only one
//  | output edge, that is 1283. Since 1283 is a dead node, 1274 will
//  | also be regarded as a dead node. Then input edges of 1274_AddL
//  | will be set to NULL. After that, cx will be an isolated node which
//  | has neither input edge nor output edge.
//
// ==========================================
// After all of this, program continues and cx->in(2) is used in addnode.cpp:163. 
// Since now cx has no input edges, the program crashes.

    The proposed solution is fairly straight-forward: 
    After the conversion of x, build a hook node add a use to cx to prevent it from dying. 
    When conversion of y is finished, this new output of cx is removed. 

    JTreg tested with both x86_64 fastdebug & release build.  Is it OK? 

Thanks,
Felix

From Nick.Gasson at arm.com  Fri Jan 18 08:40:25 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Fri, 18 Jan 2019 08:40:25 +0000
Subject: RFR: 8217368: AArch64: C2 recursive stack locking optimisation not
 triggered
Message-ID: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>

Hi,

While I was cleaning up the patch for 8216350 I noticed an issue in the 
implementation of recursive locking in aarch64_enc_fast_lock:

Bug: https://bugs.openjdk.java.net/browse/JDK-8217368
Webrev: http://cr.openjdk.java.net/~ngasson/8217368/webrev.0/

First we load the markOop of the object we want to lock and OR it with 
markOopDesc::unlocked_value (1). Then we do a CAS to exchange the 
address of the box on our thread's stack with the object's header word 
iff it's equal to the (markOop | 1) we just computed. If this fails, 
then we should check for a recursive lock by comparing

   (~(page size - 1) | 3) & (markOop - SP) == 0

Where "markOop" is the current object header word loaded by the failed 
CAS. This checks that the lock bits are zero (locked) and the stack 
address of the displaced header is within one page of the current SP. 
But on AArch64 we actually do this:

   (~(page size - 1) | 3) & ((old markOop | 1) - SP) == 0

Where "old markOop | 1" is the compare-to value used for the CAS. This 
is always false as the result has at least bit #0 set. This only affects 
C2, the C1_MacroAssembler version has the correct test.

The diff looks big but all it does is swap the usage of registers `tmp' 
and `disp_hdr' in the first section so the markOop loaded by the CAS 
ends up in disp_hdr and tmp holds the (markOop | 1) compare-to value.

Ran jtreg, plus jcstress with -XX:+UseLSE and -XX:-UseLSE. Also added 
another microbenchmark to 
micro/org/openjdk/bench/vm/lang/LockUnlock.java as I couldn't find an 
existing JMH case that triggered this.

Without patch:

Result 
"org.openjdk.bench.vm.lang.LockUnlock.testRecursiveSynchronizationNoBias":
  510.781 ?(99.9%) 1.196 ns/op [Average]
  (min, avg, max) = (508.769, 510.781, 513.854), stdev = 1.597
  CI (99.9%): [509.585, 511.977] (assumes normal distribution)

With patch:

Result 
"org.openjdk.bench.vm.lang.LockUnlock.testRecursiveSynchronizationNoBias": 

  197.038 ?(99.9%) 0.096 ns/op [Average]
  (min, avg, max) = (196.886, 197.038, 197.296), stdev = 0.128
  CI (99.9%): [196.942, 197.134] (assumes normal distribution)

Two other minor things:

* Does anyone know what the comment "// Load Compare Value application 
register." means? It's present in the PPC and S390 ports too.

* The x86 port #ifdef LP64 uses "7 - os::vm_page_size()" as the mask in 
the recursive lock test. I think the "7" here is 
markOopDesc::biased_lock_mask and is presumably there to prevent a 
silent mutual exclusion failure if a markOop with the bias locking bits 
set ends up the fast_lock path (although this should never happen). 
Should we change markOopDesc::lock_mask_in_place to 
markOopDesc::biased_lock_mask_in_place in the AArch64 port too?

Thanks,
Nick

From aph at redhat.com  Fri Jan 18 09:36:31 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 18 Jan 2019 09:36:31 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
Message-ID: <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>

On 1/18/19 8:40 AM, Nick Gasson (Arm Technology China) wrote:
> Hi,
> 
> While I was cleaning up the patch for 8216350 I noticed an issue in the 
> implementation of recursive locking in aarch64_enc_fast_lock:
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8217368
> Webrev: http://cr.openjdk.java.net/~ngasson/8217368/webrev.0/
> 
> First we load the markOop of the object we want to lock and OR it with 
> markOopDesc::unlocked_value (1). Then we do a CAS to exchange the 
> address of the box on our thread's stack with the object's header word 
> iff it's equal to the (markOop | 1) we just computed. If this fails, 
> then we should check for a recursive lock by comparing
> 
>    (~(page size - 1) | 3) & (markOop - SP) == 0
> 
> Where "markOop" is the current object header word loaded by the failed 
> CAS. This checks that the lock bits are zero (locked) and the stack 
> address of the displaced header is within one page of the current SP. 
> But on AArch64 we actually do this:
> 
>    (~(page size - 1) | 3) & ((old markOop | 1) - SP) == 0
> 
> Where "old markOop | 1" is the compare-to value used for the CAS. This 
> is always false as the result has at least bit #0 set. This only affects 
> C2, the C1_MacroAssembler version has the correct test.
> 
> The diff looks big but all it does is swap the usage of registers `tmp' 
> and `disp_hdr' in the first section so the markOop loaded by the CAS 
> ends up in disp_hdr and tmp holds the (markOop | 1) compare-to value.

The patch looks good. However, I don't understand why we aren't using
MacroAssembler::cmpxchgptr here. It looks like we should be, and you'd
end up with a less complex result.

> Two other minor things:
> 
> * Does anyone know what the comment "// Load Compare Value application 
> register." means? It's present in the PPC and S390 ports too.

Probably no-one can remember. We'll have inherited it from x86.

> * The x86 port #ifdef LP64 uses "7 - os::vm_page_size()" as the mask in 
> the recursive lock test. I think the "7" here is 
> markOopDesc::biased_lock_mask and is presumably there to prevent a 
> silent mutual exclusion failure if a markOop with the bias locking bits 
> set ends up the fast_lock path (although this should never happen). 
> Should we change markOopDesc::lock_mask_in_place to 
> markOopDesc::biased_lock_mask_in_place in the AArch64 port too?

I wouldn't think so. You're describing a change that by definition we
can't test.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Fri Jan 18 09:49:14 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 18 Jan 2019 10:49:14 +0100
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
 <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
 <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
Message-ID: <52136751-929b-4976-477d-93282ce0a0d7@oracle.com>

Hi Lutz,

looks good to me too.

Best regards,
Tobias

On 17.01.19 19:39, Vladimir Kozlov wrote:
> Looks good
> 
> Thanks,
> Vladimir
> 
> On 1/17/19 7:47 AM, Schmidt, Lutz wrote:
>> Hi Vladimir & all,
>> there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
>> What's new (in addition to some comments) is the macro
>>
>> ?? // Flush the buffer contents if the remaining capacity is less
>> ?? // than the calculated threshold (256 bytes + capacity/16)
>> ?? // That should suffice for all reasonably sized output lines.
>> ?? #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)??????????????? \
>> ?????? BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))
>>
>> It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences.
>> Regards,
>> Lutz
>>
>> ?On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:
>>
>> ???? On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
>> ???? > Hi Vladimir,
>> ???? >
>> ???? > thanks a lot for looking at this so quickly.
>> ???? >
>> ???? > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512"
>> originated from the thought "its large enough for a well-behaved line and small enough to save
>> some flushes".
>> ???? >
>> ???? > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived
>> from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I
>> wasn't sure if that could be categorized as over-engineered.
>> ???? ???? Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
>> ???? ???? Vladimir
>> ???? ???? >
>> ???? > Your thoughts?
>> ???? >
>> ???? > Thanks,
>> ???? > Lutz
>> ???? >
>> ???? > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov"
>> <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
>> ???? >
>> ???? >????? Hi Lutz,
>> ???? >
>> ???? >????? I see that you have only one usage in all cases for:
>> ???? >????? BUFFEREDSTREAM_FLUSH_IF("", 512)
>> ???? >
>> ???? >????? Can you simple declare simplified macro for this?
>> ???? >
>> ???? >????? Otherwise looks good.
>> ???? >
>> ???? >????? Thanks,
>> ???? >????? Vladimir
>> ???? >
>> ???? >????? On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
>> ???? >????? > Dear all,
>> ???? >????? >
>> ???? >????? > may I please have reviews for this (semantically) small change. Its purpose is to
>> reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
>> ???? >????? >
>> ???? >????? > Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217250
>> ???? >????? > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
>> ???? >????? >
>> ???? >????? > Thank you!
>> ???? >????? > Lutz
>> ???? >????? >
>> ???? >????? >
>> ???? >
>> ???? >
>> ????

From Nick.Gasson at arm.com  Fri Jan 18 09:52:50 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Fri, 18 Jan 2019 09:52:50 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
Message-ID: <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>

Hi Andrew,

On 18/01/2019 17:36, Andrew Haley wrote:
> 
> The patch looks good. However, I don't understand why we aren't using
> MacroAssembler::cmpxchgptr here. It looks like we should be, and you'd
> end up with a less complex result.
> 

It's not exactly the same though: MacroAssembler::cmpxchgptr adds a "dmb 
ish" to the failure path which I don't think is required here.

>> * Does anyone know what the comment "// Load Compare Value application
>> register." means? It's present in the PPC and S390 ports too.
> 
> Probably no-one can remember. We'll have inherited it from x86.

Let's delete it then.

Thanks,
Nick

From aph at redhat.com  Fri Jan 18 09:54:52 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 18 Jan 2019 09:54:52 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
 <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
 <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>
Message-ID: <3dd85d2c-f4d8-e360-21a2-68254b3c5e2b@redhat.com>

On 1/17/19 7:51 PM, B. Blaser wrote:
> Here it is on intel xeon with 5*10e9 iterations:
> * mov+cmov = 10.94s
> * cmov = 10.15s
> 
> Thoughts?

It looks like there's not much of a performance difference, but it might
help by freeing a register. OTOH, we'd still need to be sure we weren't
introducing a regression. We'd have to make sure that implicit null checks
work.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Fri Jan 18 10:23:23 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 18 Jan 2019 11:23:23 +0100
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
Message-ID: <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>

Hi Felix,

Could you please add the regression test as jtreg test?

Otherwise, the fix looks reasonable to me. Nice analysis!

Thanks,
Tobias


On 18.01.19 06:36, Yangfei (Felix) wrote:
> Hi,
> 
>     Can someone help review this change to the C2 compiler? 
> 
>     Bug: https://bugs.openjdk.java.net/browse/JDK-8217359
>     Webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.00/
> 
>     The bug triggers when C2 compiler does the following transformation in function ConvI2LNode::Ideal: 
>     // Convert ConvI2L(AddI(x, y)) to AddL(ConvI2L(x), ConvI2L(y))
>     ......
>     395     Node* cx = phase->C->constrained_convI2L(phase, x, TypeInt::make(rxlo, rxhi, widen), NULL);
>     396     Node* cy = phase->C->constrained_convI2L(phase, y, TypeInt::make(rylo, ryhi, widen), NULL);
>     ....
> 
>     Here is the process of how it triggers:
> 
> // =========================================================
> // Before line 395, x is an AddINode (id: 202). y is also an AddINode (id: 553) and x is a subtree of y.
> // The ideal graph looks like:
> //
> //       ...  ...  ...  ...
> //         \   |  /     |
> //          86_Phi   33_ConI
> //             |     /
> //         \   |    /
> //          202_AddI
> //
> //            ...  ... ... ...
> //             |     \  |  /
> //         27_ConI  202_AddI --------- (node x)
> //             |    /
> //        \    |   /
> //         549_SubI
> //             |    ...
> //         \   |    /
> //          553_AddI ---------- (node y)
> //
> // ==========================================================
> // After line 395, x is converted to cx and cx is an AddLNode (id: 1274).
> // At this point, everything looks fine.
> //         ...     ...  ...
> //           \     /    /
> //      1271_ConvI2L  1273_ConL
> //              |     /
> //         \    |    /
> //          1274_AddL
> //
> // ==========================================================
> // In line 396, y will be converted to cy.  In this progress, y
> // and its subnode will all be converted recursively.  This is
> // a rather long progress.  The convertion of y is like this:
> //
> // Node 27_ConI will be converted to node 1278_ConL.
> //
> // Since x(202) is the input edge of node 549, it will be
> // converted again.  And the result cx_2 is node 1282_AddL.
> // The structure of cx_2 is the same as cx.  After GVN(hash_find_insert()),
> // 1282_AddL is replaced with 1274_AddL.
> //
> // Then 549_SubI will be converted to 1283_SubL and the ideal graph looks like: 
> //                    ...    ...  ...
> //                      \    /    /
> //                1271_ConvI2L  1273_ConL
> //            ...         |     /
> //             |     \    |    /
> //         1278_ConL  1274_AddL
> //             |     /
> //        \    |    /
> //         1283_SubL
> //
> // After that, C2 will do the following transformation to node 1283_SubL: 
> //      x - (y + cons) ==> (x - y) - cons
> //
> // When this is done, node 1283_SubL is converted to node 1286_AddL: 
> //                        ...      ...   ...
> //                         |        |    /
> //                   1278_ConL  1271_ConI2L
> //                         |    /     ...
> //                   \     |   /      /
> //                    1284_SubL   1285_ConL
> //                         |     /
> //                    \    |    /
> //                     1286_AddL
> //
> // Then in function subsume_node(), 1283_SubL is replaced with 1286_AddL. 
> // During this progress, following operations will be carried out:
> //  | In function subsume_node(), 1283_SubL will be regarded as a
> //  | dead_node since it is replaced by 1286_AddL. The same inspection
> //  | of dead node will be carried out to the subnodes of 1283
> //  | recursively.  And remove_dead_node() function will be called
> //  | by subsume_node() to replace the input edges of dead node to NULL.
> //  | 1274_AddL is node cx. At this moment, 1274_AddL has only one
> //  | output edge, that is 1283. Since 1283 is a dead node, 1274 will
> //  | also be regarded as a dead node. Then input edges of 1274_AddL
> //  | will be set to NULL. After that, cx will be an isolated node which
> //  | has neither input edge nor output edge.
> //
> // ==========================================
> // After all of this, program continues and cx->in(2) is used in addnode.cpp:163. 
> // Since now cx has no input edges, the program crashes.
> 
>     The proposed solution is fairly straight-forward: 
>     After the conversion of x, build a hook node add a use to cx to prevent it from dying. 
>     When conversion of y is finished, this new output of cx is removed. 
> 
>     JTreg tested with both x86_64 fastdebug & release build.  Is it OK? 
> 
> Thanks,
> Felix
> 

From goetz.lindenmaier at sap.com  Fri Jan 18 11:03:15 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Fri, 18 Jan 2019 11:03:15 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
 <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
 <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <8b1ca2bdba334f42a3c2b044a557dd8c@sap.com>

Hi Martin, 

I had a look at your change. 
Overall looks good. According to Gustavos mail a nice improvement!

I think though that the way to select the algorithm is quite
messy:
In templateInterpreter vpmsumb is checked and the methods are
called directly.
In stubGenerator, generate_CRC32...()
  vpmsumb is tested to decide on vector_constants = R2.
  and generic generate_CRC_updateBytes is called, which 
again checks whether verctor_constants == R2.

I think generate_CRC_updateBytes() or some other generic
function should be located in macroAssembler_ppc and
be called from both locations.

What do you think?

Best regards,
  Goetz


> -----Original Message-----
> From: Doerr, Martin
> Sent: Donnerstag, 17. Januar 2019 14:18
> To: Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>;
> Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi,
> 
> the rebased webrev.01 applies on jdk/jdk, now (after JDK-8216376). So the
> issue Gustavo had observed does not longer exist.
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> 
> I have updated copyrights and retested it.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Montag, 7. Januar 2019 14:52
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/07/2019 11:49 AM, Doerr, Martin wrote:
> > I want to check all places where we use "mr(R1_SP, R21_sender_SP)".
> There may be more issues with that. I'll probably handle that in a separate
> change and push this CRC change afterwards.
> 
> I see. Thanks for letting me know.
> 
> Best regards,
> Gustavo
> 
> > Best regards,
> > Martin
> >
> >
> > -----Original Message-----
> > From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > Sent: Freitag, 4. Januar 2019 19:55
> > To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be used by interpreter and be faster for short arrays
> >
> > Hi Martin,
> >
> > On 01/04/2019 02:13 PM, Doerr, Martin wrote:
> >> Hi Gustavo,
> >>
> >> when called from the interpreter (the scenario you observed), R21 is set
> before resizing the frame to avoid wasted stack space
> (InterpreterMacroAssembler::call_from_interpreter).
> >
> > Got it. Thanks a lot for the explanations.
> >
> > I think it doesn't currently matter in practice, but I'm wondering if to be
> > consistent we should cut back the stack back earlier also in
> > TemplateInterpreterGenerator::generate_CRC32_update_entry()?
> >
> > diff -r a35f8c35d8c9
> src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> > --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> 04 10:09:00 2019 +0100
> > +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> 04 13:44:37 2019 -0500
> > @@ -1840,11 +1840,12 @@
> >    #endif
> >        __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64
> bit to have a clean register.
> >
> > +    // Restore caller sp for c2i case and return.
> > +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> started.
> > +
> >        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
> >        __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
> >
> > -    // Restore caller sp for c2i case and return.
> > -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> started.
> >        __ blr();
> >
> >        // Generate a vanilla native entry as the slow path.
> >
> > Currently there is no issue probably because generated code is simpler and
> does
> > no spills.
> >
> > Best regards,
> > Gustavo
> >
> >> When called from compiled methods, R21 is set by a c2i adapter which
> extends the compiled frame by space for arguments (gen_c2i_adapter).
> >>
> >> "mr(R1_SP, R21_sender_SP)" is more error-prone than
> "resize_frame_absolute" so I think the latter would be better (though it takes
> more registers and instructions), but I don't want to replace that as part of
> this CRC change.
> >>
> >> Best regards,
> >> Martin
> >>
> >>
> >> -----Original Message-----
> >> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >> Sent: Freitag, 4. Januar 2019 14:44
> >> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be used by interpreter and be faster for short arrays
> >>
> >> Hi Martin,
> >>
> >> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
> >>> thank you very much for confirming. This makes sense. We use different
> frame headers depending on whether the frame is the top Java frame or not
> (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a
> shortcut for leaf calls which relies on having an unmodified stack until this
> point. So the patch fixes the issue.
> >>
> >> Glad to help! Thanks for the additional information, I was not aware that
> the
> >> selection of different frame headers could be done at compile time. One
> last
> >> question only for my education: what exactly advanced (incremented)
> R1_SP so it
> >> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame
> for
> >> which function exactly or "who" is the caller exactly here?
> >>
> >> Thank you.
> >>
> >> Best regards,
> >> Gustavo
> >>
> >>> New webrev:
> >>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> >>>
> >>> Best regards,
> >>> Martin
> >>>
> >>>
> >>> -----Original Message-----
> >>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >>> Sent: Donnerstag, 3. Januar 2019 19:36
> >>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> should be used by interpreter and be faster for short arrays
> >>>
> >>> Hi Martin,
> >>>
> >>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
> >>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on
> our machine (with fastdbg build).
> >>>> I guess that the frameless spills mess up the stack. Can you check if the
> patch below helps?
> >>>
> >>> Thanks for providing a fix so I can try it.
> >>> Yes, I confirm the patch below indeed fixes the sigsegv crash when
> CRC32C update() method is used.
> >>> I also confirm that I don't observe the crash on the fastdebug build, only
> on the release build.
> >>> It also only affects the Interpreter mode, so passing -Xcomp avoids the
> crash on the release build.
> >>>
> >>> Just as reference, I can reproduce it on the release build with the
> following trivial code:
> >>>
> >>> import java.util.zip.CRC32C;
> >>>
> >>> class CRC32C_v1 {
> >>>       public static void main(String[] arg) {
> >>>         byte[] b = new byte[1024];
> >>>
> >>>         CRC32C crc32c = new CRC32C();
> >>>         crc32c.update(b, 0, b.length);
> >>>
> >>>         System.out.println(crc32c.getValue());
> >>>       }
> >>> }
> >>>
> >>> Thanks for fixing the typos.
> >>>
> >>>
> >>> Best regards,
> >>> Gustavo
> >>>
> >>>> Best regards,
> >>>> Martin
> >>>>
> >>>>
> >>>> diff -r a33f49d5998c
> src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> >>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu
> Jan 03 17:30:03 2019 +0100
> >>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> Thu Jan 03 18:33:16 2019 +0100
> >>>> @@ -1924,6 +1924,9 @@
> >>>>            __ addi(data, data,
> arrayOopDesc::base_offset_in_bytes(T_BYTE));
> >>>>          }
> >>>>
> >>>> +    // Restore caller sp for c2i case.
> >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>> +
> >>>>          StubRoutines::ppc64::generate_load_crc_table_addr(_masm,
> table);
> >>>>
> >>>>          if (!VM_Version::has_vpmsumb()) {
> >>>> @@ -1933,8 +1936,6 @@
> >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> tc0, tc1, tc2, true);
> >>>>          }
> >>>>
> >>>> -    // Restore caller sp for c2i case and return.
> >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>>          __ blr();
> >>>>
> >>>>          // Generate a vanilla native entry as the slow path.
> >>>> @@ -2014,6 +2015,9 @@
> >>>>            __ addi(data, data,
> arrayOopDesc::base_offset_in_bytes(T_BYTE));
> >>>>          }
> >>>>
> >>>> +    // Restore caller sp for c2i case.
> >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>> +
> >>>>          StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm,
> table);
> >>>>
> >>>>          if (!VM_Version::has_vpmsumb()) {
> >>>> @@ -2023,8 +2027,6 @@
> >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> tc0, tc1, tc2, false);
> >>>>          }
> >>>>
> >>>> -    // Restore caller sp for c2i case and return.
> >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>>          __ blr();
> >>>>
> >>>>          BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
> >>>>
> >>>>
> >>>> -----Original Message-----
> >>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >>>> Sent: Donnerstag, 3. Januar 2019 17:13
> >>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> should be used by interpreter and be faster for short arrays
> >>>>
> >>>> Hi Martin,
> >>>>
> >>>> oh that's nice. You removed the 512-byte block constraint and also
> wired it up to the Interpreter :)
> >>>>
> >>>> For the worst case, unaligned 512 byte array, I see the gap to aligned
> 512 byte array reduced by about ~5.7x.
> >>>>
> >>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
> >>>>
> >>>> This is all for the CRC32 class.
> >>>>
> >>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against
> ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
> >>>>
> >>>> I've upload a full log into
> http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
> >>>>
> >>>> I'm leaving for the lunch and I'll take a closer look when back. But
> probably you will figure it out before I sit to appreciate the meal :)
> >>>>
> >>>> Finally, since the change does some cleanup, I wonder if it would be
> worth fixing the following typos:
> >>>>
> >>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the
> code as a short version
> >>>> for Barrett but it should be changed in
> >>>>
> >>>> +  // Point to Barret constants
> >>>> +  add_const_optimized(cur_const, constants, outer_consts_size +
> inner_consts_size);
> >>>> +
> >>>>
> >>>> ?
> >>>>
> >>>> s/not/note/ in:
> >>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table
> address(es):
> >>>>
> >>>> d/lives/ in:
> >>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc
> lives lives in VCRC, now
> >>>>
> >>>> Best regards,
> >>>> Gustavo
> >>>>
> >>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
> >>>>> Hi,
> >>>>>
> >>>>> the JVM on PPC64 currently misses usage of the fast vector
> implementation in the interpreter code.
> >>>>>
> >>>>> In addition, performance is not good for short arrays (unaligned 512
> byte arrays or shorter arrays) because the current vector implementation
> needs at least 512 bytes.
> >>>>>
> >>>>> Bug:
> >>>>>
> >>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
> >>>>>
> >>>>> I have addressed these 2 issues + some cleanup with the following
> webrev:
> >>>>>
> >>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/
> <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
> >>>>>
> >>>>> Please review.
> >>>>>
> >>>>> Best regards,
> >>>>>
> >>>>> Martin
> >>>>>
> >>>>
> >>>
> >>
> >
> 


From Alan.Bateman at oracle.com  Fri Jan 18 13:32:37 2019
From: Alan.Bateman at oracle.com (Alan Bateman)
Date: Fri, 18 Jan 2019 13:32:37 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
Message-ID: <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>

On 17/01/2019 14:27, Andrew Dinn wrote:
> :
>> Vladimir and I have reviewed the JEP, it will need an area lead to
>> endorse, I think it can be Brian or Mikael in this case.
> Ok, thanks for the above answers. Looking forward to hearing further
> from Brian and/or Mikael (Vidstedt, I assume? :-).
I had a brief discussion with Brian about this yesterday. He brought up 
the same concern about using MBB as it's not the right API for this in 
the longer term.? So this JEP is very much about a short term/tactical 
solution as we've already concluded here. This leads to the question as 
to whether this JEP needs to evolve the standard/Java SE API or not. 
It's convenient for the implementation of course but we should at least 
explore doing this as a JDK-specific feature.

To that end, one approach to explore is allowing the FC.map method 
accept map modes beyond those defined by MapMode. There is precedence 
for extensibility in this area already, e.g. FC.open allows you to 
specify options beyond the standard options specified by the method. It 
would require MapMode to define a protected constructor and would 
require a bit of plumbing to support MapMode defined in a JDK-specific 
module but there are examples to point to. Another approach is aanother 
class in a JDK-specific module to define the map method. It would 
require the same plumbing under the covers but would avoid touch the FC 
spec.

-Alan


From rkennke at redhat.com  Fri Jan 18 13:37:46 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Fri, 18 Jan 2019 14:37:46 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <3dd85d2c-f4d8-e360-21a2-68254b3c5e2b@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
 <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
 <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>
 <3dd85d2c-f4d8-e360-21a2-68254b3c5e2b@redhat.com>
Message-ID: <2f209ec9-e7f9-8da3-64a2-20ac909b4931@redhat.com>


> On 1/17/19 7:51 PM, B. Blaser wrote:
>> Here it is on intel xeon with 5*10e9 iterations:
>> * mov+cmov = 10.94s
>> * cmov = 10.15s
>>
>> Thoughts?
> 
> It looks like there's not much of a performance difference, but it might
> help by freeing a register. OTOH, we'd still need to be sure we weren't
> introducing a regression. We'd have to make sure that implicit null checks
> work.

I'm pretty sure that null-checks work, in general. I used the cmov
instructions in an experiment that I did with Shenandoah barriers of
which I'm pretty sure would have blown up badly if it wouldn't. One
thing I'm not sure of is: does cmov generate a SIGSEGV on a bad address,
even if the condition is not true? I doubt it, because then we couldn't
use this for other types (long, int, etc).

I'm more worried about the bottom-type issue that is mentioned in the
comment and by Andrew Dinn, and it would be very helpful if anybody
knows about it and could clarify. Failing that we could dig deeper
and/or do extensive testing?

Roman

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/cac2173b/signature.asc>

From peter.levart at gmail.com  Fri Jan 18 14:11:57 2019
From: peter.levart at gmail.com (Peter Levart)
Date: Fri, 18 Jan 2019 15:11:57 +0100
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
Message-ID: <aae5418e-388c-eb8e-6b7d-f9a513219a75@gmail.com>

Hi Alan,

On 1/18/19 2:32 PM, Alan Bateman wrote:
> On 17/01/2019 14:27, Andrew Dinn wrote:
>> :
>>> Vladimir and I have reviewed the JEP, it will need an area lead to
>>> endorse, I think it can be Brian or Mikael in this case.
>> Ok, thanks for the above answers. Looking forward to hearing further
>> from Brian and/or Mikael (Vidstedt, I assume? :-).
> I had a brief discussion with Brian about this yesterday. He brought 
> up the same concern about using MBB as it's not the right API for this 
> in the longer term.? So this JEP is very much about a short 
> term/tactical solution as we've already concluded here. This leads to 
> the question as to whether this JEP needs to evolve the standard/Java 
> SE API or not. It's convenient for the implementation of course but we 
> should at least explore doing this as a JDK-specific feature.
>
> To that end, one approach to explore is allowing the FC.map method 
> accept map modes beyond those defined by MapMode. There is precedence 
> for extensibility in this area already, e.g. FC.open allows you to 
> specify options beyond the standard options specified by the method. 
> It would require MapMode to define a protected constructor and would 
> require a bit of plumbing to support MapMode defined in a JDK-specific 
> module but there are examples to point to.

You meant package-private constructor, right? Protected constructor 
would allow subclassing MapMode by arbitrary user class which is not 
what would be desirable. So perhaps all that is needed is to declare the 
static final field in the MapMode class as package-private. That would 
allow referenceing it in the java.nio.channels package. Then add 
SharedSecrets mechanism to expose it's value to other needed java.base 
packages and to the additional module that would expose it publicly...

Regards, Peter


From peter.levart at gmail.com  Fri Jan 18 14:28:34 2019
From: peter.levart at gmail.com (Peter Levart)
Date: Fri, 18 Jan 2019 15:28:34 +0100
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <aae5418e-388c-eb8e-6b7d-f9a513219a75@gmail.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
 <aae5418e-388c-eb8e-6b7d-f9a513219a75@gmail.com>
Message-ID: <708555d0-d3e5-2d2c-f69d-16f76a83f66a@gmail.com>


On 1/18/19 3:11 PM, Peter Levart wrote:
> You meant package-private constructor, right? Protected constructor 
> would allow subclassing MapMode by arbitrary user class which is not 
> what would be desirable.

...unless you actually want users to construct their own MapMode(s), 
like you mentioned is the case with FileChannel.open() and FileAttribute 
interface. But there this makes sense because the backend (FileSystem) 
is also pluggable, so users can define their own FileSystem 
implementations that consume their own FileAttribute(s)...

Are you proposing to add an spi for MappedByteBuffer's here? That would 
be an overkill for this feature, I think...

Regards, Peter


From martin.doerr at sap.com  Fri Jan 18 14:32:45 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 18 Jan 2019 14:32:45 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <8b1ca2bdba334f42a3c2b044a557dd8c@sap.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
 <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
 <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <8b1ca2bdba334f42a3c2b044a557dd8c@sap.com>
Message-ID: <AM6PR02MB478818B58E268930D16AEDE59A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi G?tz,

that's a good proposal. I've moved the common functionality into macroAssembler_ppc. This makes interpreter and stubGenerator code shorter.

I've also moved the vector constants computation to stubGenerator such that we only do it when the intrinsics are enabled and the vector version is supported by the processor.

New webrev:
http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.02/

@Gustavo: Thanks for testing and confirming the issue (JDK-8216376) is fixed.

Best regards,
Martin


-----Original Message-----
From: Lindenmaier, Goetz 
Sent: Freitag, 18. Januar 2019 12:03
To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used by interpreter and be faster for short arrays

Hi Martin, 

I had a look at your change. 
Overall looks good. According to Gustavos mail a nice improvement!

I think though that the way to select the algorithm is quite
messy:
In templateInterpreter vpmsumb is checked and the methods are
called directly.
In stubGenerator, generate_CRC32...()
  vpmsumb is tested to decide on vector_constants = R2.
  and generic generate_CRC_updateBytes is called, which 
again checks whether verctor_constants == R2.

I think generate_CRC_updateBytes() or some other generic
function should be located in macroAssembler_ppc and
be called from both locations.

What do you think?

Best regards,
  Goetz


> -----Original Message-----
> From: Doerr, Martin
> Sent: Donnerstag, 17. Januar 2019 14:18
> To: Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>;
> Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi,
> 
> the rebased webrev.01 applies on jdk/jdk, now (after JDK-8216376). So the
> issue Gustavo had observed does not longer exist.
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> 
> I have updated copyrights and retested it.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Montag, 7. Januar 2019 14:52
> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> On 01/07/2019 11:49 AM, Doerr, Martin wrote:
> > I want to check all places where we use "mr(R1_SP, R21_sender_SP)".
> There may be more issues with that. I'll probably handle that in a separate
> change and push this CRC change afterwards.
> 
> I see. Thanks for letting me know.
> 
> Best regards,
> Gustavo
> 
> > Best regards,
> > Martin
> >
> >
> > -----Original Message-----
> > From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > Sent: Freitag, 4. Januar 2019 19:55
> > To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be used by interpreter and be faster for short arrays
> >
> > Hi Martin,
> >
> > On 01/04/2019 02:13 PM, Doerr, Martin wrote:
> >> Hi Gustavo,
> >>
> >> when called from the interpreter (the scenario you observed), R21 is set
> before resizing the frame to avoid wasted stack space
> (InterpreterMacroAssembler::call_from_interpreter).
> >
> > Got it. Thanks a lot for the explanations.
> >
> > I think it doesn't currently matter in practice, but I'm wondering if to be
> > consistent we should cut back the stack back earlier also in
> > TemplateInterpreterGenerator::generate_CRC32_update_entry()?
> >
> > diff -r a35f8c35d8c9
> src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> > --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> 04 10:09:00 2019 +0100
> > +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> 04 13:44:37 2019 -0500
> > @@ -1840,11 +1840,12 @@
> >    #endif
> >        __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64
> bit to have a clean register.
> >
> > +    // Restore caller sp for c2i case and return.
> > +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> started.
> > +
> >        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
> >        __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
> >
> > -    // Restore caller sp for c2i case and return.
> > -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> started.
> >        __ blr();
> >
> >        // Generate a vanilla native entry as the slow path.
> >
> > Currently there is no issue probably because generated code is simpler and
> does
> > no spills.
> >
> > Best regards,
> > Gustavo
> >
> >> When called from compiled methods, R21 is set by a c2i adapter which
> extends the compiled frame by space for arguments (gen_c2i_adapter).
> >>
> >> "mr(R1_SP, R21_sender_SP)" is more error-prone than
> "resize_frame_absolute" so I think the latter would be better (though it takes
> more registers and instructions), but I don't want to replace that as part of
> this CRC change.
> >>
> >> Best regards,
> >> Martin
> >>
> >>
> >> -----Original Message-----
> >> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >> Sent: Freitag, 4. Januar 2019 14:44
> >> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be used by interpreter and be faster for short arrays
> >>
> >> Hi Martin,
> >>
> >> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
> >>> thank you very much for confirming. This makes sense. We use different
> frame headers depending on whether the frame is the top Java frame or not
> (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a
> shortcut for leaf calls which relies on having an unmodified stack until this
> point. So the patch fixes the issue.
> >>
> >> Glad to help! Thanks for the additional information, I was not aware that
> the
> >> selection of different frame headers could be done at compile time. One
> last
> >> question only for my education: what exactly advanced (incremented)
> R1_SP so it
> >> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame
> for
> >> which function exactly or "who" is the caller exactly here?
> >>
> >> Thank you.
> >>
> >> Best regards,
> >> Gustavo
> >>
> >>> New webrev:
> >>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> >>>
> >>> Best regards,
> >>> Martin
> >>>
> >>>
> >>> -----Original Message-----
> >>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >>> Sent: Donnerstag, 3. Januar 2019 19:36
> >>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> should be used by interpreter and be faster for short arrays
> >>>
> >>> Hi Martin,
> >>>
> >>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
> >>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on
> our machine (with fastdbg build).
> >>>> I guess that the frameless spills mess up the stack. Can you check if the
> patch below helps?
> >>>
> >>> Thanks for providing a fix so I can try it.
> >>> Yes, I confirm the patch below indeed fixes the sigsegv crash when
> CRC32C update() method is used.
> >>> I also confirm that I don't observe the crash on the fastdebug build, only
> on the release build.
> >>> It also only affects the Interpreter mode, so passing -Xcomp avoids the
> crash on the release build.
> >>>
> >>> Just as reference, I can reproduce it on the release build with the
> following trivial code:
> >>>
> >>> import java.util.zip.CRC32C;
> >>>
> >>> class CRC32C_v1 {
> >>>       public static void main(String[] arg) {
> >>>         byte[] b = new byte[1024];
> >>>
> >>>         CRC32C crc32c = new CRC32C();
> >>>         crc32c.update(b, 0, b.length);
> >>>
> >>>         System.out.println(crc32c.getValue());
> >>>       }
> >>> }
> >>>
> >>> Thanks for fixing the typos.
> >>>
> >>>
> >>> Best regards,
> >>> Gustavo
> >>>
> >>>> Best regards,
> >>>> Martin
> >>>>
> >>>>
> >>>> diff -r a33f49d5998c
> src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> >>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu
> Jan 03 17:30:03 2019 +0100
> >>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> Thu Jan 03 18:33:16 2019 +0100
> >>>> @@ -1924,6 +1924,9 @@
> >>>>            __ addi(data, data,
> arrayOopDesc::base_offset_in_bytes(T_BYTE));
> >>>>          }
> >>>>
> >>>> +    // Restore caller sp for c2i case.
> >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>> +
> >>>>          StubRoutines::ppc64::generate_load_crc_table_addr(_masm,
> table);
> >>>>
> >>>>          if (!VM_Version::has_vpmsumb()) {
> >>>> @@ -1933,8 +1936,6 @@
> >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> tc0, tc1, tc2, true);
> >>>>          }
> >>>>
> >>>> -    // Restore caller sp for c2i case and return.
> >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>>          __ blr();
> >>>>
> >>>>          // Generate a vanilla native entry as the slow path.
> >>>> @@ -2014,6 +2015,9 @@
> >>>>            __ addi(data, data,
> arrayOopDesc::base_offset_in_bytes(T_BYTE));
> >>>>          }
> >>>>
> >>>> +    // Restore caller sp for c2i case.
> >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>> +
> >>>>          StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm,
> table);
> >>>>
> >>>>          if (!VM_Version::has_vpmsumb()) {
> >>>> @@ -2023,8 +2027,6 @@
> >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> tc0, tc1, tc2, false);
> >>>>          }
> >>>>
> >>>> -    // Restore caller sp for c2i case and return.
> >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> caller started.
> >>>>          __ blr();
> >>>>
> >>>>          BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
> >>>>
> >>>>
> >>>> -----Original Message-----
> >>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >>>> Sent: Donnerstag, 3. Januar 2019 17:13
> >>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> >>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> should be used by interpreter and be faster for short arrays
> >>>>
> >>>> Hi Martin,
> >>>>
> >>>> oh that's nice. You removed the 512-byte block constraint and also
> wired it up to the Interpreter :)
> >>>>
> >>>> For the worst case, unaligned 512 byte array, I see the gap to aligned
> 512 byte array reduced by about ~5.7x.
> >>>>
> >>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
> >>>>
> >>>> This is all for the CRC32 class.
> >>>>
> >>>> On CRC32C I'm getting a SIGSEV that can be reproduced running against
> ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
> >>>>
> >>>> I've upload a full log into
> http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
> >>>>
> >>>> I'm leaving for the lunch and I'll take a closer look when back. But
> probably you will figure it out before I sit to appreciate the meal :)
> >>>>
> >>>> Finally, since the change does some cleanup, I wonder if it would be
> worth fixing the following typos:
> >>>>
> >>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the
> code as a short version
> >>>> for Barrett but it should be changed in
> >>>>
> >>>> +  // Point to Barret constants
> >>>> +  add_const_optimized(cur_const, constants, outer_consts_size +
> inner_consts_size);
> >>>> +
> >>>>
> >>>> ?
> >>>>
> >>>> s/not/note/ in:
> >>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table
> address(es):
> >>>>
> >>>> d/lives/ in:
> >>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc
> lives lives in VCRC, now
> >>>>
> >>>> Best regards,
> >>>> Gustavo
> >>>>
> >>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
> >>>>> Hi,
> >>>>>
> >>>>> the JVM on PPC64 currently misses usage of the fast vector
> implementation in the interpreter code.
> >>>>>
> >>>>> In addition, performance is not good for short arrays (unaligned 512
> byte arrays or shorter arrays) because the current vector implementation
> needs at least 512 bytes.
> >>>>>
> >>>>> Bug:
> >>>>>
> >>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
> >>>>>
> >>>>> I have addressed these 2 issues + some cleanup with the following
> webrev:
> >>>>>
> >>>>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/
> <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
> >>>>>
> >>>>> Please review.
> >>>>>
> >>>>> Best regards,
> >>>>>
> >>>>> Martin
> >>>>>
> >>>>
> >>>
> >>
> >
> 


From goetz.lindenmaier at sap.com  Fri Jan 18 14:42:14 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Fri, 18 Jan 2019 14:42:14 +0000
Subject: RFR(M): 8216060: [PPC64] Vector CRC implementation should be used
 by interpreter and be faster for short arrays
In-Reply-To: <AM6PR02MB478818B58E268930D16AEDE59A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <1c4646d554954551b73c077fa40f983d@sap.com>
 <771a457e-6b4d-f73b-e072-703490c9ced5@linux.vnet.ibm.com>
 <9863276de30643338249ead2a6ac7fe9@sap.com>
 <452dcb69-189e-700d-5995-582ba13669b9@linux.vnet.ibm.com>
 <37d9e6f3b2b4400d8963f54d2fe7767f@sap.com>
 <406db3e3-2ac3-dc16-f384-99a314e62a42@linux.vnet.ibm.com>
 <beff3d359c954a29962be71c40bc235b@sap.com>
 <180a6c0b-7abe-9d5c-51e6-dffbb23570d3@linux.vnet.ibm.com>
 <d898c13929a44afb82d477fd732d23e7@sap.com>
 <301fd43a-e5b5-d970-7a1a-2458dbaeec36@linux.vnet.ibm.com>
 <AM6PR02MB47887C5C3082A52540C5AB8B9A830@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <8b1ca2bdba334f42a3c2b044a557dd8c@sap.com>
 <AM6PR02MB478818B58E268930D16AEDE59A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <3a71eaf686bb4cf48946d668c6cb3868@sap.com>

Hi Martin, 

thanks for improving this, looks good now!
Actually, this is much more cleanup than I expected :)

Best regards,
  Goetz.

> -----Original Message-----
> From: Doerr, Martin
> Sent: Freitag, 18. Januar 2019 15:33
> To: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi G?tz,
> 
> that's a good proposal. I've moved the common functionality into
> macroAssembler_ppc. This makes interpreter and stubGenerator code shorter.
> 
> I've also moved the vector constants computation to stubGenerator such that
> we only do it when the intrinsics are enabled and the vector version is
> supported by the processor.
> 
> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.02/
> 
> @Gustavo: Thanks for testing and confirming the issue (JDK-8216376) is fixed.
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Lindenmaier, Goetz
> Sent: Freitag, 18. Januar 2019 12:03
> To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should be
> used by interpreter and be faster for short arrays
> 
> Hi Martin,
> 
> I had a look at your change.
> Overall looks good. According to Gustavos mail a nice improvement!
> 
> I think though that the way to select the algorithm is quite
> messy:
> In templateInterpreter vpmsumb is checked and the methods are
> called directly.
> In stubGenerator, generate_CRC32...()
>   vpmsumb is tested to decide on vector_constants = R2.
>   and generic generate_CRC_updateBytes is called, which
> again checks whether verctor_constants == R2.
> 
> I think generate_CRC_updateBytes() or some other generic
> function should be located in macroAssembler_ppc and
> be called from both locations.
> 
> What do you think?
> 
> Best regards,
>   Goetz
> 
> 
> 
> > -----Original Message-----
> > From: Doerr, Martin
> > Sent: Donnerstag, 17. Januar 2019 14:18
> > To: Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>;
> > Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> > Subject: RE: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be
> > used by interpreter and be faster for short arrays
> >
> > Hi,
> >
> > the rebased webrev.01 applies on jdk/jdk, now (after JDK-8216376). So the
> > issue Gustavo had observed does not longer exist.
> > http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> >
> > I have updated copyrights and retested it.
> >
> > Best regards,
> > Martin
> >
> >
> > -----Original Message-----
> > From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > Sent: Montag, 7. Januar 2019 14:52
> > To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> be
> > used by interpreter and be faster for short arrays
> >
> > Hi Martin,
> >
> > On 01/07/2019 11:49 AM, Doerr, Martin wrote:
> > > I want to check all places where we use "mr(R1_SP, R21_sender_SP)".
> > There may be more issues with that. I'll probably handle that in a separate
> > change and push this CRC change afterwards.
> >
> > I see. Thanks for letting me know.
> >
> > Best regards,
> > Gustavo
> >
> > > Best regards,
> > > Martin
> > >
> > >
> > > -----Original Message-----
> > > From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > > Sent: Freitag, 4. Januar 2019 19:55
> > > To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > > Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation should
> > be used by interpreter and be faster for short arrays
> > >
> > > Hi Martin,
> > >
> > > On 01/04/2019 02:13 PM, Doerr, Martin wrote:
> > >> Hi Gustavo,
> > >>
> > >> when called from the interpreter (the scenario you observed), R21 is set
> > before resizing the frame to avoid wasted stack space
> > (InterpreterMacroAssembler::call_from_interpreter).
> > >
> > > Got it. Thanks a lot for the explanations.
> > >
> > > I think it doesn't currently matter in practice, but I'm wondering if to be
> > > consistent we should cut back the stack back earlier also in
> > > TemplateInterpreterGenerator::generate_CRC32_update_entry()?
> > >
> > > diff -r a35f8c35d8c9
> > src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> > > --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> > 04 10:09:00 2019 +0100
> > > +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Fri Jan
> > 04 13:44:37 2019 -0500
> > > @@ -1840,11 +1840,12 @@
> > >    #endif
> > >        __ lwz(crc,  2*wordSize, argP);    // Current crc state, zero extend to 64
> > bit to have a clean register.
> > >
> > > +    // Restore caller sp for c2i case and return.
> > > +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> > started.
> > > +
> > >        StubRoutines::ppc64::generate_load_crc_table_addr(_masm, table);
> > >        __ kernel_crc32_singleByte(crc, data, dataLen, table, tmp, true);
> > >
> > > -    // Restore caller sp for c2i case and return.
> > > -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the caller
> > started.
> > >        __ blr();
> > >
> > >        // Generate a vanilla native entry as the slow path.
> > >
> > > Currently there is no issue probably because generated code is simpler and
> > does
> > > no spills.
> > >
> > > Best regards,
> > > Gustavo
> > >
> > >> When called from compiled methods, R21 is set by a c2i adapter which
> > extends the compiled frame by space for arguments (gen_c2i_adapter).
> > >>
> > >> "mr(R1_SP, R21_sender_SP)" is more error-prone than
> > "resize_frame_absolute" so I think the latter would be better (though it takes
> > more registers and instructions), but I don't want to replace that as part of
> > this CRC change.
> > >>
> > >> Best regards,
> > >> Martin
> > >>
> > >>
> > >> -----Original Message-----
> > >> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > >> Sent: Freitag, 4. Januar 2019 14:44
> > >> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > >> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> should
> > be used by interpreter and be faster for short arrays
> > >>
> > >> Hi Martin,
> > >>
> > >> On 01/04/2019 07:30 AM, Doerr, Martin wrote:
> > >>> thank you very much for confirming. This makes sense. We use different
> > frame headers depending on whether the frame is the top Java frame or not
> > (and on whether it's a debug build or not). Setting R1_SP to sender_SP is a
> > shortcut for leaf calls which relies on having an unmodified stack until this
> > point. So the patch fixes the issue.
> > >>
> > >> Glad to help! Thanks for the additional information, I was not aware that
> > the
> > >> selection of different frame headers could be done at compile time. One
> > last
> > >> question only for my education: what exactly advanced (incremented)
> > R1_SP so it
> > >> has to be cut back using sender_SP value, i.e. sender_SP tracks the frame
> > for
> > >> which function exactly or "who" is the caller exactly here?
> > >>
> > >> Thank you.
> > >>
> > >> Best regards,
> > >> Gustavo
> > >>
> > >>> New webrev:
> > >>> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.01/
> > >>>
> > >>> Best regards,
> > >>> Martin
> > >>>
> > >>>
> > >>> -----Original Message-----
> > >>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > >>> Sent: Donnerstag, 3. Januar 2019 19:36
> > >>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > >>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> > should be used by interpreter and be faster for short arrays
> > >>>
> > >>> Hi Martin,
> > >>>
> > >>> On 01/03/2019 03:34 PM, Doerr, Martin wrote:
> > >>>> Unfortunately, I can't reproduce the crash. TestCRC32C works stable on
> > our machine (with fastdbg build).
> > >>>> I guess that the frameless spills mess up the stack. Can you check if the
> > patch below helps?
> > >>>
> > >>> Thanks for providing a fix so I can try it.
> > >>> Yes, I confirm the patch below indeed fixes the sigsegv crash when
> > CRC32C update() method is used.
> > >>> I also confirm that I don't observe the crash on the fastdebug build, only
> > on the release build.
> > >>> It also only affects the Interpreter mode, so passing -Xcomp avoids the
> > crash on the release build.
> > >>>
> > >>> Just as reference, I can reproduce it on the release build with the
> > following trivial code:
> > >>>
> > >>> import java.util.zip.CRC32C;
> > >>>
> > >>> class CRC32C_v1 {
> > >>>       public static void main(String[] arg) {
> > >>>         byte[] b = new byte[1024];
> > >>>
> > >>>         CRC32C crc32c = new CRC32C();
> > >>>         crc32c.update(b, 0, b.length);
> > >>>
> > >>>         System.out.println(crc32c.getValue());
> > >>>       }
> > >>> }
> > >>>
> > >>> Thanks for fixing the typos.
> > >>>
> > >>>
> > >>> Best regards,
> > >>> Gustavo
> > >>>
> > >>>> Best regards,
> > >>>> Martin
> > >>>>
> > >>>>
> > >>>> diff -r a33f49d5998c
> > src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> > >>>> --- a/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp  Thu
> > Jan 03 17:30:03 2019 +0100
> > >>>> +++ b/src/hotspot/cpu/ppc/templateInterpreterGenerator_ppc.cpp
> > Thu Jan 03 18:33:16 2019 +0100
> > >>>> @@ -1924,6 +1924,9 @@
> > >>>>            __ addi(data, data,
> > arrayOopDesc::base_offset_in_bytes(T_BYTE));
> > >>>>          }
> > >>>>
> > >>>> +    // Restore caller sp for c2i case.
> > >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> > caller started.
> > >>>> +
> > >>>>          StubRoutines::ppc64::generate_load_crc_table_addr(_masm,
> > table);
> > >>>>
> > >>>>          if (!VM_Version::has_vpmsumb()) {
> > >>>> @@ -1933,8 +1936,6 @@
> > >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> > tc0, tc1, tc2, true);
> > >>>>          }
> > >>>>
> > >>>> -    // Restore caller sp for c2i case and return.
> > >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> > caller started.
> > >>>>          __ blr();
> > >>>>
> > >>>>          // Generate a vanilla native entry as the slow path.
> > >>>> @@ -2014,6 +2015,9 @@
> > >>>>            __ addi(data, data,
> > arrayOopDesc::base_offset_in_bytes(T_BYTE));
> > >>>>          }
> > >>>>
> > >>>> +    // Restore caller sp for c2i case.
> > >>>> +    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> > caller started.
> > >>>> +
> > >>>>          StubRoutines::ppc64::generate_load_crc32c_table_addr(_masm,
> > table);
> > >>>>
> > >>>>          if (!VM_Version::has_vpmsumb()) {
> > >>>> @@ -2023,8 +2027,6 @@
> > >>>>            __ kernel_crc32_vpmsum(crc, data, dataLen, table, t0, t1, t2, t3,
> > tc0, tc1, tc2, false);
> > >>>>          }
> > >>>>
> > >>>> -    // Restore caller sp for c2i case and return.
> > >>>> -    __ mr(R1_SP, R21_sender_SP); // Cut the stack back to where the
> > caller started.
> > >>>>          __ blr();
> > >>>>
> > >>>>          BLOCK_COMMENT("} CRC32C_update{Bytes|DirectByteBuffer}");
> > >>>>
> > >>>>
> > >>>> -----Original Message-----
> > >>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> > >>>> Sent: Donnerstag, 3. Januar 2019 17:13
> > >>>> To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> > dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> > >>>> Subject: Re: RFR(M): 8216060: [PPC64] Vector CRC implementation
> > should be used by interpreter and be faster for short arrays
> > >>>>
> > >>>> Hi Martin,
> > >>>>
> > >>>> oh that's nice. You removed the 512-byte block constraint and also
> > wired it up to the Interpreter :)
> > >>>>
> > >>>> For the worst case, unaligned 512 byte array, I see the gap to aligned
> > 512 byte array reduced by about ~5.7x.
> > >>>>
> > >>>> On the Interpreter I see an improvement of at least 50% for 1024 bytes.
> > >>>>
> > >>>> This is all for the CRC32 class.
> > >>>>
> > >>>> On CRC32C I'm getting a SIGSEV that can be reproduced running
> against
> > ./test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java.
> > >>>>
> > >>>> I've upload a full log into
> > http://cr.openjdk.java.net/~gromero/logs/crc32c_sigsegv/
> > >>>>
> > >>>> I'm leaving for the lunch and I'll take a closer look when back. But
> > probably you will figure it out before I sit to appreciate the meal :)
> > >>>>
> > >>>> Finally, since the change does some cleanup, I wonder if it would be
> > worth fixing the following typos:
> > >>>>
> > >>>> I think it's Barrett const., not Barret. Probably 'barret' is used in the
> > code as a short version
> > >>>> for Barrett but it should be changed in
> > >>>>
> > >>>> +  // Point to Barret constants
> > >>>> +  add_const_optimized(cur_const, constants, outer_consts_size +
> > inner_consts_size);
> > >>>> +
> > >>>>
> > >>>> ?
> > >>>>
> > >>>> s/not/note/ in:
> > >>>> cpu/ppc/macroAssembler_ppc.cpp:3977:// A not on the lookup table
> > address(es):
> > >>>>
> > >>>> d/lives/ in:
> > >>>> cpu/ppc/macroAssembler_ppc.cpp:4265:  mtvrwz(VCRC, crc); // crc
> > lives lives in VCRC, now
> > >>>>
> > >>>> Best regards,
> > >>>> Gustavo
> > >>>>
> > >>>> On 01/03/2019 12:17 PM, Doerr, Martin wrote:
> > >>>>> Hi,
> > >>>>>
> > >>>>> the JVM on PPC64 currently misses usage of the fast vector
> > implementation in the interpreter code.
> > >>>>>
> > >>>>> In addition, performance is not good for short arrays (unaligned 512
> > byte arrays or shorter arrays) because the current vector implementation
> > needs at least 512 bytes.
> > >>>>>
> > >>>>> Bug:
> > >>>>>
> > >>>>> https://bugs.openjdk.java.net/browse/JDK-8216060
> > >>>>>
> > >>>>> I have addressed these 2 issues + some cleanup with the following
> > webrev:
> > >>>>>
> > >>>>>
> http://cr.openjdk.java.net/~mdoerr/8216060_PPC64_CRC/webrev.00/
> > <http://cr.openjdk.java.net/%7Emdoerr/8216060_PPC64_CRC/webrev.00/>
> > >>>>>
> > >>>>> Please review.
> > >>>>>
> > >>>>> Best regards,
> > >>>>>
> > >>>>> Martin
> > >>>>>
> > >>>>
> > >>>
> > >>
> > >
> >
> 
> 


From aph at redhat.com  Fri Jan 18 14:56:07 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 18 Jan 2019 14:56:07 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
Message-ID: <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>

Hi,

On 1/18/19 9:52 AM, Nick Gasson (Arm Technology China) wrote:

> On 18/01/2019 17:36, Andrew Haley wrote:
>>
>> The patch looks good. However, I don't understand why we aren't using
>> MacroAssembler::cmpxchgptr here. It looks like we should be, and you'd
>> end up with a less complex result.
> 
> It's not exactly the same though: MacroAssembler::cmpxchgptr adds a "dmb 
> ish" to the failure path which I don't think is required here.

Oh, sorry. I should have said MacroAssembler::cmpxchg, with a
br.eq(cont) afterward.

>>> * Does anyone know what the comment "// Load Compare Value application
>>> register." means? It's present in the PPC and S390 ports too.
>>
>> Probably no-one can remember. We'll have inherited it from x86.
> 
> Let's delete it then.

OK.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From gromero at linux.vnet.ibm.com  Fri Jan 18 14:57:13 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Fri, 18 Jan 2019 12:57:13 -0200
Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
 CheckGraalIntrinsics failed after 8213754
Message-ID: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>

Hi,

Could the following backport to 11u be reviewed, please?

Bug     : https://bugs.openjdk.java.net/browse/JDK-8215317
Change  : http://hg.openjdk.java.net/jdk/jdk/rev/108a161aed93
Backport: http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/

It adds 4 intrinsics to the Graal test CheckGraalIntrinsics.java list so
JDK 11u becomes aware of them. Otherwise that test will break once change
8213754 [0] lands 11u (which will effectively add the 4 intrinsics to
PPC64/Hotspot and adapt the correlated methods to be intrinsified).

The backport changed the inclusion of the intrinsics for JDK 11 or higher,
instead for JDK 12 or higher (original patch).

This backport was tested on x86_64 with
./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled)
and no regressions were observed too.

Thank you.

Best regards,
Gustavo

[0] https://bugs.openjdk.java.net/browse/JDK-8213754


From gromero at linux.vnet.ibm.com  Fri Jan 18 15:07:20 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Fri, 18 Jan 2019 13:07:20 -0200
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
Message-ID: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>

Hi,

Could the following backport to 11u be reviewed, please?

Bug     : https://bugs.openjdk.java.net/browse/JDK-8213754
Change  : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/

It adds 4 intrinsics that use instructions introduced by POWER9 in order to
speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.

The change is mostly PPC64-only but it does touch shared code, for
instance, in order to adapt the methods in question to be properly
intrinsified. It also needs an additional change [0], since one Graal
test has to be adapted (a separated RFR to backport [0] was sent to [1]).

The change applies almost cleanly: only a small tweak is necessary because
the hunk for ppc.ad file relies on some absent text in the 11u code around
the change to be applied. That absent text is related to the Superword
feature (a non-related feature), which is not backported yet to 11u.

This backport was tested on POWER8 and POWER9 and no regressions were
observed.

This backport was also tested on x86_64 with
./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
change 8215317 [0] applied and no regressions were observed too.

Thank you.

Best regards,
Gustavo

[0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
[1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-January/032266.html


From Roger.Riggs at oracle.com  Fri Jan 18 15:35:24 2019
From: Roger.Riggs at oracle.com (Roger Riggs)
Date: Fri, 18 Jan 2019 10:35:24 -0500
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
Message-ID: <c8e419d0-e326-5ca4-be4a-303ea94c9d31@oracle.com>

Looks good for the jdk files.

Regards, Roger

On 01/18/2019 10:07 AM, Gustavo Romero wrote:
> Hi,
>
> Could the following backport to 11u be reviewed, please?
>
> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8213754
> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
>
> It adds 4 intrinsics that use instructions introduced by POWER9 in 
> order to
> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
>
> The change is mostly PPC64-only but it does touch shared code, for
> instance, in order to adapt the methods in question to be properly
> intrinsified. It also needs an additional change [0], since one Graal
> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
>
> The change applies almost cleanly: only a small tweak is necessary 
> because
> the hunk for ppc.ad file relies on some absent text in the 11u code 
> around
> the change to be applied. That absent text is related to the Superword
> feature (a non-related feature), which is not backported yet to 11u.
>
> This backport was tested on POWER8 and POWER9 and no regressions were
> observed.
>
> This backport was also tested on x86_64 with
> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) 
> with
> change 8215317 [0] applied and no regressions were observed too.
>
> Thank you.
>
> Best regards,
> Gustavo
>
> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
> [1] 
> https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-January/032266.html
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/1004dc72/attachment.html>

From claes.redestad at oracle.com  Fri Jan 18 16:03:45 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 18 Jan 2019 17:03:45 +0100
Subject: RFR: 8217387: Remove dead develop flag CIFireOOMAt
Message-ID: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>

Hi,

the develop flag CIFireOOMAt is effectively dead and should be removed.

Webrev: http://cr.openjdk.java.net/~redestad/8217387/open.00/
Bug:    https://bugs.openjdk.java.net/browse/JDK-8217387

Thanks!

/Claes

From shade at redhat.com  Fri Jan 18 16:01:04 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Fri, 18 Jan 2019 17:01:04 +0100
Subject: RFR: 8217387: Remove dead develop flag CIFireOOMAt
In-Reply-To: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
References: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
Message-ID: <03f5e7bf-bb4c-a4bd-c959-1ad3c754f130@redhat.com>

On 1/18/19 5:03 PM, Claes Redestad wrote:
> the develop flag CIFireOOMAt is effectively dead and should be removed.
> 
> Webrev: http://cr.openjdk.java.net/~redestad/8217387/open.00/
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217387

Looks good. There are indeed no "write" usages.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/05944802/signature.asc>

From claes.redestad at oracle.com  Fri Jan 18 16:10:08 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 18 Jan 2019 17:10:08 +0100
Subject: RFR: 8217387: Remove dead develop flag CIFireOOMAt
In-Reply-To: <03f5e7bf-bb4c-a4bd-c959-1ad3c754f130@redhat.com>
References: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
 <03f5e7bf-bb4c-a4bd-c959-1ad3c754f130@redhat.com>
Message-ID: <dd21bb67-5cdc-b70a-5c18-267fc220b1ea@oracle.com>

On 2019-01-18 17:01, Aleksey Shipilev wrote:
> Looks good. There are indeed no "write" usages.

Thanks!

/Claes

From lutz.schmidt at sap.com  Fri Jan 18 16:05:36 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Fri, 18 Jan 2019 16:05:36 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <52136751-929b-4976-477d-93282ce0a0d7@oracle.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
 <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
 <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
 <52136751-929b-4976-477d-93282ce0a0d7@oracle.com>
Message-ID: <CE7CC46E-F9D9-4B52-B040-46BC1B25CA49@sap.com>

Thank you, Tobias!

As this enhancement will not make it into jdk12, I'll rebase it to jdk/jdk. I expect no conflicts and assume I can then push without further webrev/review. 

Thanks,
Lutz

?On 18.01.19, 10:49, "Tobias Hartmann" <tobias.hartmann at oracle.com> wrote:

    Hi Lutz,
    
    looks good to me too.
    
    Best regards,
    Tobias
    
    On 17.01.19 19:39, Vladimir Kozlov wrote:
    > Looks good
    > 
    > Thanks,
    > Vladimir
    > 
    > On 1/17/19 7:47 AM, Schmidt, Lutz wrote:
    >> Hi Vladimir & all,
    >> there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
    >> What's new (in addition to some comments) is the macro
    >>
    >>    // Flush the buffer contents if the remaining capacity is less
    >>    // than the calculated threshold (256 bytes + capacity/16)
    >>    // That should suffice for all reasonably sized output lines.
    >>    #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)                \
    >>        BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))
    >>
    >> It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences.
    >> Regards,
    >> Lutz
    >>
    >> On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:
    >>
    >>      On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
    >>      > Hi Vladimir,
    >>      >
    >>      > thanks a lot for looking at this so quickly.
    >>      >
    >>      > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512"
    >> originated from the thought "its large enough for a well-behaved line and small enough to save
    >> some flushes".
    >>      >
    >>      > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived
    >> from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I
    >> wasn't sure if that could be categorized as over-engineered.
    >>           Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
    >>           Vladimir
    >>           >
    >>      > Your thoughts?
    >>      >
    >>      > Thanks,
    >>      > Lutz
    >>      >
    >>      > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov"
    >> <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
    >>      >
    >>      >      Hi Lutz,
    >>      >
    >>      >      I see that you have only one usage in all cases for:
    >>      >      BUFFEREDSTREAM_FLUSH_IF("", 512)
    >>      >
    >>      >      Can you simple declare simplified macro for this?
    >>      >
    >>      >      Otherwise looks good.
    >>      >
    >>      >      Thanks,
    >>      >      Vladimir
    >>      >
    >>      >      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
    >>      >      > Dear all,
    >>      >      >
    >>      >      > may I please have reviews for this (semantically) small change. Its purpose is to
    >> reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
    >>      >      >
    >>      >      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
    >>      >      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
    >>      >      >
    >>      >      > Thank you!
    >>      >      > Lutz
    >>      >      >
    >>      >      >
    >>      >
    >>      >
    >>     
    

From martin.doerr at sap.com  Fri Jan 18 16:07:53 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 18 Jan 2019 16:07:53 +0000
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <c8e419d0-e326-5ca4-be4a-303ea94c9d31@oracle.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <c8e419d0-e326-5ca4-be4a-303ea94c9d31@oracle.com>
Message-ID: <AM6PR02MB4788BC20E6C7E3423BFF41069A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi Gustavo,

hotspot part looks good, too.

Best regards,
Martin


From: Roger Riggs <Roger.Riggs at oracle.com>
Sent: Freitag, 18. Januar 2019 16:35
To: Gustavo Romero <gromero at linux.vnet.ibm.com>; hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>; vladimir.kozlov at oracle.com
Cc: Michihiro Horie <HORIE at jp.ibm.com>
Subject: Re: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace

Looks good for the jdk files.

Regards, Roger
On 01/18/2019 10:07 AM, Gustavo Romero wrote:
Hi,

Could the following backport to 11u be reviewed, please?

Bug     : https://bugs.openjdk.java.net/browse/JDK-8213754
Change  : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/

It adds 4 intrinsics that use instructions introduced by POWER9 in order to
speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.

The change is mostly PPC64-only but it does touch shared code, for
instance, in order to adapt the methods in question to be properly
intrinsified. It also needs an additional change [0], since one Graal
test has to be adapted (a separated RFR to backport [0] was sent to [1]).

The change applies almost cleanly: only a small tweak is necessary because
the hunk for ppc.ad file relies on some absent text in the 11u code around
the change to be applied. That absent text is related to the Superword
feature (a non-related feature), which is not backported yet to 11u.

This backport was tested on POWER8 and POWER9 and no regressions were
observed.

This backport was also tested on x86_64 with
./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
change 8215317 [0] applied and no regressions were observed too.

Thank you.

Best regards,
Gustavo

[0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
[1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-January/032266.html

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/2c054212/attachment.html>

From claes.redestad at oracle.com  Fri Jan 18 16:15:04 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 18 Jan 2019 17:15:04 +0100
Subject: RFR (trivial): 8217388: Remove develop flag ProfilerPCTickThreshold
Message-ID: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>

Hi,

this flag does not spark joy.

Bug: https://bugs.openjdk.java.net/browse/JDK-8217388
Patch:
diff -r 0a48b128e3d4 src/hotspot/share/runtime/globals.hpp
--- a/src/hotspot/share/runtime/globals.hpp	Fri Jan 18 16:49:35 2019 +0100
+++ b/src/hotspot/share/runtime/globals.hpp	Fri Jan 18 17:07:38 2019 +0100
@@ -1670,9 +1670,6 @@
    develop(intx, DontYieldALotInterval,    10, 
      \
            "Interval between which yields will be dropped 
(milliseconds)")   \
 
      \
-  develop(intx, ProfilerPCTickThreshold,    15, 
     \
-          "Number of ticks in a PC buckets to be a hotspot") 
     \
- 
     \
    notproduct(intx, DeoptimizeALotInterval,     5, 
      \
            "Number of exits until DeoptimizeALot kicks in") 
      \
 
      \

/Claes

From shade at redhat.com  Fri Jan 18 16:11:03 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Fri, 18 Jan 2019 17:11:03 +0100
Subject: RFR (trivial): 8217388: Remove develop flag
 ProfilerPCTickThreshold
In-Reply-To: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>
References: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>
Message-ID: <181d4585-9d4f-c5ec-8cac-7cf44636fea4@redhat.com>

On 1/18/19 5:15 PM, Claes Redestad wrote:
> this flag does not spark joy.

Which means "there are no uses anywhere at all".

> Bug: https://bugs.openjdk.java.net/browse/JDK-8217388
> Patch:
> diff -r 0a48b128e3d4 src/hotspot/share/runtime/globals.hpp
> --- a/src/hotspot/share/runtime/globals.hpp??? Fri Jan 18 16:49:35 2019 +0100
> +++ b/src/hotspot/share/runtime/globals.hpp??? Fri Jan 18 17:07:38 2019 +0100
> @@ -1670,9 +1670,6 @@
> ?? develop(intx, DontYieldALotInterval,??? 10, ???? \
> ?????????? "Interval between which yields will be dropped (milliseconds)")?? \
> 
> ???? \
> -? develop(intx, ProfilerPCTickThreshold,??? 15, ??? \
> -????????? "Number of ticks in a PC buckets to be a hotspot") ??? \
> - ??? \
> ?? notproduct(intx, DeoptimizeALotInterval,???? 5, ???? \
> ?????????? "Number of exits until DeoptimizeALot kicks in") ???? \
> 
> ???? \

Looks good to me.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/58cdb6c1/signature.asc>

From claes.redestad at oracle.com  Fri Jan 18 16:21:47 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 18 Jan 2019 17:21:47 +0100
Subject: RFR (trivial): 8217388: Remove develop flag
 ProfilerPCTickThreshold
In-Reply-To: <181d4585-9d4f-c5ec-8cac-7cf44636fea4@redhat.com>
References: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>
 <181d4585-9d4f-c5ec-8cac-7cf44636fea4@redhat.com>
Message-ID: <4b078f61-39d5-42d6-b8a1-1627b0b8608a@oracle.com>


On 2019-01-18 17:11, Aleksey Shipilev wrote:
> Looks good to me.

Thanks!

/Claes

From derekw at marvell.com  Fri Jan 18 17:29:02 2019
From: derekw at marvell.com (Derek White)
Date: Fri, 18 Jan 2019 17:29:02 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
Message-ID: <MN2PR18MB2733BD918F17E4626D00D1ABD29C0@MN2PR18MB2733.namprd18.prod.outlook.com>


> -----Original Message-----
> From: aarch64-port-dev <aarch64-port-dev-bounces at openjdk.java.net> On
> Behalf Of Andrew Haley
> Sent: Friday, January 18, 2019 4:37 AM
> To: Nick Gasson (Arm Technology China) <Nick.Gasson at arm.com>; hotspot-
> compiler-dev at openjdk.java.net compiler <hotspot-compiler-
> dev at openjdk.java.net>
> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
> Subject: [EXT] Re: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive
> stack locking optimisation not triggered
> 
...
> The patch looks good. However, I don't understand why we aren't using
> MacroAssembler::cmpxchgptr here. It looks like we should be, and you'd end
> up with a less complex result.

Uh oh ??
The original code used cmpxchgptr, but it introduced too many unnecessary branches. So you or Ed changed it to this code, with a (7-8 line) comment "Formerly: __ cmpxchgptr" etc, etc. I thought that comment didn't add much for all that bulk so I asked Nick to rip the comment out!

The function now fits on one screen (of sufficient size) though.

Getting cmpxchgptr to work without the extra branches would be a better solution if someone has any thoughts in that direction.

 - Derek

From derekw at marvell.com  Fri Jan 18 18:14:18 2019
From: derekw at marvell.com (Derek White)
Date: Fri, 18 Jan 2019 18:14:18 +0000
Subject: 8217368: AArch64: C2 recursive stack locking optimisation not
 triggered
In-Reply-To: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
Message-ID: <MN2PR18MB27338A15CA5AF6BC0621A07FD29C0@MN2PR18MB2733.namprd18.prod.outlook.com>

Hi Nick,

Your changes look good to me.

Once again some cleanup suggestions to pre-existing code:

Line 3420:
     "// Handle existing monitor" -> "// Check for existing monitor"

Line 3471:    "// Handle existing monitor."
   Move to line 3473.

Lines 3437, 3445, 3468, 3485, 3493:
  Add comment to lines: "// sets result"

This set contains actual code changes, but should be clearer code:
Lines 3483, 3485:
   "disp_hdr" -> "zr"

Line 3493:
    cmp(disp_hdr, rscratch1) -> cmp(rscratch1, zr)
 Note that having the "sets result" comment here is important, because it's so tempting to merge CMP+BNE -> CBNZ. But that doesn't set the condition flags.

Line 3480: delete mov.

Thanks!
 - Derek

> -----Original Message-----
> From: aarch64-port-dev <aarch64-port-dev-bounces at openjdk.java.net> On
> Behalf Of Nick Gasson (Arm Technology China)
> Sent: Friday, January 18, 2019 3:40 AM
> To: hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-
> dev at openjdk.java.net>
> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
> Subject: [EXT] [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
> locking optimisation not triggered
> 
> External Email
> 
> ----------------------------------------------------------------------
> Hi,
> 
> While I was cleaning up the patch for 8216350 I noticed an issue in the
> implementation of recursive locking in aarch64_enc_fast_lock:
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8217368
> Webrev: http://cr.openjdk.java.net/~ngasson/8217368/webrev.0/
> 
> First we load the markOop of the object we want to lock and OR it with
> markOopDesc::unlocked_value (1). Then we do a CAS to exchange the
> address of the box on our thread's stack with the object's header word iff it's
> equal to the (markOop | 1) we just computed. If this fails, then we should
> check for a recursive lock by comparing
> 
>    (~(page size - 1) | 3) & (markOop - SP) == 0
> 
> Where "markOop" is the current object header word loaded by the failed
> CAS. This checks that the lock bits are zero (locked) and the stack address of
> the displaced header is within one page of the current SP.
> But on AArch64 we actually do this:
> 
>    (~(page size - 1) | 3) & ((old markOop | 1) - SP) == 0
> 
> Where "old markOop | 1" is the compare-to value used for the CAS. This is
> always false as the result has at least bit #0 set. This only affects C2, the
> C1_MacroAssembler version has the correct test.
> 
> The diff looks big but all it does is swap the usage of registers `tmp'
> and `disp_hdr' in the first section so the markOop loaded by the CAS ends up
> in disp_hdr and tmp holds the (markOop | 1) compare-to value.
> 
> Ran jtreg, plus jcstress with -XX:+UseLSE and -XX:-UseLSE. Also added
> another microbenchmark to
> micro/org/openjdk/bench/vm/lang/LockUnlock.java as I couldn't find an
> existing JMH case that triggered this.
> 
> Without patch:
> 
> Result
> "org.openjdk.bench.vm.lang.LockUnlock.testRecursiveSynchronizationNoBia
> s":
>   510.781 ?(99.9%) 1.196 ns/op [Average]
>   (min, avg, max) = (508.769, 510.781, 513.854), stdev = 1.597
>   CI (99.9%): [509.585, 511.977] (assumes normal distribution)
> 
> With patch:
> 
> Result
> "org.openjdk.bench.vm.lang.LockUnlock.testRecursiveSynchronizationNoBia
> s":
> 
>   197.038 ?(99.9%) 0.096 ns/op [Average]
>   (min, avg, max) = (196.886, 197.038, 197.296), stdev = 0.128
>   CI (99.9%): [196.942, 197.134] (assumes normal distribution)
> 
> Two other minor things:
> 
> * Does anyone know what the comment "// Load Compare Value application
> register." means? It's present in the PPC and S390 ports too.
> 
> * The x86 port #ifdef LP64 uses "7 - os::vm_page_size()" as the mask in the
> recursive lock test. I think the "7" here is markOopDesc::biased_lock_mask
> and is presumably there to prevent a silent mutual exclusion failure if a
> markOop with the bias locking bits set ends up the fast_lock path (although
> this should never happen).
> Should we change markOopDesc::lock_mask_in_place to
> markOopDesc::biased_lock_mask_in_place in the AArch64 port too?
> 
> Thanks,
> Nick

From aph at redhat.com  Fri Jan 18 18:15:37 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 18 Jan 2019 18:15:37 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <MN2PR18MB2733BD918F17E4626D00D1ABD29C0@MN2PR18MB2733.namprd18.prod.outlook.com>
References: <MN2PR18MB2733BD918F17E4626D00D1ABD29C0@MN2PR18MB2733.namprd18.prod.outlook.com>
Message-ID: <9e7eee2c-7b8d-49b1-d1e1-897346e9b1b8@redhat.com>

On 1/18/19 5:29 PM, Derek White wrote:

> The original code used cmpxchgptr, but it introduced too many
> unnecessary branches. So you

Me, I think.

> or Ed changed it to this code, with a (7-8 line) comment "Formerly:
> __ cmpxchgptr" etc, etc. I thought that comment didn't add much for
> all that bulk so I asked Nick to rip the comment out!
> 
> The function now fits on one screen (of sufficient size) though.
> 
> Getting cmpxchgptr to work without the extra branches would be a
> better solution if someone has any thoughts in that direction.

There aren't any extra branches if you use MacroAssembler::cmpxchg.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From gromero at linux.vnet.ibm.com  Fri Jan 18 18:16:05 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Fri, 18 Jan 2019 16:16:05 -0200
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <AM6PR02MB4788BC20E6C7E3423BFF41069A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <c8e419d0-e326-5ca4-be4a-303ea94c9d31@oracle.com>
 <AM6PR02MB4788BC20E6C7E3423BFF41069A9C0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <fd94aea7-6600-75bd-e164-577cf890da67@linux.vnet.ibm.com>

Hi Roger and Martin,

Thanks a lot for the quick Reviews.

I'll wait the Review for 8215317 and then request the approval to push for both 8215317 and this change.

Goetz will kindly sponsor both then.

Thank you.

Best regards,
Gustavo

On 01/18/2019 02:07 PM, Doerr, Martin wrote:
> Hi Gustavo,
> 
> hotspot part looks good, too.
> 
> Best regards,
> 
> Martin
> 
> *From:*Roger Riggs <Roger.Riggs at oracle.com>
> *Sent:* Freitag, 18. Januar 2019 16:35
> *To:* Gustavo Romero <gromero at linux.vnet.ibm.com>; hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>; vladimir.kozlov at oracle.com
> *Cc:* Michihiro Horie <HORIE at jp.ibm.com>
> *Subject:* Re: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for isDigit/isLowerCase/isUpperCase/isWhitespace
> 
> Looks good for the jdk files.
> 
> Regards, Roger
> 
> On 01/18/2019 10:07 AM, Gustavo Romero wrote:
> 
>     Hi,
> 
>     Could the following backport to 11u be reviewed, please?
> 
>     Bug???? : https://bugs.openjdk.java.net/browse/JDK-8213754
>     Change? : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
>     Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/ <http://cr.openjdk.java.net/%7Egromero/8213754_jdk11u/v1/>
> 
>     It adds 4 intrinsics that use instructions introduced by POWER9 in order to
>     speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
> 
>     The change is mostly PPC64-only but it does touch shared code, for
>     instance, in order to adapt the methods in question to be properly
>     intrinsified. It also needs an additional change [0], since one Graal
>     test has to be adapted (a separated RFR to backport [0] was sent to [1]).
> 
>     The change applies almost cleanly: only a small tweak is necessary because
>     the hunk for ppc.ad file relies on some absent text in the 11u code around
>     the change to be applied. That absent text is related to the Superword
>     feature (a non-related feature), which is not backported yet to 11u.
> 
>     This backport was tested on POWER8 and POWER9 and no regressions were
>     observed.
> 
>     This backport was also tested on x86_64 with
>     ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>     ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
>     change 8215317 [0] applied and no regressions were observed too.
> 
>     Thank you.
> 
>     Best regards,
>     Gustavo
> 
>     [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/ <http://cr.openjdk.java.net/%7Egromero/8215317_jdk11u/v1/>
>     [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-January/032266.html
> 


From vladimir.kozlov at oracle.com  Fri Jan 18 20:26:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 18 Jan 2019 12:26:19 -0800
Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
 CheckGraalIntrinsics failed after 8213754
In-Reply-To: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
References: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
Message-ID: <1cb884f8-34c0-638b-768a-fe5eebd89c49@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/18/19 6:57 AM, Gustavo Romero wrote:
> Hi,
> 
> Could the following backport to 11u be reviewed, please?
> 
> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8215317
> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/108a161aed93
> Backport: http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
> 
> It adds 4 intrinsics to the Graal test CheckGraalIntrinsics.java list so
> JDK 11u becomes aware of them. Otherwise that test will break once change
> 8213754 [0] lands 11u (which will effectively add the 4 intrinsics to
> PPC64/Hotspot and adapt the correlated methods to be intrinsified).
> 
> The backport changed the inclusion of the intrinsics for JDK 11 or higher,
> instead for JDK 12 or higher (original patch).
> 
> This backport was tested on x86_64 with
> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled)
> and no regressions were observed too.
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
> [0] https://bugs.openjdk.java.net/browse/JDK-8213754
> 

From andrewluotechnologies at outlook.com  Fri Jan 18 22:16:51 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Fri, 18 Jan 2019 22:16:51 +0000
Subject: Enhancing jaotc to automatically find VS2017 linker
Message-ID: <MWHPR13MB1696DB273135D1BEFBE7DA3EA19C0@MWHPR13MB1696.namprd13.prod.outlook.com>

Hi,

Has there been any plans to enhance jaotc to support automatically finding the link.exe in VS2017?  If not, I am interested in contributing some work to support this.

I see that in Linker.java (src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/Linker.java) we find link.exe using the environment variables VS...COMNTOOLS, but since in VS2017 and forward, this is not defined, it seems another approach is necessary.  Microsoft suggests that you use vswhere (https://github.com/Microsoft/vswhere, BSD licensed, included with Visual Studio 2017 15.2 and forward) or their COM API to find the latest VS2017 toolset.

Anyways, if everyone agrees we should add VS2017 support, there are a few ways to do this (in order of simplest/easiest to most complex):


1.       Check that vswhere exists on the system, if it does, call vswhere (out of process - not sure this is acceptable...) and use that to find the VS2017 link.exe

2.       Ship vswhere with the JDK and call it out of process

3.       Statically link a copy of vswhere (BSD licensed - is this okay?) into our code and add a JNI stub to call it

4.       Call the COM API in a JNI function to get the latest version of VS2017

Personally I prefer (1), but if out-of-process isn't acceptable I'm fine with doing (4) or (3).

Let me know if you have any comments/feedback on this proposal.

Thanks,

-Andrew

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190118/bdf7d4ed/attachment-0001.html>

From doug.simon at oracle.com  Fri Jan 18 22:20:06 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Fri, 18 Jan 2019 23:20:06 +0100
Subject: [12] RFR 8215375: [Graal] jck:vm/jvmti/Exception/excp001/excp00101
 fails in Graal as JIT mode and -Xcomp mode
Message-ID: <66BBADCE-3072-414F-AA08-3B19D5BC9B55@oracle.com>

Please review this fix that makes Graal compiled code post a JVMTI event when throwing an exception.
The code to post the event is only compiled in if the relevant JVMTI capabilities are enabled at compile time. The event posting code performs a dynamic check to see if the current thread is interested in exception events before posting an event.

Testing: hs-tier6-graal

https://bugs.openjdk.java.net/browse/JDK-8215375
http://cr.openjdk.java.net/~dnsimon/8215375

-Doug

From vladimir.kozlov at oracle.com  Fri Jan 18 22:54:11 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 18 Jan 2019 14:54:11 -0800
Subject: [12] RFR 8215375: [Graal]
 jck:vm/jvmti/Exception/excp001/excp00101 fails in Graal as JIT mode and
 -Xcomp mode
In-Reply-To: <66BBADCE-3072-414F-AA08-3B19D5BC9B55@oracle.com>
References: <66BBADCE-3072-414F-AA08-3B19D5BC9B55@oracle.com>
Message-ID: <e6b5add0-d185-8750-eba3-fba0fb27ee8a@oracle.com>

Seems fine.

Thanks,
Vladimir

On 1/18/19 2:20 PM, Doug Simon wrote:
> Please review this fix that makes Graal compiled code post a JVMTI event when throwing an exception.
> The code to post the event is only compiled in if the relevant JVMTI capabilities are enabled at compile time. The event posting code performs a dynamic check to see if the current thread is interested in exception events before posting an event.
> 
> Testing: hs-tier6-graal
> 
> https://bugs.openjdk.java.net/browse/JDK-8215375
> http://cr.openjdk.java.net/~dnsimon/8215375
> 
> -Doug
> 

From dean.long at oracle.com  Fri Jan 18 23:53:29 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Fri, 18 Jan 2019 15:53:29 -0800
Subject: 12 RFR(XXS) 8217394: Remove
 org.graalvm.compiler.debug.test.TimerKeyTest from problem list
Message-ID: <e80fde04-dfb5-d8ed-00d3-a5371f096f04@oracle.com>

https://bugs.openjdk.java.net/browse/JDK-8217394
http://cr.openjdk.java.net/~dlong/8217394/webrev/

This should have been included with JDK-8210777, but it was missed. Trivial?

dl

From dean.long at oracle.com  Sat Jan 19 00:10:38 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Fri, 18 Jan 2019 16:10:38 -0800
Subject: [12] RFR 8215375: [Graal]
 jck:vm/jvmti/Exception/excp001/excp00101 fails in Graal as JIT mode and
 -Xcomp mode
In-Reply-To: <e6b5add0-d185-8750-eba3-fba0fb27ee8a@oracle.com>
References: <66BBADCE-3072-414F-AA08-3B19D5BC9B55@oracle.com>
 <e6b5add0-d185-8750-eba3-fba0fb27ee8a@oracle.com>
Message-ID: <50cdbe1c-a966-e2b5-a3ba-4391838a26fb@oracle.com>

Looks good.

dl

On 1/18/19 2:54 PM, Vladimir Kozlov wrote:
> Seems fine.
>
> Thanks,
> Vladimir
>
> On 1/18/19 2:20 PM, Doug Simon wrote:
>> Please review this fix that makes Graal compiled code post a JVMTI 
>> event when throwing an exception.
>> The code to post the event is only compiled in if the relevant JVMTI 
>> capabilities are enabled at compile time. The event posting code 
>> performs a dynamic check to see if the current thread is interested 
>> in exception events before posting an event.
>>
>> Testing: hs-tier6-graal
>>
>> https://bugs.openjdk.java.net/browse/JDK-8215375
>> http://cr.openjdk.java.net/~dnsimon/8215375
>>
>> -Doug
>>


From vladimir.kozlov at oracle.com  Sat Jan 19 00:20:22 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 18 Jan 2019 16:20:22 -0800
Subject: 12 RFR(XXS) 8217394: Remove
 org.graalvm.compiler.debug.test.TimerKeyTest from problem list
In-Reply-To: <e80fde04-dfb5-d8ed-00d3-a5371f096f04@oracle.com>
References: <e80fde04-dfb5-d8ed-00d3-a5371f096f04@oracle.com>
Message-ID: <7f3c685a-7805-6eff-0ee1-20184e0b54bb@oracle.com>

Good. Trivial.

Vladimir

On 1/18/19 3:53 PM, dean.long at oracle.com wrote:
> https://bugs.openjdk.java.net/browse/JDK-8217394
> http://cr.openjdk.java.net/~dlong/8217394/webrev/
> 
> This should have been included with JDK-8210777, but it was missed. Trivial?
> 
> dl

From dean.long at oracle.com  Sat Jan 19 00:23:03 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Fri, 18 Jan 2019 16:23:03 -0800
Subject: 12 RFR(XXS) 8217394: Remove
 org.graalvm.compiler.debug.test.TimerKeyTest from problem list
In-Reply-To: <7f3c685a-7805-6eff-0ee1-20184e0b54bb@oracle.com>
References: <e80fde04-dfb5-d8ed-00d3-a5371f096f04@oracle.com>
 <7f3c685a-7805-6eff-0ee1-20184e0b54bb@oracle.com>
Message-ID: <ed2b8f61-36d7-1b58-2046-e7194e40e444@oracle.com>

Thanks Vladimir.

dl

On 1/18/19 4:20 PM, Vladimir Kozlov wrote:
> Good. Trivial.
>
> Vladimir
>
> On 1/18/19 3:53 PM, dean.long at oracle.com wrote:
>> https://bugs.openjdk.java.net/browse/JDK-8217394
>> http://cr.openjdk.java.net/~dlong/8217394/webrev/
>>
>> This should have been included with JDK-8210777, but it was missed. 
>> Trivial?
>>
>> dl


From xxinliu at amazon.com  Sat Jan 19 00:39:47 2019
From: xxinliu at amazon.com (Liu, Xin)
Date: Sat, 19 Jan 2019 00:39:47 +0000
Subject: Why does call_site_target keep changing for a Nashorn method?
In-Reply-To: <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>
References: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
 <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>
Message-ID: <837C4B07-9A3F-4459-A625-12F82C9E604F@amazon.com>

Hi, Vladimir, 

Thank you for the response. After reading your email and associated RFEs,  now I got the background story. 
I understand the design decision in hotspot. 

In my case, compiler thread crowds out the app thread because we run application in docker with 1 CPU. 
Is it good idea that we decay the invocation counts of the methods if they fail due to 'call_site_target value change?'

Thanks, 
--lx


?On 1/17/19, 2:36 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:

    C1/C2 optimistically inline through CallSite instances even if those are 
    mutable (MutableCallSite/VolatileCallSite). It requires a nmethod 
    dependency and once CallSite target changes, all dependent nmethods 
    should be invalidated. If such change happens during compilation, 
    nmethod installation fails.
    
    That's exactly what you observe: the dependency is recorded during 
    inlining, but failed verification during installation.
    
    Regarding the observed behavior, it is well-known [1] [2] and was a 
    deliberate choice. As JDK-7087838 [1] states:
    
    "The consensus among language runtime implementors is that they want 
    control over switch points (and thus call sites) and so it's their 
    responsibility to handle extensive invalidation of such."
    
    So, such pathological behavior is treated as a bug in user code (Nashorn 
    in this particular case).
    
    There's an RFE filed [3] to consider alternative options for unstable 
    calls.
    
    Best regards,
    Vladimir Ivanov
    
    [1] https://bugs.openjdk.java.net/browse/JDK-7087838
    [2] https://bugs.openjdk.java.net/browse/JDK-7177745
    [3] https://bugs.openjdk.java.net/browse/JDK-8147550
    
    On 16/01/2019 14:04, Liu, Xin wrote:
    > In one of our applications, C1/C2 keeps compiling a Javascript method 
    > generated by Nashorn but the code fails a dependency check right before 
    > installing in the code cache. This is with JDK tip.
    > 
    > It can?t pass ?Dependencies::check_call_site_target_value?.
    > 
    > [C2 Parsing]
    > 
    > <bc code='182' bci='1'/>
    > 
    > <dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
    > 
    > <call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
    > 
    > <inline_success reason='accessor'/>
    > 
    > <parse method='1141' uses='21249.000000' stamp='1112.538'>
    > 
    > <bc code='180' bci='1'/>
    > 
    > <unknown id='1556'/>
    > 
    > <unknown id='1866'/>
    > 
    > <dependency type='call_site_target_value' x0='1556' x='1866'/>
    > 
    > <parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
    > 
    > </parse>
    > 
    > [Validating compilation dependencies]
    > 
    > <dependency type='call_site_target_value' x0='1132' x='1143'/>
    > 
    > <dependency type='call_site_target_value' x0='1334' x='1337'/>
    > 
    > <dependency type='call_site_target_value' x0='1424' x='1425'/>
    > 
    > <dependency type='call_site_target_value' x0='1437' x='1438'/>
    > 
    > <dependency type='call_site_target_value' x0='1454' x='1455'/>
    > 
    > <dependency type='call_site_target_value' x0='1465' x='1466'/>
    > 
    > <dependency type='call_site_target_value' x0='1482' x='1483'/>
    > 
    > <dependency type='call_site_target_value' x0='1498' x='1499'/>
    > 
    > <dependency type='call_site_target_value' x0='1509' x='1510'/>
    > 
    > <dependency type='call_site_target_value' x0='1526' x='1576'/>
    > 
    > <dependency type='call_site_target_value' x0='1528' x='1667'/>
    > 
    > <dependency type='call_site_target_value' x0='1536' x='1692'/>
    > 
    > <dependency type='call_site_target_value' x0='1537' x='1707'/>
    > 
    > <dependency type='call_site_target_value' x0='1538' x='1730'/>
    > 
    > <dependency type='call_site_target_value' x0='1539' x='1746'/>
    > 
    > <dependency type='call_site_target_value' x0='1540' x='1787'/>
    > 
    > <dependency type='call_site_target_value' x0='1550' x='1804'/>
    > 
    > <dependency type='call_site_target_value' x0='1553' x='1820'/>
    > 
    > <dependency type='call_site_target_value' x0='1554' x='1836'/>
    > 
    > <dependency type='call_site_target_value' x0='1555' x='1849'/>
    > 
    > <dependency type='call_site_target_value' x0='1556' x='1866'/>
    > 
    > <dependency_failed type='call_site_target_value' x0='1556' x='1866' 
    > witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite' 
    > stamp='1113.578'/>
    > 
    > It?s related to the GWT methodHandle.  The 2 mismatched methodhandles 
    > are very similar except for argL3, which is an int[2].
    > 
    > Even though arg0-2 are not identical objects, their contents are same.
    > 
    > (gdb)call java_lang_invoke_CallSite::target(call_site)->print()
    > 
    > java.lang.invoke.BoundMethodHandle$Species_LLLL
    > 
    > {0x00000000f586ca98}- 
    > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
    > 
    > - ---- fields(total size 6 words):
    > 
    > -'customizationCount''B'@12 0
    > 
    > - private final'type''Ljava/lang/invoke/MethodType;'@16 
    > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
    > 
    > - final'form''Ljava/lang/invoke/LambdaForm;'@20 
    > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
    > 
    > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
    > 
    > - final'argL0''Ljava/lang/Object;'@28 
    > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8}(f586c9e8)
    > 
    > - final'argL1''Ljava/lang/Object;'@32 
    > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28}(f586ca28)
    > 
    > - final'argL2''Ljava/lang/Object;'@36 
    > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60}(f586ca60)
    > 
    > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f586ca10}(f586ca10)
    > 
    > (gdb)call method_handle->print()
    > 
    > java.lang.invoke.BoundMethodHandle$Species_LLLL
    > 
    > {0x00000000f6b18500}- 
    > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
    > 
    > - ---- fields(total size 6 words):
    > 
    > -'customizationCount''B'@12 0
    > 
    > - private final'type''Ljava/lang/invoke/MethodType;'@16 
    > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
    > 
    > - final'form''Ljava/lang/invoke/LambdaForm;'@20 
    > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
    > 
    > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
    > 
    > - final'argL0''Ljava/lang/Object;'@28 
    > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450}(f6b18450)
    > 
    > - final'argL1''Ljava/lang/Object;'@32 
    > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490}(f6b18490)
    > 
    > - final'argL2''Ljava/lang/Object;'@36 
    > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8}(f6b184c8)
    > 
    > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f6b18478}(f6b18478)
    > 
    > My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.
    > 
    > // Intrinsified by C2. Counters are used during parsing to calculate 
    > branch frequencies.
    > @LambdaForm.Hidden
    > @jdk.internal.HotSpotIntrinsicCandidate
    > static
    > boolean profileBoolean(boolean result, int[] counters) {
    > // Profile is int[2] where [0] and [1] correspond to false and true 
    > occurrences respectively.
    > int idx = result ? 1 : 0;
    >      try {
    >          counters[idx] = Math./addExact/(counters[idx], 1);
    > } catch (ArithmeticException e) {
    > // Avoid continuous overflow by halving the problematic count.
    > counters[idx] = counters[idx] / 2;
    > }
    > return result;
    > }
    > 
    > I am still struggling to understand the source code in 
    > java.lang.invoke.*.  Could anybody enlighten me why the target of the 
    > callsite changes every time here?  it is relative to this profiling thing?
    > 
    > In validation log, it has validated the dep ?dependency 
    > type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t 
    > pass it after then? My guess is one MH object has been changed by 
    > another Java thread.
    > 
    > One interesting fact that compiler thread can?t pass 22^th dep.  My 
    > tuition is it goes over an unknown threshold.
    > 
    > The 2nd question is about ciEnv:: validate_compile_task_dependencies. 
    >   Why does failure of call_site_target_value_changed not count as a deopt?
    > 
    > The flag  _inc_decompile_count_on_failure =false stops MDO to mark this 
    > method ?not_compileable?.  C2 doesn?t set the flag, so C2 ends up 
    > compiling it over and over, which makes C2 a cpu hog. Here?s the code in 
    > validate_compile_task_dependencies
    > 
    >    bool counter_changed = system_dictionary_modification_counter_changed();
    > 
    >    Dependencies::DepType result = 
    > dependencies()->validate_dependencies(_task, counter_changed);
    > 
    >    if (result != Dependencies::end_marker) {
    > 
    >      if (result == Dependencies::call_site_target_value) {
    > 
    >        _inc_decompile_count_on_failure = false;
    > 
    >        record_failure("call site target change");
    > 
    > Maybe the right thing to do is to count this as a deopt and change the 
    > deopt limit computation to take into account the size of the method in 
    > nodes, just as done for abandoning compilation if the graph is too big.
    > 
    > Thanks,
    > 
    > --lx
    > 
    

From vladimir.x.ivanov at oracle.com  Sat Jan 19 01:05:44 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 18 Jan 2019 17:05:44 -0800
Subject: Why does call_site_target keep changing for a Nashorn method?
In-Reply-To: <837C4B07-9A3F-4459-A625-12F82C9E604F@amazon.com>
References: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
 <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>
 <837C4B07-9A3F-4459-A625-12F82C9E604F@amazon.com>
Message-ID: <30a97290-71c5-c445-cfaf-f8eda14fdfba@oracle.com>


> Thank you for the response. After reading your email and associated RFEs,  now I got the background story.
> I understand the design decision in hotspot.
> 
> In my case, compiler thread crowds out the app thread because we run application in docker with 1 CPU.
> Is it good idea that we decay the invocation counts of the methods if they fail due to 'call_site_target value change?'

Yes, sounds reasonable. I believe compilation bailed out due to 
invalidated call_site_target dependency should be treated as if it were 
a deoptimization with Action_reinterpret, but resetting invocation 
counts may be too much. So, decaying counters instead sounds reasonable.

Also, it's hard to tell what method to act on: problematic CallSite may 
be located somewhere deep in inline tree, but only root method is known.

Best regards,
Vladimir Ivanov

> ?On 1/17/19, 2:36 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:
> 
>      C1/C2 optimistically inline through CallSite instances even if those are
>      mutable (MutableCallSite/VolatileCallSite). It requires a nmethod
>      dependency and once CallSite target changes, all dependent nmethods
>      should be invalidated. If such change happens during compilation,
>      nmethod installation fails.
>      
>      That's exactly what you observe: the dependency is recorded during
>      inlining, but failed verification during installation.
>      
>      Regarding the observed behavior, it is well-known [1] [2] and was a
>      deliberate choice. As JDK-7087838 [1] states:
>      
>      "The consensus among language runtime implementors is that they want
>      control over switch points (and thus call sites) and so it's their
>      responsibility to handle extensive invalidation of such."
>      
>      So, such pathological behavior is treated as a bug in user code (Nashorn
>      in this particular case).
>      
>      There's an RFE filed [3] to consider alternative options for unstable
>      calls.
>      
>      Best regards,
>      Vladimir Ivanov
>      
>      [1] https://bugs.openjdk.java.net/browse/JDK-7087838
>      [2] https://bugs.openjdk.java.net/browse/JDK-7177745
>      [3] https://bugs.openjdk.java.net/browse/JDK-8147550
>      
>      On 16/01/2019 14:04, Liu, Xin wrote:
>      > In one of our applications, C1/C2 keeps compiling a Javascript method
>      > generated by Nashorn but the code fails a dependency check right before
>      > installing in the code cache. This is with JDK tip.
>      >
>      > It can?t pass ?Dependencies::check_call_site_target_value?.
>      >
>      > [C2 Parsing]
>      >
>      > <bc code='182' bci='1'/>
>      >
>      > <dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
>      >
>      > <call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
>      >
>      > <inline_success reason='accessor'/>
>      >
>      > <parse method='1141' uses='21249.000000' stamp='1112.538'>
>      >
>      > <bc code='180' bci='1'/>
>      >
>      > <unknown id='1556'/>
>      >
>      > <unknown id='1866'/>
>      >
>      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
>      >
>      > <parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
>      >
>      > </parse>
>      >
>      > [Validating compilation dependencies]
>      >
>      > <dependency type='call_site_target_value' x0='1132' x='1143'/>
>      >
>      > <dependency type='call_site_target_value' x0='1334' x='1337'/>
>      >
>      > <dependency type='call_site_target_value' x0='1424' x='1425'/>
>      >
>      > <dependency type='call_site_target_value' x0='1437' x='1438'/>
>      >
>      > <dependency type='call_site_target_value' x0='1454' x='1455'/>
>      >
>      > <dependency type='call_site_target_value' x0='1465' x='1466'/>
>      >
>      > <dependency type='call_site_target_value' x0='1482' x='1483'/>
>      >
>      > <dependency type='call_site_target_value' x0='1498' x='1499'/>
>      >
>      > <dependency type='call_site_target_value' x0='1509' x='1510'/>
>      >
>      > <dependency type='call_site_target_value' x0='1526' x='1576'/>
>      >
>      > <dependency type='call_site_target_value' x0='1528' x='1667'/>
>      >
>      > <dependency type='call_site_target_value' x0='1536' x='1692'/>
>      >
>      > <dependency type='call_site_target_value' x0='1537' x='1707'/>
>      >
>      > <dependency type='call_site_target_value' x0='1538' x='1730'/>
>      >
>      > <dependency type='call_site_target_value' x0='1539' x='1746'/>
>      >
>      > <dependency type='call_site_target_value' x0='1540' x='1787'/>
>      >
>      > <dependency type='call_site_target_value' x0='1550' x='1804'/>
>      >
>      > <dependency type='call_site_target_value' x0='1553' x='1820'/>
>      >
>      > <dependency type='call_site_target_value' x0='1554' x='1836'/>
>      >
>      > <dependency type='call_site_target_value' x0='1555' x='1849'/>
>      >
>      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
>      >
>      > <dependency_failed type='call_site_target_value' x0='1556' x='1866'
>      > witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite'
>      > stamp='1113.578'/>
>      >
>      > It?s related to the GWT methodHandle.  The 2 mismatched methodhandles
>      > are very similar except for argL3, which is an int[2].
>      >
>      > Even though arg0-2 are not identical objects, their contents are same.
>      >
>      > (gdb)call java_lang_invoke_CallSite::target(call_site)->print()
>      >
>      > java.lang.invoke.BoundMethodHandle$Species_LLLL
>      >
>      > {0x00000000f586ca98}-
>      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
>      >
>      > - ---- fields(total size 6 words):
>      >
>      > -'customizationCount''B'@12 0
>      >
>      > - private final'type''Ljava/lang/invoke/MethodType;'@16
>      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
>      >
>      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
>      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
>      >
>      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
>      >
>      > - final'argL0''Ljava/lang/Object;'@28
>      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8}(f586c9e8)
>      >
>      > - final'argL1''Ljava/lang/Object;'@32
>      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28}(f586ca28)
>      >
>      > - final'argL2''Ljava/lang/Object;'@36
>      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60}(f586ca60)
>      >
>      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f586ca10}(f586ca10)
>      >
>      > (gdb)call method_handle->print()
>      >
>      > java.lang.invoke.BoundMethodHandle$Species_LLLL
>      >
>      > {0x00000000f6b18500}-
>      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
>      >
>      > - ---- fields(total size 6 words):
>      >
>      > -'customizationCount''B'@12 0
>      >
>      > - private final'type''Ljava/lang/invoke/MethodType;'@16
>      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
>      >
>      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
>      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
>      >
>      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
>      >
>      > - final'argL0''Ljava/lang/Object;'@28
>      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450}(f6b18450)
>      >
>      > - final'argL1''Ljava/lang/Object;'@32
>      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490}(f6b18490)
>      >
>      > - final'argL2''Ljava/lang/Object;'@36
>      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8}(f6b184c8)
>      >
>      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f6b18478}(f6b18478)
>      >
>      > My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.
>      >
>      > // Intrinsified by C2. Counters are used during parsing to calculate
>      > branch frequencies.
>      > @LambdaForm.Hidden
>      > @jdk.internal.HotSpotIntrinsicCandidate
>      > static
>      > boolean profileBoolean(boolean result, int[] counters) {
>      > // Profile is int[2] where [0] and [1] correspond to false and true
>      > occurrences respectively.
>      > int idx = result ? 1 : 0;
>      >      try {
>      >          counters[idx] = Math./addExact/(counters[idx], 1);
>      > } catch (ArithmeticException e) {
>      > // Avoid continuous overflow by halving the problematic count.
>      > counters[idx] = counters[idx] / 2;
>      > }
>      > return result;
>      > }
>      >
>      > I am still struggling to understand the source code in
>      > java.lang.invoke.*.  Could anybody enlighten me why the target of the
>      > callsite changes every time here?  it is relative to this profiling thing?
>      >
>      > In validation log, it has validated the dep ?dependency
>      > type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t
>      > pass it after then? My guess is one MH object has been changed by
>      > another Java thread.
>      >
>      > One interesting fact that compiler thread can?t pass 22^th dep.  My
>      > tuition is it goes over an unknown threshold.
>      >
>      > The 2nd question is about ciEnv:: validate_compile_task_dependencies.
>      >   Why does failure of call_site_target_value_changed not count as a deopt?
>      >
>      > The flag  _inc_decompile_count_on_failure =false stops MDO to mark this
>      > method ?not_compileable?.  C2 doesn?t set the flag, so C2 ends up
>      > compiling it over and over, which makes C2 a cpu hog. Here?s the code in
>      > validate_compile_task_dependencies
>      >
>      >    bool counter_changed = system_dictionary_modification_counter_changed();
>      >
>      >    Dependencies::DepType result =
>      > dependencies()->validate_dependencies(_task, counter_changed);
>      >
>      >    if (result != Dependencies::end_marker) {
>      >
>      >      if (result == Dependencies::call_site_target_value) {
>      >
>      >        _inc_decompile_count_on_failure = false;
>      >
>      >        record_failure("call site target change");
>      >
>      > Maybe the right thing to do is to count this as a deopt and change the
>      > deopt limit computation to take into account the size of the method in
>      > nodes, just as done for abandoning compilation if the graph is too big.
>      >
>      > Thanks,
>      >
>      > --lx
>      >
>      
> 

From bsrbnd at gmail.com  Sat Jan 19 13:42:24 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Sat, 19 Jan 2019 14:42:24 +0100
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <2f209ec9-e7f9-8da3-64a2-20ac909b4931@redhat.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
 <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
 <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>
 <3dd85d2c-f4d8-e360-21a2-68254b3c5e2b@redhat.com>
 <2f209ec9-e7f9-8da3-64a2-20ac909b4931@redhat.com>
Message-ID: <CAEgw74C0H1-tj8+yr-oBKvOa4DVw0QEaMoTZF5fw-EZ-E4tf=A@mail.gmail.com>

On Fri, 18 Jan 2019 at 14:37, Roman Kennke <rkennke at redhat.com> wrote:
>
> > On 1/17/19 7:51 PM, B. Blaser wrote:
> >> Here it is on intel xeon with 5*10e9 iterations:
> >> * mov+cmov = 10.94s
> >> * cmov = 10.15s
> >>
> >> Thoughts?
> >
> > It looks like there's not much of a performance difference, but it might
> > help by freeing a register. OTOH, we'd still need to be sure we weren't
> > introducing a regression. We'd have to make sure that implicit null checks
> > work.
>
> I'm pretty sure that null-checks work, in general. I used the cmov
> instructions in an experiment that I did with Shenandoah barriers of
> which I'm pretty sure would have blown up badly if it wouldn't. One
> thing I'm not sure of is: does cmov generate a SIGSEGV on a bad address,
> even if the condition is not true? I doubt it, because then we couldn't
> use this for other types (long, int, etc).
>
> I'm more worried about the bottom-type issue that is mentioned in the
> comment and by Andrew Dinn, and it would be very helpful if anybody
> knows about it and could clarify. Failing that we could dig deeper
> and/or do extensive testing?

I'm definitely not an expert in this area but does ADLC treat this
really differently from a single LoadP / mov?

http://hg.openjdk.java.net/jdk/jdk/file/683a112e0e1e/src/hotspot/cpu/x86/x86_64.ad#l5349

Bernard

From kim.barrett at oracle.com  Sun Jan 20 00:37:43 2019
From: kim.barrett at oracle.com (Kim Barrett)
Date: Sat, 19 Jan 2019 19:37:43 -0500
Subject: RFR: 8217387: Remove dead develop flag CIFireOOMAt
In-Reply-To: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
References: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
Message-ID: <F69A1CA8-370A-40EA-A34A-FD59BB21D28B@oracle.com>

> On Jan 18, 2019, at 11:03 AM, Claes Redestad <claes.redestad at oracle.com> wrote:
> 
> Hi,
> 
> the develop flag CIFireOOMAt is effectively dead and should be removed.
> 
> Webrev: http://cr.openjdk.java.net/~redestad/8217387/open.00/
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8217387
> 
> Thanks!
> 
> /Claes

Looks good.


From claes.redestad at oracle.com  Sun Jan 20 00:41:58 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Sun, 20 Jan 2019 01:41:58 +0100
Subject: RFR: 8217387: Remove dead develop flag CIFireOOMAt
In-Reply-To: <F69A1CA8-370A-40EA-A34A-FD59BB21D28B@oracle.com>
References: <d55718df-cec5-ecc4-4830-b2aff9e16895@oracle.com>
 <F69A1CA8-370A-40EA-A34A-FD59BB21D28B@oracle.com>
Message-ID: <ed3d1fc6-9de9-95e7-f8a9-5f9aeb5e7118@oracle.com>

On 2019-01-20 01:37, Kim Barrett wrote:
> Looks good.

Thanks, Kim!

/Claes

From doug.simon at oracle.com  Sun Jan 20 13:54:57 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Sun, 20 Jan 2019 14:54:57 +0100
Subject: [12] RFR 8215375: [Graal]
 jck:vm/jvmti/Exception/excp001/excp00101 fails in Graal as JIT mode and
 -Xcomp mode
In-Reply-To: <50cdbe1c-a966-e2b5-a3ba-4391838a26fb@oracle.com>
References: <66BBADCE-3072-414F-AA08-3B19D5BC9B55@oracle.com>
 <e6b5add0-d185-8750-eba3-fba0fb27ee8a@oracle.com>
 <50cdbe1c-a966-e2b5-a3ba-4391838a26fb@oracle.com>
Message-ID: <27D7FC5E-2D18-43B9-B68C-5ED92860E83B@oracle.com>

Thanks Dean and Vladimir for the review.

> On 19 Jan 2019, at 01:10, dean.long at oracle.com wrote:
> 
> Looks good.
> 
> dl
> 
> On 1/18/19 2:54 PM, Vladimir Kozlov wrote:
>> Seems fine.
>> 
>> Thanks,
>> Vladimir
>> 
>> On 1/18/19 2:20 PM, Doug Simon wrote:
>>> Please review this fix that makes Graal compiled code post a JVMTI event when throwing an exception.
>>> The code to post the event is only compiled in if the relevant JVMTI capabilities are enabled at compile time. The event posting code performs a dynamic check to see if the current thread is interested in exception events before posting an event.
>>> 
>>> Testing: hs-tier6-graal
>>> 
>>> https://bugs.openjdk.java.net/browse/JDK-8215375
>>> http://cr.openjdk.java.net/~dnsimon/8215375
>>> 
>>> -Doug
>>> 
> 


From Nick.Gasson at arm.com  Mon Jan 21 06:01:01 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Mon, 21 Jan 2019 06:01:01 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
Message-ID: <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>

Hi Andrew,

On 18/01/2019 22:56, Andrew Haley wrote:
>>> The patch looks good. However, I don't understand why we aren't using
>>> MacroAssembler::cmpxchgptr here. It looks like we should be, and you'd
>>> end up with a less complex result.
>>
>> It's not exactly the same though: MacroAssembler::cmpxchgptr adds a "dmb
>> ish" to the failure path which I don't think is required here.
> 
> Oh, sorry. I should have said MacroAssembler::cmpxchg, with a
> br.eq(cont) afterward.
> 

OK I'll change all three places in aarch64_enc_fast_lock/unlock that do 
a compare-exchange to use MacroAssembler::cmpxchg.

Thanks,
Nick

From tobias.hartmann at oracle.com  Mon Jan 21 07:16:30 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 08:16:30 +0100
Subject: RFR (trivial): 8217388: Remove develop flag
 ProfilerPCTickThreshold
In-Reply-To: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>
References: <d03688ca-ac3f-23ea-c9d9-50080ac84b55@oracle.com>
Message-ID: <4cea098d-687f-bcb1-9eed-632b45bd355c@oracle.com>

Hi Claes,

looks good to me.

Best regards,
Tobias

On 18.01.19 17:15, Claes Redestad wrote:
> Hi,
> 
> this flag does not spark joy.
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8217388
> Patch:
> diff -r 0a48b128e3d4 src/hotspot/share/runtime/globals.hpp
> --- a/src/hotspot/share/runtime/globals.hpp??? Fri Jan 18 16:49:35 2019 +0100
> +++ b/src/hotspot/share/runtime/globals.hpp??? Fri Jan 18 17:07:38 2019 +0100
> @@ -1670,9 +1670,6 @@
> ?? develop(intx, DontYieldALotInterval,??? 10, ???? \
> ?????????? "Interval between which yields will be dropped (milliseconds)")?? \
> 
> ???? \
> -? develop(intx, ProfilerPCTickThreshold,??? 15, ??? \
> -????????? "Number of ticks in a PC buckets to be a hotspot") ??? \
> - ??? \
> ?? notproduct(intx, DeoptimizeALotInterval,???? 5, ???? \
> ?????????? "Number of exits until DeoptimizeALot kicks in") ???? \
> 
> ???? \
> 
> /Claes

From tobias.hartmann at oracle.com  Mon Jan 21 08:21:24 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 09:21:24 +0100
Subject: [12] RFR(S): 8217230: assert(t == t_no_spec) failure in
 NodeHash::check_no_speculative_types()
Message-ID: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>

Hi,

please review the following patch:
https://bugs.openjdk.java.net/browse/JDK-8217230
http://cr.openjdk.java.net/~thartmann/8217230/webrev.00/

A SafePointNode becomes dead when being cut off from root in Compile::remove_root_to_sfpts_edges()
but is not processed by IGVN and therefore remains in the graph. Since it is not reachable by root
anymore, it is not processed by Compile::remove_speculative_types and we hit the assert.

The problem was introduced by the fix for JDK-8214862 [1] in JDK 12 b27.

Thanks,
Tobias

[1] https://bugs.openjdk.java.net/browse/JDK-8214862

From aph at redhat.com  Mon Jan 21 09:10:31 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 21 Jan 2019 09:10:31 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
Message-ID: <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>

On 1/21/19 6:01 AM, Nick Gasson (Arm Technology China) wrote:
> OK I'll change all three places in aarch64_enc_fast_lock/unlock that do 
> a compare-exchange to use MacroAssembler::cmpxchg.

If you wish: be aware that if you change anything other than this place there'll
be a lot more testing to do, and review will take longer.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From Nick.Gasson at arm.com  Mon Jan 21 09:27:47 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Mon, 21 Jan 2019 09:27:47 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
Message-ID: <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>

Hi Andrew,

On 21/01/2019 17:10, Andrew Haley wrote:
> On 1/21/19 6:01 AM, Nick Gasson (Arm Technology China) wrote:
>> OK I'll change all three places in aarch64_enc_fast_lock/unlock that do
>> a compare-exchange to use MacroAssembler::cmpxchg.
> 
> If you wish: be aware that if you change anything other than this place there'll
> be a lot more testing to do, and review will take longer.
> 

I think it will be confusing for anyone looking at these functions in 
the future to have one call to cmpxhg and then two copies of essentially 
the same code inlined a few lines afterwards. IMO we should either 
change all three for consistency, or stick with the original minimal 
patch (+ Derek's cleanup suggestions) which should be easier to review.

Thanks,
Nick

From tobias.hartmann at oracle.com  Mon Jan 21 09:47:22 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 10:47:22 +0100
Subject: [13] RFR(S): 8217291: Failure of ::realloc() should be handled
 correctly in adlc/forms.cpp
Message-ID: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>

Hi,

please review the following patch:
https://bugs.openjdk.java.net/browse/JDK-8217291
http://cr.openjdk.java.net/~thartmann/8217291/webrev.00/

Similar to the fix for JDK-8212779 [1], I've introduced a wrapper method for re-allocation that
handles failures by printing a message and exiting.

Thanks,
Tobias

[1] http://hg.openjdk.java.net/jdk/jdk/rev/a3aa8d5380d9

From Pengfei.Li at arm.com  Mon Jan 21 10:53:47 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Mon, 21 Jan 2019 10:53:47 +0000
Subject: RFR(S): 8216259: AArch64: Vectorize Adler32 intrinsics
Message-ID: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Reviewers,

Webrev: http://cr.openjdk.java.net/~pli/rfr/8216259/webrev.00/
JBS: https://bugs.openjdk.java.net/browse/JDK-8216259

This is a vectorization optimization of AArch64 intrinsic code of Adler-32 checksum. An Adler-32 checksum is obtained by calculating two 16-bit checksums s1 and s2, and then concatenating their bits into a 32-bit integer. Details of the algorithm could be found on Wikipedia at https://en.wikipedia.org/wiki/Adler-32 .

In previous Adler-32 intrinsic code written by Edward Nevill, we accumulate the lower and upper halves of the checksum value, s1 and s2, for every 16 bytes in the nmax_loop and by16_loop. In this patch, these accumulation operations are vectorized with NEON instructions in these 2 loops.

I tested the correctness of my patch by comparing the checksum results of 5000 byte arrays of 1MB size. Test code and script can be found at [1].

I also tested the performance with and without my patch by a JMH case [2]. The JMH result shows that the performance gets ~2.5x optimized by this.

[1] http://cr.openjdk.java.net/~pli/rfr/8216259/Adler32Test.java
[2] http://cr.openjdk.java.net/~pli/rfr/8216259/TestAdler32.java

--
Thanks,
Pengfei


From goetz.lindenmaier at sap.com  Mon Jan 21 10:54:21 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Mon, 21 Jan 2019 10:54:21 +0000
Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
 CheckGraalIntrinsics failed after 8213754
In-Reply-To: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
References: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
Message-ID: <0094c5f18c034632bba0123a2bf6cf02@sap.com>

Hi,

looks good to me.

Best regards,
  Goetz.

> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 18. Januar 2019 15:57
> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
> <goetz.lindenmaier at sap.com>; vladimir.kozlov at oracle.com
> Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
> CheckGraalIntrinsics failed after 8213754
> 
> Hi,
> 
> Could the following backport to 11u be reviewed, please?
> 
> Bug     : https://bugs.openjdk.java.net/browse/JDK-8215317
> Change  : http://hg.openjdk.java.net/jdk/jdk/rev/108a161aed93
> Backport: http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
> 
> It adds 4 intrinsics to the Graal test CheckGraalIntrinsics.java list so
> JDK 11u becomes aware of them. Otherwise that test will break once change
> 8213754 [0] lands 11u (which will effectively add the 4 intrinsics to
> PPC64/Hotspot and adapt the correlated methods to be intrinsified).
> 
> The backport changed the inclusion of the intrinsics for JDK 11 or higher,
> instead for JDK 12 or higher (original patch).
> 
> This backport was tested on x86_64 with
> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled)
> and no regressions were observed too.
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
> [0] https://bugs.openjdk.java.net/browse/JDK-8213754


From goetz.lindenmaier at sap.com  Mon Jan 21 11:10:20 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Mon, 21 Jan 2019 11:10:20 +0000
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
Message-ID: <2ac3e91da61b43dcb2d4e45325202264@sap.com>

Hi Gustavo, 

also this change looks good. 

Best regards,
  Goetz.

> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Freitag, 18. Januar 2019 16:07
> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
> <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>;
> vladimir.kozlov at oracle.com; Roger Riggs <Roger.Riggs at oracle.com>
> Cc: Michihiro Horie <HORIE at jp.ibm.com>
> Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
> isDigit/isLowerCase/isUpperCase/isWhitespace
> 
> Hi,
> 
> Could the following backport to 11u be reviewed, please?
> 
> Bug     : https://bugs.openjdk.java.net/browse/JDK-8213754
> Change  : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
> 
> It adds 4 intrinsics that use instructions introduced by POWER9 in order to
> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
> 
> The change is mostly PPC64-only but it does touch shared code, for
> instance, in order to adapt the methods in question to be properly
> intrinsified. It also needs an additional change [0], since one Graal
> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
> 
> The change applies almost cleanly: only a small tweak is necessary because
> the hunk for ppc.ad file relies on some absent text in the 11u code around
> the change to be applied. That absent text is related to the Superword
> feature (a non-related feature), which is not backported yet to 11u.
> 
> This backport was tested on POWER8 and POWER9 and no regressions were
> observed.
> 
> This backport was also tested on x86_64 with
> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
> change 8215317 [0] applied and no regressions were observed too.
> 
> Thank you.
> 
> Best regards,
> Gustavo
> 
> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-
> January/032266.html


From gromero at linux.vnet.ibm.com  Mon Jan 21 11:39:44 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Mon, 21 Jan 2019 09:39:44 -0200
Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
 CheckGraalIntrinsics failed after 8213754
In-Reply-To: <1cb884f8-34c0-638b-768a-fe5eebd89c49@oracle.com>
References: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
 <1cb884f8-34c0-638b-768a-fe5eebd89c49@oracle.com>
Message-ID: <b2fe289a-eaeb-2840-6c33-833d1575ad7c@linux.vnet.ibm.com>

On 01/18/2019 06:26 PM, Vladimir Kozlov wrote:
> Looks good.

Thanks for the review, Vladimir!

Regards,
Gustavo

> Thanks,
> Vladimir
> 
> On 1/18/19 6:57 AM, Gustavo Romero wrote:
>> Hi,
>>
>> Could the following backport to 11u be reviewed, please?
>>
>> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8215317
>> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/108a161aed93
>> Backport: http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
>>
>> It adds 4 intrinsics to the Graal test CheckGraalIntrinsics.java list so
>> JDK 11u becomes aware of them. Otherwise that test will break once change
>> 8213754 [0] lands 11u (which will effectively add the 4 intrinsics to
>> PPC64/Hotspot and adapt the correlated methods to be intrinsified).
>>
>> The backport changed the inclusion of the intrinsics for JDK 11 or higher,
>> instead for JDK 12 or higher (original patch).
>>
>> This backport was tested on x86_64 with
>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled)
>> and no regressions were observed too.
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>> [0] https://bugs.openjdk.java.net/browse/JDK-8213754
>>
> 


From gromero at linux.vnet.ibm.com  Mon Jan 21 11:41:23 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Mon, 21 Jan 2019 09:41:23 -0200
Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
 CheckGraalIntrinsics failed after 8213754
In-Reply-To: <0094c5f18c034632bba0123a2bf6cf02@sap.com>
References: <2ed804b4-85d5-feb7-edab-85d6dee66c74@linux.vnet.ibm.com>
 <0094c5f18c034632bba0123a2bf6cf02@sap.com>
Message-ID: <a8777fc9-6d74-78e1-f9b8-06f45313e0d8@linux.vnet.ibm.com>

On 01/21/2019 08:54 AM, Lindenmaier, Goetz wrote:
> looks good to me.

Thank for the review, Goetz!

Regards,
Gustavo
  
> Best regards,
>    Goetz.
> 
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Freitag, 18. Januar 2019 15:57
>> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
>> <goetz.lindenmaier at sap.com>; vladimir.kozlov at oracle.com
>> Subject: [11u backport] RFR(S): 8215317: [GRAAL] unit test
>> CheckGraalIntrinsics failed after 8213754
>>
>> Hi,
>>
>> Could the following backport to 11u be reviewed, please?
>>
>> Bug     : https://bugs.openjdk.java.net/browse/JDK-8215317
>> Change  : http://hg.openjdk.java.net/jdk/jdk/rev/108a161aed93
>> Backport: http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
>>
>> It adds 4 intrinsics to the Graal test CheckGraalIntrinsics.java list so
>> JDK 11u becomes aware of them. Otherwise that test will break once change
>> 8213754 [0] lands 11u (which will effectively add the 4 intrinsics to
>> PPC64/Hotspot and adapt the correlated methods to be intrinsified).
>>
>> The backport changed the inclusion of the intrinsics for JDK 11 or higher,
>> instead for JDK 12 or higher (original patch).
>>
>> This backport was tested on x86_64 with
>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled)
>> and no regressions were observed too.
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>> [0] https://bugs.openjdk.java.net/browse/JDK-8213754
> 


From gromero at linux.vnet.ibm.com  Mon Jan 21 11:45:44 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Mon, 21 Jan 2019 09:45:44 -0200
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <2ac3e91da61b43dcb2d4e45325202264@sap.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <2ac3e91da61b43dcb2d4e45325202264@sap.com>
Message-ID: <8083b8db-c546-29e8-c83a-f06ebd4e624e@linux.vnet.ibm.com>

On 01/21/2019 09:10 AM, Lindenmaier, Goetz wrote:
> also this change looks good.

Thanks for reviewing it, Goetz!

I'll ping once the approvals are ok.

Thank you.

Regards,
Gustavo
  
> Best regards,
>    Goetz.
> 
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Freitag, 18. Januar 2019 16:07
>> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
>> <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>;
>> vladimir.kozlov at oracle.com; Roger Riggs <Roger.Riggs at oracle.com>
>> Cc: Michihiro Horie <HORIE at jp.ibm.com>
>> Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
>> isDigit/isLowerCase/isUpperCase/isWhitespace
>>
>> Hi,
>>
>> Could the following backport to 11u be reviewed, please?
>>
>> Bug     : https://bugs.openjdk.java.net/browse/JDK-8213754
>> Change  : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
>> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
>>
>> It adds 4 intrinsics that use instructions introduced by POWER9 in order to
>> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
>>
>> The change is mostly PPC64-only but it does touch shared code, for
>> instance, in order to adapt the methods in question to be properly
>> intrinsified. It also needs an additional change [0], since one Graal
>> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
>>
>> The change applies almost cleanly: only a small tweak is necessary because
>> the hunk for ppc.ad file relies on some absent text in the 11u code around
>> the change to be applied. That absent text is related to the Superword
>> feature (a non-related feature), which is not backported yet to 11u.
>>
>> This backport was tested on POWER8 and POWER9 and no regressions were
>> observed.
>>
>> This backport was also tested on x86_64 with
>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
>> change 8215317 [0] applied and no regressions were observed too.
>>
>> Thank you.
>>
>> Best regards,
>> Gustavo
>>
>> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-
>> January/032266.html
> 


From adinn at redhat.com  Mon Jan 21 11:55:10 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Mon, 21 Jan 2019 11:55:10 +0000
Subject: RFR: 8216392: Enable cmovP_mem and cmovP_memU instructions
In-Reply-To: <CAEgw74C0H1-tj8+yr-oBKvOa4DVw0QEaMoTZF5fw-EZ-E4tf=A@mail.gmail.com>
References: <239a5ec9-170d-9d5c-c624-7bd9a6aed699@redhat.com>
 <ba49e1b8-b7c8-b804-f973-6e88cb2bcd8a@redhat.com>
 <CAEgw74DXHF4cmVzOkHtvGuO937QqaypRFNvuB5sfDw3h8HOUPQ@mail.gmail.com>
 <e5697cd2-49d6-724e-1b63-41de0e771965@redhat.com>
 <b16d13b4-40f4-1503-765a-82799acf1527@redhat.com>
 <a4e64632-1187-1b3a-8c88-ac4b039acc71@redhat.com>
 <81dc5407-4985-883c-83fc-2e1fa2b77e66@redhat.com>
 <CAEgw74ANZ2CKpd=ivKRn5mE61EG_YqpfBbPoMpV=e=3G_ABYpw@mail.gmail.com>
 <c64262ae-8d2f-f773-3688-f1c40ad23f10@redhat.com>
 <CAEgw74DR0mw1avCmyG+k=q=X9gGCsJYSQtp+Tki4nUfcsMKF0Q@mail.gmail.com>
 <3dd85d2c-f4d8-e360-21a2-68254b3c5e2b@redhat.com>
 <2f209ec9-e7f9-8da3-64a2-20ac909b4931@redhat.com>
 <CAEgw74C0H1-tj8+yr-oBKvOa4DVw0QEaMoTZF5fw-EZ-E4tf=A@mail.gmail.com>
Message-ID: <46696e98-6519-58cb-f517-1aca8ea0ebd5@redhat.com>

On 19/01/2019 13:42, B. Blaser wrote:
> I'm definitely not an expert in this area but does ADLC treat this
> really differently from a single LoadP / mov?
> 
> http://hg.openjdk.java.net/jdk/jdk/file/683a112e0e1e/src/hotspot/cpu/x86/x86_64.ad#l5349
You are looking in the wrong place to answer that question. The place
where handling of cases might differ is not in the rule file but in the
implementation of the bottom_type method in the node classes associated
with those rules. That's a tad more difficult to check than meets they
eye at first glance.

The implementation of bottom_type for built-in nodes is in the relevant
classes defined in the opto tree headers. However for machine node
classes it is determined by the code which gets generated when the adlc
preprocessor consumes the rules included in the ad file. So, although
the rules in question here look uniform the derivation of the relevant
bottom types is not guaranteed to be the same. Indeed, that is how it
looks to me. The case handling for rules which match CMove does not
appear (to me) to be able to deal with memory inputs correctly.
Different case handling applies for rules which match LoadP or LoadN and
I very much hope (and expect) it generates code which does compute
bottom types correctly but I have not checked it. I could probably work
out what the differences between the two cases are if I spent the time
studying the code but I'm assuming (hoping :-) someone here knows how it
works and can avoid the need for me to put in that effort.

I would strongly advise against employing these rules without a
guarantee -- from someone who understands the code -- that the comment
about miscomputation of bottom types is not (no longer?) valid. The
rules may have generated valid code in all the cases they have matched
against so far. However, it is the nature of pattern-based programming
models that unexpected matches can turn up at some point and screw the
pooch. Even with existing uses there /might/ be as yet untested compile
contexts where re-ordering of instructions based on miscomputed bottom
types could manifest. It's vital to correctness that the compiler knows
which memory slices instructions are operating on, meaning it's equally
as important that these bottom types are computed correctly.

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From aph at redhat.com  Mon Jan 21 12:21:13 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 21 Jan 2019 12:21:13 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>

On 1/21/19 10:53 AM, Pengfei Li (Arm Technology China) wrote:

> I also tested the performance with and without my patch by a JMH
> case [2]. The JMH result shows that the performance gets ~2.5x
> optimized by this.

Fair enough; it does look like an improvement. However, please show us
the actual numbers, especially at small sizes. Also, how much is the
Adler32 checksum actually used? Is it something we care about?

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From rwestrel at redhat.com  Mon Jan 21 12:24:02 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Mon, 21 Jan 2019 13:24:02 +0100
Subject: [12] RFR(S): 8217230: assert(t == t_no_spec) failure in
 NodeHash::check_no_speculative_types()
In-Reply-To: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
References: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
Message-ID: <87lg3e8ahp.fsf@redhat.com>


> http://cr.openjdk.java.net/~thartmann/8217230/webrev.00/

Looks good to me.

Roland.

From nils.eliasson at oracle.com  Mon Jan 21 12:17:50 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Mon, 21 Jan 2019 13:17:50 +0100
Subject: [12] RFR(S): 8217230: assert(t == t_no_spec) failure in
 NodeHash::check_no_speculative_types()
In-Reply-To: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
References: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
Message-ID: <dd400e63-f949-7c7f-e0c5-bca61081815e@oracle.com>

Looks good!

// Nils

On 2019-01-21 09:21, Tobias Hartmann wrote:
> Hi,
>
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8217230
> http://cr.openjdk.java.net/~thartmann/8217230/webrev.00/
>
> A SafePointNode becomes dead when being cut off from root in Compile::remove_root_to_sfpts_edges()
> but is not processed by IGVN and therefore remains in the graph. Since it is not reachable by root
> anymore, it is not processed by Compile::remove_speculative_types and we hit the assert.
>
> The problem was introduced by the fix for JDK-8214862 [1] in JDK 12 b27.
>
> Thanks,
> Tobias
>
> [1] https://bugs.openjdk.java.net/browse/JDK-8214862

From tobias.hartmann at oracle.com  Mon Jan 21 12:25:48 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 13:25:48 +0100
Subject: [12] RFR(S): 8217230: assert(t == t_no_spec) failure in
 NodeHash::check_no_speculative_types()
In-Reply-To: <87lg3e8ahp.fsf@redhat.com>
References: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
 <87lg3e8ahp.fsf@redhat.com>
Message-ID: <0aba9d71-ce73-049c-5d10-af89d4446b24@oracle.com>

Thanks Roland.

Best regards,
Tobias

On 21.01.19 13:24, Roland Westrelin wrote:
> 
>> http://cr.openjdk.java.net/~thartmann/8217230/webrev.00/
> 
> Looks good to me.
> 
> Roland.
> 

From aph at redhat.com  Mon Jan 21 12:27:56 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 21 Jan 2019 12:27:56 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
Message-ID: <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>

Hi,

On 1/21/19 9:27 AM, Nick Gasson (Arm Technology China) wrote:

> On 21/01/2019 17:10, Andrew Haley wrote:
>> On 1/21/19 6:01 AM, Nick Gasson (Arm Technology China) wrote:
>>> OK I'll change all three places in aarch64_enc_fast_lock/unlock that do
>>> a compare-exchange to use MacroAssembler::cmpxchg.
>>
>> If you wish: be aware that if you change anything other than this place there'll
>> be a lot more testing to do, and review will take longer.
> 
> I think it will be confusing for anyone looking at these functions in 
> the future to have one call to cmpxhg and then two copies of essentially 
> the same code inlined a few lines afterwards. IMO we should either 
> change all three for consistency, or stick with the original minimal 
> patch (+ Derek's cleanup suggestions) which should be easier to review.

OK, if that's your position: you're writing the patch. Using cmpxhg
everywhere will make that rather twisted code much easier to read.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From tobias.hartmann at oracle.com  Mon Jan 21 12:27:33 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 13:27:33 +0100
Subject: [12] RFR(S): 8217230: assert(t == t_no_spec) failure in
 NodeHash::check_no_speculative_types()
In-Reply-To: <dd400e63-f949-7c7f-e0c5-bca61081815e@oracle.com>
References: <a360446b-c921-dd1b-4576-6c8059637af4@oracle.com>
 <dd400e63-f949-7c7f-e0c5-bca61081815e@oracle.com>
Message-ID: <717ae8b0-35d3-cdb9-c6d4-23f3e5bbadde@oracle.com>

Thanks Nils.

Best regards,
Tobias

On 21.01.19 13:17, Nils Eliasson wrote:
> Looks good!
> 
> // Nils
> 
> On 2019-01-21 09:21, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8217230
>> http://cr.openjdk.java.net/~thartmann/8217230/webrev.00/
>>
>> A SafePointNode becomes dead when being cut off from root in Compile::remove_root_to_sfpts_edges()
>> but is not processed by IGVN and therefore remains in the graph. Since it is not reachable by root
>> anymore, it is not processed by Compile::remove_speculative_types and we hit the assert.
>>
>> The problem was introduced by the fix for JDK-8214862 [1] in JDK 12 b27.
>>
>> Thanks,
>> Tobias
>>
>> [1] https://bugs.openjdk.java.net/browse/JDK-8214862

From nils.eliasson at oracle.com  Mon Jan 21 12:20:22 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Mon, 21 Jan 2019 13:20:22 +0100
Subject: [13] RFR(S): 8217291: Failure of ::realloc() should be handled
 correctly in adlc/forms.cpp
In-Reply-To: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>
References: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>
Message-ID: <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>

Looks good!

// Nils

On 2019-01-21 10:47, Tobias Hartmann wrote:
> Hi,
>
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8217291
> http://cr.openjdk.java.net/~thartmann/8217291/webrev.00/
>
> Similar to the fix for JDK-8212779 [1], I've introduced a wrapper method for re-allocation that
> handles failures by printing a message and exiting.
>
> Thanks,
> Tobias
>
> [1] http://hg.openjdk.java.net/jdk/jdk/rev/a3aa8d5380d9

From tobias.hartmann at oracle.com  Mon Jan 21 12:36:09 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 13:36:09 +0100
Subject: [13] RFR(S): 8217291: Failure of ::realloc() should be handled
 correctly in adlc/forms.cpp
In-Reply-To: <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>
References: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>
 <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>
Message-ID: <bf26cf72-d5cd-579e-2ef0-c92f01de79c1@oracle.com>

Thanks Nils!

Best regards,
Tobias

On 21.01.19 13:20, Nils Eliasson wrote:
> Looks good!
> 
> // Nils
> 
> On 2019-01-21 10:47, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8217291
>> http://cr.openjdk.java.net/~thartmann/8217291/webrev.00/
>>
>> Similar to the fix for JDK-8212779 [1], I've introduced a wrapper method for re-allocation that
>> handles failures by printing a message and exiting.
>>
>> Thanks,
>> Tobias
>>
>> [1] http://hg.openjdk.java.net/jdk/jdk/rev/a3aa8d5380d9

From doug.simon at oracle.com  Mon Jan 21 12:57:02 2019
From: doug.simon at oracle.com (Doug Simon)
Date: Mon, 21 Jan 2019 13:57:02 +0100
Subject: RFR: 8217445: [JVMCI] incorrect management of JVMCI compilation
 failure reason string
Message-ID: <6E1B238A-8546-4163-A3E5-D155AF18EB47@oracle.com>

The CompileTask::_failure_reason field assumes it is only ever assigned a compile-time constant string value (i.e. never needs to be freed). This is not the case when the value is derived from a JVMCI exception message. This patch adds support for managing a C heap allocated value in this field.

https://bugs.openjdk.java.net/browse/JDK-8217445
http://cr.openjdk.java.net/~dnsimon/8217445

-Doug


From tobias.hartmann at oracle.com  Mon Jan 21 13:02:28 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 14:02:28 +0100
Subject: [13] RFR(S): 8217447: Develop flag TraceICs is broken
Message-ID: <1690f02c-7452-07ac-4055-94760ea3609c@oracle.com>

Hi,

please review the following patch:
https://bugs.openjdk.java.net/browse/JDK-8217447
http://cr.openjdk.java.net/~thartmann/8217447/webrev.00/

While working on the value type calling convention, I've noticed that -XX:+TraceICs is broken. The
problem is that info.cached_metadata() can be NULL for optimized calls (the assert right before even
verifies that).

I've also removed the ":" from the output.

Before:
IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass) NULL:

After:
IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass = NULL)

Thanks,
Tobias

From aph at redhat.com  Mon Jan 21 13:12:29 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 21 Jan 2019 13:12:29 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
Message-ID: <8b95459f-4acd-729b-5174-670460b76c58@redhat.com>

On 1/21/19 12:21 PM, Andrew Haley wrote:

> Also, how much is the Adler32 checksum actually used? Is it
> something we care about?

... the ZIP file format uses Adler32, but as far as I remember we're
using zlib, an external library, for our zipfile handling (i.e. our
jar files.) If we are using an external library then the performance
of our intrinsicmight not matter at all, Please check.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From dmitry.chuyko at bell-sw.com  Mon Jan 21 14:11:12 2019
From: dmitry.chuyko at bell-sw.com (Dmitry Chuyko)
Date: Mon, 21 Jan 2019 17:11:12 +0300
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <8b95459f-4acd-729b-5174-670460b76c58@redhat.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
 <8b95459f-4acd-729b-5174-670460b76c58@redhat.com>
Message-ID: <7b071ae1-7bf5-9d9a-f5ef-2b5d26d57de3@bell-sw.com>

Adler32 may be chosen as HDFS checksum. Hadoop uses 512 byte blocks by 
default.

I see some speedups on Cavium Thunder X (1st gen, TX2 data later) with 
provided patch:

64 B. 8%
512 B. 10%
1 MB. 10%.


We considered following improvements without using vector instructions. 
Just split loads and break some data dependencies like:

 ??? __ ldr(temp0, Address(__ post(buff, 8)));
 ??? __ ldr(temp1, Address(__ post(buff, 8)));

 ??? __ add(s1, s1, temp0, ext::uxtb);
 ??? __ ubfx(temp2, temp0, 8, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp2);
 ??? __ ubfx(temp3, temp0, 16, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp3);
 ??? __ ubfx(temp2, temp0, 24, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp2);
 ??? __ ubfx(temp3, temp0, 32, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp3);
 ??? __ ubfx(temp2, temp0, 40, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp2);
 ??? __ ubfx(temp3, temp0, 48, 8);
 ??? __ add(s2, s2, s1);
 ??? __ add(s1, s1, temp3);

It shows 23% improvement on TX1 for size=512 but relatively the same 
performance as baseline on TX2.

-Dmitry

On 1/21/19 4:12 PM, Andrew Haley wrote:
> On 1/21/19 12:21 PM, Andrew Haley wrote:
>
>> Also, how much is the Adler32 checksum actually used? Is it
>> something we care about?
> ... the ZIP file format uses Adler32, but as far as I remember we're
> using zlib, an external library, for our zipfile handling (i.e. our
> jar files.) If we are using an external library then the performance
> of our intrinsicmight not matter at all, Please check.
>

From adinn at redhat.com  Mon Jan 21 14:58:50 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Mon, 21 Jan 2019 14:58:50 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
Message-ID: <37a39d6a-6b1d-2d96-9808-9141359114c0@redhat.com>

Hello Dmitrij,

On 10/01/2019 15:10, Dmitrij Pochepko wrote:

> I?ll focus on addressing your technical questions about testing this
> patch and intrinsic first.
> . . .
> I referenced this test in initial review request for this intrinsic. It
> takes a long time to run, so I did not include it in the webrev. I'm
> going to update the webrev to include a subset of this test as jtreg.

Ok, thank you for providing full details of the testing regime. If you
add the test as a jtreg test then I'm happy for it and your one line fix
to be pushed.

> Even brute force tests with 100% code coverage don't guarantee 100%
> correctness. The search-garbage-after-string test case for "algorithm G"
> and StringBuilder::setLength usage is a good catch by Stefan and
> Pengfei. And recent webrev addresses this case. I also tested a case
> symmetric to Pengfei's case checking that no "garbage" is read before
> specified source string [4]. I also am going to include it in the webrev.

I am aware of the limits of brute force methods. However, note that in
my previous post I set the bar at tests that would inspire confidence in
the code not ones that would guarantee correctness. God forbid that we
go down the route of formal verification, Grails are hard to come by.

The second, extra jtreg test is good.

> Indeed it is hard to review complex algorithms. The Boyer-Moore comments
> you referenced were updated as part of the original webrev to describe
> changes in algorithm E, which is in macroAssembler_aarch64.cpp. I once
> asked to validate the level of comments with you during pow function
> review [3]. If this is the level of comments you find reasonable, I?ll
> be happy to improve it here and elsewhere to this level.

Yes, I believe the code generated in the stub needs more documentation.
However, it is important to fix what is currently broken quickly. Please
raise a separate JIRA for the doc fixes and then submit an algorithm
and/or comments in the generator code that explain what the stub is doing.

> Once again, this is to address your question around testing for this
> intrinsic and patch. We are working on testing and review complex
> intrinsics to handle the wider problem of ensuring better quality of
> AArch64 intrinsics. We?ll follow up in a different email on that.
Well, one thing that needs to form part of that discussion is the
potential benefit of these patches vs the cost of producing, reviewing
and maintaining them. Included in the equation for the benefits is the
number of users it will help and the criticality of the problem they
face without the patch. On the costs side we need to factor in the
effort needed to clearly document complex code compared with the
potential cost of someone having to pick it up later and also the
potential, even with good documentation, of the resulting code becoming
a fly trap for developer and/or maintainer time.

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From adinn at redhat.com  Mon Jan 21 16:14:37 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Mon, 21 Jan 2019 16:14:37 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
Message-ID: <b55e9b08-297a-c28f-9e65-ea551129ecd2@redhat.com>

Hi Alan,

On 18/01/2019 13:32, Alan Bateman wrote:

> I had a brief discussion with Brian about this yesterday. He brought up
> the same concern about using MBB as it's not the right API for this in
> the longer term.? So this JEP is very much about a short term/tactical
> solution as we've already concluded here. This leads to the question as
> to whether this JEP needs to evolve the standard/Java SE API or not.
> It's convenient for the implementation of course but we should at least
> explore doing this as a JDK-specific feature.

I disagree with your characterization of use of MBB as a short term/
tactical solution. Despite not being entirely suitable for the task MBB
is a de facto standard way for many applications to gain direct access
to files of data located on persistent storage. The current proposal is
not, as you characterize it, a quick fix to use MBB as a temporary way
to access NVM storage until something better comes along. The intention
is rather to ensure that the current API caters for a new addition to
the persistent memory tier. The imperative is to allow existing code to
employ it now.

Of course, a better API may come along for accessing persistent storage,
whether that be NVM, flash disk or spinning platter. However, I would
hazard that in many cases existing application code and libraries will
still want/need to continue to use the MBB API, including cases where
that storage can most usefully be NVM. Rewriting application code to use
a new API will not always be feasible or cost-effective. Yet, the
improved speed of NVM suggests that an API encompassing this new case
will be very welcome and may well be cost-effective to adopt.

In sum, far from being a stop-gap this proposal should be seen as a step
towards completing and maintaining the existing MBB API for emergent tech.

> To that end, one approach to explore is allowing the FC.map method
> accept map modes beyond those defined by MapMode. There is precedence
> for extensibility in this area already, e.g. FC.open allows you to
> specify options beyond the standard options specified by the method. It
> would require MapMode to define a protected constructor and would
> require a bit of plumbing to support MapMode defined in a JDK-specific
> module but there are examples to point to. Another approach is aanother
> class in a JDK-specific module to define the map method. It would
> require the same plumbing under the covers but would avoid touch the FC
> spec.
I'm not sure what this side-step is supposed to achieve nor how that
relates to the concerns over use of MBB (perhaps it doesn't). I'm not
really clear what problem you are trying to avoid here by allowing the
MapMode enum to be extensible via a protected constructor.

If your desire is to avoid adding extra API surface to FileChannel then
where would you consider it appropriate to add such a surface. Something
is going to have to create and employ the extra enum tags that are
currently proposed for addition to MapMode. How is a client application
going to reach that something?

Perhaps we might benefit form looking at a simple example? Currently, my
most basic test program drives the API to create an MBB as follows:

        . . .
        String dir = "/mnt/pmem/test"; // mapSync should work, since fs
mount is -o dax

        Path path = new File(dir, "pmemtest").toPath();

        FileChannel fileChannel = (FileChannel) Files
                .newByteChannel(path, EnumSet.of(
                        StandardOpenOption.READ,
                        StandardOpenOption.WRITE,
                        StandardOpenOption.CREATE));

        MappedByteBuffer mappedByteBuffer =
fileChannel.map(FileChannel.MapMode.READ_WRITE_PERSISTENT, 0, 1024);
        . . .

Could you give a sketch of an alternative way that you see a client
operating?

One thing I did wonder about was whether we could insert the relevant
behavioural switch in the call to Files.newByteChannel rather than the
map call?

If we passed an ExtendedOpenOption (e.g. ExtendedOpenOption.SYNC) to
this call then method newByteChannel could create and return a
corresponding variant of FleChannelImpl, say an instance of a subclass
called SyncFileChannelImpl. This could behave as per a normal
FileChannelImpl apart from adding the MAP_SYNC flag to the mmap call
(well, also rejecting PRIVATE maps).

Would that be a better way to drive this? Would it address the concerns
you raised above?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From lutz.schmidt at sap.com  Mon Jan 21 17:15:52 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Mon, 21 Jan 2019 17:15:52 +0000
Subject: RFR(S, tedious): 8217250: Optimize CodeHeap Analytics
In-Reply-To: <CE7CC46E-F9D9-4B52-B040-46BC1B25CA49@sap.com>
References: <D481567B-EA9D-4BD2-A55C-A0AF157F0EDD@sap.com>
 <c178e47c-279b-47f8-6876-2ad28593d630@oracle.com>
 <A2738EC3-3F36-4898-87F9-EC786EA19D46@sap.com>
 <2d7b7963-61be-95b8-017b-956f2752c8f3@oracle.com>
 <9AD26B9E-015A-4BD4-A44F-12DDE2793ED0@sap.com>
 <4e61a8a5-3c6e-3c4f-0a2c-68d4b8bc2f9f@oracle.com>
 <52136751-929b-4976-477d-93282ce0a0d7@oracle.com>
 <CE7CC46E-F9D9-4B52-B040-46BC1B25CA49@sap.com>
Message-ID: <ECFEB55F-D401-4EC9-9B35-A721786CBEE6@sap.com>

Hi all, 
as said on Friday, I rebased the changeset to jdk/jdk and pushed it. The pushed version can be found at
  http://cr.openjdk.java.net/~lucy/webrevs/8217250.02/
It is identical to version 01 which was based on jdk12.
Thanks, 
Lutz 

?On 18.01.19, 17:05, "Schmidt, Lutz" <lutz.schmidt at sap.com> wrote:

    Thank you, Tobias!
    
    As this enhancement will not make it into jdk12, I'll rebase it to jdk/jdk. I expect no conflicts and assume I can then push without further webrev/review. 
    
    Thanks,
    Lutz
    
    On 18.01.19, 10:49, "Tobias Hartmann" <tobias.hartmann at oracle.com> wrote:
    
        Hi Lutz,
        
        looks good to me too.
        
        Best regards,
        Tobias
        
        On 17.01.19 19:39, Vladimir Kozlov wrote:
        > Looks good
        > 
        > Thanks,
        > Vladimir
        > 
        > On 1/17/19 7:47 AM, Schmidt, Lutz wrote:
        >> Hi Vladimir & all,
        >> there is a new webrev available: http://cr.openjdk.java.net/~lucy/webrevs/8217250.01/
        >> What's new (in addition to some comments) is the macro
        >>
        >>    // Flush the buffer contents if the remaining capacity is less
        >>    // than the calculated threshold (256 bytes + capacity/16)
        >>    // That should suffice for all reasonably sized output lines.
        >>    #define BUFFEREDSTREAM_FLUSH_AUTO(_termString)                \
        >>        BUFFEREDSTREAM_FLUSH_IF(_termString, 256+(_capacity>>4))
        >>
        >> It replaced the previous BUFFEREDSTREAM_FLUSH_IF("string", 512) occurrences.
        >> Regards,
        >> Lutz
        >>
        >> On 16.01.19, 22:53, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:
        >>
        >>      On 1/16/19 12:37 PM, Schmidt, Lutz wrote:
        >>      > Hi Vladimir,
        >>      >
        >>      > thanks a lot for looking at this so quickly.
        >>      >
        >>      > Sure, I could declare a specialized "BUFFEREDSTREAM_FLUSH_512" for this. The "512"
        >> originated from the thought "its large enough for a well-behaved line and small enough to save
        >> some flushes".
        >>      >
        >>      > I was also thinking about a "BUFFEREDSTREAM_FLUSH_AUTO", where the spare space is derived
        >> from the buffer capacity, maybe something like 10 percent of the capacity, 256 bytes minimum. I
        >> wasn't sure if that could be categorized as over-engineered.
        >>           Yes, I think BUFFEREDSTREAM_FLUSH_AUTO is better than fixed size.
        >>           Vladimir
        >>           >
        >>      > Your thoughts?
        >>      >
        >>      > Thanks,
        >>      > Lutz
        >>      >
        >>      > On 16.01.19, 19:10, "hotspot-compiler-dev on behalf of Vladimir Kozlov"
        >> <hotspot-compiler-dev-bounces at openjdk.java.net on behalf of vladimir.kozlov at oracle.com> wrote:
        >>      >
        >>      >      Hi Lutz,
        >>      >
        >>      >      I see that you have only one usage in all cases for:
        >>      >      BUFFEREDSTREAM_FLUSH_IF("", 512)
        >>      >
        >>      >      Can you simple declare simplified macro for this?
        >>      >
        >>      >      Otherwise looks good.
        >>      >
        >>      >      Thanks,
        >>      >      Vladimir
        >>      >
        >>      >      On 1/16/19 6:52 AM, Schmidt, Lutz wrote:
        >>      >      > Dear all,
        >>      >      >
        >>      >      > may I please have reviews for this (semantically) small change. Its purpose is to
        >> reduce the bufferedStream buffer flushes while printing CodeHeap Analytics.
        >>      >      >
        >>      >      > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217250
        >>      >      > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217250.00/
        >>      >      >
        >>      >      > Thank you!
        >>      >      > Lutz
        >>      >      >
        >>      >      >
        >>      >
        >>      >
        >>     
        
    
From vladimir.kozlov at oracle.com  Mon Jan 21 17:44:55 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 21 Jan 2019 09:44:55 -0800
Subject: RFR: 8217445: [JVMCI] incorrect management of JVMCI compilation
 failure reason string
In-Reply-To: <6E1B238A-8546-4163-A3E5-D155AF18EB47@oracle.com>
References: <6E1B238A-8546-4163-A3E5-D155AF18EB47@oracle.com>
Message-ID: <65d4fb89-4f30-8ace-f455-ba2393d4832f@oracle.com>

Hi Doug,

Looks good. Thank you for fixing it.

Vladimir

On 1/21/19 4:57 AM, Doug Simon wrote:
> The CompileTask::_failure_reason field assumes it is only ever assigned a compile-time constant string value (i.e. never needs to be freed). This is not the case when the value is derived from a JVMCI exception message. This patch adds support for managing a C heap allocated value in this field.
> 
> https://bugs.openjdk.java.net/browse/JDK-8217445
> http://cr.openjdk.java.net/~dnsimon/8217445
> 
> -Doug
> 

From vladimir.kozlov at oracle.com  Mon Jan 21 17:48:36 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 21 Jan 2019 09:48:36 -0800
Subject: [13] RFR(S): 8217447: Develop flag TraceICs is broken
In-Reply-To: <1690f02c-7452-07ac-4055-94760ea3609c@oracle.com>
References: <1690f02c-7452-07ac-4055-94760ea3609c@oracle.com>
Message-ID: <6d9dfbea-c600-1717-bcad-85f8a9462b32@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/21/19 5:02 AM, Tobias Hartmann wrote:
> Hi,
> 
> please review the following patch:
> https://bugs.openjdk.java.net/browse/JDK-8217447
> http://cr.openjdk.java.net/~thartmann/8217447/webrev.00/
> 
> While working on the value type calling convention, I've noticed that -XX:+TraceICs is broken. The
> problem is that info.cached_metadata() can be NULL for optimized calls (the assert right before even
> verifies that).
> 
> I've also removed the ":" from the output.
> 
> Before:
> IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass) NULL:
> 
> After:
> IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass = NULL)
> 
> Thanks,
> Tobias
> 

From aph at redhat.com  Mon Jan 21 17:51:38 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 21 Jan 2019 17:51:38 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <37a39d6a-6b1d-2d96-9808-9141359114c0@redhat.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <37a39d6a-6b1d-2d96-9808-9141359114c0@redhat.com>
Message-ID: <0ff14be6-f98f-89af-2eea-6eb635d8bd14@redhat.com>

On 1/21/19 2:58 PM, Andrew Dinn wrote:
>> Once again, this is to address your question around testing for this
>> intrinsic and patch. We are working on testing and review complex
>> intrinsics to handle the wider problem of ensuring better quality of
>> AArch64 intrinsics. We?ll follow up in a different email on that.

> Well, one thing that needs to form part of that discussion is the
> potential benefit of these patches vs the cost of producing, reviewing
> and maintaining them. Included in the equation for the benefits is the
> number of users it will help and the criticality of the problem they
> face without the patch. On the costs side we need to factor in the
> effort needed to clearly document complex code compared with the
> potential cost of someone having to pick it up later and also the
> potential, even with good documentation, of the resulting code becoming
> a fly trap for developer and/or maintainer time.

We do. I was concerned about the complexity of the Boyer-Moore-
Horspool algorithm at the time but was persuaded to admit it. These
days I'd push back more: the last year or two of the AArch64 project
has hardened my attitude.


Rob Pike's 5 Rules of Programming

    Rule 1. You can't tell where a program is going to spend its
    time. Bottlenecks occur in surprising places, so don't try to
    second guess and put in a speed hack until you've proven that's
    where the bottleneck is.

    Rule 2. Measure. Don't tune for speed until you've measured, and
    even then don't unless one part of the code overwhelms the rest.

    Rule 3. Fancy algorithms are slow when n is small, and n is
    usually small. Fancy algorithms have big constants. Until you know
    that n is frequently going to be big, don't get fancy. (Even if n
    does get big, use Rule 2 first.)

    Rule 4. Fancy algorithms are buggier than simple ones, and they're
    much harder to implement. Use simple algorithms as well as simple
    data structures.

    ...

    More at https://users.ece.utexas.edu/~adnan/pike.html

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From vladimir.kozlov at oracle.com  Mon Jan 21 17:51:18 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 21 Jan 2019 09:51:18 -0800
Subject: [13] RFR(S): 8217291: Failure of ::realloc() should be handled
 correctly in adlc/forms.cpp
In-Reply-To: <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>
References: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>
 <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>
Message-ID: <19ce3914-23c8-e588-6dc9-36bad3cc2f69@oracle.com>

+1

Thanks,
Vladimir

On 1/21/19 4:20 AM, Nils Eliasson wrote:
> Looks good!
> 
> // Nils
> 
> On 2019-01-21 10:47, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8217291
>> http://cr.openjdk.java.net/~thartmann/8217291/webrev.00/
>>
>> Similar to the fix for JDK-8212779 [1], I've introduced a wrapper method for re-allocation that
>> handles failures by printing a message and exiting.
>>
>> Thanks,
>> Tobias
>>
>> [1] http://hg.openjdk.java.net/jdk/jdk/rev/a3aa8d5380d9

From martin.doerr at sap.com  Mon Jan 21 18:07:13 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Mon, 21 Jan 2019 18:07:13 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
Message-ID: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi,

PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.

In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.

Webrev:
http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/

Please review.

Best regards,
Martin

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190121/8d71f665/attachment.html>

From tobias.hartmann at oracle.com  Mon Jan 21 18:15:38 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 19:15:38 +0100
Subject: [13] RFR(S): 8217291: Failure of ::realloc() should be handled
 correctly in adlc/forms.cpp
In-Reply-To: <19ce3914-23c8-e588-6dc9-36bad3cc2f69@oracle.com>
References: <984d33e8-1aab-6fd5-9f45-64b4b08421f2@oracle.com>
 <8bf33a6e-3d53-ae39-d301-ea097d14088d@oracle.com>
 <19ce3914-23c8-e588-6dc9-36bad3cc2f69@oracle.com>
Message-ID: <aa3fbec1-fa01-8b99-8268-ec84e6ba0eb1@oracle.com>

Thanks Vladimir.

Best regards,
Tobias

On 21.01.19 18:51, Vladimir Kozlov wrote:
> +1
> 
> Thanks,
> Vladimir
> 
> On 1/21/19 4:20 AM, Nils Eliasson wrote:
>> Looks good!
>>
>> // Nils
>>
>> On 2019-01-21 10:47, Tobias Hartmann wrote:
>>> Hi,
>>>
>>> please review the following patch:
>>> https://bugs.openjdk.java.net/browse/JDK-8217291
>>> http://cr.openjdk.java.net/~thartmann/8217291/webrev.00/
>>>
>>> Similar to the fix for JDK-8212779 [1], I've introduced a wrapper method for re-allocation that
>>> handles failures by printing a message and exiting.
>>>
>>> Thanks,
>>> Tobias
>>>
>>> [1] http://hg.openjdk.java.net/jdk/jdk/rev/a3aa8d5380d9

From tobias.hartmann at oracle.com  Mon Jan 21 18:15:21 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 21 Jan 2019 19:15:21 +0100
Subject: [13] RFR(S): 8217447: Develop flag TraceICs is broken
In-Reply-To: <6d9dfbea-c600-1717-bcad-85f8a9462b32@oracle.com>
References: <1690f02c-7452-07ac-4055-94760ea3609c@oracle.com>
 <6d9dfbea-c600-1717-bcad-85f8a9462b32@oracle.com>
Message-ID: <d2ea3d59-7b94-8e3f-dd5f-c1f13a5e175f@oracle.com>

Thanks Vladimir.

Best regards,
Tobias

On 21.01.19 18:48, Vladimir Kozlov wrote:
> Looks good.
> 
> Thanks,
> Vladimir
> 
> On 1/21/19 5:02 AM, Tobias Hartmann wrote:
>> Hi,
>>
>> please review the following patch:
>> https://bugs.openjdk.java.net/browse/JDK-8217447
>> http://cr.openjdk.java.net/~thartmann/8217447/webrev.00/
>>
>> While working on the value type calling convention, I've noticed that -XX:+TraceICs is broken. The
>> problem is that info.cached_metadata() can be NULL for optimized calls (the assert right before even
>> verifies that).
>>
>> I've also removed the ":" from the output.
>>
>> Before:
>> IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass) NULL:
>>
>> After:
>> IC at 0x00007f8020ae948b: monomorphic to compiled (rcvr klass = NULL)
>>
>> Thanks,
>> Tobias
>>

From felix.yang at huawei.com  Tue Jan 22 01:17:47 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Tue, 22 Jan 2019 01:17:47 +0000
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
 <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>

Hi,

    Thanks for reviewing.  The regression test is added. 
    New webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.01/ 
    This is committed to the submit repo: http://hg.openjdk.java.net/jdk/submit/rev/7345adfbc913

    The email I got shows that it passed the Oralce internal tests:
    =================================================
    Build Details: 2019-01-21-1210078.felix.yang.source
    0 Failed Tests
    Mach5 Tasks Results Summary
    ?	EXECUTED_WITH_FAILURE: 0
    ?	NA: 0
    ?	KILLED: 0
    ?	UNABLE_TO_RUN: 0
    ?	PASSED: 76
    ?	FAILED: 0
    =================================================

    OK to push?

Thanks for your help,
Felix

> 
> Hi Felix,
> 
> Could you please add the regression test as jtreg test?
> 
> Otherwise, the fix looks reasonable to me. Nice analysis!
> 
> Thanks,
> Tobias


From fairoz.matte at oracle.com  Tue Jan 22 03:35:16 2019
From: fairoz.matte at oracle.com (Fairoz Matte)
Date: Mon, 21 Jan 2019 19:35:16 -0800 (PST)
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
Message-ID: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>

Hi,

Please review the following patch,
JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951 
Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/

During the call to assembled stub code generate_cipherBlockChaining_decryptAESCrypt_Parallel() 
there was reference to G6 register used for temporary storage of F50, 
as G6 is not saved on stack it was resulting in garbage during retrieval.

Solution is to use unused local register (L6) for temporary storage and retrieval of F50.

Thanks,
Fairoz

From tobias.hartmann at oracle.com  Tue Jan 22 08:00:21 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 09:00:21 +0100
Subject: RFR: 8217445: [JVMCI] incorrect management of JVMCI compilation
 failure reason string
In-Reply-To: <6E1B238A-8546-4163-A3E5-D155AF18EB47@oracle.com>
References: <6E1B238A-8546-4163-A3E5-D155AF18EB47@oracle.com>
Message-ID: <0998110b-082b-82e3-521b-555af94d5827@oracle.com>

Hi Doug,

looks good to me too.

Best regards,
Tobias

On 21.01.19 13:57, Doug Simon wrote:
> The CompileTask::_failure_reason field assumes it is only ever assigned a compile-time constant string value (i.e. never needs to be freed). This is not the case when the value is derived from a JVMCI exception message. This patch adds support for managing a C heap allocated value in this field.
> 
> https://bugs.openjdk.java.net/browse/JDK-8217445
> http://cr.openjdk.java.net/~dnsimon/8217445
> 
> -Doug
> 

From tobias.hartmann at oracle.com  Tue Jan 22 08:04:10 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 09:04:10 +0100
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
 <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>
Message-ID: <d5a75c56-b8c5-e215-1485-4274c95236b6@oracle.com>

Hi Felix,

this looks good to me, thanks for adding the test!

A second review would be good. In the meantime, please request approval for integration into JDK 12
according to:
http://openjdk.java.net/jeps/3#Fix-Request-Process

Thanks,
Tobias

On 22.01.19 02:17, Yangfei (Felix) wrote:
> Hi,
> 
>     Thanks for reviewing.  The regression test is added. 
>     New webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.01/ 
>     This is committed to the submit repo: http://hg.openjdk.java.net/jdk/submit/rev/7345adfbc913
> 
>     The email I got shows that it passed the Oralce internal tests:
>     =================================================
>     Build Details: 2019-01-21-1210078.felix.yang.source
>     0 Failed Tests
>     Mach5 Tasks Results Summary
>     ?	EXECUTED_WITH_FAILURE: 0
>     ?	NA: 0
>     ?	KILLED: 0
>     ?	UNABLE_TO_RUN: 0
>     ?	PASSED: 76
>     ?	FAILED: 0
>     =================================================
> 
>     OK to push?
> 
> Thanks for your help,
> Felix
> 
>>
>> Hi Felix,
>>
>> Could you please add the regression test as jtreg test?
>>
>> Otherwise, the fix looks reasonable to me. Nice analysis!
>>
>> Thanks,
>> Tobias
> 

From tobias.hartmann at oracle.com  Tue Jan 22 08:22:16 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 09:22:16 +0100
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
Message-ID: <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>

Hi Fairoz,

this looks good to me.

Thanks,
Tobias

On 22.01.19 04:35, Fairoz Matte wrote:
> Hi,
> 
> Please review the following patch,
> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951 
> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
> 
> During the call to assembled stub code generate_cipherBlockChaining_decryptAESCrypt_Parallel() 
> there was reference to G6 register used for temporary storage of F50, 
> as G6 is not saved on stack it was resulting in garbage during retrieval.
> 
> Solution is to use unused local register (L6) for temporary storage and retrieval of F50.
> 
> Thanks,
> Fairoz
> 

From Nick.Gasson at arm.com  Tue Jan 22 09:10:15 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Tue, 22 Jan 2019 09:10:15 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
 <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
Message-ID: <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>

Hi,

On 21/01/2019 20:27, Andrew Haley wrote:
> 
> OK, if that's your position: you're writing the patch. Using cmpxhg
> everywhere will make that rather twisted code much easier to read.
> 

Please see the updated webrev to use cmpxchg in both the lock and unlock 
functions:

http://cr.openjdk.java.net/~ngasson/8217368/webrev.1/

Also includes Derek's cleanup suggestions (although some of them are not 
applicable now).

Testing I've done on this:

* Ran jtreg with assertions enabled (+UseLSE)

* Ran jcstress with both +UseLSE and -UseLSE

* Ran the JMH LockUnlock benchmarks with -UseBiasedLocking to check for 
performance regressions.

The directory below contains the the generated assembly from each webrev 
and current hg tip for this simple method:

http://cr.openjdk.java.net/~ngasson/8217368/generated/

     private Object obj = new Object();
     public int x;

     private void incX() {
         synchronized (obj) {
             x++;
         }
     }

The output of webrev.1 looks OK to me. Any other suggestions of things 
to test?

Thanks,
Nick

From dmitry.chuyko at bell-sw.com  Tue Jan 22 09:31:26 2019
From: dmitry.chuyko at bell-sw.com (Dmitry Chuyko)
Date: Tue, 22 Jan 2019 12:31:26 +0300
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <7b071ae1-7bf5-9d9a-f5ef-2b5d26d57de3@bell-sw.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
 <8b95459f-4acd-729b-5174-670460b76c58@redhat.com>
 <7b071ae1-7bf5-9d9a-f5ef-2b5d26d57de3@bell-sw.com>
Message-ID: <5da5933d-aa7f-ad0a-2c60-f3c0e500465a@bell-sw.com>

TX2 data for the patch:

64 B. 1.5x speedup
512 B. 2x speedup
1 MB. 2.2x speedup!

-Dmitry

On 1/21/19 5:11 PM, Dmitry Chuyko wrote:
> Adler32 may be chosen as HDFS checksum. Hadoop uses 512 byte blocks by 
> default.
>
> I see some speedups on Cavium Thunder X (1st gen, TX2 data later) with 
> provided patch:
>
> 64 B. 8%
> 512 B. 10%
> 1 MB. 10%.
>
>
> .........................

From aph at redhat.com  Tue Jan 22 09:36:13 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 22 Jan 2019 09:36:13 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
 <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
 <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
Message-ID: <fe2945df-24ee-71db-4591-2159a112fe48@redhat.com>

Hi,

On 1/22/19 9:10 AM, Nick Gasson (Arm Technology China) wrote:
> 
> Please see the updated webrev to use cmpxchg in both the lock and unlock 
> functions:
> 
> http://cr.openjdk.java.net/~ngasson/8217368/webrev.1/
> 
> Also includes Derek's cleanup suggestions (although some of them are not 
> applicable now).
> 
> Testing I've done on this:
> 
> * Ran jtreg with assertions enabled (+UseLSE)
> 
> * Ran jcstress with both +UseLSE and -UseLSE
> 
> * Ran the JMH LockUnlock benchmarks with -UseBiasedLocking to check for 
> performance regressions.
> 
> The directory below contains the the generated assembly from each webrev 
> and current hg tip for this simple method:
> 
> http://cr.openjdk.java.net/~ngasson/8217368/generated/

Excellent, thanks for that. Otherwise I'd have had to generate these myself.

>      private Object obj = new Object();
>      public int x;
> 
>      private void incX() {
>          synchronized (obj) {
>              x++;
>          }
>      }
> 
> The output of webrev.1 looks OK to me. Any other suggestions of things 
> to test?

That looks right, thanks. It's extremely difficult to test this stuff in practice.
Does any of the above stress test  recursive locking in the presence of many threads?

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From Nick.Gasson at arm.com  Tue Jan 22 10:15:34 2019
From: Nick.Gasson at arm.com (Nick Gasson (Arm Technology China))
Date: Tue, 22 Jan 2019 10:15:34 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <fe2945df-24ee-71db-4591-2159a112fe48@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
 <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
 <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
 <fe2945df-24ee-71db-4591-2159a112fe48@redhat.com>
Message-ID: <f9b13e71-3f4f-1539-2c55-73d26a0be153@arm.com>

Hi Andrew

On 22/01/2019 17:36, Andrew Haley wrote:
> Does any of the above stress test  recursive locking in the presence of many threads?
> 

I can't immediately find anything in jcstress that does this (although I 
might not be looking in the right place).

If you do:

make test TEST="micro:LockUnlock.testRecursiveSynchronizationNoBias" 
MICRO="OPTIONS=-t 10"

It will run that recursive JMH case with 10 threads. In this case the 
lock will be inflated and as we don't have a fast-path for 
recursive-monitor we will call into the runtime for each recursive 
monitorenter/exit. The JMH test isn't checking for correctness but we at 
least don't hit any assertions in a fastdebug build.

Thanks,
Nick

From shade at redhat.com  Tue Jan 22 10:27:34 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Tue, 22 Jan 2019 11:27:34 +0100
Subject: RFR [12] (XS): http://cr.openjdk.java.net/~shade/8217467/webrev.01/
Message-ID: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>

Bug:
  https://bugs.openjdk.java.net/browse/JDK-8217467

Fix:
  http://cr.openjdk.java.net/~shade/8217467/webrev.01/

This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64 intrinsic
is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
jdk12.

Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
(running)

Thanks,
-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190122/6dd17955/signature.asc>

From shade at redhat.com  Tue Jan 22 10:28:45 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Tue, 22 Jan 2019 11:28:45 +0100
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
Message-ID: <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>

(correct title)

On 1/22/19 11:27 AM, Aleksey Shipilev wrote:
> Bug:
>   https://bugs.openjdk.java.net/browse/JDK-8217467
> 
> Fix:
>   http://cr.openjdk.java.net/~shade/8217467/webrev.01/
> 
> This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64 intrinsic
> is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
> jdk12.
> 
> Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
> (running)
> 
> Thanks,
> -Aleksey
> 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190122/a950983d/signature.asc>

From Pengfei.Li at arm.com  Tue Jan 22 10:32:00 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Tue, 22 Jan 2019 10:32:00 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
Message-ID: <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Dmitrij,

I (not a reviewer) tested your single line fix and it looks ok to me.

Also I bump the priority of the JBS (https://bugs.openjdk.java.net/browse/JDK-8215792) to P2 and hope the fix could be in JDK 12. (The door of JDK 12 will be closed soon)

> Indeed it is hard to review complex algorithms. The Boyer-Moore comments
> you referenced were updated as part of the original webrev to describe
> changes in algorithm E, which is in macroAssembler_aarch64.cpp. I once
> asked to validate the level of comments with you during pow function review
> [3]. If this is the level of comments you find reasonable, I?ll be happy to
> improve it here and elsewhere to this level.

When I was trying to fix this bug, I found it pretty easy to get lost among branches in the code. And other engineers from Arm looking at the AArch64 intrinsics have the similar feeling. So I'd also strongly recommend you write more explanations in comments. In this String.indexOf(str) intrinsic, as there are a lot of paths for different length conditions, I have another suggestion of adding the conditions of path A to G you wrote in your last email into comments.

--
Thanks,
Pengfei


From tobias.hartmann at oracle.com  Tue Jan 22 10:33:10 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 11:33:10 +0100
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
 <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
Message-ID: <f878efe5-4cd2-2d98-8e36-9bab6c552653@oracle.com>

Hi Aleksey,

looks good to me.

Best regards,
Tobias

On 22.01.19 11:28, Aleksey Shipilev wrote:
> (correct title)
> 
> On 1/22/19 11:27 AM, Aleksey Shipilev wrote:
>> Bug:
>>   https://bugs.openjdk.java.net/browse/JDK-8217467
>>
>> Fix:
>>   http://cr.openjdk.java.net/~shade/8217467/webrev.01/
>>
>> This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64 intrinsic
>> is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
>> jdk12.
>>
>> Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
>> (running)
>>
>> Thanks,
>> -Aleksey
>>
> 
> 

From adinn at redhat.com  Tue Jan 22 10:44:56 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Tue, 22 Jan 2019 10:44:56 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>

On 22/01/2019 10:32, Pengfei Li (Arm Technology China) wrote:
> I (not a reviewer) tested your single line fix and it looks ok to
> me.
> 
> Also I bump the priority of the JBS
> (https://bugs.openjdk.java.net/browse/JDK-8215792) to P2 and hope the
> fix could be in JDK 12. (The door of JDK 12 will be closed soon)

That's not really needed while we are in Rampdown Phase 1. However, I
agree that P2 is actually appropriate for this bug.

The fix can be pushed to the jdk12 repo. However, the bug needs to have
its fix version set accordingly (which I have just done).

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From rkennke at redhat.com  Tue Jan 22 10:48:50 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Tue, 22 Jan 2019 11:48:50 +0100
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
 <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
Message-ID: <ae3f8faf-5634-b562-a0a9-44a2744ca526@redhat.com>

Looks good. Thanks!

Roman


> (correct title)
> 
> On 1/22/19 11:27 AM, Aleksey Shipilev wrote:
>> Bug:
>>   https://bugs.openjdk.java.net/browse/JDK-8217467
>>
>> Fix:
>>   http://cr.openjdk.java.net/~shade/8217467/webrev.01/
>>
>> This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64 intrinsic
>> is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
>> jdk12.
>>
>> Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
>> (running)
>>
>> Thanks,
>> -Aleksey
>>
> 
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190122/036a2863/signature.asc>

From tobias.hartmann at oracle.com  Tue Jan 22 10:54:19 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 11:54:19 +0100
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>
Message-ID: <59adabc9-3b9f-c96b-3220-bed4998c4e7b@oracle.com>

Hi,

On 22.01.19 11:44, Andrew Dinn wrote:
> That's not really needed while we are in Rampdown Phase 1. However, I
> agree that P2 is actually appropriate for this bug.

Actually, it *is* required because we are in Rampdown Phase 2 now:
https://mail.openjdk.java.net/pipermail/jdk-dev/2019-January/002537.html

and therefore only P1 and P2 bugs with approval can be integrated:
http://openjdk.java.net/jeps/3

> The fix can be pushed to the jdk12 repo. However, the bug needs to have
> its fix version set accordingly (which I have just done).

Yes and approval is required!
http://openjdk.java.net/jeps/3#Fix-Request-Process

Thanks,
Tobias

From Pengfei.Li at arm.com  Tue Jan 22 11:03:20 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Tue, 22 Jan 2019 11:03:20 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <e17eea64-a72b-468a-8aa0-abc602bef51f@redhat.com>
Message-ID: <DB7PR08MB31154BFFBE55B84C0780172B96980@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Andrew Haley,

> Fair enough; it does look like an improvement. However, please show us the
> actual numbers, especially at small sizes. Also, how much is the
> Adler32 checksum actually used? Is it something we care about?

I updated my JMH case (http://cr.openjdk.java.net/~pli/rfr/8216259/TestAdler32.java) with some small sizes added. Please see the results below.

Before patch:
Benchmark                      (count)  Mode  Cnt   Score    Error  Units
TestAdler32.testAdler32Update       64  avgt   15   0.047 ?  0.001  us/op
TestAdler32.testAdler32Update      128  avgt   15   0.084 ?  0.001  us/op
TestAdler32.testAdler32Update      256  avgt   15   0.157 ?  0.001  us/op
TestAdler32.testAdler32Update      512  avgt   15   0.313 ?  0.001  us/op
TestAdler32.testAdler32Update     1024  avgt   15   0.607 ?  0.002  us/op
TestAdler32.testAdler32Update     2048  avgt   15   1.195 ?  0.003  us/op
TestAdler32.testAdler32Update     4096  avgt   15   2.371 ?  0.005  us/op
TestAdler32.testAdler32Update     8192  avgt   15   4.936 ?  0.018  us/op
TestAdler32.testAdler32Update    16384  avgt   15   9.729 ?  0.116  us/op
TestAdler32.testAdler32Update    32768  avgt   15  19.332 ?  0.081  us/op
TestAdler32.testAdler32Update    65536  avgt   15  38.180 ?  0.098  us/op

After patch:
Benchmark                      (count)  Mode  Cnt   Score    Error  Units
TestAdler32.testAdler32Update       64  avgt   15   0.026 ?  0.001  us/op
TestAdler32.testAdler32Update      128  avgt   15   0.039 ?  0.001  us/op
TestAdler32.testAdler32Update      256  avgt   15   0.067 ?  0.001  us/op
TestAdler32.testAdler32Update      512  avgt   15   0.124 ?  0.001  us/op
TestAdler32.testAdler32Update     1024  avgt   15   0.232 ?  0.001  us/op
TestAdler32.testAdler32Update     2048  avgt   15   0.445 ?  0.001  us/op
TestAdler32.testAdler32Update     4096  avgt   15   0.873 ?  0.002  us/op
TestAdler32.testAdler32Update     8192  avgt   15   1.770 ?  0.010  us/op
TestAdler32.testAdler32Update    16384  avgt   15   3.658 ?  0.101  us/op
TestAdler32.testAdler32Update    32768  avgt   15   7.221 ?  0.043  us/op
TestAdler32.testAdler32Update    65536  avgt   15  14.353 ?  0.035  us/op

Dmitry Chuyko has just said it's used in Hadoop HDFS. I either don't know if any other applications, besides zlib, are using Adler-32.

--
Thanks,
Pengfei


From adinn at redhat.com  Tue Jan 22 11:10:36 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Tue, 22 Jan 2019 11:10:36 +0000
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <59adabc9-3b9f-c96b-3220-bed4998c4e7b@oracle.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>
 <59adabc9-3b9f-c96b-3220-bed4998c4e7b@oracle.com>
Message-ID: <d725be89-5f53-7dc7-5a62-1b5b48aca2b3@redhat.com>

On 22/01/2019 10:54, Tobias Hartmann wrote:
> On 22.01.19 11:44, Andrew Dinn wrote:
>> That's not really needed while we are in Rampdown Phase 1. However, I
>> agree that P2 is actually appropriate for this bug.
> 
> Actually, it *is* required because we are in Rampdown Phase 2 now:
> https://mail.openjdk.java.net/pipermail/jdk-dev/2019-January/002537.html

Oops, yes. Sorry. I just found that post in my Trash folder!

> and therefore only P1 and P2 bugs with approval can be integrated:
> http://openjdk.java.net/jeps/3
> 
>> The fix can be pushed to the jdk12 repo. However, the bug needs to have
>> its fix version set accordingly (which I have just done).
> 
> Yes and approval is required!
> http://openjdk.java.net/jeps/3#Fix-Request-Process
Hmm, ok. Well, although this is definitely a bug I don't think it is
critical as it happens in relatively rare circumstances. So, I think it
needs pushing to jdk13 and then backporting to jdk12 after initial
release. I have reset the fix version to jdk13.

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From dmitrij.pochepko at bell-sw.com  Tue Jan 22 11:35:28 2019
From: dmitrij.pochepko at bell-sw.com (Dmitrij Pochepko)
Date: Tue, 22 Jan 2019 14:35:28 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <d725be89-5f53-7dc7-5a62-1b5b48aca2b3@redhat.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>
 <59adabc9-3b9f-c96b-3220-bed4998c4e7b@oracle.com>
 <d725be89-5f53-7dc7-5a62-1b5b48aca2b3@redhat.com>
Message-ID: <39337100-90d6-9f8f-24f3-1a57c3b50cee@bell-sw.com>


On 22/01/2019 2:10 PM, Andrew Dinn wrote:
> On 22/01/2019 10:54, Tobias Hartmann wrote:
>> On 22.01.19 11:44, Andrew Dinn wrote:
>>> That's not really needed while we are in Rampdown Phase 1. However, I
>>> agree that P2 is actually appropriate for this bug.
>> Actually, it *is* required because we are in Rampdown Phase 2 now:
>> https://mail.openjdk.java.net/pipermail/jdk-dev/2019-January/002537.html
> Oops, yes. Sorry. I just found that post in my Trash folder!
>
>> and therefore only P1 and P2 bugs with approval can be integrated:
>> http://openjdk.java.net/jeps/3
>>
>>> The fix can be pushed to the jdk12 repo. However, the bug needs to have
>>> its fix version set accordingly (which I have just done).
>> Yes and approval is required!
>> http://openjdk.java.net/jeps/3#Fix-Request-Process
> Hmm, ok. Well, although this is definitely a bug I don't think it is
> critical as it happens in relatively rare circumstances. So, I think it
> needs pushing to jdk13 and then backporting to jdk12 after initial
> release. I have reset the fix version to jdk13.

I'll send updated webrev with tests and updated documentation (since I 
already has it and it doesn't affect code) hopefully in a few hours 
after final polishing.


Thanks,

Dmitrij

>
> regards,
>
>
> Andrew Dinn
> -----------
> Senior Principal Software Engineer
> Red Hat UK Ltd
> Registered in England and Wales under Company Registration No. 03798903
> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From felix.yang at huawei.com  Tue Jan 22 12:03:14 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Tue, 22 Jan 2019 12:03:14 +0000
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <d5a75c56-b8c5-e215-1485-4274c95236b6@oracle.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
 <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>
 <d5a75c56-b8c5-e215-1485-4274c95236b6@oracle.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F49892@dggeml527-mbx.china.huawei.com>

Hi,

    I have updated the JBS accordingly, requesting approval for integration into JDK 12. 
    May I have another reviewer please? 

Thanks for your help,
Felix


> Hi Felix,
> 
> this looks good to me, thanks for adding the test!
> 
> A second review would be good. In the meantime, please request approval for
> integration into JDK 12
> according to:
> http://openjdk.java.net/jeps/3#Fix-Request-Process
> 
> Thanks,
> Tobias
> 
> On 22.01.19 02:17, Yangfei (Felix) wrote:
> > Hi,
> >
> >     Thanks for reviewing.  The regression test is added.
> >     New webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.01/
> >     This is committed to the submit repo:
> http://hg.openjdk.java.net/jdk/submit/rev/7345adfbc913
> >
> >     The email I got shows that it passed the Oralce internal tests:
> >     =================================================
> >     Build Details: 2019-01-21-1210078.felix.yang.source
> >     0 Failed Tests
> >     Mach5 Tasks Results Summary
> >     ?	EXECUTED_WITH_FAILURE: 0
> >     ?	NA: 0
> >     ?	KILLED: 0
> >     ?	UNABLE_TO_RUN: 0
> >     ?	PASSED: 76
> >     ?	FAILED: 0
> >     =================================================
> >
> >     OK to push?
> >
> > Thanks for your help,
> > Felix
> >
> >>
> >> Hi Felix,
> >>
> >> Could you please add the regression test as jtreg test?
> >>
> >> Otherwise, the fix looks reasonable to me. Nice analysis!
> >>
> >> Thanks,
> >> Tobias
> >

From rwestrel at redhat.com  Tue Jan 22 14:01:05 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 22 Jan 2019 15:01:05 +0100
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
 <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
Message-ID: <874la094gu.fsf@redhat.com>


>>   http://cr.openjdk.java.net/~shade/8217467/webrev.01/

Looks good to me too.

Roland.

From claes.redestad at oracle.com  Tue Jan 22 16:06:33 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 22 Jan 2019 17:06:33 +0100
Subject: RFR: 8217519: Improve RegMask population count calculation
Message-ID: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>

Hi,

this patch extract the population count used in RegMask::Size() to a
utility method in share/utilities/population_count.hpp, as well as
adds a test that verifies this produces the same results as the existing
lookup table implementation.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217519
Webrev: http://cr.openjdk.java.net/~redestad/8217519/open.00/

This reduces instructions retired in RegMask::Size() by 50-60% in some
tests and profiles, which equates to a speedup of C2 by ~5% total. This
improves startup marginally in my tests.

Compiler intrinsics (such as gcc's __builtin_popcount()) would be
appealing, but that actually gives worse performance than this patch (on
current build configurations/setups available to me).

Testing: tier1-3 (ongoing, previous increments of the patch without
the gtest has been thoroughly tested)

Thanks!

/Claes

From Alan.Bateman at oracle.com  Tue Jan 22 16:12:22 2019
From: Alan.Bateman at oracle.com (Alan Bateman)
Date: Tue, 22 Jan 2019 16:12:22 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <708555d0-d3e5-2d2c-f69d-16f76a83f66a@gmail.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
 <aae5418e-388c-eb8e-6b7d-f9a513219a75@gmail.com>
 <708555d0-d3e5-2d2c-f69d-16f76a83f66a@gmail.com>
Message-ID: <5c8a7e85-bdb4-61f8-54ed-75689d0fcf16@oracle.com>

On 18/01/2019 14:28, Peter Levart wrote:
>
> ...unless you actually want users to construct their own MapMode(s), 
> like you mentioned is the case with FileChannel.open() and 
> FileAttribute interface. But there this makes sense because the 
> backend (FileSystem) is also pluggable, so users can define their own 
> FileSystem implementations that consume their own FileAttribute(s)...
>
> Are you proposing to add an spi for MappedByteBuffer's here? That 
> would be an overkill for this feature, I think...
No, we definitely don't want to go there as buffers are closed 
abstraction. Instead, this is just about allowing the JDK to support 
additional map modes beyond those specified by FileChannel.map. If you 
create your own MapMode and call the map method with it then it will be 
rejected, probably UOE. With the suggestion, a JDK-specific module would 
define READ_WRITE_SYNC and you could pass that mode to FileChannel.map. 
There's a bit of plumbing needed make that work but there are examples 
of this already (e.g. socket options and file open options).

-Alan.

From tobias.hartmann at oracle.com  Tue Jan 22 16:23:37 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 22 Jan 2019 17:23:37 +0100
Subject: RFR: 8217519: Improve RegMask population count calculation
In-Reply-To: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
References: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
Message-ID: <31198307-0db5-dd60-ac55-c0a79c35b064@oracle.com>

Hi Claes,

this looks good to me.

Best regards,
Tobias

On 22.01.19 17:06, Claes Redestad wrote:
> Hi,
> 
> this patch extract the population count used in RegMask::Size() to a
> utility method in share/utilities/population_count.hpp, as well as
> adds a test that verifies this produces the same results as the existing
> lookup table implementation.
> 
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217519
> Webrev: http://cr.openjdk.java.net/~redestad/8217519/open.00/
> 
> This reduces instructions retired in RegMask::Size() by 50-60% in some
> tests and profiles, which equates to a speedup of C2 by ~5% total. This
> improves startup marginally in my tests.
> 
> Compiler intrinsics (such as gcc's __builtin_popcount()) would be
> appealing, but that actually gives worse performance than this patch (on
> current build configurations/setups available to me).
> 
> Testing: tier1-3 (ongoing, previous increments of the patch without
> the gtest has been thoroughly tested)
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Tue Jan 22 16:28:49 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 22 Jan 2019 17:28:49 +0100
Subject: RFR: 8217519: Improve RegMask population count calculation
In-Reply-To: <31198307-0db5-dd60-ac55-c0a79c35b064@oracle.com>
References: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
 <31198307-0db5-dd60-ac55-c0a79c35b064@oracle.com>
Message-ID: <5772e468-e607-24a8-895c-58c222bd2b11@oracle.com>

Tobias, thanks for reviewing (and sanity checking this and a few earlier
versions)!

On 2019-01-22 17:23, Tobias Hartmann wrote:
> Hi Claes,
> 
> this looks good to me

From vladimir.kozlov at oracle.com  Tue Jan 22 16:57:11 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 08:57:11 -0800
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
Message-ID: <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>

Yes, it is good.

Thanks,
Vladimir

On 1/22/19 12:22 AM, Tobias Hartmann wrote:
> Hi Fairoz,
> 
> this looks good to me.
> 
> Thanks,
> Tobias
> 
> On 22.01.19 04:35, Fairoz Matte wrote:
>> Hi,
>>
>> Please review the following patch,
>> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951
>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
>>
>> During the call to assembled stub code generate_cipherBlockChaining_decryptAESCrypt_Parallel()
>> there was reference to G6 register used for temporary storage of F50,
>> as G6 is not saved on stack it was resulting in garbage during retrieval.
>>
>> Solution is to use unused local register (L6) for temporary storage and retrieval of F50.
>>
>> Thanks,
>> Fairoz
>>

From aph at redhat.com  Tue Jan 22 16:58:52 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 22 Jan 2019 16:58:52 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <fe2945df-24ee-71db-4591-2159a112fe48@redhat.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
 <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
 <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
 <fe2945df-24ee-71db-4591-2159a112fe48@redhat.com>
Message-ID: <83dc55db-5e4e-5510-a172-efaeac351593@redhat.com>

On 1/22/19 9:36 AM, Andrew Haley wrote:
> Please see the updated webrev to use cmpxchg in both the lock and unlock 
> functions:
> 
> http://cr.openjdk.java.net/~ngasson/8217368/webrev.1/

OK.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From aph at redhat.com  Tue Jan 22 17:03:23 2019
From: aph at redhat.com (Andrew Haley)
Date: Tue, 22 Jan 2019 17:03:23 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <771c5094-aacb-d52c-437f-29aaf5f8f01a@redhat.com>

On 1/21/19 10:53 AM, Pengfei Li (Arm Technology China) wrote:
> Webrev: http://cr.openjdk.java.net/~pli/rfr/8216259/webrev.00/
> JBS: https://bugs.openjdk.java.net/browse/JDK-8216259

The patch checks out fine, but there's one thing I'd like you to do. Please don't
repeat this block of code:

3317     // Below is a vectorized implementation of updating s1 and s2 for 16 bytes.
3318     // We use b1, b2, ..., b16 to denote the 16 bytes loaded in each iteration.
3319     // In non-vectorized code, we update s1 and s2 as:
3320     //   s1 <- s1 + b1
3321     //   s2 <- s2 + s1
3322     //   s1 <- s1 + b2
3323     //   s2 <- s2 + b1
3324     //   ...
3325     //   s1 <- s1 + b16
3326     //   s2 <- s2 + s1
3327     // Putting above assignments together, we have:
3328     //   s1_new = s1 + b1 + b2 + ... + b16
3329     //   s2_new = s2 + (s1 + b1) + (s1 + b1 + b2) + ... + (s1 + b1 + b2 + ... + b16)
3330     //          = s2 + s1 * 16 + (b1 * 16 + b2 * 15 + ... + b16 * 1)
3331     //          = s2 + s1 * 16 + (b1, b2, ... b16) dot (16, 15, ... 1)
3332     __ ld1(vbytes, __ T16B, Address(__ post(buff, 16)));
3333
3334     // s2 = s2 + s1 * 16
3335     __ add(s2, s2, s1, Assembler::LSL, 4);
3336
3337     // vs1acc = b1 + b2 + b3 + ... + b16
3338     // vs2acc = (b1 * 16) + (b2 * 15) + (b3 * 14) + ... + (b16 * 1)
3339     __ umullv(vs2acc, __ T8B, vtable, vbytes);
3340     __ umlalv(vs2acc, __ T16B, vtable, vbytes);
3341     __ uaddlv(vs1acc, __ T16B, vbytes);
3342     __ uaddlv(vs2acc, __ T8H, vs2acc);
3343
3344     // s1 = s1 + vs1acc, s2 = s2 + vs2acc
3345     __ fmovd(temp0, vs1acc);
3346     __ fmovd(temp1, vs2acc);
3347     __ add(s1, s1, temp0);
3348     __ add(s2, s2, temp1);
3349
3350     __ subs(count, count, 16);
3351     __ br(Assembler::HS, L_nmax_loop);

Instead, please put it into a function (e.g. updateBytesCRC32C_inner)
and call it from updateBytesCRC32C. There's no point writing all this
stuff out twice.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From vladimir.kozlov at oracle.com  Tue Jan 22 17:04:08 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 09:04:08 -0800
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <DA41BE1DDCA941489001C7FBD7A8820ED5F49892@dggeml527-mbx.china.huawei.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
 <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>
 <d5a75c56-b8c5-e215-1485-4274c95236b6@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F49892@dggeml527-mbx.china.huawei.com>
Message-ID: <35e45132-2187-16c8-22fb-17e61a117941@oracle.com>

Changes are good.

I approved the fix for jdk12 as HotSpot group lead.

Thanks,
Vladimir


On 1/22/19 4:03 AM, Yangfei (Felix) wrote:
> Hi,
> 
>      I have updated the JBS accordingly, requesting approval for integration into JDK 12.
>      May I have another reviewer please?
> 
> Thanks for your help,
> Felix
> 
> 
>> Hi Felix,
>>
>> this looks good to me, thanks for adding the test!
>>
>> A second review would be good. In the meantime, please request approval for
>> integration into JDK 12
>> according to:
>> http://openjdk.java.net/jeps/3#Fix-Request-Process
>>
>> Thanks,
>> Tobias
>>
>> On 22.01.19 02:17, Yangfei (Felix) wrote:
>>> Hi,
>>>
>>>      Thanks for reviewing.  The regression test is added.
>>>      New webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.01/
>>>      This is committed to the submit repo:
>> http://hg.openjdk.java.net/jdk/submit/rev/7345adfbc913
>>>
>>>      The email I got shows that it passed the Oralce internal tests:
>>>      =================================================
>>>      Build Details: 2019-01-21-1210078.felix.yang.source
>>>      0 Failed Tests
>>>      Mach5 Tasks Results Summary
>>>      ?	EXECUTED_WITH_FAILURE: 0
>>>      ?	NA: 0
>>>      ?	KILLED: 0
>>>      ?	UNABLE_TO_RUN: 0
>>>      ?	PASSED: 76
>>>      ?	FAILED: 0
>>>      =================================================
>>>
>>>      OK to push?
>>>
>>> Thanks for your help,
>>> Felix
>>>
>>>>
>>>> Hi Felix,
>>>>
>>>> Could you please add the regression test as jtreg test?
>>>>
>>>> Otherwise, the fix looks reasonable to me. Nice analysis!
>>>>
>>>> Thanks,
>>>> Tobias
>>>

From vladimir.kozlov at oracle.com  Tue Jan 22 17:30:49 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 09:30:49 -0800
Subject: RFR: 8217519: Improve RegMask population count calculation
In-Reply-To: <31198307-0db5-dd60-ac55-c0a79c35b064@oracle.com>
References: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
 <31198307-0db5-dd60-ac55-c0a79c35b064@oracle.com>
Message-ID: <16301529-9bf3-9c92-a15a-251a4cbaa553@oracle.com>

Yes, this is good.

Thanks,
Vladimir

On 1/22/19 8:23 AM, Tobias Hartmann wrote:
> Hi Claes,
> 
> this looks good to me.
> 
> Best regards,
> Tobias
> 
> On 22.01.19 17:06, Claes Redestad wrote:
>> Hi,
>>
>> this patch extract the population count used in RegMask::Size() to a
>> utility method in share/utilities/population_count.hpp, as well as
>> adds a test that verifies this produces the same results as the existing
>> lookup table implementation.
>>
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217519
>> Webrev: http://cr.openjdk.java.net/~redestad/8217519/open.00/
>>
>> This reduces instructions retired in RegMask::Size() by 50-60% in some
>> tests and profiles, which equates to a speedup of C2 by ~5% total. This
>> improves startup marginally in my tests.
>>
>> Compiler intrinsics (such as gcc's __builtin_popcount()) would be
>> appealing, but that actually gives worse performance than this patch (on
>> current build configurations/setups available to me).
>>
>> Testing: tier1-3 (ongoing, previous increments of the patch without
>> the gtest has been thoroughly tested)
>>
>> Thanks!
>>
>> /Claes

From dmitrij.pochepko at bell-sw.com  Tue Jan 22 18:35:12 2019
From: dmitrij.pochepko at bell-sw.com (Dmitrij Pochepko)
Date: Tue, 22 Jan 2019 21:35:12 +0300
Subject: RFR(S): 8215792: AArch64: String.indexOf generates incorrect
 result
In-Reply-To: <39337100-90d6-9f8f-24f3-1a57c3b50cee@bell-sw.com>
References: <DB7PR08MB311571B5E49523B4BE60A7F1968D0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <32345571546521566@sas2-ce04c18c415c.qloud-c.yandex.net>
 <DB7PR08MB3115069837D808BC9688D49C968E0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <efb075f1-5815-b2e0-31eb-3dfb8a841528@bell-sw.com>
 <07582b62-ccdf-97c8-5bd9-f441b488fa03@bell-sw.com>
 <79558b49-6375-f0d4-1278-f66a4f470b13@redhat.com>
 <75d28ca7-9e80-4bd4-11a6-c858048e4380@bell-sw.com>
 <DB7PR08MB3115899A06AB5EE77C89887896980@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <3950542d-bc5a-3937-27e8-8b48d6f6e875@redhat.com>
 <59adabc9-3b9f-c96b-3220-bed4998c4e7b@oracle.com>
 <d725be89-5f53-7dc7-5a62-1b5b48aca2b3@redhat.com>
 <39337100-90d6-9f8f-24f3-1a57c3b50cee@bell-sw.com>
Message-ID: <56d93deb-252b-38f7-0cda-5b365dc3751e@bell-sw.com>

Hi,

please take a look at webrev.02: 
http://cr.openjdk.java.net/~dpochepk/8215792/webrev.02/

webrev.02 has more aarch64 tests and documentation added. Since tests 
are specifically for aarch64 implementation I've set requires tag to run 
it on aarch64 only. I ran these tests on linux-aarch64 machine to 
ensure? everything is fine and on linux-amd64 to ensure these tests are 
filtered out there.

I'm going to add such documentation and tests for other intrinsics as 
well as separate issues.

This patch is for jdk_jdk. I think it should be backported then into 
jdk12 and jdk11u

Thanks,

Dmitrij

On 22.01.2019 14:35, Dmitrij Pochepko wrote:
>
> On 22/01/2019 2:10 PM, Andrew Dinn wrote:
>> On 22/01/2019 10:54, Tobias Hartmann wrote:
>>> On 22.01.19 11:44, Andrew Dinn wrote:
>>>> That's not really needed while we are in Rampdown Phase 1. However, I
>>>> agree that P2 is actually appropriate for this bug.
>>> Actually, it *is* required because we are in Rampdown Phase 2 now:
>>> https://mail.openjdk.java.net/pipermail/jdk-dev/2019-January/002537.html 
>>>
>> Oops, yes. Sorry. I just found that post in my Trash folder!
>>
>>> and therefore only P1 and P2 bugs with approval can be integrated:
>>> http://openjdk.java.net/jeps/3
>>>
>>>> The fix can be pushed to the jdk12 repo. However, the bug needs to 
>>>> have
>>>> its fix version set accordingly (which I have just done).
>>> Yes and approval is required!
>>> http://openjdk.java.net/jeps/3#Fix-Request-Process
>> Hmm, ok. Well, although this is definitely a bug I don't think it is
>> critical as it happens in relatively rare circumstances. So, I think it
>> needs pushing to jdk13 and then backporting to jdk12 after initial
>> release. I have reset the fix version to jdk13.
>
> I'll send updated webrev with tests and updated documentation (since I 
> already has it and it doesn't affect code) hopefully in a few hours 
> after final polishing.
>
>
> Thanks,
>
> Dmitrij
>
>>
>> regards,
>>
>>
>> Andrew Dinn
>> -----------
>> Senior Principal Software Engineer
>> Red Hat UK Ltd
>> Registered in England and Wales under Company Registration No. 03798903
>> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander


From derekw at marvell.com  Tue Jan 22 18:34:54 2019
From: derekw at marvell.com (Derek White)
Date: Tue, 22 Jan 2019 18:34:54 +0000
Subject: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive stack
 locking optimisation not triggered
In-Reply-To: <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
References: <895ba862-6c8e-486a-2eff-99057d692074@arm.com>
 <f1d6de8b-a308-a119-57a5-c62fbacb4e88@redhat.com>
 <4a09e8b7-9990-aa66-0afb-bf4e41cab831@arm.com>
 <a3fb9ae8-42f4-826c-ad22-32be5e9de24d@redhat.com>
 <79118967-c5b6-ca5c-7c6b-4adb80a4ed60@arm.com>
 <f115fa0c-e3a5-549c-4ec4-4fbfc2d47f6a@redhat.com>
 <a20904db-429c-0739-68fa-bc24f2b13972@arm.com>
 <ad79d164-6c82-34bd-4a98-3584792958e6@redhat.com>
 <62b9e1c3-7c76-c3a2-0a8e-4e3ce4f79d9b@arm.com>
Message-ID: <MN2PR18MB2733A77C9810C8B22662873CD2980@MN2PR18MB2733.namprd18.prod.outlook.com>

Looks great!

Thanks Nick,
 - Derek 

> -----Original Message-----
> From: aarch64-port-dev <aarch64-port-dev-bounces at openjdk.java.net> On
> Behalf Of Nick Gasson (Arm Technology China)
> Sent: Tuesday, January 22, 2019 4:10 AM
> To: Andrew Haley <aph at redhat.com>; hotspot-compiler-
> dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
> Cc: nd <nd at arm.com>; aarch64-port-dev at openjdk.java.net
> Subject: [EXT] Re: [aarch64-port-dev ] RFR: 8217368: AArch64: C2 recursive
> stack locking optimisation not triggered
> 
> External Email
> 
> ----------------------------------------------------------------------
> Hi,
> 
> On 21/01/2019 20:27, Andrew Haley wrote:
> >
> > OK, if that's your position: you're writing the patch. Using cmpxhg
> > everywhere will make that rather twisted code much easier to read.
> >
> 
> Please see the updated webrev to use cmpxchg in both the lock and unlock
> functions:
> 
> http://cr.openjdk.java.net/~ngasson/8217368/webrev.1/
> 
> Also includes Derek's cleanup suggestions (although some of them are not
> applicable now).
> 
> Testing I've done on this:
> 
> * Ran jtreg with assertions enabled (+UseLSE)
> 
> * Ran jcstress with both +UseLSE and -UseLSE
> 
> * Ran the JMH LockUnlock benchmarks with -UseBiasedLocking to check for
> performance regressions.
> 
> The directory below contains the the generated assembly from each webrev
> and current hg tip for this simple method:
> 
> http://cr.openjdk.java.net/~ngasson/8217368/generated/
> 
>      private Object obj = new Object();
>      public int x;
> 
>      private void incX() {
>          synchronized (obj) {
>              x++;
>          }
>      }
> 
> The output of webrev.1 looks OK to me. Any other suggestions of things to
> test?
> 
> Thanks,
> Nick

From vladimir.x.ivanov at oracle.com  Tue Jan 22 19:05:46 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 22 Jan 2019 11:05:46 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
Message-ID: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8202952

The crash happens when PhaseCFG encounters a dead MachNode in the graph.
The problematic node is a leftover from matching of an instruction with 
a duplicated memory operand (sarI_mem_CL [1] in that particular case).

Address has the following shape [2]:
   AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL

It could be subsumed into complex addressing expression, but the 
constant is too large (doesn't fit into immL32). So, matcher has to 
compute inner address expression separately and put it into a register.

Since memory operand is duplicated, 2 copies are materialized during 
matching, but as part of ::Expand() one of the copies is eliminated, 
thus leaving a dead mach node in the IR (for the address expression).

The fix is to adjust Matcher::clone_address_expressions() to avoid 
cloning inner AddP when constant offset is too large.

Testing: hs-precheckin-comp, hs-tier1, hs-tier2

Best regards,
Vladimir Ivanov

[1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
%{
   match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));


[2]
  o347 AddP  === _ o2181 o1768 o1769  [[o349 o371 ]]
     o1768 AddP  === _ o2181 o2181 o1765  [[o347 ]]
         o2181 DecodeN === _ o287  [[o1768 o1768 o327 o347 o327 ]] 
#int[int:>=0]:NotNull:exact *
         o1765 LShiftL === _ o1761 o60  [[o1768 ]]
             o1761 ConvI2L === _ o1741  [[o1765 ]] 
#long:maxint-51..maxint-48
             o60   ConI  === o0  [[o61 o1765 o1434 o2013 o1631 o2017 
o1808  60 ]]  #int:2
     o1769 ConL  === o0  [[o347 ]]  #long:-8589932784

From vladimir.kozlov at oracle.com  Tue Jan 22 19:56:03 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 11:56:03 -0800
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <ae3f8faf-5634-b562-a0a9-44a2744ca526@redhat.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
 <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
 <ae3f8faf-5634-b562-a0a9-44a2744ca526@redhat.com>
Message-ID: <42b2473e-50d4-a438-ad47-f6c3e216ae07@oracle.com>

Yes, changes are good. I approved it for push into JDK 12.

Thanks,
Vladimir

On 1/22/19 2:48 AM, Roman Kennke wrote:
> Looks good. Thanks!
> 
> Roman
> 
> 
>> (correct title)
>>
>> On 1/22/19 11:27 AM, Aleksey Shipilev wrote:
>>> Bug:
>>>    https://bugs.openjdk.java.net/browse/JDK-8217467
>>>
>>> Fix:
>>>    http://cr.openjdk.java.net/~shade/8217467/webrev.01/
>>>
>>> This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64 intrinsic
>>> is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
>>> jdk12.
>>>
>>> Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
>>> (running)
>>>
>>> Thanks,
>>> -Aleksey
>>>
>>
>>
> 

From vladimir.kozlov at oracle.com  Tue Jan 22 19:54:11 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 11:54:11 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
Message-ID: <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>

The fix is different from what we discussed.
Can you explain how it helps?

Thanks,
Vladimir K

On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8202952
> 
> The crash happens when PhaseCFG encounters a dead MachNode in the graph.
> The problematic node is a leftover from matching of an instruction with a duplicated memory operand (sarI_mem_CL [1] in 
> that particular case).
> 
> Address has the following shape [2]:
>  ? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
> 
> It could be subsumed into complex addressing expression, but the constant is too large (doesn't fit into immL32). So, 
> matcher has to compute inner address expression separately and put it into a register.
> 
> Since memory operand is duplicated, 2 copies are materialized during matching, but as part of ::Expand() one of the 
> copies is eliminated, thus leaving a dead mach node in the IR (for the address expression).
> 
> The fix is to adjust Matcher::clone_address_expressions() to avoid cloning inner AddP when constant offset is too large.
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
> 
> Best regards,
> Vladimir Ivanov
> 
> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
> %{
>  ? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
> 
> 
> [2]
>  ?o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
>  ??? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
>  ??????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]] #int[int:>=0]:NotNull:exact *
>  ??????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
>  ??????????? o1761 ConvI2L === _ o1741? [[o1765 ]] #long:maxint-51..maxint-48
>  ??????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631 o2017 o1808? 60 ]]? #int:2
>  ??? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From vladimir.x.ivanov at oracle.com  Tue Jan 22 20:08:22 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 22 Jan 2019 12:08:22 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
Message-ID: <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>


On 22/01/2019 11:54, Vladimir Kozlov wrote:
> The fix is different from what we discussed.
> Can you explain how it helps?

We discussed adding AddP case to _shared_nodes.

Proposed fix achieves similar result with a different approach:

   * Matcher::clone_address_expressions() marks problematic AddP as 
shared (based on constant value);

   * DFA() doesn't construct duplicated State for inner AddP (since it's 
marked as shared);

   * Matcher doesn't need to materialize duplicated mach nodes, since it 
matches inner AddP separately;

Best regards,
Vladimir Ivanov

> On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
>> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
>> https://bugs.openjdk.java.net/browse/JDK-8202952
>>
>> The crash happens when PhaseCFG encounters a dead MachNode in the graph.
>> The problematic node is a leftover from matching of an instruction 
>> with a duplicated memory operand (sarI_mem_CL [1] in that particular 
>> case).
>>
>> Address has the following shape [2]:
>> ?? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
>>
>> It could be subsumed into complex addressing expression, but the 
>> constant is too large (doesn't fit into immL32). So, matcher has to 
>> compute inner address expression separately and put it into a register.
>>
>> Since memory operand is duplicated, 2 copies are materialized during 
>> matching, but as part of ::Expand() one of the copies is eliminated, 
>> thus leaving a dead mach node in the IR (for the address expression).
>>
>> The fix is to adjust Matcher::clone_address_expressions() to avoid 
>> cloning inner AddP when constant offset is too large.
>>
>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
>> %{
>> ?? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
>>
>>
>> [2]
>> ??o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
>> ???? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
>> ???????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]] 
>> #int[int:>=0]:NotNull:exact *
>> ???????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
>> ???????????? o1761 ConvI2L === _ o1741? [[o1765 ]] 
>> #long:maxint-51..maxint-48
>> ???????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631 o2017 
>> o1808? 60 ]]? #int:2
>> ???? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From shade at redhat.com  Tue Jan 22 20:29:46 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Tue, 22 Jan 2019 21:29:46 +0100
Subject: RFR [12] 8217467 (XS): Access barriers are missing in C2
 intrinsic for Base64
In-Reply-To: <42b2473e-50d4-a438-ad47-f6c3e216ae07@oracle.com>
References: <2cf6bd9e-d73f-c4b3-725d-aba3f5ed08c3@redhat.com>
 <99f5f7c8-0747-2cae-f8e5-e7d05358efcf@redhat.com>
 <ae3f8faf-5634-b562-a0a9-44a2744ca526@redhat.com>
 <42b2473e-50d4-a438-ad47-f6c3e216ae07@oracle.com>
Message-ID: <7b98400f-d363-6c06-4214-4ad934bd9488@redhat.com>

Thank you, pushed to jdk/jdk12.

-Aleksey

On 1/22/19 8:56 PM, Vladimir Kozlov wrote:
> Yes, changes are good. I approved it for push into JDK 12.
> 
> Thanks,
> Vladimir
> 
> On 1/22/19 2:48 AM, Roman Kennke wrote:
>> Looks good. Thanks!
>>
>> Roman
>>
>>
>>> (correct title)
>>>
>>> On 1/22/19 11:27 AM, Aleksey Shipilev wrote:
>>>> Bug:
>>>> ?? https://bugs.openjdk.java.net/browse/JDK-8217467
>>>>
>>>> Fix:
>>>> ?? http://cr.openjdk.java.net/~shade/8217467/webrev.01/
>>>>
>>>> This is found and verified by Shenandoah CTW tests that verifies barrier placement. Base64
>>>> intrinsic
>>>> is new, and only enabled on modern hardware (I think you need AVX512). I'd like to push this fix to
>>>> jdk12.
>>>>
>>>> Testing: Shenandoah CTW tests, hotspot tier1 (includes compiler/intrinsics/base64), jdk-submit12
>>>> (running)
>>>>
>>>> Thanks,
>>>> -Aleksey
>>>>
>>>
>>>
>>


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190122/d6b3ed2a/signature-0001.asc>

From vladimir.kozlov at oracle.com  Tue Jan 22 20:42:11 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 22 Jan 2019 12:42:11 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
 <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
Message-ID: <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>

Got it. Good.

thanks,
Vladimir

On 1/22/19 12:08 PM, Vladimir Ivanov wrote:
> 
> On 22/01/2019 11:54, Vladimir Kozlov wrote:
>> The fix is different from what we discussed.
>> Can you explain how it helps?
> 
> We discussed adding AddP case to _shared_nodes.
> 
> Proposed fix achieves similar result with a different approach:
> 
>  ? * Matcher::clone_address_expressions() marks problematic AddP as shared (based on constant value);
> 
>  ? * DFA() doesn't construct duplicated State for inner AddP (since it's marked as shared);
> 
>  ? * Matcher doesn't need to materialize duplicated mach nodes, since it matches inner AddP separately;
> 
> Best regards,
> Vladimir Ivanov
> 
>> On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
>>> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
>>> https://bugs.openjdk.java.net/browse/JDK-8202952
>>>
>>> The crash happens when PhaseCFG encounters a dead MachNode in the graph.
>>> The problematic node is a leftover from matching of an instruction with a duplicated memory operand (sarI_mem_CL [1] 
>>> in that particular case).
>>>
>>> Address has the following shape [2]:
>>> ?? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
>>>
>>> It could be subsumed into complex addressing expression, but the constant is too large (doesn't fit into immL32). So, 
>>> matcher has to compute inner address expression separately and put it into a register.
>>>
>>> Since memory operand is duplicated, 2 copies are materialized during matching, but as part of ::Expand() one of the 
>>> copies is eliminated, thus leaving a dead mach node in the IR (for the address expression).
>>>
>>> The fix is to adjust Matcher::clone_address_expressions() to avoid cloning inner AddP when constant offset is too large.
>>>
>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>>
>>> Best regards,
>>> Vladimir Ivanov
>>>
>>> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
>>> %{
>>> ?? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
>>>
>>>
>>> [2]
>>> ??o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
>>> ???? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
>>> ???????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]] #int[int:>=0]:NotNull:exact *
>>> ???????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
>>> ???????????? o1761 ConvI2L === _ o1741? [[o1765 ]] #long:maxint-51..maxint-48
>>> ???????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631 o2017 o1808? 60 ]]? #int:2
>>> ???? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From gromero at linux.vnet.ibm.com  Tue Jan 22 22:53:48 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Tue, 22 Jan 2019 20:53:48 -0200
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <8083b8db-c546-29e8-c83a-f06ebd4e624e@linux.vnet.ibm.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <2ac3e91da61b43dcb2d4e45325202264@sap.com>
 <8083b8db-c546-29e8-c83a-f06ebd4e624e@linux.vnet.ibm.com>
Message-ID: <89eeb1bc-950c-9c9f-f49f-aabae7b6637f@linux.vnet.ibm.com>

Hi Goetz,

On 01/21/2019 09:45 AM, Gustavo Romero wrote:
> On 01/21/2019 09:10 AM, Lindenmaier, Goetz wrote:
>> also this change looks good.
> 
> Thanks for reviewing it, Goetz!
> 
> I'll ping once the approvals are ok.

This change and JDK-8215317 are approved to be pushed to 11u:

[0] https://bugs.openjdk.java.net/browse/JDK-8215317
[1] https://bugs.openjdk.java.net/browse/JDK-8213754

Could you please push them at the same time to 11u?

Thank you!

Best regards,
Gustavo

> Thank you.
> 
> Regards,
> Gustavo
> 
>> Best regards,
>> ?? Goetz.
>>
>>> -----Original Message-----
>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>> Sent: Freitag, 18. Januar 2019 16:07
>>> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
>>> <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>;
>>> vladimir.kozlov at oracle.com; Roger Riggs <Roger.Riggs at oracle.com>
>>> Cc: Michihiro Horie <HORIE at jp.ibm.com>
>>> Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
>>> isDigit/isLowerCase/isUpperCase/isWhitespace
>>>
>>> Hi,
>>>
>>> Could the following backport to 11u be reviewed, please?
>>>
>>> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8213754
>>> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
>>> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
>>>
>>> It adds 4 intrinsics that use instructions introduced by POWER9 in order to
>>> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
>>>
>>> The change is mostly PPC64-only but it does touch shared code, for
>>> instance, in order to adapt the methods in question to be properly
>>> intrinsified. It also needs an additional change [0], since one Graal
>>> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
>>>
>>> The change applies almost cleanly: only a small tweak is necessary because
>>> the hunk for ppc.ad file relies on some absent text in the 11u code around
>>> the change to be applied. That absent text is related to the Superword
>>> feature (a non-related feature), which is not backported yet to 11u.
>>>
>>> This backport was tested on POWER8 and POWER9 and no regressions were
>>> observed.
>>>
>>> This backport was also tested on x86_64 with
>>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
>>> change 8215317 [0] applied and no regressions were observed too.
>>>
>>> Thank you.
>>>
>>> Best regards,
>>> Gustavo
>>>
>>> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
>>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-
>>> January/032266.html
>>
> 


From vladimir.x.ivanov at oracle.com  Wed Jan 23 02:14:32 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 22 Jan 2019 18:14:32 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
 <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
 <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>
Message-ID: <a5cd9102-ee57-26db-0ef4-8331064bd935@oracle.com>

Thanks, Vladimir.

Best regards,
Vladimir Ivanov

On 22/01/2019 12:42, Vladimir Kozlov wrote:
> Got it. Good.
> 
> thanks,
> Vladimir
> 
> On 1/22/19 12:08 PM, Vladimir Ivanov wrote:
>>
>> On 22/01/2019 11:54, Vladimir Kozlov wrote:
>>> The fix is different from what we discussed.
>>> Can you explain how it helps?
>>
>> We discussed adding AddP case to _shared_nodes.
>>
>> Proposed fix achieves similar result with a different approach:
>>
>> ?? * Matcher::clone_address_expressions() marks problematic AddP as 
>> shared (based on constant value);
>>
>> ?? * DFA() doesn't construct duplicated State for inner AddP (since 
>> it's marked as shared);
>>
>> ?? * Matcher doesn't need to materialize duplicated mach nodes, since 
>> it matches inner AddP separately;
>>
>> Best regards,
>> Vladimir Ivanov
>>
>>> On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
>>>> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
>>>> https://bugs.openjdk.java.net/browse/JDK-8202952
>>>>
>>>> The crash happens when PhaseCFG encounters a dead MachNode in the 
>>>> graph.
>>>> The problematic node is a leftover from matching of an instruction 
>>>> with a duplicated memory operand (sarI_mem_CL [1] in that particular 
>>>> case).
>>>>
>>>> Address has the following shape [2]:
>>>> ?? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
>>>>
>>>> It could be subsumed into complex addressing expression, but the 
>>>> constant is too large (doesn't fit into immL32). So, matcher has to 
>>>> compute inner address expression separately and put it into a register.
>>>>
>>>> Since memory operand is duplicated, 2 copies are materialized during 
>>>> matching, but as part of ::Expand() one of the copies is eliminated, 
>>>> thus leaving a dead mach node in the IR (for the address expression).
>>>>
>>>> The fix is to adjust Matcher::clone_address_expressions() to avoid 
>>>> cloning inner AddP when constant offset is too large.
>>>>
>>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>>>
>>>> Best regards,
>>>> Vladimir Ivanov
>>>>
>>>> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
>>>> %{
>>>> ?? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
>>>>
>>>>
>>>> [2]
>>>> ??o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
>>>> ???? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
>>>> ???????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]] 
>>>> #int[int:>=0]:NotNull:exact *
>>>> ???????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
>>>> ???????????? o1761 ConvI2L === _ o1741? [[o1765 ]] 
>>>> #long:maxint-51..maxint-48
>>>> ???????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631 o2017 
>>>> o1808? 60 ]]? #int:2
>>>> ???? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From igor.ignatyev at oracle.com  Wed Jan 23 02:26:02 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Tue, 22 Jan 2019 18:26:02 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not compile
 by javac
Message-ID: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
> 64 lines changed: 23 ins; 6 del; 35 mod;

Hi all,

could you please review this small fix for jit-tester?

the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.

besides the fix for the bug, the patch also include the following small clean ups:
 - use DIST_JAR var value instead of 'JAR' string constant in makefile
 - change default target testbase dir
 - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
 - add -Xcomp to all the generator tests
 - use tmp directory for class files
 - check javac error code
 - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet 

webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
testing: generated 1000 tests, all can be compiled and work fine

Thanks,
-- Igor  

From fairoz.matte at oracle.com  Wed Jan 23 03:20:09 2019
From: fairoz.matte at oracle.com (Fairoz Matte)
Date: Tue, 22 Jan 2019 19:20:09 -0800 (PST)
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
Message-ID: <323b7338-d507-4850-ab53-4a5295d7b62f@default>

Thanks Tobias and Vladimir for review.

Thanks,
Fairoz

> -----Original Message-----
> From: Vladimir Kozlov
> Sent: Tuesday, January 22, 2019 10:27 PM
> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
> dev at openjdk.java.net
> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> com.sun.crypto.provider.CipherBlockChaining
> 
> Yes, it is good.
> 
> Thanks,
> Vladimir
> 
> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
> > Hi Fairoz,
> >
> > this looks good to me.
> >
> > Thanks,
> > Tobias
> >
> > On 22.01.19 04:35, Fairoz Matte wrote:
> >> Hi,
> >>
> >> Please review the following patch,
> >> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951
> >> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
> >>
> >> During the call to assembled stub code
> >> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
> >> there was reference to G6 register used for temporary storage of F50,
> >> as G6 is not saved on stack it was resulting in garbage during retrieval.
> >>
> >> Solution is to use unused local register (L6) for temporary storage and
> retrieval of F50.
> >>
> >> Thanks,
> >> Fairoz
> >>

From goetz.lindenmaier at sap.com  Wed Jan 23 07:19:22 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Wed, 23 Jan 2019 07:19:22 +0000
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <89eeb1bc-950c-9c9f-f49f-aabae7b6637f@linux.vnet.ibm.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <2ac3e91da61b43dcb2d4e45325202264@sap.com>
 <8083b8db-c546-29e8-c83a-f06ebd4e624e@linux.vnet.ibm.com>
 <89eeb1bc-950c-9c9f-f49f-aabae7b6637f@linux.vnet.ibm.com>
Message-ID: <e75afc7a58d0496aab4025a0a6d22737@sap.com>

Done ...

Best regards,
  Goetz.

> -----Original Message-----
> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> Sent: Dienstag, 22. Januar 2019 23:54
> To: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>; hotspot-compiler-
> dev at openjdk.java.net
> Subject: Re: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
> isDigit/isLowerCase/isUpperCase/isWhitespace
> 
> Hi Goetz,
> 
> On 01/21/2019 09:45 AM, Gustavo Romero wrote:
> > On 01/21/2019 09:10 AM, Lindenmaier, Goetz wrote:
> >> also this change looks good.
> >
> > Thanks for reviewing it, Goetz!
> >
> > I'll ping once the approvals are ok.
> 
> This change and JDK-8215317 are approved to be pushed to 11u:
> 
> [0] https://bugs.openjdk.java.net/browse/JDK-8215317
> [1] https://bugs.openjdk.java.net/browse/JDK-8213754
> 
> Could you please push them at the same time to 11u?
> 
> Thank you!
> 
> Best regards,
> Gustavo
> 
> > Thank you.
> >
> > Regards,
> > Gustavo
> >
> >> Best regards,
> >> ?? Goetz.
> >>
> >>> -----Original Message-----
> >>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
> >>> Sent: Freitag, 18. Januar 2019 16:07
> >>> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
> >>> <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>;
> >>> vladimir.kozlov at oracle.com; Roger Riggs <Roger.Riggs at oracle.com>
> >>> Cc: Michihiro Horie <HORIE at jp.ibm.com>
> >>> Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
> >>> isDigit/isLowerCase/isUpperCase/isWhitespace
> >>>
> >>> Hi,
> >>>
> >>> Could the following backport to 11u be reviewed, please?
> >>>
> >>> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8213754
> >>> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
> >>> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
> >>>
> >>> It adds 4 intrinsics that use instructions introduced by POWER9 in order to
> >>> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
> >>>
> >>> The change is mostly PPC64-only but it does touch shared code, for
> >>> instance, in order to adapt the methods in question to be properly
> >>> intrinsified. It also needs an additional change [0], since one Graal
> >>> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
> >>>
> >>> The change applies almost cleanly: only a small tweak is necessary because
> >>> the hunk for ppc.ad file relies on some absent text in the 11u code around
> >>> the change to be applied. That absent text is related to the Superword
> >>> feature (a non-related feature), which is not backported yet to 11u.
> >>>
> >>> This backport was tested on POWER8 and POWER9 and no regressions
> were
> >>> observed.
> >>>
> >>> This backport was also tested on x86_64 with
> >>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
> >>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
> >>> change 8215317 [0] applied and no regressions were observed too.
> >>>
> >>> Thank you.
> >>>
> >>> Best regards,
> >>> Gustavo
> >>>
> >>> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
> >>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-
> >>> January/032266.html
> >>
> >


From nils.eliasson at oracle.com  Wed Jan 23 09:23:54 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Wed, 23 Jan 2019 10:23:54 +0100
Subject: RFR: 8217519: Improve RegMask population count calculation
In-Reply-To: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
References: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
Message-ID: <2abb40ba-bfc4-6719-e418-1f1d016c57ec@oracle.com>

Excellent!

Thanks for fixing!

// Nils

On 2019-01-22 17:06, Claes Redestad wrote:
> Hi,
>
> this patch extract the population count used in RegMask::Size() to a
> utility method in share/utilities/population_count.hpp, as well as
> adds a test that verifies this produces the same results as the existing
> lookup table implementation.
>
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217519
> Webrev: http://cr.openjdk.java.net/~redestad/8217519/open.00/
>
> This reduces instructions retired in RegMask::Size() by 50-60% in some
> tests and profiles, which equates to a speedup of C2 by ~5% total. This
> improves startup marginally in my tests.
>
> Compiler intrinsics (such as gcc's __builtin_popcount()) would be
> appealing, but that actually gives worse performance than this patch (on
> current build configurations/setups available to me).
>
> Testing: tier1-3 (ongoing, previous increments of the patch without
> the gtest has been thoroughly tested)
>
> Thanks!
>
> /Claes

From claes.redestad at oracle.com  Wed Jan 23 09:36:24 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 23 Jan 2019 10:36:24 +0100
Subject: RFR: 8217519: Improve RegMask population count calculation
In-Reply-To: <2abb40ba-bfc4-6719-e418-1f1d016c57ec@oracle.com>
References: <d4f2f4c9-fbe9-0859-e579-52b6e810d6d9@oracle.com>
 <2abb40ba-bfc4-6719-e418-1f1d016c57ec@oracle.com>
Message-ID: <7b45044d-84d5-74ec-22c2-f5e697582264@oracle.com>

Nils, Vladimir, Tobias,

thanks for reviewing - pushed.

/Claes

On 2019-01-23 10:23, Nils Eliasson wrote:
> Excellent!
> 
> Thanks for fixing!
> 
> // Nils
> 
> On 2019-01-22 17:06, Claes Redestad wrote:
>> Hi,
>>
>> this patch extract the population count used in RegMask::Size() to a
>> utility method in share/utilities/population_count.hpp, as well as
>> adds a test that verifies this produces the same results as the existing
>> lookup table implementation.
>>
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217519
>> Webrev: http://cr.openjdk.java.net/~redestad/8217519/open.00/
>>
>> This reduces instructions retired in RegMask::Size() by 50-60% in some
>> tests and profiles, which equates to a speedup of C2 by ~5% total. This
>> improves startup marginally in my tests.
>>
>> Compiler intrinsics (such as gcc's __builtin_popcount()) would be
>> appealing, but that actually gives worse performance than this patch (on
>> current build configurations/setups available to me).
>>
>> Testing: tier1-3 (ongoing, previous increments of the patch without
>> the gtest has been thoroughly tested)
>>
>> Thanks!
>>
>> /Claes

From claes.redestad at oracle.com  Wed Jan 23 12:00:52 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Wed, 23 Jan 2019 13:00:52 +0100
Subject: RFR: 8217629: RegMask::find_lowest_bit can reuse count_trailing_zeros
 utility
Message-ID: <673cad2b-7414-393e-3f2d-c44ea68e47d5@oracle.com>

Hi,

reusing the count_trailing_zeros utility from RegMask is a simple
cleanup which may enable optimizations on many platforms, like tzcnt
on Intel/AMD, and improves inlining.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217629
Webrev: http://cr.openjdk.java.net/~redestad/8217629/open.00/

On my startup tests and profiles this reduces instructions spent in C2s
register allocator by ~4%, and ~2% on the total.

Testing: tier1-3

Thanks!

/Claes

From gromero at linux.vnet.ibm.com  Wed Jan 23 12:11:21 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Wed, 23 Jan 2019 10:11:21 -0200
Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
 isDigit/isLowerCase/isUpperCase/isWhitespace
In-Reply-To: <e75afc7a58d0496aab4025a0a6d22737@sap.com>
References: <2d4d1747-a83d-5f65-eea3-d982969ae4fd@linux.vnet.ibm.com>
 <2ac3e91da61b43dcb2d4e45325202264@sap.com>
 <8083b8db-c546-29e8-c83a-f06ebd4e624e@linux.vnet.ibm.com>
 <89eeb1bc-950c-9c9f-f49f-aabae7b6637f@linux.vnet.ibm.com>
 <e75afc7a58d0496aab4025a0a6d22737@sap.com>
Message-ID: <fe8de715-f5ca-5ff3-ce6b-76c7fe2a7d9c@linux.vnet.ibm.com>

On 01/23/2019 05:19 AM, Lindenmaier, Goetz wrote:
> Done ...

Thanks a lot, Goetz!

Regards,
Gustavo

> Best regards,
>    Goetz.
> 
>> -----Original Message-----
>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>> Sent: Dienstag, 22. Januar 2019 23:54
>> To: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>; hotspot-compiler-
>> dev at openjdk.java.net
>> Subject: Re: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
>> isDigit/isLowerCase/isUpperCase/isWhitespace
>>
>> Hi Goetz,
>>
>> On 01/21/2019 09:45 AM, Gustavo Romero wrote:
>>> On 01/21/2019 09:10 AM, Lindenmaier, Goetz wrote:
>>>> also this change looks good.
>>>
>>> Thanks for reviewing it, Goetz!
>>>
>>> I'll ping once the approvals are ok.
>>
>> This change and JDK-8215317 are approved to be pushed to 11u:
>>
>> [0] https://bugs.openjdk.java.net/browse/JDK-8215317
>> [1] https://bugs.openjdk.java.net/browse/JDK-8213754
>>
>> Could you please push them at the same time to 11u?
>>
>> Thank you!
>>
>> Best regards,
>> Gustavo
>>
>>> Thank you.
>>>
>>> Regards,
>>> Gustavo
>>>
>>>> Best regards,
>>>>  ?? Goetz.
>>>>
>>>>> -----Original Message-----
>>>>> From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>>>>> Sent: Freitag, 18. Januar 2019 16:07
>>>>> To: hotspot-compiler-dev at openjdk.java.net; Lindenmaier, Goetz
>>>>> <goetz.lindenmaier at sap.com>; Doerr, Martin <martin.doerr at sap.com>;
>>>>> vladimir.kozlov at oracle.com; Roger Riggs <Roger.Riggs at oracle.com>
>>>>> Cc: Michihiro Horie <HORIE at jp.ibm.com>
>>>>> Subject: [11u backport] RFR(M): 8213754: PPC64: Add Intrinsics for
>>>>> isDigit/isLowerCase/isUpperCase/isWhitespace
>>>>>
>>>>> Hi,
>>>>>
>>>>> Could the following backport to 11u be reviewed, please?
>>>>>
>>>>> Bug???? : https://bugs.openjdk.java.net/browse/JDK-8213754
>>>>> Change? : http://hg.openjdk.java.net/jdk/jdk/rev/7384e00d5860
>>>>> Backport: http://cr.openjdk.java.net/~gromero/8213754_jdk11u/v1/
>>>>>
>>>>> It adds 4 intrinsics that use instructions introduced by POWER9 in order to
>>>>> speed up methods isDigit, isLowerCase, isUpperCase, and isWhitespace.
>>>>>
>>>>> The change is mostly PPC64-only but it does touch shared code, for
>>>>> instance, in order to adapt the methods in question to be properly
>>>>> intrinsified. It also needs an additional change [0], since one Graal
>>>>> test has to be adapted (a separated RFR to backport [0] was sent to [1]).
>>>>>
>>>>> The change applies almost cleanly: only a small tweak is necessary because
>>>>> the hunk for ppc.ad file relies on some absent text in the 11u code around
>>>>> the change to be applied. That absent text is related to the Superword
>>>>> feature (a non-related feature), which is not backported yet to 11u.
>>>>>
>>>>> This backport was tested on POWER8 and POWER9 and no regressions
>> were
>>>>> observed.
>>>>>
>>>>> This backport was also tested on x86_64 with
>>>>> ./test/hotspot/jtreg/compiler/{c1,c2,intrinsics} plus
>>>>> ./test/hotspot/jtreg/compiler/graalunit (with Graal compiler enabled) with
>>>>> change 8215317 [0] applied and no regressions were observed too.
>>>>>
>>>>> Thank you.
>>>>>
>>>>> Best regards,
>>>>> Gustavo
>>>>>
>>>>> [0] http://cr.openjdk.java.net/~gromero/8215317_jdk11u/v1/
>>>>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-
>>>>> January/032266.html
>>>>
>>>
> 


From magnus.ihse.bursie at oracle.com  Wed Jan 23 12:55:58 2019
From: magnus.ihse.bursie at oracle.com (Magnus Ihse Bursie)
Date: Wed, 23 Jan 2019 13:55:58 +0100
Subject: RFR(M)(round 2): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <de5bb24804a0c5b66f0412382f338e415de6b1ed.camel@gmail.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
 <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>
 <de5bb24804a0c5b66f0412382f338e415de6b1ed.camel@gmail.com>
Message-ID: <3f62f15e-ac5f-94d4-9744-c9cef796a3fa@oracle.com>

Hi Jakub,

On 2019-01-15 17:31, Jakub Van?k wrote:
> Hi Magnus and Erik,
>
> I have added the link to the repository to README and I have removed
> the link to the mailing list thread. I have also recreated the GitHub
> repository. Now it is a fork of the mentioned repository with two extra
> commits containing README and the build scripts.
>
> New webrev URL: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.04/
> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902

Sorry for the late reply.

This looks very good! Thank you for fixing this, including rebasing the 
github repo.

I'm not sure if you've gotten reviews from the hotspot team for the 
hotspot source changes, but from a build perspective, this is good to go.

/Magnus
>
> Regards,
>
> Jakub
>
> On 2019-01-15 at 15:05 +0100, Magnus Ihse Bursie wrote:
>> On 2018-12-25 16:19, Jakub Van?k wrote:
>>> Hi,
>>>
>>> please review this webrev. It is a successor of the softfloat-3
>>> [patch]
>>> thread (first email
>>>
> http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
>>> )
>>>
>>> Changes since the last patch (v6):
>>>
>>> - renamed --with-softloat* to --with-sflt* (it is more compact and
>>> it
>>>     corresponds to the old --with-sflt-lib=... option)
>>>
>>> - license is now obtained via --with-sflt-license switch (so it is
>>> not
>>>     included in OpenJDK source tree)
>>>
>>> - updated documentation (slight rewording, added the license
>>> option)
>>>
>>> - checks for default --with/--without behavior are in place again
>>>     (I forgot them when I changed the way the library is detected)
>>>
>>> - added a simple testcase - I found a disrepancy between softfloat
>>> and
>>>     system function behavior. When a float with bits 0x003FFFFF is
>>>     added to 0x00000001, the correct result is 0x00400000, but the
>>>     default software floating point implementation returns
>>> 0x00000000.
>>>     However I'm not sure where to put this test - now it is in
>>>     test/hotspot/jtreg/compiler/floatingpoint.
>>>
>>> - comments in code refer to CR 6757269 and newly JDK-8215902 too.
>>>
>>> I have created a repository with SoftFloat-3e with build
>>> configuration
>>> specifically for OpenJDK on armel:
>>> https://github.com/ev3dev-lang-java/softfloat-openjdk
>>>
>>> I can add a link to it to the documentation.
>>>
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
>>> Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/
>> Hi Jakub,
>>
>> In general this looks good.
>>
>> Some comments:
>>
>> I agree with Erik that you can add a link to your github project;
>> compiling SoftFloat is outside the scope of the OpenJDK build
>> instructions, but it can sure be helpful to lower the bar for users
>> wanting to do that. Just one question: any particular reason you
>> didn't
>> create your github repo by forking the official
>> https://github.com/ucb-bar/berkeley-softfloat-3? That way, it would
>> have
>> been easy for users to see that you were not adding any malicious or
>> suspicious code to the original SoftFloat distribution.
>>
>> On the other hand, I think the link to
>>
> http://mail.openjdk.java.net/pipermail/aarch32-port-dev/2016-November/000611.html
>>   
>> is unnecessary and just creates clutter in the documentation. Please
>> remove it.
>>
>> /Magnus
>>> CI build:
>>> https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
>>>
>>> Cheers,
>>>
>>> Jakub
>>>
>>


From jamsheed.c.m at oracle.com  Wed Jan 23 14:08:57 2019
From: jamsheed.c.m at oracle.com (Jamsheed)
Date: Wed, 23 Jan 2019 19:38:57 +0530
Subject: [12] RFR: 8213825: assert(false) failed: Non-balanced monitor
 enter/exit! Likely JNI locking
Message-ID: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>

Hi,

Request for review

bug: https://bugs.openjdk.java.net/browse/JDK-8213825

webrev: http://cr.openjdk.java.net/~jcm/8213825/webrev.00/index.html

Bug & Fix Desc:

if markword load has sfpt as control i/p(i.e synchronizations near a 
safepoint), it skips sfpt assuming sfptOp wouldn't write to markword memory
fix: not to skip sfpt for markword loads.

tests: hs-tier1-5,? hs-precheckin-comp

Best regards,

Jamsheed


From nils.eliasson at oracle.com  Wed Jan 23 15:16:00 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Wed, 23 Jan 2019 16:16:00 +0100
Subject: RFR: 8217629: RegMask::find_lowest_bit can reuse
 count_trailing_zeros utility
In-Reply-To: <673cad2b-7414-393e-3f2d-c44ea68e47d5@oracle.com>
References: <673cad2b-7414-393e-3f2d-c44ea68e47d5@oracle.com>
Message-ID: <db63aed8-1db7-3271-3081-f86308598696@oracle.com>

Hi Claes,

Looks great!

Consider it trivial.

/ Nils

On 2019-01-23 13:00, Claes Redestad wrote:
> Hi,
>
> reusing the count_trailing_zeros utility from RegMask is a simple
> cleanup which may enable optimizations on many platforms, like tzcnt
> on Intel/AMD, and improves inlining.
>
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217629
> Webrev: http://cr.openjdk.java.net/~redestad/8217629/open.00/
>
> On my startup tests and profiles this reduces instructions spent in C2s
> register allocator by ~4%, and ~2% on the total.
>
> Testing: tier1-3
>
> Thanks!
>
> /Claes

From tobias.hartmann at oracle.com  Wed Jan 23 15:28:23 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 23 Jan 2019 16:28:23 +0100
Subject: RFR: 8217629: RegMask::find_lowest_bit can reuse
 count_trailing_zeros utility
In-Reply-To: <673cad2b-7414-393e-3f2d-c44ea68e47d5@oracle.com>
References: <673cad2b-7414-393e-3f2d-c44ea68e47d5@oracle.com>
Message-ID: <c7003971-865e-4f56-2cfd-a3950335405d@oracle.com>

Hi Claes,

looks good to me too.

Best regards,
Tobias

On 23.01.19 13:00, Claes Redestad wrote:
> Hi,
> 
> reusing the count_trailing_zeros utility from RegMask is a simple
> cleanup which may enable optimizations on many platforms, like tzcnt
> on Intel/AMD, and improves inlining.
> 
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217629
> Webrev: http://cr.openjdk.java.net/~redestad/8217629/open.00/
> 
> On my startup tests and profiles this reduces instructions spent in C2s
> register allocator by ~4%, and ~2% on the total.
> 
> Testing: tier1-3
> 
> Thanks!
> 
> /Claes

From shade at redhat.com  Wed Jan 23 16:50:00 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 23 Jan 2019 17:50:00 +0100
Subject: RFR (S) 8217639: Minimal and Zero builds fail after JDK-8217519
 (Improve RegMask population count calculation)
Message-ID: <3bde3396-02a4-4b16-4fc5-257f67a34211@redhat.com>

Bug:
  https://bugs.openjdk.java.net/browse/JDK-8217639

Reason: New test references "extern uint8_t byte[] bitsInByte", and that is defined in
libadt/vectset.cpp, which is not compiled when C2 is disabled in Minimal and Zero VM builds. I was
first considering to enabled libadt build when C2 is disabled, but the more straight-forward fix
would be to give the test its own golden data to test against. This would also implicitly test for
accidental bugs in bitsInByte matrix in production code.

Fix:

diff -r c96f9aa1f3d8 -r 29037fc5194d test/hotspot/gtest/utilities/test_population_count.cpp
--- a/test/hotspot/gtest/utilities/test_population_count.cpp    Wed Jan 23 13:16:16 2019 +0000
+++ b/test/hotspot/gtest/utilities/test_population_count.cpp    Wed Jan 23 17:04:25 2019 +0100
@@ -29,18 +29,35 @@
 #include "utilities/globalDefinitions.hpp"
 #include "unittest.hpp"

+uint8_t test_popcnt_bitsInByte[BITS_IN_BYTE_ARRAY_SIZE] = {
+        0, 1, 1, 2, 1, 2, 2, 3, 1, 2, 2, 3, 2, 3, 3, 4,
+        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
+        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
+        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
+        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
+        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
+        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
+        4, 5, 5, 6, 5, 6, 6, 7, 5, 6, 6, 7, 6, 7, 7, 8
+};

 TEST(population_count, sparse) {
-  extern uint8_t bitsInByte[BITS_IN_BYTE_ARRAY_SIZE];
   // Step through the entire input range from a random starting point,
   // verify population_count return values against the lookup table
   // approach used historically
   uint32_t step = 4711;
   for (uint32_t value = os::random() % step; value < UINT_MAX - step; value += step) {
-    uint32_t lookup = bitsInByte[(value >> 24) & 0xff] +
-                      bitsInByte[(value >> 16) & 0xff] +
-                      bitsInByte[(value >> 8)  & 0xff] +
-                      bitsInByte[ value        & 0xff];
+    uint32_t lookup = test_popcnt_bitsInByte[(value >> 24) & 0xff] +
+                      test_popcnt_bitsInByte[(value >> 16) & 0xff] +
+                      test_popcnt_bitsInByte[(value >> 8)  & 0xff] +
+                      test_popcnt_bitsInByte[ value        & 0xff];

     EXPECT_EQ(lookup, population_count(value))
         << "value = " << value;

Testing: Linux x86_64 {server,zero,minimal} build and gtest:population_count

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190123/17675e21/signature.asc>

From vladimir.kozlov at oracle.com  Wed Jan 23 16:57:41 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 08:57:41 -0800
Subject: RFR (S) 8217639: Minimal and Zero builds fail after JDK-8217519
 (Improve RegMask population count calculation)
In-Reply-To: <3bde3396-02a4-4b16-4fc5-257f67a34211@redhat.com>
References: <3bde3396-02a4-4b16-4fc5-257f67a34211@redhat.com>
Message-ID: <925d42a1-460d-7a64-d872-603f42375337@oracle.com>

Good. I think it is trivial.

thanks,
Vladimir

On 1/23/19 8:50 AM, Aleksey Shipilev wrote:
> Bug:
>    https://bugs.openjdk.java.net/browse/JDK-8217639
> 
> Reason: New test references "extern uint8_t byte[] bitsInByte", and that is defined in
> libadt/vectset.cpp, which is not compiled when C2 is disabled in Minimal and Zero VM builds. I was
> first considering to enabled libadt build when C2 is disabled, but the more straight-forward fix
> would be to give the test its own golden data to test against. This would also implicitly test for
> accidental bugs in bitsInByte matrix in production code.
> 
> Fix:
> 
> diff -r c96f9aa1f3d8 -r 29037fc5194d test/hotspot/gtest/utilities/test_population_count.cpp
> --- a/test/hotspot/gtest/utilities/test_population_count.cpp    Wed Jan 23 13:16:16 2019 +0000
> +++ b/test/hotspot/gtest/utilities/test_population_count.cpp    Wed Jan 23 17:04:25 2019 +0100
> @@ -29,18 +29,35 @@
>   #include "utilities/globalDefinitions.hpp"
>   #include "unittest.hpp"
> 
> +uint8_t test_popcnt_bitsInByte[BITS_IN_BYTE_ARRAY_SIZE] = {
> +        0, 1, 1, 2, 1, 2, 2, 3, 1, 2, 2, 3, 2, 3, 3, 4,
> +        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
> +        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
> +        1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
> +        2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
> +        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
> +        3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
> +        4, 5, 5, 6, 5, 6, 6, 7, 5, 6, 6, 7, 6, 7, 7, 8
> +};
> 
>   TEST(population_count, sparse) {
> -  extern uint8_t bitsInByte[BITS_IN_BYTE_ARRAY_SIZE];
>     // Step through the entire input range from a random starting point,
>     // verify population_count return values against the lookup table
>     // approach used historically
>     uint32_t step = 4711;
>     for (uint32_t value = os::random() % step; value < UINT_MAX - step; value += step) {
> -    uint32_t lookup = bitsInByte[(value >> 24) & 0xff] +
> -                      bitsInByte[(value >> 16) & 0xff] +
> -                      bitsInByte[(value >> 8)  & 0xff] +
> -                      bitsInByte[ value        & 0xff];
> +    uint32_t lookup = test_popcnt_bitsInByte[(value >> 24) & 0xff] +
> +                      test_popcnt_bitsInByte[(value >> 16) & 0xff] +
> +                      test_popcnt_bitsInByte[(value >> 8)  & 0xff] +
> +                      test_popcnt_bitsInByte[ value        & 0xff];
> 
>       EXPECT_EQ(lookup, population_count(value))
>           << "value = " << value;
> 
> Testing: Linux x86_64 {server,zero,minimal} build and gtest:population_count
> 
> -Aleksey
> 

From vladimir.kozlov at oracle.com  Wed Jan 23 17:11:18 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 09:11:18 -0800
Subject: [12] RFR: 8213825: assert(false) failed: Non-balanced monitor
 enter/exit! Likely JNI locking
In-Reply-To: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>
References: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>
Message-ID: <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>

Hi Jamsheed,

Fix is good. I approved it for JDK 12 push.

Thanks,
Vladimir

On 1/23/19 6:08 AM, Jamsheed wrote:
> Hi,
> 
> Request for review
> 
> bug: https://bugs.openjdk.java.net/browse/JDK-8213825
> 
> webrev: http://cr.openjdk.java.net/~jcm/8213825/webrev.00/index.html
> 
> Bug & Fix Desc:
> 
> if markword load has sfpt as control i/p(i.e synchronizations near a safepoint), it skips sfpt assuming sfptOp wouldn't 
> write to markword memory
> fix: not to skip sfpt for markword loads.
> 
> tests: hs-tier1-5,? hs-precheckin-comp
> 
> Best regards,
> 
> Jamsheed
> 

From vladimir.kozlov at oracle.com  Wed Jan 23 17:24:45 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 09:24:45 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not
 compile by javac
In-Reply-To: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
References: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
Message-ID: <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>

On 1/22/19 6:26 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>> 64 lines changed: 23 ins; 6 del; 35 mod;
> 
> Hi all,
> 
> could you please review this small fix for jit-tester?
> 
> the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.

ok

> 
> besides the fix for the bug, the patch also include the following small clean ups:
>   - use DIST_JAR var value instead of 'JAR' string constant in makefile

ok

>   - change default target testbase dir

ok

>   - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags

Also would be nice to add -ea -esa too if they are not used yet.

>   - add -Xcomp to all the generator tests

why you need -Xcomp?

>   - use tmp directory for class files

Will it work on Windows which has issues with tmp dir? There was discussion about it recently.

>   - check javac error code

ok

>   - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet

ok

Thanks,
Vladimir

> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
> testing: generated 1000 tests, all can be compiled and work fine
> 
> Thanks,
> -- Igor
> 

From derekw at marvell.com  Wed Jan 23 17:27:34 2019
From: derekw at marvell.com (Derek White)
Date: Wed, 23 Jan 2019 17:27:34 +0000
Subject: Changes to Bellsoft/Marvell method of developing intrinsics
Message-ID: <MN2PR18MB2733D9E9AFCEA1B90D6E92FAD2990@MN2PR18MB2733.namprd18.prod.outlook.com>

AArch64 Community,

First I should describe the relationship between myself, Marvell, and Bellsoft. I'm the JVM team lead at Marvell/Cavium, and we work as a virtual team with Bellsoft to help port, analyze, and optimize the aarch64 port of OpenJDK (as well as Hadoop, etc). Bellsoft also contributes to OpenJDK independently.

Andrew Dinn has brought up several good points on testing, code quality, and when and where code complexity should be spent in the aarch64 port. I'll describe my general thoughts on code complexity, what Bellsoft does generally for testing before check-ins, as well as describe what we will be doing for new and existing complex intrinsics code.

Intrinsics are a category of code that can handle more complexity than usual because the complexity is quite local. A developer can generally ignore the details hiding in the implementation unless actively reviewing or enhancing the intrinsic. But while pockets of complexity are OK, black holes of complexity are not. The effort to understand the intrinsics must be substantially less than then effort to develop it. The nature of intrinsics also make them easier to test in isolation, but the testing has to be sufficient. And I agree that the performance gain of each intrinsic has to justify the work developing and supporting it.

Bellsoft's current testing process, before sending a patch for review, is developing testing specific to the patch itself and testing for regressions with JCK and relevant jtreg tests. If the patch is in shared code, it undergoes testing on Linux x86, ARM, AARCH64, Windows, Mac, Solaris x86 and SPARC.

Obviously this has not been sufficient to prevent bugs in the more complex intrinsics we've implemented for aarch64 - even with the stellar code review provided by the community. And the effort required to review the intrinsics has been too high.

Because of this we will change how we develop patches for complex intrinsics. Before sending the code out for public review, we intend to:

  *   Use an additional "red-team" developer to focus on finding the weak points in the code and develop tests that ensure code coverage testing, test case coverage, etc. This is in addition to the normal testing and test development that the initiating developer is expected to do.
  *   The "red-team" developer will also suggest changes for code clarity and code documentation, and will document the test strategy (what cases are tested, what tests cover what code, how to run tests).
  *   We will include all tests developed as part of the patch, even if some modes may not be practical to run regularly as jtreg tests (for example if some tests take excessive time). This will allow later enhancements or fixes to the intrinsic to go through at least as thorough testing as the original.
By breaking the patch development task into two roles we expect to end up with better code quality and make the reviewing task easier.

Note that this is the process that we will be using. We don't expect the rest of the community to adopt this, or if they did, agree on exactly how complex a "complex intrinsic" needs to be to warrant this approach.

We will also begin back-reviewing existing complex intrinsics. If other members of the community are interested in working on this we can coordinate to ensure coverage.

Please let me know if you have any comments on this plan. Thanks,

  *   Derek

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190123/ea2be25b/attachment-0001.html>

From igor.ignatyev at oracle.com  Wed Jan 23 17:36:55 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 09:36:55 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
Message-ID: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
> 32 lines changed: 32 ins; 0 del; 0 mod;

Hi all,

could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?

the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.

webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args

Thanks,
-- Igor


From igor.ignatyev at oracle.com  Wed Jan 23 17:46:24 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 09:46:24 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not
 compile by javac
In-Reply-To: <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>
References: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
 <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>
Message-ID: <3A45E5A7-1351-47FD-8633-065CF639BF2A@oracle.com>

Hi Vladimir,

thanks for your review!

>>  - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
> Also would be nice to add -ea -esa too if they are not used yet.
-Xmixed was added to "speed-up" compilation in case external flags has Xcomp. from my point of view, it's better if '-ea -esa' are provided during test runs, as in some cases you might want to run jaotc w/o them. 

>>  - add -Xcomp to all the generator tests
> why you need -Xcomp?
b/c jit-tester is supposed to compare results of interpreted execution (saved in .gold.* files) w/ the result from compilers, so generated tests must be run w/ Xcomp, otherwise we will comparing one results from interpreter w/ the results from interpreter.

>>  - use tmp directory for class files
> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
we use tmp dir only in the test generator which can be run on any platform, it doesn't have to be run on the same host/platform as actual test execution. in fact the preferred usage model of jit-tester is to pre-generate test corpus and reuse it.

Thanks,
-- Igor

> On Jan 23, 2019, at 9:24 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> On 1/22/19 6:26 PM, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>> 64 lines changed: 23 ins; 6 del; 35 mod;
>> Hi all,
>> could you please review this small fix for jit-tester?
>> the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.
> 
> ok
> 
>> besides the fix for the bug, the patch also include the following small clean ups:
>>  - use DIST_JAR var value instead of 'JAR' string constant in makefile
> 
> ok
> 
>>  - change default target testbase dir
> 
> ok
> 
>>  - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
> 
> Also would be nice to add -ea -esa too if they are not used yet.
> 
>>  - add -Xcomp to all the generator tests
> 
> why you need -Xcomp?
> 
>>  - use tmp directory for class files
> 
> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
> 
>>  - check javac error code
> 
> ok
> 
>>  - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet
> 
> ok
> 
> Thanks,
> Vladimir
> 
>> webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
>> testing: generated 1000 tests, all can be compiled and work fine
>> Thanks,
>> -- Igor


From vladimir.kozlov at oracle.com  Wed Jan 23 18:28:33 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 10:28:33 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
Message-ID: <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>

I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
Relying on env variable is not robust I think.

Thanks,
Vladimir

On 1/23/19 9:36 AM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>> 32 lines changed: 32 ins; 0 del; 0 mod;
> 
> Hi all,
> 
> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
> 
> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
> 
> Thanks,
> -- Igor
> 

From shade at redhat.com  Wed Jan 23 18:32:06 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 23 Jan 2019 19:32:06 +0100
Subject: RFR (S) 8217639: Minimal and Zero builds fail after JDK-8217519
 (Improve RegMask population count calculation)
In-Reply-To: <925d42a1-460d-7a64-d872-603f42375337@oracle.com>
References: <3bde3396-02a4-4b16-4fc5-257f67a34211@redhat.com>
 <925d42a1-460d-7a64-d872-603f42375337@oracle.com>
Message-ID: <1ea00c5e-b01f-76a2-8034-3d9c59058e90@redhat.com>

On 1/23/19 5:57 PM, Vladimir Kozlov wrote:
> Good. I think it is trivial.

Thanks, I think so too. Pushed.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190123/dd834384/signature.asc>

From vladimir.kozlov at oracle.com  Wed Jan 23 18:36:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 10:36:19 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not
 compile by javac
In-Reply-To: <3A45E5A7-1351-47FD-8633-065CF639BF2A@oracle.com>
References: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
 <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>
 <3A45E5A7-1351-47FD-8633-065CF639BF2A@oracle.com>
Message-ID: <8e89ff21-d3d4-3f95-ef54-1aeb55e2aeac@oracle.com>

On 1/23/19 9:46 AM, Igor Ignatyev wrote:
> Hi Vladimir,
> 
> thanks for your review!
> 
>>>   - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>> Also would be nice to add -ea -esa too if they are not used yet.
> -Xmixed was added to "speed-up" compilation in case external flags has Xcomp. from my point of view, it's better if '-ea -esa' are provided during test runs, as in some cases you might want to run jaotc w/o them.

I got it about -Xmixed. I put aot tests to noxcomp group for our CI testing.

To always use '-ea -esa' with jaotc during testing is good I think. We have them by default in our hs-comp testing tasks 
definitions. I thought to have them here is also good if this testing does not use flags from task definitions.
> 
>>>   - add -Xcomp to all the generator tests
>> why you need -Xcomp?
> b/c jit-tester is supposed to compare results of interpreted execution (saved in .gold.* files) w/ the result from compilers, so generated tests must be run w/ Xcomp, otherwise we will comparing one results from interpreter w/ the results from interpreter.

Got it.

> 
>>>   - use tmp directory for class files
>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
> we use tmp dir only in the test generator which can be run on any platform, it doesn't have to be run on the same host/platform as actual test execution. in fact the preferred usage model of jit-tester is to pre-generate test corpus and reuse it.

Okay.

thanks,
Vladimir

> 
> Thanks,
> -- Igor
> 
>> On Jan 23, 2019, at 9:24 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>
>> On 1/22/19 6:26 PM, Igor Ignatyev wrote:
>>> http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>>> 64 lines changed: 23 ins; 6 del; 35 mod;
>>> Hi all,
>>> could you please review this small fix for jit-tester?
>>> the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.
>>
>> ok
>>
>>> besides the fix for the bug, the patch also include the following small clean ups:
>>>   - use DIST_JAR var value instead of 'JAR' string constant in makefile
>>
>> ok
>>
>>>   - change default target testbase dir
>>
>> ok
>>
>>>   - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>>
>> Also would be nice to add -ea -esa too if they are not used yet.
>>
>>>   - add -Xcomp to all the generator tests
>>
>> why you need -Xcomp?
>>
>>>   - use tmp directory for class files
>>
>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
>>
>>>   - check javac error code
>>
>> ok
>>
>>>   - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet
>>
>> ok
>>
>> Thanks,
>> Vladimir
>>
>>> webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
>>> testing: generated 1000 tests, all can be compiled and work fine
>>> Thanks,
>>> -- Igor
> 

From igor.ignatyev at oracle.com  Wed Jan 23 18:34:59 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 10:34:59 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
Message-ID: <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>


> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
that's correct, the runs where the test fails used libraries from the default location.

> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
> Relying on env variable is not robust I think.

these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.

-- Igor
> 
> Thanks,
> Vladimir
> 
> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>> Hi all,
>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>> Thanks,
>> -- Igor


From dean.long at oracle.com  Wed Jan 23 20:24:30 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Wed, 23 Jan 2019 12:24:30 -0800
Subject: [12] RFR: 8213825: assert(false) failed: Non-balanced monitor
 enter/exit! Likely JNI locking
In-Reply-To: <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>
References: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>
 <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>
Message-ID: <257d62c0-c0f9-394e-1cbb-0f33b3a1d365@oracle.com>

Looks good to me too.? Nice job tracking this down, Jamsheed!

dl

On 1/23/19 9:11 AM, Vladimir Kozlov wrote:
> Hi Jamsheed,
>
> Fix is good. I approved it for JDK 12 push.
>
> Thanks,
> Vladimir
>
> On 1/23/19 6:08 AM, Jamsheed wrote:
>> Hi,
>>
>> Request for review
>>
>> bug: https://bugs.openjdk.java.net/browse/JDK-8213825
>>
>> webrev: http://cr.openjdk.java.net/~jcm/8213825/webrev.00/index.html
>>
>> Bug & Fix Desc:
>>
>> if markword load has sfpt as control i/p(i.e synchronizations near a 
>> safepoint), it skips sfpt assuming sfptOp wouldn't write to markword 
>> memory
>> fix: not to skip sfpt for markword loads.
>>
>> tests: hs-tier1-5,? hs-precheckin-comp
>>
>> Best regards,
>>
>> Jamsheed
>>


From igor.ignatyev at oracle.com  Wed Jan 23 22:12:49 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 14:12:49 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not
 compile by javac
In-Reply-To: <8e89ff21-d3d4-3f95-ef54-1aeb55e2aeac@oracle.com>
References: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
 <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>
 <3A45E5A7-1351-47FD-8633-065CF639BF2A@oracle.com>
 <8e89ff21-d3d4-3f95-ef54-1aeb55e2aeac@oracle.com>
Message-ID: <BAA2B878-33C3-47D5-914B-C34E6F259B75@oracle.com>

Vladimir,

we can always specify '-ea -esa' in our task definitions if we want, but baking them into generated tests will affect all executions of these tests, and seems to be inadequate. as testing jaotc tool isn't the goal of these tests, I'd prefer not to add more jaotc-specific than necessary. what do you think?

-- Igor

> On Jan 23, 2019, at 10:36 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> On 1/23/19 9:46 AM, Igor Ignatyev wrote:
>> Hi Vladimir,
>> thanks for your review!
>>>>  - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>>> Also would be nice to add -ea -esa too if they are not used yet.
>> -Xmixed was added to "speed-up" compilation in case external flags has Xcomp. from my point of view, it's better if '-ea -esa' are provided during test runs, as in some cases you might want to run jaotc w/o them.
> 
> I got it about -Xmixed. I put aot tests to noxcomp group for our CI testing.
> 
> To always use '-ea -esa' with jaotc during testing is good I think. We have them by default in our hs-comp testing tasks definitions. I thought to have them here is also good if this testing does not use flags from task definitions.
>>>>  - add -Xcomp to all the generator tests
>>> why you need -Xcomp?
>> b/c jit-tester is supposed to compare results of interpreted execution (saved in .gold.* files) w/ the result from compilers, so generated tests must be run w/ Xcomp, otherwise we will comparing one results from interpreter w/ the results from interpreter.
> 
> Got it.
> 
>>>>  - use tmp directory for class files
>>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
>> we use tmp dir only in the test generator which can be run on any platform, it doesn't have to be run on the same host/platform as actual test execution. in fact the preferred usage model of jit-tester is to pre-generate test corpus and reuse it.
> 
> Okay.
> 
> thanks,
> Vladimir
> 
>> Thanks,
>> -- Igor
>>> On Jan 23, 2019, at 9:24 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>> 
>>> On 1/22/19 6:26 PM, Igor Ignatyev wrote:
>>>> http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>>>> 64 lines changed: 23 ins; 6 del; 35 mod;
>>>> Hi all,
>>>> could you please review this small fix for jit-tester?
>>>> the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.
>>> 
>>> ok
>>> 
>>>> besides the fix for the bug, the patch also include the following small clean ups:
>>>>  - use DIST_JAR var value instead of 'JAR' string constant in makefile
>>> 
>>> ok
>>> 
>>>>  - change default target testbase dir
>>> 
>>> ok
>>> 
>>>>  - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>>> 
>>> Also would be nice to add -ea -esa too if they are not used yet.
>>> 
>>>>  - add -Xcomp to all the generator tests
>>> 
>>> why you need -Xcomp?
>>> 
>>>>  - use tmp directory for class files
>>> 
>>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
>>> 
>>>>  - check javac error code
>>> 
>>> ok
>>> 
>>>>  - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet
>>> 
>>> ok
>>> 
>>> Thanks,
>>> Vladimir
>>> 
>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
>>>> testing: generated 1000 tests, all can be compiled and work fine
>>>> Thanks,
>>>> -- Igor


From igor.ignatyev at oracle.com  Wed Jan 23 22:13:18 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 14:13:18 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
Message-ID: <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>

Vladimir,

I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html <http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html>

(testing is in-progress)

Thanks,
-- Igor

> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com> wrote:
> 
> 
> 
>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>> 
>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
> that's correct, the runs where the test fails used libraries from the default location.
> 
>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>> Relying on env variable is not robust I think.
> 
> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.
> 
> -- Igor
>> 
>> Thanks,
>> Vladimir
>> 
>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>> Hi all,
>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>> Thanks,
>>> -- Igor
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190123/f0da03de/attachment.html>

From gromero at linux.vnet.ibm.com  Wed Jan 23 22:17:59 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Wed, 23 Jan 2019 20:17:59 -0200
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>

Hi Martin,

On 01/21/2019 04:07 PM, Doerr, Martin wrote:
> PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
> We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.
> In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.
> Webrev:
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webrev.00/>

Thanks for the clean-up. Change looks good!

It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
noted them recently so I missed both in my previous clean-up). And also
the static table simplification.

I tested the change with different array sizes and byte values with and
without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no issues.

Only a nit: should we update the following comment and replace 'timesXtoThe32'
by something better, maybe 'table'? That name doesn't look much meaningful in the
current context and seems taken from the native code for java.util.zip.CRC32:

3902 /**
3903  * uint32_t crc;
3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
3905  */
3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val, Register table, Register tmp) {


Best regards,
Gustavo


From vladimir.kozlov at oracle.com  Wed Jan 23 22:32:52 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 14:32:52 -0800
Subject: RFR(S) [12] : 8158646 : [jittester] generated tests may not
 compile by javac
In-Reply-To: <BAA2B878-33C3-47D5-914B-C34E6F259B75@oracle.com>
References: <6D91688A-01A0-46E0-A304-9F39E16F574E@oracle.com>
 <87177e44-0b60-c385-afb7-eeedd3d29829@oracle.com>
 <3A45E5A7-1351-47FD-8633-065CF639BF2A@oracle.com>
 <8e89ff21-d3d4-3f95-ef54-1aeb55e2aeac@oracle.com>
 <BAA2B878-33C3-47D5-914B-C34E6F259B75@oracle.com>
Message-ID: <29ab8f7c-9301-d001-bb6f-7537322abda4@oracle.com>

On 1/23/19 2:12 PM, Igor Ignatyev wrote:
> Vladimir,
> 
> we can always specify '-ea -esa' in our task definitions if we want, but baking them into generated tests will affect all executions of these tests, and seems to be inadequate. as testing jaotc tool isn't the goal of these tests, I'd prefer not to add more jaotc-specific than necessary. what do you think?

I only suggested to add these flags to command line in AotTestGeneratorsFactory.java where jaotc is used.
It may help debug intermittent failures if there are issues with AOTed code.

But I am fine if these tests run in Mach5 with these flags set in task definition when jaotc is used.

Vladimir

> 
> -- Igor
> 
>> On Jan 23, 2019, at 10:36 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>
>> On 1/23/19 9:46 AM, Igor Ignatyev wrote:
>>> Hi Vladimir,
>>> thanks for your review!
>>>>>   - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>>>> Also would be nice to add -ea -esa too if they are not used yet.
>>> -Xmixed was added to "speed-up" compilation in case external flags has Xcomp. from my point of view, it's better if '-ea -esa' are provided during test runs, as in some cases you might want to run jaotc w/o them.
>>
>> I got it about -Xmixed. I put aot tests to noxcomp group for our CI testing.
>>
>> To always use '-ea -esa' with jaotc during testing is good I think. We have them by default in our hs-comp testing tasks definitions. I thought to have them here is also good if this testing does not use flags from task definitions.
>>>>>   - add -Xcomp to all the generator tests
>>>> why you need -Xcomp?
>>> b/c jit-tester is supposed to compare results of interpreted execution (saved in .gold.* files) w/ the result from compilers, so generated tests must be run w/ Xcomp, otherwise we will comparing one results from interpreter w/ the results from interpreter.
>>
>> Got it.
>>
>>>>>   - use tmp directory for class files
>>>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
>>> we use tmp dir only in the test generator which can be run on any platform, it doesn't have to be run on the same host/platform as actual test execution. in fact the preferred usage model of jit-tester is to pre-generate test corpus and reuse it.
>>
>> Okay.
>>
>> thanks,
>> Vladimir
>>
>>> Thanks,
>>> -- Igor
>>>> On Jan 23, 2019, at 9:24 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>>>
>>>> On 1/22/19 6:26 PM, Igor Ignatyev wrote:
>>>>> http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>>>>> 64 lines changed: 23 ins; 6 del; 35 mod;
>>>>> Hi all,
>>>>> could you please review this small fix for jit-tester?
>>>>> the bug was caused by TypeList not being fully cleared b/w generation. we only remove classes which starts w/ "Test_", so we don't remove "basic" classes, e.g. Runnable, and don't clean their 'children'. in most cases, this is fine, as each generation will use only its own Test_N_* classes so having Test_M_* (M != N) classes as Runnable's children has no impact besides garbage in memory, however, if we get an error during Test_N generation we will redo generation for the same N, and in such cases, previous children of "basic" classes (read Runnable) cause incompatible types. the fix is to remove "Test_" classes from the children.
>>>>
>>>> ok
>>>>
>>>>> besides the fix for the bug, the patch also include the following small clean ups:
>>>>>   - use DIST_JAR var value instead of 'JAR' string constant in makefile
>>>>
>>>> ok
>>>>
>>>>>   - change default target testbase dir
>>>>
>>>> ok
>>>>
>>>>>   - make sure jaotc is always run w/ X-mixed regardless of "external" vm flags
>>>>
>>>> Also would be nice to add -ea -esa too if they are not used yet.
>>>>
>>>>>   - add -Xcomp to all the generator tests
>>>>
>>>> why you need -Xcomp?
>>>>
>>>>>   - use tmp directory for class files
>>>>
>>>> Will it work on Windows which has issues with tmp dir? There was discussion about it recently.
>>>>
>>>>>   - check javac error code
>>>>
>>>> ok
>>>>
>>>>>   - optimize getAllParents/getAllChildren to call getAllParents/getAllChildren only if a class hasn't been added yet
>>>>
>>>> ok
>>>>
>>>> Thanks,
>>>> Vladimir
>>>>
>>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8158646/webrev.00/index.html
>>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8158646
>>>>> testing: generated 1000 tests, all can be compiled and work fine
>>>>> Thanks,
>>>>> -- Igor
> 

From vladimir.kozlov at oracle.com  Wed Jan 23 22:41:10 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 14:41:10 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
 <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
Message-ID: <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>

It should be AOTLoader::heaps_count(). Otherwise it is very good.

Thanks,
Vladimir

On 1/23/19 2:13 PM, Igor Ignatyev wrote:
> Vladimir,
> 
> I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've 
> decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html
> 
> (testing is in-progress)
> 
> Thanks,
> -- Igor
> 
>> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com <mailto:igor.ignatyev at oracle.com>> wrote:
>>
>>
>>
>>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>>
>>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
>> that's correct, the runs where the test fails used libraries from the default location.
>>
>>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>>> Relying on env variable is not robust I think.
>>
>> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I 
>> see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ 
>> current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways 
>> to retrieve this information.
>>
>> -- Igor
>>>
>>> Thanks,
>>> Vladimir
>>>
>>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>>> Hi all,
>>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is 
>>>> the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which 
>>>> contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, 
>>>> TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>>> Thanks,
>>>> -- Igor
>>
> 

From igor.ignatyev at oracle.com  Wed Jan 23 22:44:27 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 14:44:27 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
 <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
 <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>
Message-ID: <C2A817B7-6F7C-446F-A0A6-1D54FA4BFB7E@oracle.com>

you meant libraries_count, right?

-- Igor

> On Jan 23, 2019, at 2:41 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> It should be AOTLoader::heaps_count(). Otherwise it is very good.
> 
> Thanks,
> Vladimir
> 
> On 1/23/19 2:13 PM, Igor Ignatyev wrote:
>> Vladimir,
>> I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html
>> (testing is in-progress)
>> Thanks,
>> -- Igor
>>> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com <mailto:igor.ignatyev at oracle.com>> wrote:
>>> 
>>> 
>>> 
>>>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>>> 
>>>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
>>> that's correct, the runs where the test fails used libraries from the default location.
>>> 
>>>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>>>> Relying on env variable is not robust I think.
>>> 
>>> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.
>>> 
>>> -- Igor
>>>> 
>>>> Thanks,
>>>> Vladimir
>>>> 
>>>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>>>> Hi all,
>>>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>>>> Thanks,
>>>>> -- Igor
>>> 


From vladimir.kozlov at oracle.com  Wed Jan 23 22:51:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 14:51:19 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <C2A817B7-6F7C-446F-A0A6-1D54FA4BFB7E@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
 <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
 <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>
 <C2A817B7-6F7C-446F-A0A6-1D54FA4BFB7E@oracle.com>
Message-ID: <666b08d3-6074-833f-4f6f-e3db6d488f5b@oracle.com>

No, heaps_count(). Some libraries could be invalid (AOT compilation config was different, for example) and are not used:

http://hg.openjdk.java.net/jdk/jdk/file/e3ed96060992/src/hotspot/share/aot/aotLoader.cpp#l190

May be we should just check UseAOT flag? If no AOT libraries are loaded ot they are invalid UseAOT will be set to false.

Vladimir

On 1/23/19 2:44 PM, Igor Ignatyev wrote:
> you meant libraries_count, right?
> 
> -- Igor
> 
>> On Jan 23, 2019, at 2:41 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>
>> It should be AOTLoader::heaps_count(). Otherwise it is very good.
>>
>> Thanks,
>> Vladimir
>>
>> On 1/23/19 2:13 PM, Igor Ignatyev wrote:
>>> Vladimir,
>>> I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html
>>> (testing is in-progress)
>>> Thanks,
>>> -- Igor
>>>> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com <mailto:igor.ignatyev at oracle.com>> wrote:
>>>>
>>>>
>>>>
>>>>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>>>>
>>>>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
>>>> that's correct, the runs where the test fails used libraries from the default location.
>>>>
>>>>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>>>>> Relying on env variable is not robust I think.
>>>>
>>>> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.
>>>>
>>>> -- Igor
>>>>>
>>>>> Thanks,
>>>>> Vladimir
>>>>>
>>>>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>>>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>>>>> Hi all,
>>>>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>>>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>>>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>>>>> Thanks,
>>>>>> -- Igor
>>>>
> 

From igor.ignatyev at oracle.com  Thu Jan 24 00:07:46 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 16:07:46 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <666b08d3-6074-833f-4f6f-e3db6d488f5b@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
 <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
 <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>
 <C2A817B7-6F7C-446F-A0A6-1D54FA4BFB7E@oracle.com>
 <666b08d3-6074-833f-4f6f-e3db6d488f5b@oracle.com>
Message-ID: <6E46BBB8-2DE4-43E7-A698-7F67F564A27E@oracle.com>

UseAOT will be changed to false, only if UseAOT wasn't specified in the command line, so we can't use it reliably to determine if there are any loaded AOT libraries. 

I've changed WB_AotLibrariesCount to use AOTLoader::heaps_count, retested the fix locally, it works fine. testing it in mach5.

Thanks,
-- Igor

> On Jan 23, 2019, at 2:51 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> No, heaps_count(). Some libraries could be invalid (AOT compilation config was different, for example) and are not used:
> 
> http://hg.openjdk.java.net/jdk/jdk/file/e3ed96060992/src/hotspot/share/aot/aotLoader.cpp#l190
> 
> May be we should just check UseAOT flag? If no AOT libraries are loaded ot they are invalid UseAOT will be set to false.
> 
> Vladimir
> 
> On 1/23/19 2:44 PM, Igor Ignatyev wrote:
>> you meant libraries_count, right?
>> -- Igor
>>> On Jan 23, 2019, at 2:41 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>> 
>>> It should be AOTLoader::heaps_count(). Otherwise it is very good.
>>> 
>>> Thanks,
>>> Vladimir
>>> 
>>> On 1/23/19 2:13 PM, Igor Ignatyev wrote:
>>>> Vladimir,
>>>> I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html
>>>> (testing is in-progress)
>>>> Thanks,
>>>> -- Igor
>>>>> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com <mailto:igor.ignatyev at oracle.com>> wrote:
>>>>> 
>>>>> 
>>>>> 
>>>>>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>>>>> 
>>>>>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
>>>>> that's correct, the runs where the test fails used libraries from the default location.
>>>>> 
>>>>>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>>>>>> Relying on env variable is not robust I think.
>>>>> 
>>>>> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.
>>>>> 
>>>>> -- Igor
>>>>>> 
>>>>>> Thanks,
>>>>>> Vladimir
>>>>>> 
>>>>>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>>>>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>>>>>> Hi all,
>>>>>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>>>>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>>>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>>>>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>>>>>> Thanks,
>>>>>>> -- Igor
>>>>> 


From igor.veresov at oracle.com  Thu Jan 24 00:15:02 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Wed, 23 Jan 2019 16:15:02 -0800
Subject: [12] RFR(XS) 8217678: [AOT] jck Math/IncrementExact and
 Math/DecrementExact tests fail when test classes are AOTed
Message-ID: <0C581623-E5D1-4009-8B1B-E21023DE408A@oracle.com>

When fixing JDK-8196568  I must?ve thought that a folding of an exact math node would produce a deopt. But obviously it doesn?t.
Webrev: http://cr.openjdk.java.net/~iveresov/8217678/webrev.00/
JBS: https://bugs.openjdk.java.net/browse/JDK-8217678


Please review and approve.

Thanks!
igor


From vladimir.kozlov at oracle.com  Thu Jan 24 00:16:29 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 16:16:29 -0800
Subject: RFR(S) [12] : 8216180 : [AOT]
 compiler/intrinsics/bigInteger/TestMulAdd.java crashed with AOT enabled
In-Reply-To: <6E46BBB8-2DE4-43E7-A698-7F67F564A27E@oracle.com>
References: <28BD4C78-3A3F-4B36-8D93-B9F520B08E34@oracle.com>
 <c337d8af-8dfc-f0e3-93e1-2185d622561f@oracle.com>
 <E257AB68-E95D-4E79-BBB7-0BCEE9CCEE39@oracle.com>
 <74C74200-60AD-4C99-913B-A06752EBE965@oracle.com>
 <793023c7-61d9-8bf1-09f9-f046ea7c4d36@oracle.com>
 <C2A817B7-6F7C-446F-A0A6-1D54FA4BFB7E@oracle.com>
 <666b08d3-6074-833f-4f6f-e3db6d488f5b@oracle.com>
 <6E46BBB8-2DE4-43E7-A698-7F67F564A27E@oracle.com>
Message-ID: <be2f9548-8783-1b02-ac6a-e87c8d063f29@oracle.com>

On 1/23/19 4:07 PM, Igor Ignatyev wrote:
> UseAOT will be changed to false, only if UseAOT wasn't specified in the command line, so we can't use it reliably to determine if there are any loaded AOT libraries.

Okay.

> 
> I've changed WB_AotLibrariesCount to use AOTLoader::heaps_count, retested the fix locally, it works fine. testing it in mach5.

Good.

Thanks,
Vladimir

> 
> Thanks,
> -- Igor
> 
>> On Jan 23, 2019, at 2:51 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>
>> No, heaps_count(). Some libraries could be invalid (AOT compilation config was different, for example) and are not used:
>>
>> http://hg.openjdk.java.net/jdk/jdk/file/e3ed96060992/src/hotspot/share/aot/aotLoader.cpp#l190
>>
>> May be we should just check UseAOT flag? If no AOT libraries are loaded ot they are invalid UseAOT will be set to false.
>>
>> Vladimir
>>
>> On 1/23/19 2:44 PM, Igor Ignatyev wrote:
>>> you meant libraries_count, right?
>>> -- Igor
>>>> On Jan 23, 2019, at 2:41 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
>>>>
>>>> It should be AOTLoader::heaps_count(). Otherwise it is very good.
>>>>
>>>> Thanks,
>>>> Vladimir
>>>>
>>>> On 1/23/19 2:13 PM, Igor Ignatyev wrote:
>>>>> Vladimir,
>>>>> I gave it a bit more thoughts, and am inclining to agree that replying on env. variables is indeed fragile. so I've decided to go w/ a new WB method --http://cr.openjdk.java.net/~iignatyev//8216180/webrev.01/index.html
>>>>> (testing is in-progress)
>>>>> Thanks,
>>>>> -- Igor
>>>>>> On Jan 23, 2019, at 10:34 AM, Igor Ignatyev <igor.ignatyev at oracle.com <mailto:igor.ignatyev at oracle.com>> wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>>> On Jan 23, 2019, at 10:28 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>>>>>>
>>>>>>> I assume tests don't use -XX:AOTLibrary= flag but load them from default location in JDK. Right?
>>>>>> that's correct, the runs where the test fails used libraries from the default location.
>>>>>>
>>>>>>> Can we instead skip such tests if any AOT library is loaded? We can check it with PrintAOT or new ouptu or new WB API.
>>>>>>> Relying on env variable is not robust I think.
>>>>>>
>>>>>> these env variables are part of run-test "official" contract, so I believe it's safe to use them. the only problem I see w/ such approach is runs w/ jdk-images which include AOT'ed modules in them, but there are no such images, and w/ current state of AOT, they aren't actually possible however if you have strong objections, I can look into other ways to retrieve this information.
>>>>>>
>>>>>> -- Igor
>>>>>>>
>>>>>>> Thanks,
>>>>>>> Vladimir
>>>>>>>
>>>>>>> On 1/23/19 9:36 AM, Igor Ignatyev wrote:
>>>>>>>> http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>>>>> 32 lines changed: 32 ins; 0 del; 0 mod;
>>>>>>>> Hi all,
>>>>>>>> could you please review this small patch which exclude TestMulAdd test from execution if java.base is AOT'ed compiled?
>>>>>>>> the test disables some intrinsics, and if it's run w/ AOT'ed java.base there these intrinsics are enabled (which is the most common, if not the only, case) we get crash. the fix introduces new @requires value -- vm.aot.modules which contains comma-separated list of AOT'ed modules and use it to skip this test if java.base is one of them.
>>>>>>>> webrev: http://cr.openjdk.java.net/~iignatyev//8216180/webrev.00/index.html
>>>>>>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8216180
>>>>>>>> testing: compiler/intrinsics/bigInteger tests on linux-x64 w/ JTREG=AOT_MODULES=java.base, TEST_OPTS_AOT_MODULES=java.base and w/o any extra make args
>>>>>>>> Thanks,
>>>>>>>> -- Igor
>>>>>>
> 

From vladimir.kozlov at oracle.com  Thu Jan 24 00:20:04 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 16:20:04 -0800
Subject: [12] RFR(XS) 8217678: [AOT] jck Math/IncrementExact and
 Math/DecrementExact tests fail when test classes are AOTed
In-Reply-To: <0C581623-E5D1-4009-8B1B-E21023DE408A@oracle.com>
References: <0C581623-E5D1-4009-8B1B-E21023DE408A@oracle.com>
Message-ID: <779a9eb8-a5d8-bd52-43e7-fa3382c58faf@oracle.com>

Good.

Please file push request since it has to be pushed into JDK 12:
http://openjdk.java.net/jeps/3#Fix-Request-Process

Thanks,
Vladimir

On 1/23/19 4:15 PM, Igor Veresov wrote:
> When fixing JDK-8196568  I must?ve thought that a folding of an exact math node would produce a deopt. But obviously it doesn?t.
> Webrev: http://cr.openjdk.java.net/~iveresov/8217678/webrev.00/
> JBS: https://bugs.openjdk.java.net/browse/JDK-8217678
> 
> 
> Please review and approve.
> 
> Thanks!
> igor
> 
> 
> 

From igor.veresov at oracle.com  Thu Jan 24 00:33:06 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Wed, 23 Jan 2019 16:33:06 -0800
Subject: [12] RFR(XS) 8217678: [AOT] jck Math/IncrementExact and
 Math/DecrementExact tests fail when test classes are AOTed
In-Reply-To: <779a9eb8-a5d8-bd52-43e7-fa3382c58faf@oracle.com>
References: <0C581623-E5D1-4009-8B1B-E21023DE408A@oracle.com>
 <779a9eb8-a5d8-bd52-43e7-fa3382c58faf@oracle.com>
Message-ID: <8BA6416C-9FED-4D57-A757-6D4B681BC986@oracle.com>

Thanks for the review. I?ve added the ?Fix Request? to the JBS issue.

igor


> On Jan 23, 2019, at 4:20 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> Good.
> 
> Please file push request since it has to be pushed into JDK 12:
> http://openjdk.java.net/jeps/3#Fix-Request-Process
> 
> Thanks,
> Vladimir
> 
> On 1/23/19 4:15 PM, Igor Veresov wrote:
>> When fixing JDK-8196568  I must?ve thought that a folding of an exact math node would produce a deopt. But obviously it doesn?t.
>> Webrev: http://cr.openjdk.java.net/~iveresov/8217678/webrev.00/
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217678
>> Please review and approve.
>> Thanks!
>> igor

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190123/14185aea/attachment.html>

From vladimir.kozlov at oracle.com  Thu Jan 24 00:35:17 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 16:35:17 -0800
Subject: [12] RFR(XS) 8217678: [AOT] jck Math/IncrementExact and
 Math/DecrementExact tests fail when test classes are AOTed
In-Reply-To: <8BA6416C-9FED-4D57-A757-6D4B681BC986@oracle.com>
References: <0C581623-E5D1-4009-8B1B-E21023DE408A@oracle.com>
 <779a9eb8-a5d8-bd52-43e7-fa3382c58faf@oracle.com>
 <8BA6416C-9FED-4D57-A757-6D4B681BC986@oracle.com>
Message-ID: <d0a09408-f012-68c1-5fd9-c2c371b4c4bc@oracle.com>

Approved.

Vladimir

On 1/23/19 4:33 PM, Igor Veresov wrote:
> Thanks for the review. I?ve added the ?Fix Request? to the JBS issue.
> 
> igor
> 
> 
> 
>> On Jan 23, 2019, at 4:20 PM, Vladimir Kozlov <vladimir.kozlov at oracle.com <mailto:vladimir.kozlov at oracle.com>> wrote:
>>
>> Good.
>>
>> Please file push request since it has to be pushed into JDK 12:
>> http://openjdk.java.net/jeps/3#Fix-Request-Process
>>
>> Thanks,
>> Vladimir
>>
>> On 1/23/19 4:15 PM, Igor Veresov wrote:
>>> When fixing JDK-8196568 ?I must?ve thought that a folding of an exact math node would produce a deopt. But obviously 
>>> it doesn?t.
>>> Webrev: http://cr.openjdk.java.net/~iveresov/8217678/webrev.00/
>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217678
>>> Please review and approve.
>>> Thanks!
>>> igor
> 

From igor.ignatyev at oracle.com  Thu Jan 24 01:08:19 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 17:08:19 -0800
Subject: RFR(T) [12] : 8167276 :
 jvmci/compilerToVM/MaterializeVirtualObjectTest.java fails with
 -XX:-EliminateAllocations
Message-ID: <C53D4E04-9222-40A5-9799-908402B604F1@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
> 8 lines changed: 5 ins; 0 del; 3 mod; 

Hi all,

could you please review this tiny patch which excludes MaterializeVirtualObjectTest test from runs w/ disabled EliminateAllocations?

webrev: http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8167276
testing: the test w/ -XX:-EliminateAllocations, XX:+EliminateAllocations and w/o any extra flags 

Thanks,
-- Igor

From igor.ignatyev at oracle.com  Thu Jan 24 01:10:36 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 17:10:36 -0800
Subject: RFR(T)[12]: 8150757 : [TESTBUG] compiler/ciReplay/TestVM.sh and
 compiler/ciReplay/TestVM_no_comp_level.sh fail when no compilations are
 happening
Message-ID: <CD92F250-8D72-411E-A60A-8DACD4CDF84B@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
> 7 lines changed: 5 ins; 0 del; 2 mod;

Hi all,

could you please review this tiny fix for compiler/ciReplay/ tests? these tests try to crash JVM by running '-Xcomp -XX:CICrashAt=1 -version', but if they are run w/ AOT'ed java.base, there is nothing else to compile in '-version', so JVM doesn't crash and the tests fail. the fix replaces usage of -version w/ a class w/ empty main method, so crashes will happen w/ or w/o AOT'ed java.base.

webrev: http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8150757
testing: compiler/ciReplay/ tests w/ and w/o AOT'ed java.base

Thanks,
-- Igor

From felix.yang at huawei.com  Thu Jan 24 01:22:47 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Thu, 24 Jan 2019 01:22:47 +0000
Subject: [RFR] 8217359: C2 compiler triggers SIGSEGV after tranformation
 in ConvI2LNode::Ideal
In-Reply-To: <35e45132-2187-16c8-22fb-17e61a117941@oracle.com>
References: <DA41BE1DDCA941489001C7FBD7A8820ED5F3F998@dggeml527-mbx.china.huawei.com>
 <32982b31-3a91-58fb-a6b8-b1cd9f7cdb41@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F495BF@dggeml527-mbx.china.huawei.com>
 <d5a75c56-b8c5-e215-1485-4274c95236b6@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F49892@dggeml527-mbx.china.huawei.com>
 <35e45132-2187-16c8-22fb-17e61a117941@oracle.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F49E80@dggeml527-mbx.china.huawei.com>

Thanks Tobias and Vladimir.

This is pushed as : 
http://hg.openjdk.java.net/jdk/jdk/rev/44f41693631f
http://hg.openjdk.java.net/jdk/jdk12/rev/44f41693631f

Felix


> 
> Changes are good.
> 
> I approved the fix for jdk12 as HotSpot group lead.
> 
> Thanks,
> Vladimir
> 
> 
> On 1/22/19 4:03 AM, Yangfei (Felix) wrote:
> > Hi,
> >
> >      I have updated the JBS accordingly, requesting approval for integration
> into JDK 12.
> >      May I have another reviewer please?
> >
> > Thanks for your help,
> > Felix
> >
> >
> >> Hi Felix,
> >>
> >> this looks good to me, thanks for adding the test!
> >>
> >> A second review would be good. In the meantime, please request approval
> for
> >> integration into JDK 12
> >> according to:
> >> http://openjdk.java.net/jeps/3#Fix-Request-Process
> >>
> >> Thanks,
> >> Tobias
> >>
> >> On 22.01.19 02:17, Yangfei (Felix) wrote:
> >>> Hi,
> >>>
> >>>      Thanks for reviewing.  The regression test is added.
> >>>      New webrev: http://cr.openjdk.java.net/~fyang/8217359/webrev.01/
> >>>      This is committed to the submit repo:
> >> http://hg.openjdk.java.net/jdk/submit/rev/7345adfbc913
> >>>
> >>>      The email I got shows that it passed the Oralce internal tests:
> >>>      =================================================
> >>>      Build Details: 2019-01-21-1210078.felix.yang.source
> >>>      0 Failed Tests
> >>>      Mach5 Tasks Results Summary
> >>>      ?	EXECUTED_WITH_FAILURE: 0
> >>>      ?	NA: 0
> >>>      ?	KILLED: 0
> >>>      ?	UNABLE_TO_RUN: 0
> >>>      ?	PASSED: 76
> >>>      ?	FAILED: 0
> >>>      =================================================
> >>>
> >>>      OK to push?
> >>>
> >>> Thanks for your help,
> >>> Felix
> >>>
> >>>>
> >>>> Hi Felix,
> >>>>
> >>>> Could you please add the regression test as jtreg test?
> >>>>
> >>>> Otherwise, the fix looks reasonable to me. Nice analysis!
> >>>>
> >>>> Thanks,
> >>>> Tobias
> >>>

From vladimir.kozlov at oracle.com  Thu Jan 24 01:39:13 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 17:39:13 -0800
Subject: RFR(T) [12] : 8167276 :
 jvmci/compilerToVM/MaterializeVirtualObjectTest.java fails with
 -XX:-EliminateAllocations
In-Reply-To: <C53D4E04-9222-40A5-9799-908402B604F1@oracle.com>
References: <C53D4E04-9222-40A5-9799-908402B604F1@oracle.com>
Message-ID: <305920e9-f487-fe22-4955-2142cc6c5430@oracle.com>

Good.

Thanks,
Vladimir

On 1/23/19 5:08 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
>> 8 lines changed: 5 ins; 0 del; 3 mod;
> 
> Hi all,
> 
> could you please review this tiny patch which excludes MaterializeVirtualObjectTest test from runs w/ disabled EliminateAllocations?
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8167276
> testing: the test w/ -XX:-EliminateAllocations, XX:+EliminateAllocations and w/o any extra flags
> 
> Thanks,
> -- Igor
> 

From vladimir.kozlov at oracle.com  Thu Jan 24 01:41:51 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 17:41:51 -0800
Subject: RFR(T)[12]: 8150757 : [TESTBUG] compiler/ciReplay/TestVM.sh and
 compiler/ciReplay/TestVM_no_comp_level.sh fail when no compilations are
 happening
In-Reply-To: <CD92F250-8D72-411E-A60A-8DACD4CDF84B@oracle.com>
References: <CD92F250-8D72-411E-A60A-8DACD4CDF84B@oracle.com>
Message-ID: <96ea1b63-0bca-3fef-3cd6-89a749dddbbd@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/23/19 5:10 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
>> 7 lines changed: 5 ins; 0 del; 2 mod;
> 
> Hi all,
> 
> could you please review this tiny fix for compiler/ciReplay/ tests? these tests try to crash JVM by running '-Xcomp -XX:CICrashAt=1 -version', but if they are run w/ AOT'ed java.base, there is nothing else to compile in '-version', so JVM doesn't crash and the tests fail. the fix replaces usage of -version w/ a class w/ empty main method, so crashes will happen w/ or w/o AOT'ed java.base.
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8150757
> testing: compiler/ciReplay/ tests w/ and w/o AOT'ed java.base
> 
> Thanks,
> -- Igor
> 

From felix.yang at huawei.com  Thu Jan 24 01:57:16 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Thu, 24 Jan 2019 01:57:16 +0000
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <a5cd9102-ee57-26db-0ef4-8331064bd935@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
 <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
 <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>
 <a5cd9102-ee57-26db-0ef4-8331064bd935@oracle.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F4AF69@dggeml527-mbx.china.huawei.com>

Hi,

    Since JDK 12 has the same issue, will this fix be integrated into this repo? 
    BTW: I have another simple test case that also triggers the bug.  I have put the test on the JBS. 

Thanks,
Felix


> 
> Thanks, Vladimir.
> 
> Best regards,
> Vladimir Ivanov
> 
> On 22/01/2019 12:42, Vladimir Kozlov wrote:
> > Got it. Good.
> >
> > thanks,
> > Vladimir
> >
> > On 1/22/19 12:08 PM, Vladimir Ivanov wrote:
> >>
> >> On 22/01/2019 11:54, Vladimir Kozlov wrote:
> >>> The fix is different from what we discussed.
> >>> Can you explain how it helps?
> >>
> >> We discussed adding AddP case to _shared_nodes.
> >>
> >> Proposed fix achieves similar result with a different approach:
> >>
> >> ?? * Matcher::clone_address_expressions() marks problematic AddP as
> >> shared (based on constant value);
> >>
> >> ?? * DFA() doesn't construct duplicated State for inner AddP (since
> >> it's marked as shared);
> >>
> >> ?? * Matcher doesn't need to materialize duplicated mach nodes, since
> >> it matches inner AddP separately;
> >>
> >> Best regards,
> >> Vladimir Ivanov
> >>
> >>> On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
> >>>> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
> >>>> https://bugs.openjdk.java.net/browse/JDK-8202952
> >>>>
> >>>> The crash happens when PhaseCFG encounters a dead MachNode in the
> >>>> graph.
> >>>> The problematic node is a leftover from matching of an instruction
> >>>> with a duplicated memory operand (sarI_mem_CL [1] in that particular
> >>>> case).
> >>>>
> >>>> Address has the following shape [2]:
> >>>> ?? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
> >>>>
> >>>> It could be subsumed into complex addressing expression, but the
> >>>> constant is too large (doesn't fit into immL32). So, matcher has to
> >>>> compute inner address expression separately and put it into a register.
> >>>>
> >>>> Since memory operand is duplicated, 2 copies are materialized during
> >>>> matching, but as part of ::Expand() one of the copies is eliminated,
> >>>> thus leaving a dead mach node in the IR (for the address expression).
> >>>>
> >>>> The fix is to adjust Matcher::clone_address_expressions() to avoid
> >>>> cloning inner AddP when constant offset is too large.
> >>>>
> >>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
> >>>>
> >>>> Best regards,
> >>>> Vladimir Ivanov
> >>>>
> >>>> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
> >>>> %{
> >>>> ?? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
> >>>>
> >>>>
> >>>> [2]
> >>>> ??o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
> >>>> ???? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
> >>>> ???????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]]
> >>>> #int[int:>=0]:NotNull:exact *
> >>>> ???????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
> >>>> ???????????? o1761 ConvI2L === _ o1741? [[o1765 ]]
> >>>> #long:maxint-51..maxint-48
> >>>> ???????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631
> o2017
> >>>> o1808? 60 ]]? #int:2
> >>>> ???? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From vladimir.x.ivanov at oracle.com  Thu Jan 24 02:58:50 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Wed, 23 Jan 2019 18:58:50 -0800
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <DA41BE1DDCA941489001C7FBD7A8820ED5F4AF69@dggeml527-mbx.china.huawei.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
 <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
 <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>
 <a5cd9102-ee57-26db-0ef4-8331064bd935@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F4AF69@dggeml527-mbx.china.huawei.com>
Message-ID: <85ed4b30-c83f-53b4-3f9d-59f53f0d71e2@oracle.com>


>      Since JDK 12 has the same issue, will this fix be integrated into this repo?

I don't plan to integrate it into jdk12.

My reasoning is:

   (1) it's a long-standing bug (from day 1 on x64?) with very low 
likelihood of exposure
         * was found only recently using fuzzers
         * no similar crashes reported before

   (2) JDK 12 is in RDP2 phase and is open only for P1?P2 bug fixes

Though the bug technically meets RDP2 criteria, I don't see it as a 
critical issue for the release in a late development phase.

Best regards,
Vladimir Ivanov

>> On 22/01/2019 12:42, Vladimir Kozlov wrote:
>>> Got it. Good.
>>>
>>> thanks,
>>> Vladimir
>>>
>>> On 1/22/19 12:08 PM, Vladimir Ivanov wrote:
>>>>
>>>> On 22/01/2019 11:54, Vladimir Kozlov wrote:
>>>>> The fix is different from what we discussed.
>>>>> Can you explain how it helps?
>>>>
>>>> We discussed adding AddP case to _shared_nodes.
>>>>
>>>> Proposed fix achieves similar result with a different approach:
>>>>
>>>>  ?? * Matcher::clone_address_expressions() marks problematic AddP as
>>>> shared (based on constant value);
>>>>
>>>>  ?? * DFA() doesn't construct duplicated State for inner AddP (since
>>>> it's marked as shared);
>>>>
>>>>  ?? * Matcher doesn't need to materialize duplicated mach nodes, since
>>>> it matches inner AddP separately;
>>>>
>>>> Best regards,
>>>> Vladimir Ivanov
>>>>
>>>>> On 1/22/19 11:05 AM, Vladimir Ivanov wrote:
>>>>>> http://cr.openjdk.java.net/~vlivanov/8202952/webrev.00/
>>>>>> https://bugs.openjdk.java.net/browse/JDK-8202952
>>>>>>
>>>>>> The crash happens when PhaseCFG encounters a dead MachNode in the
>>>>>> graph.
>>>>>> The problematic node is a leftover from matching of an instruction
>>>>>> with a duplicated memory operand (sarI_mem_CL [1] in that particular
>>>>>> case).
>>>>>>
>>>>>> Address has the following shape [2]:
>>>>>>  ?? AddP (AddP DecodeN (LShiftL ConvI2L ConI)) ConL
>>>>>>
>>>>>> It could be subsumed into complex addressing expression, but the
>>>>>> constant is too large (doesn't fit into immL32). So, matcher has to
>>>>>> compute inner address expression separately and put it into a register.
>>>>>>
>>>>>> Since memory operand is duplicated, 2 copies are materialized during
>>>>>> matching, but as part of ::Expand() one of the copies is eliminated,
>>>>>> thus leaving a dead mach node in the IR (for the address expression).
>>>>>>
>>>>>> The fix is to adjust Matcher::clone_address_expressions() to avoid
>>>>>> cloning inner AddP when constant offset is too large.
>>>>>>
>>>>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>>>>>
>>>>>> Best regards,
>>>>>> Vladimir Ivanov
>>>>>>
>>>>>> [1] instruct sarI_mem_CL(memory dst, rcx_RegI shift, rFlagsReg cr)
>>>>>> %{
>>>>>>  ?? match(Set dst (StoreI dst (RShiftI (LoadI dst) shift)));
>>>>>>
>>>>>>
>>>>>> [2]
>>>>>>  ??o347 AddP? === _ o2181 o1768 o1769? [[o349 o371 ]]
>>>>>>  ???? o1768 AddP? === _ o2181 o2181 o1765? [[o347 ]]
>>>>>>  ???????? o2181 DecodeN === _ o287? [[o1768 o1768 o327 o347 o327 ]]
>>>>>> #int[int:>=0]:NotNull:exact *
>>>>>>  ???????? o1765 LShiftL === _ o1761 o60? [[o1768 ]]
>>>>>>  ???????????? o1761 ConvI2L === _ o1741? [[o1765 ]]
>>>>>> #long:maxint-51..maxint-48
>>>>>>  ???????????? o60?? ConI? === o0? [[o61 o1765 o1434 o2013 o1631
>> o2017
>>>>>> o1808? 60 ]]? #int:2
>>>>>>  ???? o1769 ConL? === o0? [[o347 ]]? #long:-8589932784

From felix.yang at huawei.com  Thu Jan 24 03:21:07 2019
From: felix.yang at huawei.com (Yangfei (Felix))
Date: Thu, 24 Jan 2019 03:21:07 +0000
Subject: [13] RFR (XS): 8202952: C2: Unexpected dead nodes after matching
In-Reply-To: <85ed4b30-c83f-53b4-3f9d-59f53f0d71e2@oracle.com>
References: <e6fc835a-904e-ab45-28d6-5b5ae82c53de@oracle.com>
 <9bf3ac2c-5881-576c-cc64-917cec246f8f@oracle.com>
 <242b6a41-3db6-2911-1045-1e5eb63ba862@oracle.com>
 <ab8be7ca-a3d0-69dc-ca10-ded169da7d60@oracle.com>
 <a5cd9102-ee57-26db-0ef4-8331064bd935@oracle.com>
 <DA41BE1DDCA941489001C7FBD7A8820ED5F4AF69@dggeml527-mbx.china.huawei.com>
 <85ed4b30-c83f-53b4-3f9d-59f53f0d71e2@oracle.com>
Message-ID: <DA41BE1DDCA941489001C7FBD7A8820ED5F4AFF2@dggeml527-mbx.china.huawei.com>

That sounds reasonable to me.  

BTW: The test I updated on the JBS is also reduced from a fuzzer test.  

Thanks,
Felix

> 
> 
> >      Since JDK 12 has the same issue, will this fix be integrated into this repo?
> 
> I don't plan to integrate it into jdk12.
> 
> My reasoning is:
> 
>    (1) it's a long-standing bug (from day 1 on x64?) with very low
> likelihood of exposure
>          * was found only recently using fuzzers
>          * no similar crashes reported before
> 
>    (2) JDK 12 is in RDP2 phase and is open only for P1?P2 bug fixes
> 
> Though the bug technically meets RDP2 criteria, I don't see it as a
> critical issue for the release in a late development phase.
> 
> Best regards,
> Vladimir Ivanov


From igor.ignatyev at oracle.com  Thu Jan 24 04:03:59 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 23 Jan 2019 20:03:59 -0800
Subject: RFR(T) [12] : 8217699 : add
 java/util/concurrent/CountDownLatch/Basic.java to ProblemList-Xcomp
Message-ID: <8A057C09-9FDB-49E8-A319-DD29FE98174E@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8217699/webrev.00/index.html
> 2 lines changed: 1 ins; 0 del; 1 mod;

Hi all,
could you please review this trivial fix which puts java/util/concurrent/CountDownLatch/Basic.java tests into ProblemList-Xcomp?

webrev: http://cr.openjdk.java.net/~iignatyev//8217699/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8217699

Thanks,
-- Igor

From vladimir.kozlov at oracle.com  Thu Jan 24 04:23:18 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 23 Jan 2019 20:23:18 -0800
Subject: RFR(T) [12] : 8217699 : add
 java/util/concurrent/CountDownLatch/Basic.java to ProblemList-Xcomp
In-Reply-To: <8A057C09-9FDB-49E8-A319-DD29FE98174E@oracle.com>
References: <8A057C09-9FDB-49E8-A319-DD29FE98174E@oracle.com>
Message-ID: <54EBFBCF-1CD1-4921-A314-E59C9C7FC009@oracle.com>

Good.

Thanks 
Vladimir 

> On Jan 23, 2019, at 8:03 PM, Igor Ignatyev <igor.ignatyev at oracle.com> wrote:
> 
> http://cr.openjdk.java.net/~iignatyev//8217699/webrev.00/index.html
>> 2 lines changed: 1 ins; 0 del; 1 mod;
> 
> Hi all,
> could you please review this trivial fix which puts java/util/concurrent/CountDownLatch/Basic.java tests into ProblemList-Xcomp?
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8217699/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8217699
> 
> Thanks,
> -- Igor


From dean.long at oracle.com  Thu Jan 24 05:13:07 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Wed, 23 Jan 2019 21:13:07 -0800
Subject: RFR(T) [12] : 8167276 :
 jvmci/compilerToVM/MaterializeVirtualObjectTest.java fails with
 -XX:-EliminateAllocations
In-Reply-To: <C53D4E04-9222-40A5-9799-908402B604F1@oracle.com>
References: <C53D4E04-9222-40A5-9799-908402B604F1@oracle.com>
Message-ID: <d3bb4a8e-0292-45ca-bcbd-e8a6786075b0@oracle.com>

Looks good.

dl

On 1/23/19 5:08 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
>> 8 lines changed: 5 ins; 0 del; 3 mod;
> Hi all,
>
> could you please review this tiny patch which excludes MaterializeVirtualObjectTest test from runs w/ disabled EliminateAllocations?
>
> webrev: http://cr.openjdk.java.net/~iignatyev//8167276/webrev.02/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8167276
> testing: the test w/ -XX:-EliminateAllocations, XX:+EliminateAllocations and w/o any extra flags
>
> Thanks,
> -- Igor


From dean.long at oracle.com  Thu Jan 24 05:14:32 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Wed, 23 Jan 2019 21:14:32 -0800
Subject: RFR(T)[12]: 8150757 : [TESTBUG] compiler/ciReplay/TestVM.sh and
 compiler/ciReplay/TestVM_no_comp_level.sh fail when no compilations are
 happening
In-Reply-To: <96ea1b63-0bca-3fef-3cd6-89a749dddbbd@oracle.com>
References: <CD92F250-8D72-411E-A60A-8DACD4CDF84B@oracle.com>
 <96ea1b63-0bca-3fef-3cd6-89a749dddbbd@oracle.com>
Message-ID: <d41eb308-ff2d-c841-7e57-e033fd78c3a2@oracle.com>

+1

dl

On 1/23/19 5:41 PM, Vladimir Kozlov wrote:
> Looks good.
>
> Thanks,
> Vladimir
>
> On 1/23/19 5:10 PM, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
>>> 7 lines changed: 5 ins; 0 del; 2 mod;
>>
>> Hi all,
>>
>> could you please review this tiny fix for compiler/ciReplay/ tests? 
>> these tests try to crash JVM by running '-Xcomp -XX:CICrashAt=1 
>> -version', but if they are run w/ AOT'ed java.base, there is nothing 
>> else to compile in '-version', so JVM doesn't crash and the tests 
>> fail. the fix replaces usage of -version w/ a class w/ empty main 
>> method, so crashes will happen w/ or w/o AOT'ed java.base.
>>
>> webrev: 
>> http://cr.openjdk.java.net/~iignatyev//8150757/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8150757
>> testing: compiler/ciReplay/ tests w/ and w/o AOT'ed java.base
>>
>> Thanks,
>> -- Igor
>>


From jamsheed.c.m at oracle.com  Thu Jan 24 06:09:25 2019
From: jamsheed.c.m at oracle.com (Jamsheed)
Date: Thu, 24 Jan 2019 11:39:25 +0530
Subject: [12] RFR: 8213825: assert(false) failed: Non-balanced monitor
 enter/exit! Likely JNI locking
In-Reply-To: <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>
References: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>
 <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>
Message-ID: <d76258d3-a530-bb79-b4bc-cefb9cce756e@oracle.com>

Hi Vladimir,

Thanks a lot for the review and approval to push in 12.

Best regards,

Jamsheed

On 1/23/19 10:41 PM, Vladimir Kozlov wrote:
> Hi Jamsheed,
>
> Fix is good. I approved it for JDK 12 push.
>
> Thanks,
> Vladimir
>
> On 1/23/19 6:08 AM, Jamsheed wrote:
>> Hi,
>>
>> Request for review
>>
>> bug: https://bugs.openjdk.java.net/browse/JDK-8213825
>>
>> webrev: http://cr.openjdk.java.net/~jcm/8213825/webrev.00/index.html
>>
>> Bug & Fix Desc:
>>
>> if markword load has sfpt as control i/p(i.e synchronizations near a 
>> safepoint), it skips sfpt assuming sfptOp wouldn't write to markword 
>> memory
>> fix: not to skip sfpt for markword loads.
>>
>> tests: hs-tier1-5,? hs-precheckin-comp
>>
>> Best regards,
>>
>> Jamsheed
>>

From jamsheed.c.m at oracle.com  Thu Jan 24 06:15:26 2019
From: jamsheed.c.m at oracle.com (Jamsheed)
Date: Thu, 24 Jan 2019 11:45:26 +0530
Subject: [12] RFR: 8213825: assert(false) failed: Non-balanced monitor
 enter/exit! Likely JNI locking
In-Reply-To: <257d62c0-c0f9-394e-1cbb-0f33b3a1d365@oracle.com>
References: <b3648bb2-0907-453c-c9c8-3e79da38dc00@oracle.com>
 <a2f5c24e-4a09-1122-2cb8-d26aebc42a1e@oracle.com>
 <257d62c0-c0f9-394e-1cbb-0f33b3a1d365@oracle.com>
Message-ID: <c6472aa3-ac2d-ae0d-9d5b-02bb42f17f09@oracle.com>

Thanks a lot for the review, Dean.

Best regards,

Jamsheed

On 1/24/19 1:54 AM, dean.long at oracle.com wrote:
> Looks good to me too.? Nice job tracking this down, Jamsheed!
>
> dl
>
> On 1/23/19 9:11 AM, Vladimir Kozlov wrote:
>> Hi Jamsheed,
>>
>> Fix is good. I approved it for JDK 12 push.
>>
>> Thanks,
>> Vladimir
>>
>> On 1/23/19 6:08 AM, Jamsheed wrote:
>>> Hi,
>>>
>>> Request for review
>>>
>>> bug: https://bugs.openjdk.java.net/browse/JDK-8213825
>>>
>>> webrev: http://cr.openjdk.java.net/~jcm/8213825/webrev.00/index.html
>>>
>>> Bug & Fix Desc:
>>>
>>> if markword load has sfpt as control i/p(i.e synchronizations near a 
>>> safepoint), it skips sfpt assuming sfptOp wouldn't write to markword 
>>> memory
>>> fix: not to skip sfpt for markword loads.
>>>
>>> tests: hs-tier1-5,? hs-precheckin-comp
>>>
>>> Best regards,
>>>
>>> Jamsheed
>>>
>

From fairoz.matte at oracle.com  Thu Jan 24 07:14:03 2019
From: fairoz.matte at oracle.com (Fairoz Matte)
Date: Wed, 23 Jan 2019 23:14:03 -0800 (PST)
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <323b7338-d507-4850-ab53-4a5295d7b62f@default>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
Message-ID: <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>

Hi,

This crash is very random and to exercise AES stability adding a unit testcase.
Thanks Sean Coffey for bringing this into my notice.

I have updated webrev and kindly review
http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/

Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after 3_000+ iterations.
In the test case there is loop for 5_000 iterations and running in -Xbatch making it more
predictable.

Thanks,
Fairoz

> -----Original Message-----
> From: Fairoz Matte
> Sent: Wednesday, January 23, 2019 8:50 AM
> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
> dev at openjdk.java.net
> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> com.sun.crypto.provider.CipherBlockChaining
> 
> Thanks Tobias and Vladimir for review.
> 
> Thanks,
> Fairoz
> 
> > -----Original Message-----
> > From: Vladimir Kozlov
> > Sent: Tuesday, January 22, 2019 10:27 PM
> > To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
> > dev at openjdk.java.net
> > Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> > com.sun.crypto.provider.CipherBlockChaining
> >
> > Yes, it is good.
> >
> > Thanks,
> > Vladimir
> >
> > On 1/22/19 12:22 AM, Tobias Hartmann wrote:
> > > Hi Fairoz,
> > >
> > > this looks good to me.
> > >
> > > Thanks,
> > > Tobias
> > >
> > > On 22.01.19 04:35, Fairoz Matte wrote:
> > >> Hi,
> > >>
> > >> Please review the following patch,
> > >> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951
> > >> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
> > >>
> > >> During the call to assembled stub code
> > >> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
> > >> there was reference to G6 register used for temporary storage of
> > >> F50, as G6 is not saved on stack it was resulting in garbage during
> retrieval.
> > >>
> > >> Solution is to use unused local register (L6) for temporary storage
> > >> and
> > retrieval of F50.
> > >>
> > >> Thanks,
> > >> Fairoz
> > >>

From tobias.hartmann at oracle.com  Thu Jan 24 08:15:07 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 24 Jan 2019 09:15:07 +0100
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
 <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
Message-ID: <a13b45c0-e28b-6009-dc6c-d49a039c251d@oracle.com>

Hi Fairoz,

still looks good to me but please fix the indentation in the test (lines 56-60, 122).
No new webrev required.

Thanks,
Tobias

On 24.01.19 08:14, Fairoz Matte wrote:
> Hi,
> 
> This crash is very random and to exercise AES stability adding a unit testcase.
> Thanks Sean Coffey for bringing this into my notice.
> 
> I have updated webrev and kindly review
> http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/
> 
> Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after 3_000+ iterations.
> In the test case there is loop for 5_000 iterations and running in -Xbatch making it more
> predictable.
> 
> Thanks,
> Fairoz
> 
>> -----Original Message-----
>> From: Fairoz Matte
>> Sent: Wednesday, January 23, 2019 8:50 AM
>> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
>> dev at openjdk.java.net
>> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>> com.sun.crypto.provider.CipherBlockChaining
>>
>> Thanks Tobias and Vladimir for review.
>>
>> Thanks,
>> Fairoz
>>
>>> -----Original Message-----
>>> From: Vladimir Kozlov
>>> Sent: Tuesday, January 22, 2019 10:27 PM
>>> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
>>> dev at openjdk.java.net
>>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>>> com.sun.crypto.provider.CipherBlockChaining
>>>
>>> Yes, it is good.
>>>
>>> Thanks,
>>> Vladimir
>>>
>>> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
>>>> Hi Fairoz,
>>>>
>>>> this looks good to me.
>>>>
>>>> Thanks,
>>>> Tobias
>>>>
>>>> On 22.01.19 04:35, Fairoz Matte wrote:
>>>>> Hi,
>>>>>
>>>>> Please review the following patch,
>>>>> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951
>>>>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
>>>>>
>>>>> During the call to assembled stub code
>>>>> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
>>>>> there was reference to G6 register used for temporary storage of
>>>>> F50, as G6 is not saved on stack it was resulting in garbage during
>> retrieval.
>>>>>
>>>>> Solution is to use unused local register (L6) for temporary storage
>>>>> and
>>> retrieval of F50.
>>>>>
>>>>> Thanks,
>>>>> Fairoz
>>>>>

From fairoz.matte at oracle.com  Thu Jan 24 08:26:32 2019
From: fairoz.matte at oracle.com (Fairoz Matte)
Date: Thu, 24 Jan 2019 00:26:32 -0800 (PST)
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <a13b45c0-e28b-6009-dc6c-d49a039c251d@oracle.com>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
 <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
 <a13b45c0-e28b-6009-dc6c-d49a039c251d@oracle.com>
Message-ID: <5c604932-29e1-4d34-bf34-1dae31a6c6c4@default>

Thanks Tobias,

I have adjusted indentation.

Thanks,
Fairoz

> -----Original Message-----
> From: Tobias Hartmann
> Sent: Thursday, January 24, 2019 1:45 PM
> To: Fairoz Matte <fairoz.matte at oracle.com>; Vladimir Kozlov
> <vladimir.kozlov at oracle.com>; hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> com.sun.crypto.provider.CipherBlockChaining
> 
> Hi Fairoz,
> 
> still looks good to me but please fix the indentation in the test (lines 56-60,
> 122).
> No new webrev required.
> 
> Thanks,
> Tobias
> 
> On 24.01.19 08:14, Fairoz Matte wrote:
> > Hi,
> >
> > This crash is very random and to exercise AES stability adding a unit
> testcase.
> > Thanks Sean Coffey for bringing this into my notice.
> >
> > I have updated webrev and kindly review
> > http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/
> >
> > Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after
> 3_000+ iterations.
> > In the test case there is loop for 5_000 iterations and running in
> > -Xbatch making it more predictable.
> >
> > Thanks,
> > Fairoz
> >
> >> -----Original Message-----
> >> From: Fairoz Matte
> >> Sent: Wednesday, January 23, 2019 8:50 AM
> >> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
> >> dev at openjdk.java.net
> >> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> >> com.sun.crypto.provider.CipherBlockChaining
> >>
> >> Thanks Tobias and Vladimir for review.
> >>
> >> Thanks,
> >> Fairoz
> >>
> >>> -----Original Message-----
> >>> From: Vladimir Kozlov
> >>> Sent: Tuesday, January 22, 2019 10:27 PM
> >>> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
> >>> dev at openjdk.java.net
> >>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> >>> com.sun.crypto.provider.CipherBlockChaining
> >>>
> >>> Yes, it is good.
> >>>
> >>> Thanks,
> >>> Vladimir
> >>>
> >>> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
> >>>> Hi Fairoz,
> >>>>
> >>>> this looks good to me.
> >>>>
> >>>> Thanks,
> >>>> Tobias
> >>>>
> >>>> On 22.01.19 04:35, Fairoz Matte wrote:
> >>>>> Hi,
> >>>>>
> >>>>> Please review the following patch, JBS bug -
> >>>>> https://bugs.openjdk.java.net/browse/JDK-8209951
> >>>>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
> >>>>>
> >>>>> During the call to assembled stub code
> >>>>> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
> >>>>> there was reference to G6 register used for temporary storage of
> >>>>> F50, as G6 is not saved on stack it was resulting in garbage
> >>>>> during
> >> retrieval.
> >>>>>
> >>>>> Solution is to use unused local register (L6) for temporary
> >>>>> storage and
> >>> retrieval of F50.
> >>>>>
> >>>>> Thanks,
> >>>>> Fairoz
> >>>>>

From rwestrel at redhat.com  Thu Jan 24 09:43:12 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Thu, 24 Jan 2019 10:43:12 +0100
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <877eg6gaqk.fsf@redhat.com>
References: <877eg6gaqk.fsf@redhat.com>
Message-ID: <878sza75n3.fsf@redhat.com>


> http://cr.openjdk.java.net/~roland/8215483/webrev.00/

Anyone for that one?

Roland.

From claes.redestad at oracle.com  Thu Jan 24 09:58:37 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Thu, 24 Jan 2019 10:58:37 +0100
Subject: RFR(T): 8217716: Remove dead code in PhaseChaitin
Message-ID: <9437526b-41b2-a486-f3bc-fcfd6aaf082f@oracle.com>

Hi,

various methods and fields in PhaseChaitin and friends are unused and
should be removed.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217716
Webrev: http://cr.openjdk.java.net/~redestad/8217716/open.00/

At least one of the unused methods (PhaseChaitin::Pre_Simplify)
are linger in product build, so cleaning this up marginally improves
static footprint (-4Kb).

Testing: tier1+2

Thanks!

/Claes

From martin.doerr at sap.com  Thu Jan 24 10:02:37 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Thu, 24 Jan 2019 10:02:37 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
Message-ID: <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi Gustavo,

thank you for reviewing and testing.

Seems like many comments were taken from java.util.zip.CRC32. I guess it was intended to refer to it.
I think it's not bad to have it this way because it makes it easier to compare both implementations.
Maybe Lutz can comment on this and if he would like to keep it this way.

Best regards,
Martin


-----Original Message-----
From: Gustavo Romero <gromero at linux.vnet.ibm.com> 
Sent: Mittwoch, 23. Januar 2019 23:18
To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32

Hi Martin,

On 01/21/2019 04:07 PM, Doerr, Martin wrote:
> PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
> We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.
> In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.
> Webrev:
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webrev.00/>

Thanks for the clean-up. Change looks good!

It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
noted them recently so I missed both in my previous clean-up). And also
the static table simplification.

I tested the change with different array sizes and byte values with and
without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no issues.

Only a nit: should we update the following comment and replace 'timesXtoThe32'
by something better, maybe 'table'? That name doesn't look much meaningful in the
current context and seems taken from the native code for java.util.zip.CRC32:

3902 /**
3903  * uint32_t crc;
3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
3905  */
3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val, Register table, Register tmp) {


Best regards,
Gustavo


From tobias.hartmann at oracle.com  Thu Jan 24 10:03:41 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Thu, 24 Jan 2019 11:03:41 +0100
Subject: RFR(T): 8217716: Remove dead code in PhaseChaitin
In-Reply-To: <9437526b-41b2-a486-f3bc-fcfd6aaf082f@oracle.com>
References: <9437526b-41b2-a486-f3bc-fcfd6aaf082f@oracle.com>
Message-ID: <1daad406-ec37-f852-a08b-fd5c96349152@oracle.com>

Hi Claes,

looks good and trivial.

Best regards,
Tobias

On 24.01.19 10:58, Claes Redestad wrote:
> Hi,
> 
> various methods and fields in PhaseChaitin and friends are unused and
> should be removed.
> 
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217716
> Webrev: http://cr.openjdk.java.net/~redestad/8217716/open.00/
> 
> At least one of the unused methods (PhaseChaitin::Pre_Simplify)
> are linger in product build, so cleaning this up marginally improves
> static footprint (-4Kb).
> 
> Testing: tier1+2
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Thu Jan 24 10:04:56 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Thu, 24 Jan 2019 11:04:56 +0100
Subject: RFR(T): 8217716: Remove dead code in PhaseChaitin
In-Reply-To: <1daad406-ec37-f852-a08b-fd5c96349152@oracle.com>
References: <9437526b-41b2-a486-f3bc-fcfd6aaf082f@oracle.com>
 <1daad406-ec37-f852-a08b-fd5c96349152@oracle.com>
Message-ID: <7f7c5785-319f-4700-c248-6f681393fa47@oracle.com>

On 2019-01-24 11:03, Tobias Hartmann wrote:
> Hi Claes,
> 
> looks good and trivial.

Thanks, Tobias!

/Claes

From Pengfei.Li at arm.com  Thu Jan 24 10:29:50 2019
From: Pengfei.Li at arm.com (Pengfei Li (Arm Technology China))
Date: Thu, 24 Jan 2019 10:29:50 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <771c5094-aacb-d52c-437f-29aaf5f8f01a@redhat.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <771c5094-aacb-d52c-437f-29aaf5f8f01a@redhat.com>
Message-ID: <DB7PR08MB31154B111D6349161EAE5647969A0@DB7PR08MB3115.eurprd08.prod.outlook.com>

Hi Andrew Haley,

> Instead, please put it into a function (e.g. updateBytesCRC32C_inner) and call
> it from updateBytesCRC32C. There's no point writing all this stuff out twice.

I uploaded a new webrev. Is it what you want?
http://cr.openjdk.java.net/~pli/rfr/8216259/webrev.01/

--
Thanks,
Pengfei


From lutz.schmidt at sap.com  Thu Jan 24 11:11:37 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Thu, 24 Jan 2019 11:11:37 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
 <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>

Gustavo, Martin,

I agree, that comment appears somewhat disconnected from the code.
I'm really not sure if it will help a lot in the future to have a 
grep string that helps finding the related code in java.util.zip.CRC32.

In short: change it to something meaningful in the local context. 

Thanks,
Lutz

?On 24.01.19, 11:02, "Doerr, Martin" <martin.doerr at sap.com> wrote:

    Hi Gustavo,
    
    thank you for reviewing and testing.
    
    Seems like many comments were taken from java.util.zip.CRC32. I guess it was intended to refer to it.
    I think it's not bad to have it this way because it makes it easier to compare both implementations.
    Maybe Lutz can comment on this and if he would like to keep it this way.
    
    Best regards,
    Martin
    
    
    -----Original Message-----
    From: Gustavo Romero <gromero at linux.vnet.ibm.com>
    Sent: Mittwoch, 23. Januar 2019 23:18
    To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
    Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
    Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
    
    Hi Martin,
    
    On 01/21/2019 04:07 PM, Doerr, Martin wrote:
    > PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
    > We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.
    > In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.
    > Webrev:
    > http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webrev.00/>
    
    Thanks for the clean-up. Change looks good!
    
    It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
    noted them recently so I missed both in my previous clean-up). And also
    the static table simplification.
    
    I tested the change with different array sizes and byte values with and
    without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no issues.
    
    Only a nit: should we update the following comment and replace 'timesXtoThe32'
    by something better, maybe 'table'? That name doesn't look much meaningful in the
    current context and seems taken from the native code for java.util.zip.CRC32:
    
    3902 /**
    3903  * uint32_t crc;
    3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
    3905  */
    3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val, Register table, Register tmp) {
    
    
    Best regards,
    Gustavo
    
    
From martin.doerr at sap.com  Thu Jan 24 12:11:37 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Thu, 24 Jan 2019 12:11:37 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
 <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>
Message-ID: <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi Lutz and Gustavo,

that's fine. Removed the comments which refer to java.util.zip.CRC32 stuff.

And while reading through the comments, I found out that kernel_crc32_singleByte is not useful (since we have the ...Reg version). So I just removed it and replaced its only usage by better code (TemplateInterpreterGenerator::generate_CRC32_update_entry).

New webrev:
http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.01/

Best regards,
Martin


-----Original Message-----
From: Schmidt, Lutz 
Sent: Donnerstag, 24. Januar 2019 12:12
To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32

Gustavo, Martin,

I agree, that comment appears somewhat disconnected from the code.
I'm really not sure if it will help a lot in the future to have a 
grep string that helps finding the related code in java.util.zip.CRC32.

In short: change it to something meaningful in the local context. 

Thanks,
Lutz

?On 24.01.19, 11:02, "Doerr, Martin" <martin.doerr at sap.com> wrote:

    Hi Gustavo,
    
    thank you for reviewing and testing.
    
    Seems like many comments were taken from java.util.zip.CRC32. I guess it was intended to refer to it.
    I think it's not bad to have it this way because it makes it easier to compare both implementations.
    Maybe Lutz can comment on this and if he would like to keep it this way.
    
    Best regards,
    Martin
    
    
    -----Original Message-----
    From: Gustavo Romero <gromero at linux.vnet.ibm.com>
    Sent: Mittwoch, 23. Januar 2019 23:18
    To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
    Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
    Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
    
    Hi Martin,
    
    On 01/21/2019 04:07 PM, Doerr, Martin wrote:
    > PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
    > We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.
    > In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.
    > Webrev:
    > http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webrev.00/>
    
    Thanks for the clean-up. Change looks good!
    
    It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
    noted them recently so I missed both in my previous clean-up). And also
    the static table simplification.
    
    I tested the change with different array sizes and byte values with and
    without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no issues.
    
    Only a nit: should we update the following comment and replace 'timesXtoThe32'
    by something better, maybe 'table'? That name doesn't look much meaningful in the
    current context and seems taken from the native code for java.util.zip.CRC32:
    
    3902 /**
    3903  * uint32_t crc;
    3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
    3905  */
    3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val, Register table, Register tmp) {
    
    
    Best regards,
    Gustavo
    
    
From aph at redhat.com  Thu Jan 24 12:32:55 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 24 Jan 2019 12:32:55 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <DB7PR08MB31154B111D6349161EAE5647969A0@DB7PR08MB3115.eurprd08.prod.outlook.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <771c5094-aacb-d52c-437f-29aaf5f8f01a@redhat.com>
 <DB7PR08MB31154B111D6349161EAE5647969A0@DB7PR08MB3115.eurprd08.prod.outlook.com>
Message-ID: <7e8244c4-ba51-b434-425e-4db9f92fa500@redhat.com>

On 1/24/19 10:29 AM, Pengfei Li (Arm Technology China) wrote:
> Hi Andrew Haley,
> 
>> Instead, please put it into a function (e.g. updateBytesCRC32C_inner) and call
>> it from updateBytesCRC32C. There's no point writing all this stuff out twice.
> 
> I uploaded a new webrev. Is it what you want?
> http://cr.openjdk.java.net/~pli/rfr/8216259/webrev.01/

Yes, thank you. Ningsheng, once you have commit access to OpenJDK, will
you please push this?

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From nils.eliasson at oracle.com  Thu Jan 24 12:39:20 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Thu, 24 Jan 2019 13:39:20 +0100
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <878sza75n3.fsf@redhat.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
Message-ID: <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>

Hi Roland,

Looks good!

Thanks for fixing!

// Nils

On 2019-01-24 10:43, Roland Westrelin wrote:
>> http://cr.openjdk.java.net/~roland/8215483/webrev.00/
> Anyone for that one?
>
> Roland.

From gromero at linux.vnet.ibm.com  Thu Jan 24 14:17:04 2019
From: gromero at linux.vnet.ibm.com (Gustavo Romero)
Date: Thu, 24 Jan 2019 12:17:04 -0200
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
 <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>
 <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <ca7913e3-136c-d30e-5b1a-9b46b9e82a3d@linux.vnet.ibm.com>

Hi Martin,

On 01/24/2019 10:11 AM, Doerr, Martin wrote:
> Hi Lutz and Gustavo,
> 
> that's fine. Removed the comments which refer to java.util.zip.CRC32 stuff.
> 
> And while reading through the comments, I found out that kernel_crc32_singleByte is not useful (since we have the ...Reg version). So I just removed it and replaced its only usage by better code (TemplateInterpreterGenerator::generate_CRC32_update_entry).
> 
> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.01/

Thanks for the updated webrev.

The additional clean-up makes the code easier to read/follow too.

LGTM.

Best regards,
Gustavo

> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Schmidt, Lutz
> Sent: Donnerstag, 24. Januar 2019 12:12
> To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
> Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
> 
> Gustavo, Martin,
> 
> I agree, that comment appears somewhat disconnected from the code.
> I'm really not sure if it will help a lot in the future to have a
> grep string that helps finding the related code in java.util.zip.CRC32.
> 
> In short: change it to something meaningful in the local context.
> 
> Thanks,
> Lutz
> 
> ?On 24.01.19, 11:02, "Doerr, Martin" <martin.doerr at sap.com> wrote:
> 
>      Hi Gustavo,
>      
>      thank you for reviewing and testing.
>      
>      Seems like many comments were taken from java.util.zip.CRC32. I guess it was intended to refer to it.
>      I think it's not bad to have it this way because it makes it easier to compare both implementations.
>      Maybe Lutz can comment on this and if he would like to keep it this way.
>      
>      Best regards,
>      Martin
>      
>      
>      -----Original Message-----
>      From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>      Sent: Mittwoch, 23. Januar 2019 23:18
>      To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>      Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
>      Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
>      
>      Hi Martin,
>      
>      On 01/21/2019 04:07 PM, Doerr, Martin wrote:
>      > PPC64 currently contains static tables for CRC32/CRC32C calculations. We only need some of them depending on Endianess and on whether vector instructions are available or not.
>      > We can get rid of quite some code when we generate these constants at startup as we already do for the vector version.
>      > In addition, we can save one register in the vector case because we can use one constants pointer for all related constants.
>      > Webrev:
>      > http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/ <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webrev.00/>
>      
>      Thanks for the clean-up. Change looks good!
>      
>      It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
>      noted them recently so I missed both in my previous clean-up). And also
>      the static table simplification.
>      
>      I tested the change with different array sizes and byte values with and
>      without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no issues.
>      
>      Only a nit: should we update the following comment and replace 'timesXtoThe32'
>      by something better, maybe 'table'? That name doesn't look much meaningful in the
>      current context and seems taken from the native code for java.util.zip.CRC32:
>      
>      3902 /**
>      3903  * uint32_t crc;
>      3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
>      3905  */
>      3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val, Register table, Register tmp) {
>      
>      
>      Best regards,
>      Gustavo
>      
>      
> 
> 


From Ningsheng.Jian at arm.com  Thu Jan 24 09:34:22 2019
From: Ningsheng.Jian at arm.com (Ningsheng Jian (Arm Technology China))
Date: Thu, 24 Jan 2019 09:34:22 +0000
Subject: [aarch64-port-dev ] Changes to Bellsoft/Marvell method of
 developing intrinsics
In-Reply-To: <MN2PR18MB2733D9E9AFCEA1B90D6E92FAD2990@MN2PR18MB2733.namprd18.prod.outlook.com>
References: <MN2PR18MB2733D9E9AFCEA1B90D6E92FAD2990@MN2PR18MB2733.namprd18.prod.outlook.com>
Message-ID: <AM0PR08MB35238E8B4C2925F00586A0C7909A0@AM0PR08MB3523.eurprd08.prod.outlook.com>

Hi Derek,

> 
> We will also begin back-reviewing existing complex intrinsics. If other members
> of the community are interested in working on this we can coordinate to ensure
> coverage.
> 

We (Arm) are happy to co-work on this and Pengfei has just started to investigate some existing complex string intrinsics.

Thanks,
Ningsheng

From aph at redhat.com  Thu Jan 24 16:51:45 2019
From: aph at redhat.com (Andrew Haley)
Date: Thu, 24 Jan 2019 16:51:45 +0000
Subject: Changes to Bellsoft/Marvell method of developing intrinsics
In-Reply-To: <MN2PR18MB2733D9E9AFCEA1B90D6E92FAD2990@MN2PR18MB2733.namprd18.prod.outlook.com>
References: <MN2PR18MB2733D9E9AFCEA1B90D6E92FAD2990@MN2PR18MB2733.namprd18.prod.outlook.com>
Message-ID: <f701775e-a511-e40b-0803-cec9fbfbdd53@redhat.com>

On 1/23/19 5:27 PM, Derek White wrote:

> Because of this we will change how we develop patches for complex
> intrinsics. Before sending the code out for public review, we intend
> to:
> 
>   * Use an additional ?red-team? developer to focus on finding the
> weak points in the code and develop tests that ensure code coverage
> testing, test case coverage, etc. This is in addition to the normal
> testing and test development that the initiating developer is
> expected to do.

>   * The ?red-team? developer will also suggest changes for code
> clarity and code documentation, and will document the test strategy
> (what cases are tested, what tests cover what code, how to run
> tests).

>   * We will include all tests developed as part of the patch, even
> if some modes may not be practical to run regularly as jtreg tests
> (for example if some tests take excessive time). This will allow
> later enhancements or fixes to the intrinsic to go through at least
> as thorough testing as the original.

Thank you for that. I would like to add one thing: before doing
anything you should openly discuss whether a change should be made
at all. We need to know the potential gains, the maintenance costs, and
what the alternatives are.

For example, it may well be possible to write intrinsics in C++ with a
little vector code that will perform nearly as well as hand-carved
assembly language. These will be much cheaper to write, easier to
maintain, and more reliable, for all the usual reasons to do with
high-level languages.

We may decide that we're not going to do an optimization even though
it will make some operation 10% faster because it's too risky. It's
only worth making changes if they really are justified by a significant
improvement on real-world workloads.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From vladimir.kozlov at oracle.com  Thu Jan 24 17:40:24 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 24 Jan 2019 09:40:24 -0800
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
 <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
Message-ID: <5cd79149-c15c-b6de-7feb-9640b7b455f6@oracle.com>

Good.  This intrinsic is used only after method become hot and compiled by C2 JIT. You need a lot of iteration to 
trigger C2 compilation - C2 compiling threshold is 10_000. The test should iterate at least that much (not 5_000).
How long test run with 5_000 iterations? May be it is not practical to run 10_000 if it takes 30 min :(

Thanks,
Vladimir

On 1/23/19 11:14 PM, Fairoz Matte wrote:
> Hi,
> 
> This crash is very random and to exercise AES stability adding a unit testcase.
> Thanks Sean Coffey for bringing this into my notice.
> 
> I have updated webrev and kindly review
> http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/
> 
> Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after 3_000+ iterations.
> In the test case there is loop for 5_000 iterations and running in -Xbatch making it more
> predictable.
> 
> Thanks,
> Fairoz
> 
>> -----Original Message-----
>> From: Fairoz Matte
>> Sent: Wednesday, January 23, 2019 8:50 AM
>> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
>> dev at openjdk.java.net
>> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>> com.sun.crypto.provider.CipherBlockChaining
>>
>> Thanks Tobias and Vladimir for review.
>>
>> Thanks,
>> Fairoz
>>
>>> -----Original Message-----
>>> From: Vladimir Kozlov
>>> Sent: Tuesday, January 22, 2019 10:27 PM
>>> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
>>> dev at openjdk.java.net
>>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>>> com.sun.crypto.provider.CipherBlockChaining
>>>
>>> Yes, it is good.
>>>
>>> Thanks,
>>> Vladimir
>>>
>>> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
>>>> Hi Fairoz,
>>>>
>>>> this looks good to me.
>>>>
>>>> Thanks,
>>>> Tobias
>>>>
>>>> On 22.01.19 04:35, Fairoz Matte wrote:
>>>>> Hi,
>>>>>
>>>>> Please review the following patch,
>>>>> JBS bug - https://bugs.openjdk.java.net/browse/JDK-8209951
>>>>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
>>>>>
>>>>> During the call to assembled stub code
>>>>> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
>>>>> there was reference to G6 register used for temporary storage of
>>>>> F50, as G6 is not saved on stack it was resulting in garbage during
>> retrieval.
>>>>>
>>>>> Solution is to use unused local register (L6) for temporary storage
>>>>> and
>>> retrieval of F50.
>>>>>
>>>>> Thanks,
>>>>> Fairoz
>>>>>

From vladimir.kozlov at oracle.com  Thu Jan 24 17:50:53 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 24 Jan 2019 09:50:53 -0800
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
Message-ID: <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>

Looks good to me too. But it would be nice to have changes explanation in RFE. Why it helps vectorize off heap memory 
accesses?

Thanks,
Vladimir

On 1/24/19 4:39 AM, Nils Eliasson wrote:
> Hi Roland,
> 
> Looks good!
> 
> Thanks for fixing!
> 
> // Nils
> 
> On 2019-01-24 10:43, Roland Westrelin wrote:
>>> http://cr.openjdk.java.net/~roland/8215483/webrev.00/
>> Anyone for that one?
>>
>> Roland.

From andrewluotechnologies at outlook.com  Thu Jan 24 19:48:01 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Thu, 24 Jan 2019 19:48:01 +0000
Subject: Enhancing jaotc to automatically find VS2017 linker
In-Reply-To: <MWHPR13MB1696DB273135D1BEFBE7DA3EA19C0@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB1696DB273135D1BEFBE7DA3EA19C0@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <MWHPR13MB1696B00D9C15ABD61491EAA8A19A0@MWHPR13MB1696.namprd13.prod.outlook.com>

Just wanted to check in again on this in case my email got missed over the long weekend (in the US).  Let me know if I've sent this to the wrong mailing list...

Anyways, after looking into it more myself though, it seems like out-of-process isn't that unusual given that we execute link.exe out of process anyways.

Thanks,

-Andrew

From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Andrew Luo
Sent: Friday, January 18, 2019 2:17 PM
To: hotspot-compiler-dev at openjdk.java.net
Subject: Enhancing jaotc to automatically find VS2017 linker

Hi,

Has there been any plans to enhance jaotc to support automatically finding the link.exe in VS2017?  If not, I am interested in contributing some work to support this.

I see that in Linker.java (src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/Linker.java) we find link.exe using the environment variables VS...COMNTOOLS, but since in VS2017 and forward, this is not defined, it seems another approach is necessary.  Microsoft suggests that you use vswhere (https://github.com/Microsoft/vswhere, BSD licensed, included with Visual Studio 2017 15.2 and forward) or their COM API to find the latest VS2017 toolset.

Anyways, if everyone agrees we should add VS2017 support, there are a few ways to do this (in order of simplest/easiest to most complex):


1.       Check that vswhere exists on the system, if it does, call vswhere (out of process - not sure this is acceptable...) and use that to find the VS2017 link.exe

2.       Ship vswhere with the JDK and call it out of process

3.       Statically link a copy of vswhere (BSD licensed - is this okay?) into our code and add a JNI stub to call it

4.       Call the COM API in a JNI function to get the latest version of VS2017

Personally I prefer (1), but if out-of-process isn't acceptable I'm fine with doing (4) or (3).

Let me know if you have any comments/feedback on this proposal.

Thanks,

-Andrew

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190124/fbc15379/attachment-0001.html>

From linuxtardis at gmail.com  Thu Jan 24 23:17:38 2019
From: linuxtardis at gmail.com (Jakub =?UTF-8?Q?Van=C4=9Bk?=)
Date: Fri, 25 Jan 2019 00:17:38 +0100
Subject: RFR(M)(round 2): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <3f62f15e-ac5f-94d4-9744-c9cef796a3fa@oracle.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
 <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>
 <de5bb24804a0c5b66f0412382f338e415de6b1ed.camel@gmail.com>
 <3f62f15e-ac5f-94d4-9744-c9cef796a3fa@oracle.com>
Message-ID: <12e1cb109842f145edf23b4ea5ef591395188de9.camel@gmail.com>

Hi Magnus,

thanks for the review!

I haven't received a review for the hotspot source changes yet, so I
will have to wait.

Regards,

Jakub

On 2019-01-23 at 13:55 +0100, Magnus Ihse Bursie wrote:
> Hi Jakub,
> 
> On 2019-01-15 17:31, Jakub Van?k wrote:
> > Hi Magnus and Erik,
> > 
> > I have added the link to the repository to README and I have
> > removed
> > the link to the mailing list thread. I have also recreated the
> > GitHub
> > repository. Now it is a fork of the mentioned repository with two
> > extra
> > commits containing README and the build scripts.
> > 
> > New webrev URL: 
> > http://cr.openjdk.java.net/~jakvanek/8215902/webrev.04/
> > Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
> 
> Sorry for the late reply.
> 
> This looks very good! Thank you for fixing this, including rebasing
> the 
> github repo.
> 
> I'm not sure if you've gotten reviews from the hotspot team for the 
> hotspot source changes, but from a build perspective, this is good to
> go.
> 
> /Magnus
> > 
> > Regards,
> > 
> > Jakub
> > 
> > On 2019-01-15 at 15:05 +0100, Magnus Ihse Bursie wrote:
> > > On 2018-12-25 16:19, Jakub Van?k wrote:
> > > > Hi,
> > > > 
> > > > please review this webrev. It is a successor of the softfloat-3
> > > > [patch]
> > > > thread (first email
> > > > 
> > 
> > 
http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
> > > > )
> > > > 
> > > > Changes since the last patch (v6):
> > > > 
> > > > - renamed --with-softloat* to --with-sflt* (it is more compact
> > > > and
> > > > it
> > > >     corresponds to the old --with-sflt-lib=... option)
> > > > 
> > > > - license is now obtained via --with-sflt-license switch (so it
> > > > is
> > > > not
> > > >     included in OpenJDK source tree)
> > > > 
> > > > - updated documentation (slight rewording, added the license
> > > > option)
> > > > 
> > > > - checks for default --with/--without behavior are in place
> > > > again
> > > >     (I forgot them when I changed the way the library is
> > > > detected)
> > > > 
> > > > - added a simple testcase - I found a disrepancy between
> > > > softfloat
> > > > and
> > > >     system function behavior. When a float with bits 0x003FFFFF
> > > > is
> > > >     added to 0x00000001, the correct result is 0x00400000, but
> > > > the
> > > >     default software floating point implementation returns
> > > > 0x00000000.
> > > >     However I'm not sure where to put this test - now it is in
> > > >     test/hotspot/jtreg/compiler/floatingpoint.
> > > > 
> > > > - comments in code refer to CR 6757269 and newly JDK-8215902
> > > > too.
> > > > 
> > > > I have created a repository with SoftFloat-3e with build
> > > > configuration
> > > > specifically for OpenJDK on armel:
> > > > https://github.com/ev3dev-lang-java/softfloat-openjdk
> > > > 
> > > > I can add a link to it to the documentation.
> > > > 
> > > > Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
> > > > Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/
> > > 
> > > Hi Jakub,
> > > 
> > > In general this looks good.
> > > 
> > > Some comments:
> > > 
> > > I agree with Erik that you can add a link to your github project;
> > > compiling SoftFloat is outside the scope of the OpenJDK build
> > > instructions, but it can sure be helpful to lower the bar for
> > > users
> > > wanting to do that. Just one question: any particular reason you
> > > didn't
> > > create your github repo by forking the official
> > > https://github.com/ucb-bar/berkeley-softfloat-3? That way, it
> > > would
> > > have
> > > been easy for users to see that you were not adding any malicious
> > > or
> > > suspicious code to the original SoftFloat distribution.
> > > 
> > > On the other hand, I think the link to
> > > 
> > 
> > 
http://mail.openjdk.java.net/pipermail/aarch32-port-dev/2016-November/000611.html
> > >   
> > > is unnecessary and just creates clutter in the documentation.
> > > Please
> > > remove it.
> > > 
> > > /Magnus
> > > > CI build:
> > > > 
https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
> > > > 
> > > > Cheers,
> > > > 
> > > > Jakub
> > > > 
> 
> 


From vladimir.x.ivanov at oracle.com  Fri Jan 25 01:24:15 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 24 Jan 2019 17:24:15 -0800
Subject: [13] RFR (S): 8217760: C2: Missing symbolic info on a call from
 intrinsics when invoked through MethodHandle
Message-ID: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8217760/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8217760

If an intrinsic is called through MethodHandle and it contains a call, 
then it crashes at the call site during resolution due to inconsistent 
symbolic info: bytecode refers to method handle linker (MH::linkTo*), 
but the call invokes some concrete method (result of inlining through 
the linker).

The fix is to explicitly attach symbolic info to the call using the 
machinery introduced by JDK-8072008 [1].

Testing: hs-precheckin-comp, hs-tier1, hs-tier2, hs-tier3

Best regards,
Vladimir Ivanov

[1] https://bugs.openjdk.java.net/browse/JDK-8072008

From Ningsheng.Jian at arm.com  Fri Jan 25 01:24:16 2019
From: Ningsheng.Jian at arm.com (Ningsheng Jian (Arm Technology China))
Date: Fri, 25 Jan 2019 01:24:16 +0000
Subject: [aarch64-port-dev ] RFR(S): 8216259: AArch64: Vectorize Adler32
 intrinsics
In-Reply-To: <7e8244c4-ba51-b434-425e-4db9f92fa500@redhat.com>
References: <DB7PR08MB3115B20F823E5EAF0BA9C327969F0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <771c5094-aacb-d52c-437f-29aaf5f8f01a@redhat.com>
 <DB7PR08MB31154B111D6349161EAE5647969A0@DB7PR08MB3115.eurprd08.prod.outlook.com>
 <7e8244c4-ba51-b434-425e-4db9f92fa500@redhat.com>
Message-ID: <73dec208-e876-570c-446d-bf9b12303d37@arm.com>


Hi Andrew,

On 01/24/2019 08:32 PM, Andrew Haley wrote:
> On 1/24/19 10:29 AM, Pengfei Li (Arm Technology China) wrote:
>> Hi Andrew Haley,
>>
>>> Instead, please put it into a function (e.g. updateBytesCRC32C_inner) and call
>>> it from updateBytesCRC32C. There's no point writing all this stuff out twice.
>>
>> I uploaded a new webrev. Is it what you want?
>> http://cr.openjdk.java.net/~pli/rfr/8216259/webrev.01/
>
> Yes, thank you. Ningsheng, once you have commit access to OpenJDK, will
> you please push this?
>

Sure! Thank you!

Regards,
Ningsheng

From igor.veresov at oracle.com  Fri Jan 25 01:26:37 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Thu, 24 Jan 2019 17:26:37 -0800
Subject: Enhancing jaotc to automatically find VS2017 linker
In-Reply-To: <MWHPR13MB1696B00D9C15ABD61491EAA8A19A0@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB1696DB273135D1BEFBE7DA3EA19C0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696B00D9C15ABD61491EAA8A19A0@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <BF3AFE9A-7118-4E8B-99D2-4B2BFCCBD908@oracle.com>

I think (1) sounds reasonable. Bob, what do you think?

igor


> On Jan 24, 2019, at 11:48 AM, Andrew Luo <andrewluotechnologies at outlook.com> wrote:
> 
> Just wanted to check in again on this in case my email got missed over the long weekend (in the US).  Let me know if I?ve sent this to the wrong mailing list?
>  
> Anyways, after looking into it more myself though, it seems like out-of-process isn?t that unusual given that we execute link.exe out of process anyways.
>  
> Thanks,
>  
> -Andrew
>  
> From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Andrew Luo
> Sent: Friday, January 18, 2019 2:17 PM
> To: hotspot-compiler-dev at openjdk.java.net
> Subject: Enhancing jaotc to automatically find VS2017 linker
>  
> Hi,
>  
> Has there been any plans to enhance jaotc to support automatically finding the link.exe in VS2017?  If not, I am interested in contributing some work to support this.
>  
> I see that in Linker.java (src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/Linker.java) we find link.exe using the environment variables VS?COMNTOOLS, but since in VS2017 and forward, this is not defined, it seems another approach is necessary.  Microsoft suggests that you use vswhere (https://github.com/Microsoft/vswhere <https://github.com/Microsoft/vswhere>, BSD licensed, included with Visual Studio 2017 15.2 and forward) or their COM API to find the latest VS2017 toolset.
>  
> Anyways, if everyone agrees we should add VS2017 support, there are a few ways to do this (in order of simplest/easiest to most complex):
>  
> 1.       Check that vswhere exists on the system, if it does, call vswhere (out of process ? not sure this is acceptable?) and use that to find the VS2017 link.exe
> 2.       Ship vswhere with the JDK and call it out of process
> 3.       Statically link a copy of vswhere (BSD licensed ? is this okay?) into our code and add a JNI stub to call it
> 4.       Call the COM API in a JNI function to get the latest version of VS2017
>  
> Personally I prefer (1), but if out-of-process isn?t acceptable I?m fine with doing (4) or (3).
>  
> Let me know if you have any comments/feedback on this proposal.
>  
> Thanks,
>  
> -Andrew

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190124/36fd63f9/attachment-0001.html>

From vladimir.x.ivanov at oracle.com  Fri Jan 25 01:34:03 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 24 Jan 2019 17:34:03 -0800
Subject: [13] RFR (XS): 8191998: C2: inlining through MH linkers drops
 speculative part of argument types
Message-ID: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8191998/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8191998

CallGenerator::for_method_handle_inline() casts MH linker (MH::linkTo*) 
arguments before attempting inlining. If any argument has a speculative 
type attached, it is lost and can't be used later.

The patch preserves speculative part while sharpening the type (if 
needed) based on static information from the MemberName instance.

Testing: hs-precheckin-comp, hs-tier1, hs-tier2.

Best regards,
Vladimir Ivanov

From vladimir.x.ivanov at oracle.com  Fri Jan 25 01:56:39 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 24 Jan 2019 17:56:39 -0800
Subject: [13] RFR (S): 8192001: C2: inlining through dispatching MH linkers
 ignores speculative type of the receiver
Message-ID: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8192001/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8192001

When inlining through MethodHandle calls, C2 can improve inlining 
decisions by taking speculative types into account (availability of type 
information is addressed by JDK-8191998 [1]).

There's no profiling performed at method handle linker call sites 
(MethodHandle::linkTo*), but type info can flow from other sources.

As an example, consider the following case:

   class A           { void m() { ... } }
   class B extends A { void m() { ... } }

   MH = LOOKUP.findVirtual(A.class, "m", ...);

   void test(A o) throws Throwable {
     MH.invokeExact(o);
   }

   test(new B());

Before (no inlining):
251   12   !b        TestMH::test (21 bytes)
   ...
   @ 16   TestMH1$A::m (1 bytes)   virtual call

After (guarded inlining):
251   12   !b        TestMH::test (21 bytes)
   ...
   @ 16   TestMH1$B1::m (1 bytes)   inline (hot)
      \-> TypeProfile (-1/6701 counts) = TestMH1$B1

Testing: hs-precheckin-comp, hs-tier1, hs-tier2.

Best regards,
Vladimir Ivanov

[1] https://bugs.openjdk.java.net/browse/JDK-8192001

From fairoz.matte at oracle.com  Fri Jan 25 02:31:09 2019
From: fairoz.matte at oracle.com (Fairoz Matte)
Date: Thu, 24 Jan 2019 18:31:09 -0800 (PST)
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <5cd79149-c15c-b6de-7feb-9640b7b455f6@oracle.com>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
 <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
 <5cd79149-c15c-b6de-7feb-9640b7b455f6@oracle.com>
Message-ID: <2befeec4-9ef3-4a7d-991d-2bfa2a0052de@default>

HI Vladimir,

Yes, it is difficult to run for 10_000 iterations in this case as it takes 15 to ~16mins.
Test case takes 8mins to run 5_000 iterations.
Crash is observed between 2_900 to 3_400 iterations, 
So I am running for 5_000 iteration with 10mins timeout.
Do let me know, if this works out for push?

Thanks,
Fairoz

> -----Original Message-----
> From: Vladimir Kozlov
> Sent: Thursday, January 24, 2019 11:10 PM
> To: Fairoz Matte <fairoz.matte at oracle.com>; Tobias Hartmann
> <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> com.sun.crypto.provider.CipherBlockChaining
> 
> Good.  This intrinsic is used only after method become hot and compiled by
> C2 JIT. You need a lot of iteration to trigger C2 compilation - C2 compiling
> threshold is 10_000. The test should iterate at least that much (not 5_000).
> How long test run with 5_000 iterations? May be it is not practical to run
> 10_000 if it takes 30 min :(
> 
> Thanks,
> Vladimir
> 
> On 1/23/19 11:14 PM, Fairoz Matte wrote:
> > Hi,
> >
> > This crash is very random and to exercise AES stability adding a unit
> testcase.
> > Thanks Sean Coffey for bringing this into my notice.
> >
> > I have updated webrev and kindly review
> > http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/
> >
> > Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after
> 3_000+ iterations.
> > In the test case there is loop for 5_000 iterations and running in
> > -Xbatch making it more predictable.
> >
> > Thanks,
> > Fairoz
> >
> >> -----Original Message-----
> >> From: Fairoz Matte
> >> Sent: Wednesday, January 23, 2019 8:50 AM
> >> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
> >> dev at openjdk.java.net
> >> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> >> com.sun.crypto.provider.CipherBlockChaining
> >>
> >> Thanks Tobias and Vladimir for review.
> >>
> >> Thanks,
> >> Fairoz
> >>
> >>> -----Original Message-----
> >>> From: Vladimir Kozlov
> >>> Sent: Tuesday, January 22, 2019 10:27 PM
> >>> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
> >>> dev at openjdk.java.net
> >>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
> >>> com.sun.crypto.provider.CipherBlockChaining
> >>>
> >>> Yes, it is good.
> >>>
> >>> Thanks,
> >>> Vladimir
> >>>
> >>> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
> >>>> Hi Fairoz,
> >>>>
> >>>> this looks good to me.
> >>>>
> >>>> Thanks,
> >>>> Tobias
> >>>>
> >>>> On 22.01.19 04:35, Fairoz Matte wrote:
> >>>>> Hi,
> >>>>>
> >>>>> Please review the following patch, JBS bug -
> >>>>> https://bugs.openjdk.java.net/browse/JDK-8209951
> >>>>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
> >>>>>
> >>>>> During the call to assembled stub code
> >>>>> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
> >>>>> there was reference to G6 register used for temporary storage of
> >>>>> F50, as G6 is not saved on stack it was resulting in garbage
> >>>>> during
> >> retrieval.
> >>>>>
> >>>>> Solution is to use unused local register (L6) for temporary
> >>>>> storage and
> >>> retrieval of F50.
> >>>>>
> >>>>> Thanks,
> >>>>> Fairoz
> >>>>>

From vladimir.kozlov at oracle.com  Fri Jan 25 03:00:28 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 24 Jan 2019 19:00:28 -0800
Subject: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
 com.sun.crypto.provider.CipherBlockChaining
In-Reply-To: <2befeec4-9ef3-4a7d-991d-2bfa2a0052de@default>
References: <be6b6aaa-d3e9-4ee4-a139-93d036c47ae8@default>
 <9d49a648-50e6-30cd-5705-2997f658d8e8@oracle.com>
 <794dcbd2-60e0-e9f4-f9b5-f789e45d373a@oracle.com>
 <323b7338-d507-4850-ab53-4a5295d7b62f@default>
 <d570143b-30cf-4d6d-92b6-fb6b47056fdb@default>
 <5cd79149-c15c-b6de-7feb-9640b7b455f6@oracle.com>
 <2befeec4-9ef3-4a7d-991d-2bfa2a0052de@default>
Message-ID: <0F90FB89-4F15-4759-9AB4-E82ACE912963@oracle.com>

Yes, it works for push. Thank you for information.

Vladimir 

> On Jan 24, 2019, at 6:31 PM, Fairoz Matte <fairoz.matte at oracle.com> wrote:
> 
> HI Vladimir,
> 
> Yes, it is difficult to run for 10_000 iterations in this case as it takes 15 to ~16mins.
> Test case takes 8mins to run 5_000 iterations.
> Crash is observed between 2_900 to 3_400 iterations, 
> So I am running for 5_000 iteration with 10mins timeout.
> Do let me know, if this works out for push?
> 
> Thanks,
> Fairoz
> 
>> -----Original Message-----
>> From: Vladimir Kozlov
>> Sent: Thursday, January 24, 2019 11:10 PM
>> To: Fairoz Matte <fairoz.matte at oracle.com>; Tobias Hartmann
>> <tobias.hartmann at oracle.com>; hotspot-compiler-dev at openjdk.java.net
>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>> com.sun.crypto.provider.CipherBlockChaining
>> 
>> Good.  This intrinsic is used only after method become hot and compiled by
>> C2 JIT. You need a lot of iteration to trigger C2 compilation - C2 compiling
>> threshold is 10_000. The test should iterate at least that much (not 5_000).
>> How long test run with 5_000 iterations? May be it is not practical to run
>> 10_000 if it takes 30 min :(
>> 
>> Thanks,
>> Vladimir
>> 
>>> On 1/23/19 11:14 PM, Fairoz Matte wrote:
>>> Hi,
>>> 
>>> This crash is very random and to exercise AES stability adding a unit
>> testcase.
>>> Thanks Sean Coffey for bringing this into my notice.
>>> 
>>> I have updated webrev and kindly review
>>> http://cr.openjdk.java.net/~fmatte/8209951/webrev.01/
>>> 
>>> Note: Crash is only observed on JDK 8 with Sparc Solaris 10 machine after
>> 3_000+ iterations.
>>> In the test case there is loop for 5_000 iterations and running in
>>> -Xbatch making it more predictable.
>>> 
>>> Thanks,
>>> Fairoz
>>> 
>>>> -----Original Message-----
>>>> From: Fairoz Matte
>>>> Sent: Wednesday, January 23, 2019 8:50 AM
>>>> To: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-
>>>> dev at openjdk.java.net
>>>> Subject: RE: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>>>> com.sun.crypto.provider.CipherBlockChaining
>>>> 
>>>> Thanks Tobias and Vladimir for review.
>>>> 
>>>> Thanks,
>>>> Fairoz
>>>> 
>>>>> -----Original Message-----
>>>>> From: Vladimir Kozlov
>>>>> Sent: Tuesday, January 22, 2019 10:27 PM
>>>>> To: Fairoz Matte <fairoz.matte at oracle.com>; hotspot-compiler-
>>>>> dev at openjdk.java.net
>>>>> Subject: Re: [13] RFR(S): 8209951 : Problematic sparc intrinsic:
>>>>> com.sun.crypto.provider.CipherBlockChaining
>>>>> 
>>>>> Yes, it is good.
>>>>> 
>>>>> Thanks,
>>>>> Vladimir
>>>>> 
>>>>>> On 1/22/19 12:22 AM, Tobias Hartmann wrote:
>>>>>> Hi Fairoz,
>>>>>> 
>>>>>> this looks good to me.
>>>>>> 
>>>>>> Thanks,
>>>>>> Tobias
>>>>>> 
>>>>>>> On 22.01.19 04:35, Fairoz Matte wrote:
>>>>>>> Hi,
>>>>>>> 
>>>>>>> Please review the following patch, JBS bug -
>>>>>>> https://bugs.openjdk.java.net/browse/JDK-8209951
>>>>>>> Webrev - http://cr.openjdk.java.net/~fmatte/8209951/webrev.00/
>>>>>>> 
>>>>>>> During the call to assembled stub code
>>>>>>> generate_cipherBlockChaining_decryptAESCrypt_Parallel()
>>>>>>> there was reference to G6 register used for temporary storage of
>>>>>>> F50, as G6 is not saved on stack it was resulting in garbage
>>>>>>> during
>>>> retrieval.
>>>>>>> 
>>>>>>> Solution is to use unused local register (L6) for temporary
>>>>>>> storage and
>>>>> retrieval of F50.
>>>>>>> 
>>>>>>> Thanks,
>>>>>>> Fairoz
>>>>>>> 


From vladimir.x.ivanov at oracle.com  Fri Jan 25 03:18:18 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 24 Jan 2019 19:18:18 -0800
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
Message-ID: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/
https://bugs.openjdk.java.net/browse/JDK-8059241

I'd like to revive the fix for 8059241 [1].

Quoting original review request:

   "According to -XX:+CITime, C2 spends too much time in incremental
inlining (see the bug for the numbers).

...

The fix is two-fold:

   (1) Reduce PhaseRemoveUseless frequency: inline in larger chunks until
IR size LiveNodeCountInliningCutoff, then eliminate dead nodes.

   (2) Have a relatively small (10%) gap between
LiveNodeCountInliningCutoff and actual limit when inlining step is
finished to give the algorithm some space to "breath" (hence smallest
inlining chunk produce at least 10%*LiveNodeCountInliningCutoff nodes)."


At that time, the blocker issue was that skipping RemoveUseless/IGVN 
exposed dead nodes during consequent parsing [2] and I didn't find a 
good solution without sacrifice too much improvement.

I believe I have it now: parsing is good at handling local dead code, 
but the problem arise when inlining is resumed from a call node on 
consequent iterations (e.g., uncommon trap dominating the call being 
inlined). So, when return path is dead after inlining, cleanup is 
performed before initiating next iteration. It allows to purge any 
effectively dead call sites pending inlining.

Also, there's a fix for unrelated problem exposed during testing [3]:

It's unsafe to repeatedly call PhaseGVN::transform() on TypeNode because 
it may cause a type change and then hitting assert due to changed hash 
in TypeNode::set_type() (called from Node::raise_bottom_type()) because 
type node hash depends on its type.


Testing: hs-precheckin-comp, tier1-5 \w -XX:+AlwaysIncrementalInline

I ran Octane w/ -XX:+CITime and observed significant reduction in 
"Incremental Inline" times (~50% off on IGVN, 80-95% off on "Prune 
Useless").

Best regards,
Vladimir Ivanov

[1] 
https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2015-April/017752.html

[2] 
https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2015-April/017801.html

[3] 
http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/src/hotspot/share/opto/castnode.cpp.udiff.html

static inline Node* addP_of_X2P(PhaseGVN *phase,
                                  Node* base,
                                  Node* dispX,
                                  bool negate = false) {
    if (negate) {
-    dispX = new SubXNode(phase->MakeConX(0), phase->transform(dispX));
+    dispX = phase->transform(new SubXNode(phase->MakeConX(0), dispX));
    }
    return new AddPNode(phase->C->top(),
                        phase->transform(new CastX2PNode(base)),
-                      phase->transform(dispX));
+                      dispX);
  }

From igor.ignatyev at oracle.com  Fri Jan 25 05:22:15 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 24 Jan 2019 21:22:15 -0800
Subject: RFR(S)[12] : 8067250 : [mlvm]
 vm/mlvm/mixed/stress/regression/b6969574 fails and perf regression
Message-ID: <EEAF9BB5-8441-4E6E-B063-2C8FBFFABF49@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8067250/webrev.00/index.html
> 21 lines changed: 0 ins; 17 del; 4 mod;

Hi all,

could you please review this small fix for b6969574 test? 

the tests compares performance of different invocation modes, long time ago it started to report that MH.invokeWithArguments() is slower than Method.invoke(). 8078511[1] was filed to consider possible improvement for invokeWithArguments. however, since MH.invokeWithArguments performance isn't expected to  be better thanMethod.invoke, it was decided to remove it from the test.

webrev: http://cr.openjdk.java.net/~iignatyev//8067250/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8067250
testing: the affected tests in the configurations known to provoke failures 

Thanks,
-- Igor

From david.holmes at oracle.com  Fri Jan 25 06:45:05 2019
From: david.holmes at oracle.com (David Holmes)
Date: Fri, 25 Jan 2019 16:45:05 +1000
Subject: RFR(M)(round 2): 8215902: Add support for SoftFloat-3e library
In-Reply-To: <12e1cb109842f145edf23b4ea5ef591395188de9.camel@gmail.com>
References: <4497ca084b9f48dbb8f6de1aa35c83653fd7acfb.camel@gmail.com>
 <7f69fc73-1c10-6b68-d657-c9e758d4bf1d@oracle.com>
 <de5bb24804a0c5b66f0412382f338e415de6b1ed.camel@gmail.com>
 <3f62f15e-ac5f-94d4-9744-c9cef796a3fa@oracle.com>
 <12e1cb109842f145edf23b4ea5ef591395188de9.camel@gmail.com>
Message-ID: <b36fabae-a384-4d94-ec92-a5fbcc34300b@oracle.com>

Hi Jakub,

On 25/01/2019 9:17 am, Jakub Van?k wrote:
> Hi Magnus,
> 
> thanks for the review!
> 
> I haven't received a review for the hotspot source changes yet, so I
> will have to wait.

Not an expert on the details of the FP code but the wrapper layer 
appears okay to me.

One nit with the test is that we don't name tests using bug numbers any 
more, so please rename the test to something more descriptive ... 
perhaps FloatPrecisionTest.java ?

Also on the test, given this wraps a number of FP functions does the 
test need to be more elaborate to ensure they have all been covered?

Thanks,
David
-----

> Regards,
> 
> Jakub
> 
> On 2019-01-23 at 13:55 +0100, Magnus Ihse Bursie wrote:
>> Hi Jakub,
>>
>> On 2019-01-15 17:31, Jakub Van?k wrote:
>>> Hi Magnus and Erik,
>>>
>>> I have added the link to the repository to README and I have
>>> removed
>>> the link to the mailing list thread. I have also recreated the
>>> GitHub
>>> repository. Now it is a fork of the mentioned repository with two
>>> extra
>>> commits containing README and the build scripts.
>>>
>>> New webrev URL:
>>> http://cr.openjdk.java.net/~jakvanek/8215902/webrev.04/
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
>>
>> Sorry for the late reply.
>>
>> This looks very good! Thank you for fixing this, including rebasing
>> the
>> github repo.
>>
>> I'm not sure if you've gotten reviews from the hotspot team for the
>> hotspot source changes, but from a build perspective, this is good to
>> go.
>>
>> /Magnus
>>>
>>> Regards,
>>>
>>> Jakub
>>>
>>> On 2019-01-15 at 15:05 +0100, Magnus Ihse Bursie wrote:
>>>> On 2018-12-25 16:19, Jakub Van?k wrote:
>>>>> Hi,
>>>>>
>>>>> please review this webrev. It is a successor of the softfloat-3
>>>>> [patch]
>>>>> thread (first email
>>>>>
>>>
>>>
> http://mail.openjdk.java.net/pipermail/hotspot-runtime-dev/2018-November/031311.html
>>>>> )
>>>>>
>>>>> Changes since the last patch (v6):
>>>>>
>>>>> - renamed --with-softloat* to --with-sflt* (it is more compact
>>>>> and
>>>>> it
>>>>>      corresponds to the old --with-sflt-lib=... option)
>>>>>
>>>>> - license is now obtained via --with-sflt-license switch (so it
>>>>> is
>>>>> not
>>>>>      included in OpenJDK source tree)
>>>>>
>>>>> - updated documentation (slight rewording, added the license
>>>>> option)
>>>>>
>>>>> - checks for default --with/--without behavior are in place
>>>>> again
>>>>>      (I forgot them when I changed the way the library is
>>>>> detected)
>>>>>
>>>>> - added a simple testcase - I found a disrepancy between
>>>>> softfloat
>>>>> and
>>>>>      system function behavior. When a float with bits 0x003FFFFF
>>>>> is
>>>>>      added to 0x00000001, the correct result is 0x00400000, but
>>>>> the
>>>>>      default software floating point implementation returns
>>>>> 0x00000000.
>>>>>      However I'm not sure where to put this test - now it is in
>>>>>      test/hotspot/jtreg/compiler/floatingpoint.
>>>>>
>>>>> - comments in code refer to CR 6757269 and newly JDK-8215902
>>>>> too.
>>>>>
>>>>> I have created a repository with SoftFloat-3e with build
>>>>> configuration
>>>>> specifically for OpenJDK on armel:
>>>>> https://github.com/ev3dev-lang-java/softfloat-openjdk
>>>>>
>>>>> I can add a link to it to the documentation.
>>>>>
>>>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8215902
>>>>> Webrev: http://cr.openjdk.java.net/~jakvanek/8215902/webrev.02/
>>>>
>>>> Hi Jakub,
>>>>
>>>> In general this looks good.
>>>>
>>>> Some comments:
>>>>
>>>> I agree with Erik that you can add a link to your github project;
>>>> compiling SoftFloat is outside the scope of the OpenJDK build
>>>> instructions, but it can sure be helpful to lower the bar for
>>>> users
>>>> wanting to do that. Just one question: any particular reason you
>>>> didn't
>>>> create your github repo by forking the official
>>>> https://github.com/ucb-bar/berkeley-softfloat-3? That way, it
>>>> would
>>>> have
>>>> been easy for users to see that you were not adding any malicious
>>>> or
>>>> suspicious code to the original SoftFloat distribution.
>>>>
>>>> On the other hand, I think the link to
>>>>
>>>
>>>
> http://mail.openjdk.java.net/pipermail/aarch32-port-dev/2016-November/000611.html
>>>>    
>>>> is unnecessary and just creates clutter in the documentation.
>>>> Please
>>>> remove it.
>>>>
>>>> /Magnus
>>>>> CI build:
>>>>>
> https://ci.adoptopenjdk.net/view/ev3dev/job/openjdk12_build_ev3_linux/67/
>>>>>
>>>>> Cheers,
>>>>>
>>>>> Jakub
>>>>>
>>
>>
> 

From igor.ignatyev at oracle.com  Fri Jan 25 07:46:48 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 24 Jan 2019 23:46:48 -0800
Subject: RFR(T) [12] : 8217770 : problem list
 org.graalvm.compiler.debug.test.DebugContextTest
Message-ID: <88E96595-0B8D-48F3-B7A1-B48A2A7922C4@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
> 1 line changed: 1 ins; 0 del; 0 mod;

> diff -r 6533b2b34593 test/hotspot/jtreg/ProblemList-graal.txt
>  org.graalvm.compiler.core.test.OptionsVerifierTest                               8205081
>  org.graalvm.compiler.hotspot.test.CompilationWrapperTest                         8205081
>  org.graalvm.compiler.replacements.test.classfile.ClassfileBytecodeProviderTest   8205081
> +org.graalvm.compiler.debug.test.DebugContextTest                                 8205081
>  
>  org.graalvm.compiler.core.test.deopt.CompiledMethodTest          8202955

Hi all,

could you please review this tiny and trivial patch which puts org.graalvm.compiler.debug.test.DebugContextTest back into the problem list?

this graal unit test was prematurely removed from the problem list by 8217580[1-2]. this test is/was known to fail not only because of 8203504[3], but also because of 8205081[4]. this patch put DebugContextTest test to the problem list w/ 8205081 as the reason.

JBS: https://bugs.openjdk.java.net/browse/JDK-8217770
webrev: http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
testing: compiler/graalunit/DebugTest.java (which includes this unit test)

[1] https://bugs.openjdk.java.net/browse/JDK-8217580
[2] http://hg.openjdk.java.net/jdk/jdk12/rev/6533b2b34593#l2.44
[3] https://bugs.openjdk.java.net/browse/JDK-8203504
[4] https://bugs.openjdk.java.net/browse/JDK-8205081

Thanks,
-- Igor

From david.holmes at oracle.com  Fri Jan 25 07:50:28 2019
From: david.holmes at oracle.com (David Holmes)
Date: Fri, 25 Jan 2019 17:50:28 +1000
Subject: RFR(T) [12] : 8217770 : problem list
 org.graalvm.compiler.debug.test.DebugContextTest
In-Reply-To: <88E96595-0B8D-48F3-B7A1-B48A2A7922C4@oracle.com>
References: <88E96595-0B8D-48F3-B7A1-B48A2A7922C4@oracle.com>
Message-ID: <e7296bb2-2257-2f28-2ca4-fe7bbc8e2e12@oracle.com>

Looks good.

Thanks,
David

On 25/01/2019 5:46 pm, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
>> 1 line changed: 1 ins; 0 del; 0 mod;
> 
>> diff -r 6533b2b34593 test/hotspot/jtreg/ProblemList-graal.txt
>>   org.graalvm.compiler.core.test.OptionsVerifierTest                               8205081
>>   org.graalvm.compiler.hotspot.test.CompilationWrapperTest                         8205081
>>   org.graalvm.compiler.replacements.test.classfile.ClassfileBytecodeProviderTest   8205081
>> +org.graalvm.compiler.debug.test.DebugContextTest                                 8205081
>>   
>>   org.graalvm.compiler.core.test.deopt.CompiledMethodTest          8202955
> 
> Hi all,
> 
> could you please review this tiny and trivial patch which puts org.graalvm.compiler.debug.test.DebugContextTest back into the problem list?
> 
> this graal unit test was prematurely removed from the problem list by 8217580[1-2]. this test is/was known to fail not only because of 8203504[3], but also because of 8205081[4]. this patch put DebugContextTest test to the problem list w/ 8205081 as the reason.
> 
> JBS: https://bugs.openjdk.java.net/browse/JDK-8217770
> webrev: http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
> testing: compiler/graalunit/DebugTest.java (which includes this unit test)
> 
> [1] https://bugs.openjdk.java.net/browse/JDK-8217580
> [2] http://hg.openjdk.java.net/jdk/jdk12/rev/6533b2b34593#l2.44
> [3] https://bugs.openjdk.java.net/browse/JDK-8203504
> [4] https://bugs.openjdk.java.net/browse/JDK-8205081
> 
> Thanks,
> -- Igor
> 

From igor.ignatyev at oracle.com  Fri Jan 25 07:53:38 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 24 Jan 2019 23:53:38 -0800
Subject: RFR(T) [12] : 8217770 : problem list
 org.graalvm.compiler.debug.test.DebugContextTest
In-Reply-To: <e7296bb2-2257-2f28-2ca4-fe7bbc8e2e12@oracle.com>
References: <88E96595-0B8D-48F3-B7A1-B48A2A7922C4@oracle.com>
 <e7296bb2-2257-2f28-2ca4-fe7bbc8e2e12@oracle.com>
Message-ID: <E6860494-8637-405E-9FFD-B6DEB37051A2@oracle.com>

that was fast! thanks David.

-- Igor

> On Jan 24, 2019, at 11:50 PM, David Holmes <david.holmes at oracle.com> wrote:
> 
> Looks good.
> 
> Thanks,
> David
> 
> On 25/01/2019 5:46 pm, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
>>> 1 line changed: 1 ins; 0 del; 0 mod;
>>> diff -r 6533b2b34593 test/hotspot/jtreg/ProblemList-graal.txt
>>>  org.graalvm.compiler.core.test.OptionsVerifierTest                               8205081
>>>  org.graalvm.compiler.hotspot.test.CompilationWrapperTest                         8205081
>>>  org.graalvm.compiler.replacements.test.classfile.ClassfileBytecodeProviderTest   8205081
>>> +org.graalvm.compiler.debug.test.DebugContextTest                                 8205081
>>>    org.graalvm.compiler.core.test.deopt.CompiledMethodTest          8202955
>> Hi all,
>> could you please review this tiny and trivial patch which puts org.graalvm.compiler.debug.test.DebugContextTest back into the problem list?
>> this graal unit test was prematurely removed from the problem list by 8217580[1-2]. this test is/was known to fail not only because of 8203504[3], but also because of 8205081[4]. this patch put DebugContextTest test to the problem list w/ 8205081 as the reason.
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217770
>> webrev: http://cr.openjdk.java.net/~iignatyev//8217770/webrev.00/index.html
>> testing: compiler/graalunit/DebugTest.java (which includes this unit test)
>> [1] https://bugs.openjdk.java.net/browse/JDK-8217580
>> [2] http://hg.openjdk.java.net/jdk/jdk12/rev/6533b2b34593#l2.44
>> [3] https://bugs.openjdk.java.net/browse/JDK-8203504
>> [4] https://bugs.openjdk.java.net/browse/JDK-8205081
>> Thanks,
>> -- Igor


From tobias.hartmann at oracle.com  Fri Jan 25 08:07:55 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 25 Jan 2019 09:07:55 +0100
Subject: [13] RFR (S): 8217760: C2: Missing symbolic info on a call from
 intrinsics when invoked through MethodHandle
In-Reply-To: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
References: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
Message-ID: <b067cd07-9699-964a-1e57-2f7876b0abf0@oracle.com>

Hi Vladimir,

this looks good to me.

Best regards,
Tobias

On 25.01.19 02:24, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8217760/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8217760
> 
> If an intrinsic is called through MethodHandle and it contains a call, then it crashes at the call
> site during resolution due to inconsistent symbolic info: bytecode refers to method handle linker
> (MH::linkTo*), but the call invokes some concrete method (result of inlining through the linker).
> 
> The fix is to explicitly attach symbolic info to the call using the machinery introduced by
> JDK-8072008 [1].
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2, hs-tier3
> 
> Best regards,
> Vladimir Ivanov
> 
> [1] https://bugs.openjdk.java.net/browse/JDK-8072008

From tobias.hartmann at oracle.com  Fri Jan 25 08:13:43 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 25 Jan 2019 09:13:43 +0100
Subject: [13] RFR (XS): 8191998: C2: inlining through MH linkers drops
 speculative part of argument types
In-Reply-To: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
References: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
Message-ID: <b31aa68a-06d9-8449-6a89-30b4ae2e7a98@oracle.com>

Hi Vladimir,

looks good.

Best regards,
Tobias

On 25.01.19 02:34, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8191998/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8191998
> 
> CallGenerator::for_method_handle_inline() casts MH linker (MH::linkTo*) arguments before attempting
> inlining. If any argument has a speculative type attached, it is lost and can't be used later.
> 
> The patch preserves speculative part while sharpening the type (if needed) based on static
> information from the MemberName instance.
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
> 
> Best regards,
> Vladimir Ivanov

From goetz.lindenmaier at sap.com  Fri Jan 25 08:14:53 2019
From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz)
Date: Fri, 25 Jan 2019 08:14:53 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
 <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>
 <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
Message-ID: <ec71c98f21934e7ab1b2d5343c2bca6a@sap.com>

Hi Martin,

The change looks good to me. 

Best regards,
  Goetz.

> -----Original Message-----
> From: Doerr, Martin
> Sent: Thursday, January 24, 2019 1:12 PM
> To: Schmidt, Lutz <lutz.schmidt at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: RE: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
> 
> Hi Lutz and Gustavo,
> 
> that's fine. Removed the comments which refer to java.util.zip.CRC32 stuff.
> 
> And while reading through the comments, I found out that
> kernel_crc32_singleByte is not useful (since we have the ...Reg version). So I
> just removed it and replaced its only usage by better code
> (TemplateInterpreterGenerator::generate_CRC32_update_entry).
> 
> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.01/
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Schmidt, Lutz
> Sent: Donnerstag, 24. Januar 2019 12:12
> To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
> 
> Gustavo, Martin,
> 
> I agree, that comment appears somewhat disconnected from the code.
> I'm really not sure if it will help a lot in the future to have a
> grep string that helps finding the related code in java.util.zip.CRC32.
> 
> In short: change it to something meaningful in the local context.
> 
> Thanks,
> Lutz
> 
> ?On 24.01.19, 11:02, "Doerr, Martin" <martin.doerr at sap.com> wrote:
> 
>     Hi Gustavo,
> 
>     thank you for reviewing and testing.
> 
>     Seems like many comments were taken from java.util.zip.CRC32. I guess it
> was intended to refer to it.
>     I think it's not bad to have it this way because it makes it easier to compare
> both implementations.
>     Maybe Lutz can comment on this and if he would like to keep it this way.
> 
>     Best regards,
>     Martin
> 
> 
>     -----Original Message-----
>     From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>     Sent: Mittwoch, 23. Januar 2019 23:18
>     To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>     Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
>     Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of
> CRC32
> 
>     Hi Martin,
> 
>     On 01/21/2019 04:07 PM, Doerr, Martin wrote:
>     > PPC64 currently contains static tables for CRC32/CRC32C calculations. We
> only need some of them depending on Endianess and on whether vector
> instructions are available or not.
>     > We can get rid of quite some code when we generate these constants at
> startup as we already do for the vector version.
>     > In addition, we can save one register in the vector case because we can
> use one constants pointer for all related constants.
>     > Webrev:
>     >
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/
> <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webre
> v.00/>
> 
>     Thanks for the clean-up. Change looks good!
> 
>     It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
>     noted them recently so I missed both in my previous clean-up). And also
>     the static table simplification.
> 
>     I tested the change with different array sizes and byte values with and
>     without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no
> issues.
> 
>     Only a nit: should we update the following comment and replace
> 'timesXtoThe32'
>     by something better, maybe 'table'? That name doesn't look much
> meaningful in the
>     current context and seems taken from the native code for
> java.util.zip.CRC32:
> 
>     3902 /**
>     3903  * uint32_t crc;
>     3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
>     3905  */
>     3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val,
> Register table, Register tmp) {
> 
> 
>     Best regards,
>     Gustavo
> 
> 
> 
> 


From rwestrel at redhat.com  Fri Jan 25 08:18:20 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 09:18:20 +0100
Subject: [13] RFR (XS): 8191998: C2: inlining through MH linkers drops
 speculative part of argument types
In-Reply-To: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
References: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
Message-ID: <87sgxh40c3.fsf@redhat.com>


> http://cr.openjdk.java.net/~vlivanov/8191998/webrev.00/

That looks good to me.

Roland.

From rwestrel at redhat.com  Fri Jan 25 08:17:27 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 09:17:27 +0100
Subject: [13] RFR (S): 8217760: C2: Missing symbolic info on a call from
 intrinsics when invoked through MethodHandle
In-Reply-To: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
References: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
Message-ID: <87va2d40dk.fsf@redhat.com>


> http://cr.openjdk.java.net/~vlivanov/8217760/webrev.00/

Looks good to me.

Roland.

From rwestrel at redhat.com  Fri Jan 25 08:21:46 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 09:21:46 +0100
Subject: [13] RFR (S): 8192001: C2: inlining through dispatching MH
 linkers ignores speculative type of the receiver
In-Reply-To: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
References: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
Message-ID: <87pnsl406d.fsf@redhat.com>


> http://cr.openjdk.java.net/~vlivanov/8192001/webrev.00/

That looks good to me.

Roland.

From tobias.hartmann at oracle.com  Fri Jan 25 08:25:55 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 25 Jan 2019 09:25:55 +0100
Subject: [13] RFR (S): 8192001: C2: inlining through dispatching MH
 linkers ignores speculative type of the receiver
In-Reply-To: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
References: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
Message-ID: <f4f9af4a-a9e8-e0b1-3b6e-c0d17e7893e7@oracle.com>

Hi Vladimir,

this looks good to me too.

Best regards,
Tobias

On 25.01.19 02:56, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8192001/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8192001
> 
> When inlining through MethodHandle calls, C2 can improve inlining decisions by taking speculative
> types into account (availability of type information is addressed by JDK-8191998 [1]).
> 
> There's no profiling performed at method handle linker call sites (MethodHandle::linkTo*), but type
> info can flow from other sources.
> 
> As an example, consider the following case:
> 
> ? class A?????????? { void m() { ... } }
> ? class B extends A { void m() { ... } }
> 
> ? MH = LOOKUP.findVirtual(A.class, "m", ...);
> 
> ? void test(A o) throws Throwable {
> ??? MH.invokeExact(o);
> ? }
> 
> ? test(new B());
> 
> Before (no inlining):
> 251?? 12?? !b??????? TestMH::test (21 bytes)
> ? ...
> ? @ 16?? TestMH1$A::m (1 bytes)?? virtual call
> 
> After (guarded inlining):
> 251?? 12?? !b??????? TestMH::test (21 bytes)
> ? ...
> ? @ 16?? TestMH1$B1::m (1 bytes)?? inline (hot)
> ???? \-> TypeProfile (-1/6701 counts) = TestMH1$B1
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
> 
> Best regards,
> Vladimir Ivanov
> 
> [1] https://bugs.openjdk.java.net/browse/JDK-8192001

From rwestrel at redhat.com  Fri Jan 25 08:35:37 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 09:35:37 +0100
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
Message-ID: <87munp3zja.fsf@redhat.com>


> http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/

Isn't there a bug here:

2090   set_inlining_progress(false);
2091   set_do_cleanup(false);
2092   return inlining_progress() && !needs_cleanup;

Can inlining_progress() be anything but false at line 2092?

Roland.

From rwestrel at redhat.com  Fri Jan 25 08:36:43 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 09:36:43 +0100
Subject: RFR(S)[12] : 8067250 : [mlvm]
 vm/mlvm/mixed/stress/regression/b6969574 fails and perf regression
In-Reply-To: <EEAF9BB5-8441-4E6E-B063-2C8FBFFABF49@oracle.com>
References: <EEAF9BB5-8441-4E6E-B063-2C8FBFFABF49@oracle.com>
Message-ID: <87k1it3zhg.fsf@redhat.com>


> http://cr.openjdk.java.net/~iignatyev//8067250/webrev.00/index.html

Looks good to me.

Roland.

From vladimir.x.ivanov at oracle.com  Fri Jan 25 08:50:19 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 00:50:19 -0800
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <87munp3zja.fsf@redhat.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
 <87munp3zja.fsf@redhat.com>
Message-ID: <ac426ae9-0570-1781-3958-218d8599d381@oracle.com>


> 
>> http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/
> 
> Isn't there a bug here:
> 
> 2090   set_inlining_progress(false);
> 2091   set_do_cleanup(false);
> 2092   return inlining_progress() && !needs_cleanup;
> 
> Can inlining_progress() be anything but false at line 2092?

Yes, you are right. Updated webrev in-place.

diff --git a/src/hotspot/share/opto/compile.cpp 
b/src/hotspot/share/opto/compile.cpp
--- a/src/hotspot/share/opto/compile.cpp
+++ b/src/hotspot/share/opto/compile.cpp
@@ -2089,7 +2089,7 @@

    set_inlining_progress(false);
    set_do_cleanup(false);
-  return inlining_progress() && !needs_cleanup;
+  return (_late_inlines.length() > 0) && !needs_cleanup;
  }

Testing results are still valid because I extensively tested earlier 
version without that particular change, but resubmitted testing just in 
case.

Best regards,
Vladimir Ivanov

From martin.doerr at sap.com  Fri Jan 25 08:54:53 2019
From: martin.doerr at sap.com (Doerr, Martin)
Date: Fri, 25 Jan 2019 08:54:53 +0000
Subject: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
In-Reply-To: <ec71c98f21934e7ab1b2d5343c2bca6a@sap.com>
References: <AM6PR02MB47889B74BC2F81B06ACEC28C9A9F0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <243b17be-e1a3-7b68-1e72-9a114552860c@linux.vnet.ibm.com>
 <AM6PR02MB4788E79F0764D15A3376376D9A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <51A38D8D-15B8-47FA-AF07-5F4F8D1E0C94@sap.com>
 <AM6PR02MB47888C89ECE27D7135121F179A9A0@AM6PR02MB4788.eurprd02.prod.outlook.com>
 <ec71c98f21934e7ab1b2d5343c2bca6a@sap.com>
Message-ID: <AM6PR02MB47885B0F9189292D1B5076819A9B0@AM6PR02MB4788.eurprd02.prod.outlook.com>

Hi G?tz and Gustavo,

thanks for the reviews. Pushed.

Best regards,
Martin


-----Original Message-----
From: Lindenmaier, Goetz 
Sent: Freitag, 25. Januar 2019 09:15
To: Doerr, Martin <martin.doerr at sap.com>; Schmidt, Lutz <lutz.schmidt at sap.com>; Gustavo Romero <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
Subject: RE: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32

Hi Martin,

The change looks good to me. 

Best regards,
  Goetz.

> -----Original Message-----
> From: Doerr, Martin
> Sent: Thursday, January 24, 2019 1:12 PM
> To: Schmidt, Lutz <lutz.schmidt at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: RE: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
> 
> Hi Lutz and Gustavo,
> 
> that's fine. Removed the comments which refer to java.util.zip.CRC32 stuff.
> 
> And while reading through the comments, I found out that
> kernel_crc32_singleByte is not useful (since we have the ...Reg version). So I
> just removed it and replaced its only usage by better code
> (TemplateInterpreterGenerator::generate_CRC32_update_entry).
> 
> New webrev:
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.01/
> 
> Best regards,
> Martin
> 
> 
> -----Original Message-----
> From: Schmidt, Lutz
> Sent: Donnerstag, 24. Januar 2019 12:12
> To: Doerr, Martin <martin.doerr at sap.com>; Gustavo Romero
> <gromero at linux.vnet.ibm.com>; 'hotspot-compiler-dev at openjdk.java.net'
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
> Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of CRC32
> 
> Gustavo, Martin,
> 
> I agree, that comment appears somewhat disconnected from the code.
> I'm really not sure if it will help a lot in the future to have a
> grep string that helps finding the related code in java.util.zip.CRC32.
> 
> In short: change it to something meaningful in the local context.
> 
> Thanks,
> Lutz
> 
> ?On 24.01.19, 11:02, "Doerr, Martin" <martin.doerr at sap.com> wrote:
> 
>     Hi Gustavo,
> 
>     thank you for reviewing and testing.
> 
>     Seems like many comments were taken from java.util.zip.CRC32. I guess it
> was intended to refer to it.
>     I think it's not bad to have it this way because it makes it easier to compare
> both implementations.
>     Maybe Lutz can comment on this and if he would like to keep it this way.
> 
>     Best regards,
>     Martin
> 
> 
>     -----Original Message-----
>     From: Gustavo Romero <gromero at linux.vnet.ibm.com>
>     Sent: Mittwoch, 23. Januar 2019 23:18
>     To: Doerr, Martin <martin.doerr at sap.com>; 'hotspot-compiler-
> dev at openjdk.java.net' <hotspot-compiler-dev at openjdk.java.net>
>     Cc: Lindenmaier, Goetz <goetz.lindenmaier at sap.com>
>     Subject: Re: RFR(M): 8217459: [PPC64] Cleanup non-vector version of
> CRC32
> 
>     Hi Martin,
> 
>     On 01/21/2019 04:07 PM, Doerr, Martin wrote:
>     > PPC64 currently contains static tables for CRC32/CRC32C calculations. We
> only need some of them depending on Endianess and on whether vector
> instructions are available or not.
>     > We can get rid of quite some code when we generate these constants at
> startup as we already do for the vector version.
>     > In addition, we can save one register in the vector case because we can
> use one constants pointer for all related constants.
>     > Webrev:
>     >
> http://cr.openjdk.java.net/~mdoerr/8217459_ppc64_crc_consts/webrev.00/
> <http://cr.openjdk.java.net/%7Emdoerr/8217459_ppc64_crc_consts/webre
> v.00/>
> 
>     Thanks for the clean-up. Change looks good!
> 
>     It's good to see fold_8bit_crc32 and kernel_crc32_1byte going away (I just
>     noted them recently so I missed both in my previous clean-up). And also
>     the static table simplification.
> 
>     I tested the change with different array sizes and byte values with and
>     without vpmsum in the CPU, i.e. has_vpmsumb() = false, and found no
> issues.
> 
>     Only a nit: should we update the following comment and replace
> 'timesXtoThe32'
>     by something better, maybe 'table'? That name doesn't look much
> meaningful in the
>     current context and seems taken from the native code for
> java.util.zip.CRC32:
> 
>     3902 /**
>     3903  * uint32_t crc;
>     3904  * timesXtoThe32[crc & 0xFF] ^ (crc >> 8);
>     3905  */
>     3906 void MacroAssembler::fold_byte_crc32(Register crc, Register val,
> Register table, Register tmp) {
> 
> 
>     Best regards,
>     Gustavo
> 
> 
> 
> 


From rwestrel at redhat.com  Fri Jan 25 09:09:00 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 10:09:00 +0100
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <ac426ae9-0570-1781-3958-218d8599d381@oracle.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
 <87munp3zja.fsf@redhat.com> <ac426ae9-0570-1781-3958-218d8599d381@oracle.com>
Message-ID: <87bm453xzn.fsf@redhat.com>


> diff --git a/src/hotspot/share/opto/compile.cpp 
> b/src/hotspot/share/opto/compile.cpp
> --- a/src/hotspot/share/opto/compile.cpp
> +++ b/src/hotspot/share/opto/compile.cpp
> @@ -2089,7 +2089,7 @@
>
>     set_inlining_progress(false);
>     set_do_cleanup(false);
> -  return inlining_progress() && !needs_cleanup;
> +  return (_late_inlines.length() > 0) && !needs_cleanup;
>   }
>

That looks good to me.

Roland.

From forax at univ-mlv.fr  Fri Jan 25 09:28:50 2019
From: forax at univ-mlv.fr (Remi Forax)
Date: Fri, 25 Jan 2019 10:28:50 +0100 (CET)
Subject: [13] RFR (S): 8192001: C2: inlining through dispatching MH
 linkers ignores speculative type of the receiver
In-Reply-To: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
References: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
Message-ID: <2129244478.1151671.1548408530315.JavaMail.zimbra@u-pem.fr>

Hi Vladimir,
thanks for fixing that,
it was a blocker in my attempt to implement a Stream like API using method handles :)

R?mi 

----- Mail original -----
> De: "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com>
> ?: "hotspot compiler" <hotspot-compiler-dev at openjdk.java.net>
> Envoy?: Vendredi 25 Janvier 2019 02:56:39
> Objet: [13] RFR (S): 8192001: C2: inlining through dispatching MH linkers ignores speculative type of the receiver

> http://cr.openjdk.java.net/~vlivanov/8192001/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8192001
> 
> When inlining through MethodHandle calls, C2 can improve inlining
> decisions by taking speculative types into account (availability of type
> information is addressed by JDK-8191998 [1]).
> 
> There's no profiling performed at method handle linker call sites
> (MethodHandle::linkTo*), but type info can flow from other sources.
> 
> As an example, consider the following case:
> 
>   class A           { void m() { ... } }
>   class B extends A { void m() { ... } }
> 
>   MH = LOOKUP.findVirtual(A.class, "m", ...);
> 
>   void test(A o) throws Throwable {
>     MH.invokeExact(o);
>   }
> 
>   test(new B());
> 
> Before (no inlining):
> 251   12   !b        TestMH::test (21 bytes)
>   ...
>   @ 16   TestMH1$A::m (1 bytes)   virtual call
> 
> After (guarded inlining):
> 251   12   !b        TestMH::test (21 bytes)
>   ...
>   @ 16   TestMH1$B1::m (1 bytes)   inline (hot)
>      \-> TypeProfile (-1/6701 counts) = TestMH1$B1
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
> 
> Best regards,
> Vladimir Ivanov
> 
> [1] https://bugs.openjdk.java.net/browse/JDK-8192001

From aph at redhat.com  Fri Jan 25 09:33:51 2019
From: aph at redhat.com (Andrew Haley)
Date: Fri, 25 Jan 2019 09:33:51 +0000
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
 <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>
Message-ID: <ad7e2dad-fc24-30a3-cc2f-1504b6151ba5@redhat.com>

On 1/24/19 5:50 PM, Vladimir Kozlov wrote:

> Looks good to me too. But it would be nice to have changes
> explanation in RFE. Why it helps vectorize off heap memory accesses?

I don't quite understand what you're asking here, but I guess it's
about what applications need vertorized MappedByteBuffers. Sometimes
people use allocateDirect to get a chunk of memory rather than simply
allocate, and it would be surprising that their programs ran more
slowly as a result. Apache Lucene, for example, is a large-scale user
of ByteBuffers, and they need all the performance they can get.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From rwestrel at redhat.com  Fri Jan 25 09:48:18 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 10:48:18 +0100
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
 <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>
Message-ID: <878sz93w65.fsf@redhat.com>


Thanks for the review.

> But it would be nice to have changes explanation in RFE.

I copy-pasted the content of the RFR email as a comment in the CR. Is
that good enough?

Roland.

From rwestrel at redhat.com  Fri Jan 25 09:48:37 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Fri, 25 Jan 2019 10:48:37 +0100
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
Message-ID: <875zud3w5m.fsf@redhat.com>


Thanks for the review, Nils.

Roland.

From shade at redhat.com  Fri Jan 25 12:17:19 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Fri, 25 Jan 2019 13:17:19 +0100
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
Message-ID: <753e605c-d1c2-da73-a54f-1db15c5fc253@redhat.com>

On 1/25/19 4:18 AM, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/
> 
> I ran Octane w/ -XX:+CITime and observed significant reduction in "Incremental Inline" times (~50%
> off on IGVN, 80-95% off on "Prune Useless").

Yes, I see "Prune Useless" going down with -XX:+CITime with this patch. However, I suspect there are
performance regressions, and I am not able to verify them until this one is fixed:
  https://bugs.openjdk.java.net/browse/JDK-8217782

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/9040002e/signature.asc>

From claes.redestad at oracle.com  Fri Jan 25 12:51:58 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 25 Jan 2019 13:51:58 +0100
Subject: RFR(T): 8217782: Spill detection broken after JDK-8217716
Message-ID: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>

Hi,

in a recent code cleanup I accidentally removed setting of
lrg._was_spilled1/2, missing a few cases where they were actually in
use. This cause a severe performance degradation in some benchmarks like
Octane Richards.

Restore calculation of lrg._was_spilled1/2.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217716
Webrev: http://cr.openjdk.java.net/~redestad/8217782/open.00/

Testing: verified locally that performance in Octane Richards is back to
normal levels, running tier1+2

Thanks!

/Claes


From tobias.hartmann at oracle.com  Fri Jan 25 12:54:43 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Fri, 25 Jan 2019 13:54:43 +0100
Subject: RFR(T): 8217782: Spill detection broken after JDK-8217716
In-Reply-To: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
References: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
Message-ID: <7ef18d46-fafd-ad4f-bde4-f2f05c2c1609@oracle.com>

Hi Claes,

looks good.

Best regards,
Tobias

On 25.01.19 13:51, Claes Redestad wrote:
> Hi,
> 
> in a recent code cleanup I accidentally removed setting of
> lrg._was_spilled1/2, missing a few cases where they were actually in
> use. This cause a severe performance degradation in some benchmarks like
> Octane Richards.
> 
> Restore calculation of lrg._was_spilled1/2.
> 
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217716
> Webrev: http://cr.openjdk.java.net/~redestad/8217782/open.00/
> 
> Testing: verified locally that performance in Octane Richards is back to
> normal levels, running tier1+2
> 
> Thanks!
> 
> /Claes
> 

From claes.redestad at oracle.com  Fri Jan 25 13:02:27 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 25 Jan 2019 14:02:27 +0100
Subject: RFR(T): 8217782: Spill detection broken after JDK-8217716
In-Reply-To: <7ef18d46-fafd-ad4f-bde4-f2f05c2c1609@oracle.com>
References: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
 <7ef18d46-fafd-ad4f-bde4-f2f05c2c1609@oracle.com>
Message-ID: <03f192a2-889d-72dc-9950-723b9544e8d1@oracle.com>


On 2019-01-25 13:54, Tobias Hartmann wrote:
> Hi Claes,
> 
> looks good.

Thanks, Tobias!

/Claes

From shade at redhat.com  Fri Jan 25 13:05:32 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Fri, 25 Jan 2019 14:05:32 +0100
Subject: RFR(T): 8217782: Spill detection broken after JDK-8217716
In-Reply-To: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
References: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
Message-ID: <cf95b953-31dc-e933-b816-3fc9bf071fd4@redhat.com>

On 1/25/19 1:51 PM, Claes Redestad wrote:
> Restore calculation of lrg._was_spilled1/2.

D'oh.

> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217716
> Webrev: http://cr.openjdk.java.net/~redestad/8217782/open.00/

Looks good. The rest of Octane seems to be improving back too.

-Aleksey


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/fd5ed38e/signature-0001.asc>

From claes.redestad at oracle.com  Fri Jan 25 13:09:33 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Fri, 25 Jan 2019 14:09:33 +0100
Subject: RFR(T): 8217782: Spill detection broken after JDK-8217716
In-Reply-To: <cf95b953-31dc-e933-b816-3fc9bf071fd4@redhat.com>
References: <d37fcfd1-b45a-c74e-168e-bac945e168a1@oracle.com>
 <cf95b953-31dc-e933-b816-3fc9bf071fd4@redhat.com>
Message-ID: <11aaeed2-ac61-b9cc-3956-62a8c5c68056@oracle.com>


On 2019-01-25 14:05, Aleksey Shipilev wrote:
>> Bug:https://bugs.openjdk.java.net/browse/JDK-8217716
>> Webrev:http://cr.openjdk.java.net/~redestad/8217782/open.00/
> Looks good. The rest of Octane seems to be improving back too.

Thanks for reviewing and verifying!

I'll give functional testing a few more cycles before push.

/Claes

From xxinliu at amazon.com  Fri Jan 25 08:08:23 2019
From: xxinliu at amazon.com (Liu, Xin)
Date: Fri, 25 Jan 2019 08:08:23 +0000
Subject: Why does call_site_target keep changing for a Nashorn method?
In-Reply-To: <30a97290-71c5-c445-cfaf-f8eda14fdfba@oracle.com>
References: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
 <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>
 <837C4B07-9A3F-4459-A625-12F82C9E604F@amazon.com>
 <30a97290-71c5-c445-cfaf-f8eda14fdfba@oracle.com>
Message-ID: <0B7E6CB4-0953-4C48-8F36-823D43E354D0@amazon.com>

Hello, Vladimir, 

What's about this change? It can pass hotspot-tier1 tests. 
From my  observation, my application sticks with C1 because it's MDO::_invocation_counter can't increment. 
 
diff -r d02f1f4ff3a6 src/hotspot/share/ci/ciEnv.cpp
--- a/src/hotspot/share/ci/ciEnv.cpp	Thu Jan 24 14:22:50 2019 -0800
+++ b/src/hotspot/share/ci/ciEnv.cpp	Thu Jan 24 23:35:52 2019 -0800
@@ -939,6 +939,11 @@
   if (result != Dependencies::end_marker) {
     if (result == Dependencies::call_site_target_value) {
       _inc_decompile_count_on_failure = false;
+
+      MethodData* mdo = target->get_Method()->method_data();
+      if (mdo != NULL) {
+        mdo->invocation_counter()->decay();
+      }
       record_failure("call site target change");
     } else if (Dependencies::is_klass_type(result)) {
       record_failure("concurrent class loading");


I am not sure if I should update ciMethodData::_invocation_counter as well. There's no mutator function for it. 
It looks that nobody consumes ciMethodDate::_invocation_counter. 

I believe I can shake off the unstable dynamic functions in inliner.  Do you think it can solve  JDK-8147550?
In Dependencies::check_call_site_target_value, we have the exact callsite oop.
 
We can add a new node type called 'DynamicCallData" in MethodData.hpp. Each dynamic call node  attaches a profiling node. It contains information like "call site target changes".
Inliner uses the profiling data to determine inline or not.

Btw, this patch in 2011 seems to abort inlining based on the profiling data of callsite.  my idea is same.  
http://cr.openjdk.java.net/~twisti/7087838/src/share/vm/opto/callGenerator.cpp.udiff.html

thanks,
--lx

 
?On 1/18/19, 5:06 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:

    
    > Thank you for the response. After reading your email and associated RFEs,  now I got the background story.
    > I understand the design decision in hotspot.
    > 
    > In my case, compiler thread crowds out the app thread because we run application in docker with 1 CPU.
    > Is it good idea that we decay the invocation counts of the methods if they fail due to 'call_site_target value change?'
    
    Yes, sounds reasonable. I believe compilation bailed out due to 
    invalidated call_site_target dependency should be treated as if it were 
    a deoptimization with Action_reinterpret, but resetting invocation 
    counts may be too much. So, decaying counters instead sounds reasonable.
    
    Also, it's hard to tell what method to act on: problematic CallSite may 
    be located somewhere deep in inline tree, but only root method is known.
    
    Best regards,
    Vladimir Ivanov
    
    > On 1/17/19, 2:36 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:
    > 
    >      C1/C2 optimistically inline through CallSite instances even if those are
    >      mutable (MutableCallSite/VolatileCallSite). It requires a nmethod
    >      dependency and once CallSite target changes, all dependent nmethods
    >      should be invalidated. If such change happens during compilation,
    >      nmethod installation fails.
    >      
    >      That's exactly what you observe: the dependency is recorded during
    >      inlining, but failed verification during installation.
    >      
    >      Regarding the observed behavior, it is well-known [1] [2] and was a
    >      deliberate choice. As JDK-7087838 [1] states:
    >      
    >      "The consensus among language runtime implementors is that they want
    >      control over switch points (and thus call sites) and so it's their
    >      responsibility to handle extensive invalidation of such."
    >      
    >      So, such pathological behavior is treated as a bug in user code (Nashorn
    >      in this particular case).
    >      
    >      There's an RFE filed [3] to consider alternative options for unstable
    >      calls.
    >      
    >      Best regards,
    >      Vladimir Ivanov
    >      
    >      [1] https://bugs.openjdk.java.net/browse/JDK-7087838
    >      [2] https://bugs.openjdk.java.net/browse/JDK-7177745
    >      [3] https://bugs.openjdk.java.net/browse/JDK-8147550
    >      
    >      On 16/01/2019 14:04, Liu, Xin wrote:
    >      > In one of our applications, C1/C2 keeps compiling a Javascript method
    >      > generated by Nashorn but the code fails a dependency check right before
    >      > installing in the code cache. This is with JDK tip.
    >      >
    >      > It can?t pass ?Dependencies::check_call_site_target_value?.
    >      >
    >      > [C2 Parsing]
    >      >
    >      > <bc code='182' bci='1'/>
    >      >
    >      > <dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
    >      >
    >      > <call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
    >      >
    >      > <inline_success reason='accessor'/>
    >      >
    >      > <parse method='1141' uses='21249.000000' stamp='1112.538'>
    >      >
    >      > <bc code='180' bci='1'/>
    >      >
    >      > <unknown id='1556'/>
    >      >
    >      > <unknown id='1866'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
    >      >
    >      > <parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
    >      >
    >      > </parse>
    >      >
    >      > [Validating compilation dependencies]
    >      >
    >      > <dependency type='call_site_target_value' x0='1132' x='1143'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1334' x='1337'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1424' x='1425'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1437' x='1438'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1454' x='1455'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1465' x='1466'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1482' x='1483'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1498' x='1499'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1509' x='1510'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1526' x='1576'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1528' x='1667'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1536' x='1692'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1537' x='1707'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1538' x='1730'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1539' x='1746'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1540' x='1787'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1550' x='1804'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1553' x='1820'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1554' x='1836'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1555' x='1849'/>
    >      >
    >      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
    >      >
    >      > <dependency_failed type='call_site_target_value' x0='1556' x='1866'
    >      > witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite'
    >      > stamp='1113.578'/>
    >      >
    >      > It?s related to the GWT methodHandle.  The 2 mismatched methodhandles
    >      > are very similar except for argL3, which is an int[2].
    >      >
    >      > Even though arg0-2 are not identical objects, their contents are same.
    >      >
    >      > (gdb)call java_lang_invoke_CallSite::target(call_site)->print()
    >      >
    >      > java.lang.invoke.BoundMethodHandle$Species_LLLL
    >      >
    >      > {0x00000000f586ca98}-
    >      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
    >      >
    >      > - ---- fields(total size 6 words):
    >      >
    >      > -'customizationCount''B'@12 0
    >      >
    >      > - private final'type''Ljava/lang/invoke/MethodType;'@16
    >      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
    >      >
    >      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
    >      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
    >      >
    >      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
    >      >
    >      > - final'argL0''Ljava/lang/Object;'@28
    >      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8}(f586c9e8)
    >      >
    >      > - final'argL1''Ljava/lang/Object;'@32
    >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28}(f586ca28)
    >      >
    >      > - final'argL2''Ljava/lang/Object;'@36
    >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60}(f586ca60)
    >      >
    >      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f586ca10}(f586ca10)
    >      >
    >      > (gdb)call method_handle->print()
    >      >
    >      > java.lang.invoke.BoundMethodHandle$Species_LLLL
    >      >
    >      > {0x00000000f6b18500}-
    >      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
    >      >
    >      > - ---- fields(total size 6 words):
    >      >
    >      > -'customizationCount''B'@12 0
    >      >
    >      > - private final'type''Ljava/lang/invoke/MethodType;'@16
    >      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
    >      >
    >      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
    >      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
    >      >
    >      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
    >      >
    >      > - final'argL0''Ljava/lang/Object;'@28
    >      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450}(f6b18450)
    >      >
    >      > - final'argL1''Ljava/lang/Object;'@32
    >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490}(f6b18490)
    >      >
    >      > - final'argL2''Ljava/lang/Object;'@36
    >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8}(f6b184c8)
    >      >
    >      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f6b18478}(f6b18478)
    >      >
    >      > My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.
    >      >
    >      > // Intrinsified by C2. Counters are used during parsing to calculate
    >      > branch frequencies.
    >      > @LambdaForm.Hidden
    >      > @jdk.internal.HotSpotIntrinsicCandidate
    >      > static
    >      > boolean profileBoolean(boolean result, int[] counters) {
    >      > // Profile is int[2] where [0] and [1] correspond to false and true
    >      > occurrences respectively.
    >      > int idx = result ? 1 : 0;
    >      >      try {
    >      >          counters[idx] = Math./addExact/(counters[idx], 1);
    >      > } catch (ArithmeticException e) {
    >      > // Avoid continuous overflow by halving the problematic count.
    >      > counters[idx] = counters[idx] / 2;
    >      > }
    >      > return result;
    >      > }
    >      >
    >      > I am still struggling to understand the source code in
    >      > java.lang.invoke.*.  Could anybody enlighten me why the target of the
    >      > callsite changes every time here?  it is relative to this profiling thing?
    >      >
    >      > In validation log, it has validated the dep ?dependency
    >      > type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t
    >      > pass it after then? My guess is one MH object has been changed by
    >      > another Java thread.
    >      >
    >      > One interesting fact that compiler thread can?t pass 22^th dep.  My
    >      > tuition is it goes over an unknown threshold.
    >      >
    >      > The 2nd question is about ciEnv:: validate_compile_task_dependencies.
    >      >   Why does failure of call_site_target_value_changed not count as a deopt?
    >      >
    >      > The flag  _inc_decompile_count_on_failure =false stops MDO to mark this
    >      > method ?not_compileable?.  C2 doesn?t set the flag, so C2 ends up
    >      > compiling it over and over, which makes C2 a cpu hog. Here?s the code in
    >      > validate_compile_task_dependencies
    >      >
    >      >    bool counter_changed = system_dictionary_modification_counter_changed();
    >      >
    >      >    Dependencies::DepType result =
    >      > dependencies()->validate_dependencies(_task, counter_changed);
    >      >
    >      >    if (result != Dependencies::end_marker) {
    >      >
    >      >      if (result == Dependencies::call_site_target_value) {
    >      >
    >      >        _inc_decompile_count_on_failure = false;
    >      >
    >      >        record_failure("call site target change");
    >      >
    >      > Maybe the right thing to do is to count this as a deopt and change the
    >      > deopt limit computation to take into account the size of the method in
    >      > nodes, just as done for abandoning compilation if the graph is too big.
    >      >
    >      > Thanks,
    >      >
    >      > --lx
    >      >
    >      
    > 
    

From igor.ignatyev at oracle.com  Fri Jan 25 16:53:24 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Fri, 25 Jan 2019 08:53:24 -0800
Subject: RFR(S)[12] : 8067250 : [mlvm]
 vm/mlvm/mixed/stress/regression/b6969574 fails and perf regression
In-Reply-To: <87k1it3zhg.fsf@redhat.com>
References: <EEAF9BB5-8441-4E6E-B063-2C8FBFFABF49@oracle.com>
 <87k1it3zhg.fsf@redhat.com>
Message-ID: <030E98B0-05C8-4F07-954E-55C467C32AF3@oracle.com>

thanks Roland.

-- Igor

> On Jan 25, 2019, at 12:36 AM, Roland Westrelin <rwestrel at redhat.com> wrote:
> 
> 
>> http://cr.openjdk.java.net/~iignatyev//8067250/webrev.00/index.html
> 
> Looks good to me.
> 
> Roland.


From vladimir.kozlov at oracle.com  Fri Jan 25 18:21:02 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 25 Jan 2019 10:21:02 -0800
Subject: [13] RFR (XS): 8191998: C2: inlining through MH linkers drops
 speculative part of argument types
In-Reply-To: <b31aa68a-06d9-8449-6a89-30b4ae2e7a98@oracle.com>
References: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
 <b31aa68a-06d9-8449-6a89-30b4ae2e7a98@oracle.com>
Message-ID: <317ddb36-f2af-2141-62a7-9f2bdc83f500@oracle.com>

+1

Vladimir K

On 1/25/19 12:13 AM, Tobias Hartmann wrote:
> Hi Vladimir,
> 
> looks good.
> 
> Best regards,
> Tobias
> 
> On 25.01.19 02:34, Vladimir Ivanov wrote:
>> http://cr.openjdk.java.net/~vlivanov/8191998/webrev.00/
>> https://bugs.openjdk.java.net/browse/JDK-8191998
>>
>> CallGenerator::for_method_handle_inline() casts MH linker (MH::linkTo*) arguments before attempting
>> inlining. If any argument has a speculative type attached, it is lost and can't be used later.
>>
>> The patch preserves speculative part while sharpening the type (if needed) based on static
>> information from the MemberName instance.
>>
>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
>>
>> Best regards,
>> Vladimir Ivanov

From vladimir.kozlov at oracle.com  Fri Jan 25 18:24:42 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 25 Jan 2019 10:24:42 -0800
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <878sz93w65.fsf@redhat.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
 <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com> <878sz93w65.fsf@redhat.com>
Message-ID: <07f57bf4-2320-47ba-197e-4c934f4ae645@oracle.com>

Thank you, Roland

Yes, this information is good.

Vladimir

On 1/25/19 1:48 AM, Roland Westrelin wrote:
> 
> Thanks for the review.
> 
>> But it would be nice to have changes explanation in RFE.
> 
> I copy-pasted the content of the RFR email as a comment in the CR. Is
> that good enough?
> 
> Roland.
> 

From vladimir.kozlov at oracle.com  Fri Jan 25 18:29:16 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Fri, 25 Jan 2019 10:29:16 -0800
Subject: RFR(S): 8215483: Off heap memory accesses should be vectorized
In-Reply-To: <ad7e2dad-fc24-30a3-cc2f-1504b6151ba5@redhat.com>
References: <877eg6gaqk.fsf@redhat.com> <878sza75n3.fsf@redhat.com>
 <e3ab3fc6-e425-944c-a450-024038349e49@oracle.com>
 <18b6c29f-3b1d-7970-c14c-e020f8d86c98@oracle.com>
 <ad7e2dad-fc24-30a3-cc2f-1504b6151ba5@redhat.com>
Message-ID: <a6c83559-9a41-2d32-5c21-950c5cb577ba@oracle.com>

Hi Andrew,

I missed original RFR e-mail and asked to add explanation for code change in RFE which had only code sample before and 
very short statement. Roland added that information now and I am satisfied.
Yes, we should vectorize such cases.

Thanks,
Vladimir

On 1/25/19 1:33 AM, Andrew Haley wrote:
> On 1/24/19 5:50 PM, Vladimir Kozlov wrote:
> 
>> Looks good to me too. But it would be nice to have changes
>> explanation in RFE. Why it helps vectorize off heap memory accesses?
> 
> I don't quite understand what you're asking here, but I guess it's
> about what applications need vertorized MappedByteBuffers. Sometimes
> people use allocateDirect to get a chunk of memory rather than simply
> allocate, and it would be surprising that their programs ran more
> slowly as a result. Apache Lucene, for example, is a large-scale user
> of ByteBuffers, and they need all the performance they can get.
> 

From igor.veresov at oracle.com  Fri Jan 25 20:06:06 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Fri, 25 Jan 2019 12:06:06 -0800
Subject: [12] RFR(T) 8217828: Un-ProblemList LongMulOverflowTest.java
Message-ID: <F08B53F1-24BA-42E6-A136-0AE4676FF11C@oracle.com>

Per Igor Ignatiev a change to a test does not require approval. So, I just need a review. Thanks!

Webrev: http://cr.openjdk.java.net/~iveresov/8217828/webrev.00/ <http://cr.openjdk.java.net/~iveresov/8217828/webrev.00/>

igor


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/38702ad8/attachment.html>

From igor.ignatyev at oracle.com  Fri Jan 25 20:44:55 2019
From: igor.ignatyev at oracle.com (Igor Ignatev)
Date: Fri, 25 Jan 2019 12:44:55 -0800
Subject: [12] RFR(T) 8217828: Un-ProblemList LongMulOverflowTest.java
In-Reply-To: <F08B53F1-24BA-42E6-A136-0AE4676FF11C@oracle.com>
References: <F08B53F1-24BA-42E6-A136-0AE4676FF11C@oracle.com>
Message-ID: <824B62EC-36BE-49CE-9CD5-A8D7C65D6626@oracle.com>

Looks good and trivial. 

? Igor

> On Jan 25, 2019, at 12:06 PM, Igor Veresov <igor.veresov at oracle.com> wrote:
> 
> Per Igor Ignatiev a change to a test does not require approval. So, I just need a review. Thanks!
> 
> Webrev: http://cr.openjdk.java.net/~iveresov/8217828/webrev.00/
> 
> igor
> 
> 
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/d435519c/attachment.html>

From vladimir.x.ivanov at oracle.com  Fri Jan 25 21:27:40 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 13:27:40 -0800
Subject: [13] RFR (M): 6986483: CHA: optimize calls through interfaces
Message-ID: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>

http://cr.openjdk.java.net/~vlivanov/6986483/webrev.01/
https://bugs.openjdk.java.net/browse/JDK-6986483

Another candidate for revival. At that time it was reviewed, but 
integration was blocked pending another bug fix. Now the fix is in.

Quote from original review request [1]:

"Proposed change adds CHA support in C2 for interface calls.

Consider the following hierarchy:

    interface Intf { m(); }
    class C implements Intf { public m() { ... } }
    class C1 extends C { /* doesn't override m() */ }
    ...
    class Cn extends C { /* doesn't override m() */ }

Call site: invokeinterface Intf.m() ...

If Intf were an abstract class, CHA could deduce that Intf::m() can be
replaced with C::m(), but it doesn't work for interfaces. Verifier
doesn't check interface types in bytecode, so CHA can't assume the
receiver implements Intf.

CHA in C1 handles such call sites for interfaces with a single
implementor. It replaces invokeinterface Intf.m() with invokevirtual
C.m() guarded by a subtype check (instanceof C). C2 doesn't do that and
this request is about adding that. Type profiling doesn't help here (the
call site is usually megamorphic), so C2 can't inline it.

The proposed implementation is similar to C1, except that the code
deoptimizes when subtype check fails and ICCE is thrown from the
interpreter.

While working on it, I spotted and fixed a couple of inefficiencies in
C1 implementation:

    (1) dependency context being used was broader than necessary -
resolved instead of declared interface (hence, possibility of
unnecessary invalidations);

    (2) didn't work for interfaces w/ any default methods: CHA doesn't
support default methods at the moment, so what matters is whether
Intf::m() is default or not and not whether Intf has *any* concrete 
methods."


Testing: hs-precheckin-comp, hs-tier1, hs-tier2

Best regards,
Vladimir Ivanov

[1] 
https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2017-February/025630.html

From vladimir.x.ivanov at oracle.com  Fri Jan 25 21:33:34 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 13:33:34 -0800
Subject: [13] RFR (S): 8192001: C2: inlining through dispatching MH
 linkers ignores speculative type of the receiver
In-Reply-To: <2129244478.1151671.1548408530315.JavaMail.zimbra@u-pem.fr>
References: <f40eb31e-b376-8e61-7cad-abad0ec97626@oracle.com>
 <2129244478.1151671.1548408530315.JavaMail.zimbra@u-pem.fr>
Message-ID: <e3b5ff9b-b475-aadf-78b6-6d3e4ba17141@oracle.com>

Thanks for reviews, Roland & Tobias.

On 25/01/2019 01:28, Remi Forax wrote:
> thanks for fixing that,
> it was a blocker in my attempt to implement a Stream like API using method handles :)
Let me know about your experience. Completely forgot where the request 
came from :-)

Best regards,
Vladimir Ivanov

> ----- Mail original -----
>> De: "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com>
>> ?: "hotspot compiler" <hotspot-compiler-dev at openjdk.java.net>
>> Envoy?: Vendredi 25 Janvier 2019 02:56:39
>> Objet: [13] RFR (S): 8192001: C2: inlining through dispatching MH linkers ignores speculative type of the receiver
> 
>> http://cr.openjdk.java.net/~vlivanov/8192001/webrev.00/
>> https://bugs.openjdk.java.net/browse/JDK-8192001
>>
>> When inlining through MethodHandle calls, C2 can improve inlining
>> decisions by taking speculative types into account (availability of type
>> information is addressed by JDK-8191998 [1]).
>>
>> There's no profiling performed at method handle linker call sites
>> (MethodHandle::linkTo*), but type info can flow from other sources.
>>
>> As an example, consider the following case:
>>
>>    class A           { void m() { ... } }
>>    class B extends A { void m() { ... } }
>>
>>    MH = LOOKUP.findVirtual(A.class, "m", ...);
>>
>>    void test(A o) throws Throwable {
>>      MH.invokeExact(o);
>>    }
>>
>>    test(new B());
>>
>> Before (no inlining):
>> 251   12   !b        TestMH::test (21 bytes)
>>    ...
>>    @ 16   TestMH1$A::m (1 bytes)   virtual call
>>
>> After (guarded inlining):
>> 251   12   !b        TestMH::test (21 bytes)
>>    ...
>>    @ 16   TestMH1$B1::m (1 bytes)   inline (hot)
>>       \-> TypeProfile (-1/6701 counts) = TestMH1$B1
>>
>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> [1] https://bugs.openjdk.java.net/browse/JDK-8192001

From vladimir.x.ivanov at oracle.com  Fri Jan 25 21:34:05 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 13:34:05 -0800
Subject: [13] RFR (XS): 8191998: C2: inlining through MH linkers drops
 speculative part of argument types
In-Reply-To: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
References: <2c266be3-d068-86ad-a521-3682faa17043@oracle.com>
Message-ID: <f6cd80c1-fe45-5258-c9e7-70d1a716a5b5@oracle.com>

Thanks for reviews, Roland, Tobias, and Vladimir K.

Best regards,
Vladimir Ivanov

On 24/01/2019 17:34, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8191998/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8191998
> 
> CallGenerator::for_method_handle_inline() casts MH linker (MH::linkTo*) 
> arguments before attempting inlining. If any argument has a speculative 
> type attached, it is lost and can't be used later.
> 
> The patch preserves speculative part while sharpening the type (if 
> needed) based on static information from the MemberName instance.
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2.
> 
> Best regards,
> Vladimir Ivanov

From vladimir.x.ivanov at oracle.com  Fri Jan 25 21:34:41 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 13:34:41 -0800
Subject: [13] RFR (S): 8217760: C2: Missing symbolic info on a call from
 intrinsics when invoked through MethodHandle
In-Reply-To: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
References: <7b65363c-25cf-9153-8606-1618241ad50b@oracle.com>
Message-ID: <0e3e4c2e-c0ea-7479-d145-630f0f70aba3@oracle.com>

Thanks for reviews, Tobias & Roland.

Best regards,
Vladimir Ivanov

On 24/01/2019 17:24, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/8217760/webrev.00/
> https://bugs.openjdk.java.net/browse/JDK-8217760
> 
> If an intrinsic is called through MethodHandle and it contains a call, 
> then it crashes at the call site during resolution due to inconsistent 
> symbolic info: bytecode refers to method handle linker (MH::linkTo*), 
> but the call invokes some concrete method (result of inlining through 
> the linker).
> 
> The fix is to explicitly attach symbolic info to the call using the 
> machinery introduced by JDK-8072008 [1].
> 
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2, hs-tier3
> 
> Best regards,
> Vladimir Ivanov
> 
> [1] https://bugs.openjdk.java.net/browse/JDK-8072008

From igor.veresov at oracle.com  Fri Jan 25 22:49:05 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Fri, 25 Jan 2019 14:49:05 -0800
Subject: [12] RFR(T) 8217828: Un-ProblemList LongMulOverflowTest.java
In-Reply-To: <824B62EC-36BE-49CE-9CD5-A8D7C65D6626@oracle.com>
References: <F08B53F1-24BA-42E6-A136-0AE4676FF11C@oracle.com>
 <824B62EC-36BE-49CE-9CD5-A8D7C65D6626@oracle.com>
Message-ID: <47918555-7C90-4183-A830-3C83335E6BA6@oracle.com>

Thanks, Igor!

igor


> On Jan 25, 2019, at 12:44 PM, Igor Ignatev <igor.ignatyev at oracle.com> wrote:
> 
> Looks good and trivial. 
> 
> ? Igor
> 
> On Jan 25, 2019, at 12:06 PM, Igor Veresov <igor.veresov at oracle.com <mailto:igor.veresov at oracle.com>> wrote:
> 
>> Per Igor Ignatiev a change to a test does not require approval. So, I just need a review. Thanks!
>> 
>> Webrev: http://cr.openjdk.java.net/~iveresov/8217828/webrev.00/ <http://cr.openjdk.java.net/~iveresov/8217828/webrev.00/>
>> 
>> igor
>> 
>> 
>> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/da4964b3/attachment.html>

From vladimir.x.ivanov at oracle.com  Fri Jan 25 23:34:54 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Fri, 25 Jan 2019 15:34:54 -0800
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <753e605c-d1c2-da73-a54f-1db15c5fc253@redhat.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
 <753e605c-d1c2-da73-a54f-1db15c5fc253@redhat.com>
Message-ID: <55a9e6ab-1f9f-d11d-873b-43a835be233b@oracle.com>

Thanks, Aleksey.

I did some measurements with earlier version [1] and didn't notice any 
regressions:
                           BEFORE                 AFTER
Box2DBench            50.439 ?  5.644       43.549 ?  2.520  ms/op
CryptoBench            6.972 ?  0.137        7.019 ?  0.163  ms/op
DeltaBlueBench       630.607 ?  6.784      644.160 ? 11.296  us/op
EarleyBoyerBench      20.284 ?  0.415       20.735 ?  0.378  ms/op
GbemuBench            58.731 ?  2.852       59.245 ?  3.936  ms/op
NavierStokesBench      6.606 ?  0.042        6.628 ?  0.057  ms/op
PdfJSBench            83.063 ?  4.480       78.909 ?  2.268  ms/op
RaytraceBench       3583.622 ? 40.646     3578.562 ? 45.962  us/op
RegexpBench           70.060 ?  2.538       72.189 ?  2.651  ms/op
RichardsBench        233.414 ?  2.990      233.116 ?  5.641  us/op
SplayBench           956.145 ? 37.547      917.133 ? 37.245  us/op

Let me know about your findings.

Best regards,
Vladimir Ivanov

[1] http://hg.openjdk.java.net/jdk/jdk/rev/80b55cf3a804

On 25/01/2019 04:17, Aleksey Shipilev wrote:
> On 1/25/19 4:18 AM, Vladimir Ivanov wrote:
>> http://cr.openjdk.java.net/~vlivanov/8059241/webrev.03/
>>
>> I ran Octane w/ -XX:+CITime and observed significant reduction in "Incremental Inline" times (~50%
>> off on IGVN, 80-95% off on "Prune Useless").
> 
> Yes, I see "Prune Useless" going down with -XX:+CITime with this patch. However, I suspect there are
> performance regressions, and I am not able to verify them until this one is fixed:
>    https://bugs.openjdk.java.net/browse/JDK-8217782
> 
> -Aleksey
> 

From andrewluotechnologies at outlook.com  Fri Jan 25 23:55:09 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Fri, 25 Jan 2019 23:55:09 +0000
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
Message-ID: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>

See attached patch.  Any feedback is welcome.

Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors...

Thanks,

-Andrew

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/8627e313/attachment-0001.html>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: jaotcdiff.txt
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/8627e313/jaotcdiff-0001.txt>

From andrewluotechnologies at outlook.com  Fri Jan 25 23:56:50 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Fri, 25 Jan 2019 23:56:50 +0000
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>

Minor public -> private visibility fix.  Just noticed right after I sent it out...

Thanks,

-Andrew

From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Andrew Luo
Sent: Friday, January 25, 2019 3:55 PM
To: hotspot-compiler-dev at openjdk.java.net
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker

See attached patch.  Any feedback is welcome.

Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors...

Thanks,

-Andrew

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/01e381c8/attachment.html>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: jaotcdiff2.txt
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/01e381c8/jaotcdiff2.txt>

From igor.veresov at oracle.com  Sat Jan 26 00:19:41 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Fri, 25 Jan 2019 16:19:41 -0800
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <06B51E32-14DD-49B7-9DBC-79A677EC70AC@oracle.com>

Just checking, have you signed the OCA?

igor


> On Jan 25, 2019, at 3:56 PM, Andrew Luo <andrewluotechnologies at outlook.com> wrote:
> 
> Minor public -> private visibility fix.  Just noticed right after I sent it out?
>  
> Thanks,
>  
> -Andrew
>  
> From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> On Behalf Of Andrew Luo
> Sent: Friday, January 25, 2019 3:55 PM
> To: hotspot-compiler-dev at openjdk.java.net
> Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
>  
> See attached patch.  Any feedback is welcome.
>  
> Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors?
>  
> Thanks,
>  
> -Andrew
>  
> <jaotcdiff2.txt>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190125/8c9bc57a/attachment.html>

From andrewluotechnologies at outlook.com  Sat Jan 26 00:53:04 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Sat, 26 Jan 2019 00:53:04 +0000
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <06B51E32-14DD-49B7-9DBC-79A677EC70AC@oracle.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <06B51E32-14DD-49B7-9DBC-79A677EC70AC@oracle.com>
Message-ID: <MWHPR13MB1696CCE77E8FB27FF3301A8BA1940@MWHPR13MB1696.namprd13.prod.outlook.com>

Hi Igor,

Yes, I?ve signed an OCA.  I?ve contributed to OpenJDK before, just not on this mailing list.

Thanks,

-Andrew

From: Igor Veresov <igor.veresov at oracle.com>
Sent: Friday, January 25, 2019 4:20 PM
To: Andrew Luo <andrewluotechnologies at outlook.com>
Cc: hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] Enhance jaotc to automatically find VS2017+ linker

Just checking, have you signed the OCA?

igor


On Jan 25, 2019, at 3:56 PM, Andrew Luo <andrewluotechnologies at outlook.com<mailto:andrewluotechnologies at outlook.com>> wrote:

Minor public -> private visibility fix.  Just noticed right after I sent it out?

Thanks,

-Andrew

From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net<mailto:hotspot-compiler-dev-bounces at openjdk.java.net>> On Behalf Of Andrew Luo
Sent: Friday, January 25, 2019 3:55 PM
To: hotspot-compiler-dev at openjdk.java.net<mailto:hotspot-compiler-dev at openjdk.java.net>
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker

See attached patch.  Any feedback is welcome.

Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors?

Thanks,

-Andrew

<jaotcdiff2.txt>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190126/f09fe6aa/attachment-0001.html>

From shade at redhat.com  Sat Jan 26 10:08:53 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Sat, 26 Jan 2019 11:08:53 +0100
Subject: [13] RFR (S): 8059241: C2: Excessive RemoveUseless passes during
 incremental inlining
In-Reply-To: <55a9e6ab-1f9f-d11d-873b-43a835be233b@oracle.com>
References: <00177ee9-7d3d-f3da-e2e8-1f9b76a2bdc8@oracle.com>
 <753e605c-d1c2-da73-a54f-1db15c5fc253@redhat.com>
 <55a9e6ab-1f9f-d11d-873b-43a835be233b@oracle.com>
Message-ID: <05770af2-6fe0-7bca-84eb-54f1c8878ff6@redhat.com>

On 1/26/19 12:34 AM, Vladimir Ivanov wrote:
> I did some measurements with earlier version [1] and didn't notice any regressions:
> ????????????????????????? BEFORE???????????????? AFTER
> Box2DBench??????????? 50.439 ?? 5.644?????? 43.549 ?? 2.520? ms/op
> CryptoBench??????????? 6.972 ?? 0.137??????? 7.019 ?? 0.163? ms/op
> DeltaBlueBench?????? 630.607 ?? 6.784????? 644.160 ? 11.296? us/op
> EarleyBoyerBench????? 20.284 ?? 0.415?????? 20.735 ?? 0.378? ms/op
> GbemuBench??????????? 58.731 ?? 2.852?????? 59.245 ?? 3.936? ms/op
> NavierStokesBench????? 6.606 ?? 0.042??????? 6.628 ?? 0.057? ms/op
> PdfJSBench??????????? 83.063 ?? 4.480?????? 78.909 ?? 2.268? ms/op
> RaytraceBench?????? 3583.622 ? 40.646???? 3578.562 ? 45.962? us/op
> RegexpBench?????????? 70.060 ?? 2.538?????? 72.189 ?? 2.651? ms/op
> RichardsBench??????? 233.414 ?? 2.990????? 233.116 ?? 5.641? us/op
> SplayBench?????????? 956.145 ? 37.547????? 917.133 ? 37.245? us/op
> 
> Let me know about your findings.

Oh no, if you did Octane/JMH runs, this is already okay. I would struggle with plain Octane runner
separately.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190126/65965d8c/signature.asc>

From igor.ignatyev at oracle.com  Sat Jan 26 16:32:21 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Sat, 26 Jan 2019 08:32:21 -0800
Subject: RFR(T)[12] : 8217852 : problem-list ctw of jdk.jconsole and
 java.desktop on windows
Message-ID: <CCC4B7DB-F16B-4EF1-9C00-D28C97A3800E@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html
> 4 lines changed: 4 ins; 0 del; 0 mod;

Hi all,

could you please review this ting and trivial patch which problem lists jdk.jconsole and java.desktop* ctw tests on windows? the tests were un-problem listed by 8217580[1] as 8189604[2] was resolved, but it appears we still get similar problems in the same tests.

JBS: https://bugs.openjdk.java.net/browse/JDK-8217852
webrev: http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html

Thanks,
-- Igor

From vladimir.kozlov at oracle.com  Sat Jan 26 19:53:24 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Sat, 26 Jan 2019 11:53:24 -0800
Subject: RFR(T)[12] : 8217852 : problem-list ctw of jdk.jconsole and
 java.desktop on windows
In-Reply-To: <CCC4B7DB-F16B-4EF1-9C00-D28C97A3800E@oracle.com>
References: <CCC4B7DB-F16B-4EF1-9C00-D28C97A3800E@oracle.com>
Message-ID: <1992B74B-6EF8-459A-BCB2-28A683FA0545@oracle.com>

Good.

Thanks
Vladimir 

> On Jan 26, 2019, at 8:32 AM, Igor Ignatyev <igor.ignatyev at oracle.com> wrote:
> 
> http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html
>> 4 lines changed: 4 ins; 0 del; 0 mod;
> 
> Hi all,
> 
> could you please review this ting and trivial patch which problem lists jdk.jconsole and java.desktop* ctw tests on windows? the tests were un-problem listed by 8217580[1] as 8189604[2] was resolved, but it appears we still get similar problems in the same tests.
> 
> JBS: https://bugs.openjdk.java.net/browse/JDK-8217852
> webrev: http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html
> 
> Thanks,
> -- Igor


From igor.ignatyev at oracle.com  Sat Jan 26 20:51:51 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Sat, 26 Jan 2019 12:51:51 -0800
Subject: RFR(T)[12] : 8217852 : problem-list ctw of jdk.jconsole and
 java.desktop on windows
In-Reply-To: <1992B74B-6EF8-459A-BCB2-28A683FA0545@oracle.com>
References: <CCC4B7DB-F16B-4EF1-9C00-D28C97A3800E@oracle.com>
 <1992B74B-6EF8-459A-BCB2-28A683FA0545@oracle.com>
Message-ID: <990315BB-103A-498A-A98E-E20A7A89EDB7@oracle.com>

thanks for your review Vladimir. pushed.

-- Igor

> On Jan 26, 2019, at 11:53 AM, Vladimir Kozlov <vladimir.kozlov at oracle.com> wrote:
> 
> Good.
> 
> Thanks
> Vladimir 
> 
>> On Jan 26, 2019, at 8:32 AM, Igor Ignatyev <igor.ignatyev at oracle.com> wrote:
>> 
>> http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html
>>> 4 lines changed: 4 ins; 0 del; 0 mod;
>> 
>> Hi all,
>> 
>> could you please review this ting and trivial patch which problem lists jdk.jconsole and java.desktop* ctw tests on windows? the tests were un-problem listed by 8217580[1] as 8189604[2] was resolved, but it appears we still get similar problems in the same tests.
>> 
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217852
>> webrev: http://cr.openjdk.java.net/~iignatyev//8217852/webrev.00/index.html
>> 
>> Thanks,
>> -- Igor
> 


From sandhya.viswanathan at intel.com  Sun Jan 27 03:47:51 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Sun, 27 Jan 2019 03:47:51 +0000
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>

Hi All,

Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/

The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
I have corrected the guard to _LP64 and updated the spill/fill instructions.  This bug only affected the Knights family where AVX512VL is not supported.

I have tested it on SKX and Knights family with compiler jtreg tests.

Please review.

Best Regards,
Sandhya


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190127/dda1ba86/attachment.html>

From nils.eliasson at oracle.com  Mon Jan 28 08:40:18 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Mon, 28 Jan 2019 09:40:18 +0100
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
Message-ID: <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>

Hi Sandhya,

Looks good,

Regards,

Nils

On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>
> Hi All,
>
> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371 
> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>
> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/ 
> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>
> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>
> I have corrected the guard to _LP64 and updated the spill/fill 
> instructions.? This bug only affected the Knights family where 
> AVX512VL is not supported.
>
> I have tested it on SKX and Knights family with compiler jtreg tests.
>
> Please review.
>
> Best Regards,
>
> Sandhya
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190128/f52d4a16/attachment.html>

From claes.redestad at oracle.com  Mon Jan 28 09:34:58 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 28 Jan 2019 10:34:58 +0100
Subject: RFR: 8217869: Add count_leading_zeros utility
Message-ID: <b2f9b972-98be-f990-36ce-349428369cd5@oracle.com>

Hi,

adding a count_leading_zeros implementation using compiler intrinsics
means a straightforward optimization of RegMask::find_highest_bit. For
platforms lacking intrinsic support a more efficient algorithm is
implemented.

Adding a 64-bit version would be straightforward, but let's do that
when we need it.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217869
Webrev: http://cr.openjdk.java.net/~redestad/8217869/open.00/

Testing: tier1-3

The xlc changes are untested, so it'd be much appreciated if someone
can run the test on that platform.

Thanks!

/Claes

From nils.eliasson at oracle.com  Mon Jan 28 09:36:19 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Mon, 28 Jan 2019 10:36:19 +0100
Subject: RFR: 8217869: Add count_leading_zeros utility
In-Reply-To: <b2f9b972-98be-f990-36ce-349428369cd5@oracle.com>
References: <b2f9b972-98be-f990-36ce-349428369cd5@oracle.com>
Message-ID: <8a5f0388-3969-0d24-53ec-4c72f3e50df0@oracle.com>

A very welcome improvement.

Thank you Claes!

Reviewed,

// Nils

On 2019-01-28 10:34, Claes Redestad wrote:
> Hi,
>
> adding a count_leading_zeros implementation using compiler intrinsics
> means a straightforward optimization of RegMask::find_highest_bit. For
> platforms lacking intrinsic support a more efficient algorithm is
> implemented.
>
> Adding a 64-bit version would be straightforward, but let's do that
> when we need it.
>
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217869
> Webrev: http://cr.openjdk.java.net/~redestad/8217869/open.00/
>
> Testing: tier1-3
>
> The xlc changes are untested, so it'd be much appreciated if someone
> can run the test on that platform.
>
> Thanks!
>
> /Claes

From claes.redestad at oracle.com  Mon Jan 28 09:50:38 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 28 Jan 2019 10:50:38 +0100
Subject: RFR: 8217869: Add count_leading_zeros utility
In-Reply-To: <8a5f0388-3969-0d24-53ec-4c72f3e50df0@oracle.com>
References: <b2f9b972-98be-f990-36ce-349428369cd5@oracle.com>
 <8a5f0388-3969-0d24-53ec-4c72f3e50df0@oracle.com>
Message-ID: <50f965e9-460d-cf65-6b9a-e7134adea30c@oracle.com>

Thanks, Nils!

/Claes

On 2019-01-28 10:36, Nils Eliasson wrote:
> A very welcome improvement.
> 
> Thank you Claes!
> 
> Reviewed,
> 
> // Nils

From tobias.hartmann at oracle.com  Mon Jan 28 09:54:35 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 28 Jan 2019 10:54:35 +0100
Subject: RFR: 8217869: Add count_leading_zeros utility
In-Reply-To: <8a5f0388-3969-0d24-53ec-4c72f3e50df0@oracle.com>
References: <b2f9b972-98be-f990-36ce-349428369cd5@oracle.com>
 <8a5f0388-3969-0d24-53ec-4c72f3e50df0@oracle.com>
Message-ID: <039d1b8c-03b5-7a35-e7fd-5263a2436e3d@oracle.com>

Hi Claes,

looks good to me too.

Best regards,
Tobias

On 28.01.19 10:36, Nils Eliasson wrote:
> A very welcome improvement.
> 
> Thank you Claes!
> 
> Reviewed,
> 
> // Nils
> 
> On 2019-01-28 10:34, Claes Redestad wrote:
>> Hi,
>>
>> adding a count_leading_zeros implementation using compiler intrinsics
>> means a straightforward optimization of RegMask::find_highest_bit. For
>> platforms lacking intrinsic support a more efficient algorithm is
>> implemented.
>>
>> Adding a 64-bit version would be straightforward, but let's do that
>> when we need it.
>>
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217869
>> Webrev: http://cr.openjdk.java.net/~redestad/8217869/open.00/
>>
>> Testing: tier1-3
>>
>> The xlc changes are untested, so it'd be much appreciated if someone
>> can run the test on that platform.
>>
>> Thanks!
>>
>> /Claes

From tobias.hartmann at oracle.com  Mon Jan 28 10:00:49 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Mon, 28 Jan 2019 11:00:49 +0100
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
Message-ID: <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>

Hi Sandhya,

looks good to me too.

Best regards,
Tobias

On 28.01.19 09:40, Nils Eliasson wrote:
> Hi Sandhya,
> 
> Looks good,
> 
> Regards,
> 
> Nils
> 
> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>
>> Hi All,
>>
>> ?
>>
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>
>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>
>> ?
>>
>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>
>> I have corrected the guard to _LP64 and updated the spill/fill instructions.? This bug only
>> affected the Knights family where AVX512VL is not supported.
>>
>> ?
>>
>> I have tested it on SKX and Knights family with compiler jtreg tests.
>>
>> ?
>>
>> Please review.
>>
>> ?
>>
>> Best Regards,
>>
>> Sandhya
>>
>> ?
>>
>> ?
>>
>> ?
>>

From rkennke at redhat.com  Mon Jan 28 12:46:06 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Mon, 28 Jan 2019 13:46:06 +0100
Subject: RFR: 8217874: Shenandoah: AArch64: Clobbered register in
 ShenandoahBarrierSetAssembler::cmpxchg_oop()
Message-ID: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>

In AArch64, when called from C2, in
ShenandoahBarrierSetAssembler::cmpxchg_oop() the result register may
overlap with other input argument registers and thus fail the leading
assert, and lead to clobbered registers. In the body of the code block,
a temporary register should be used instead, and result should only get
filled in at the end.

Bug:
https://bugs.openjdk.java.net/browse/JDK-8217874
Webrev:
http://cr.openjdk.java.net/~rkennke/JDK-8217874/webrev.00/

Testing: Some tests failed before (e.g. TestVerifyJCStress.java), those
are good now. No regressions in hotspot_gc_shenandoah either.

Can I get a review please?

Thanks, Roman

From lutz.schmidt at sap.com  Mon Jan 28 13:13:19 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Mon, 28 Jan 2019 13:13:19 +0000
Subject: 8217465: RFR(S): [REDO] - Optimize CodeHeap Analytics
Message-ID: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>

Hi all,

may I please request reviews for this REDO of JDK-8217250. The only relevant difference of this REDO is that I moved the 
  #define USE_BUFFEREDSTREAM
line further down. It is now located after all the #include statements. 

The changeset is included in our inhouse tests since Jan 23rd with no issues detected. It was submitted to jdk/submit on Jan 25th with one failure reported on windows-x64 (see attachment). I cannot relate the failure to my changes. Could someone please have a look at the logs? If the reported failure is a false positive, here are the bug and webrev links for your reviews:

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217465
Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217465.00/

Thanks a lot!
Lutz

 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: JDK-8217465_mach5.pdf
Type: application/pdf
Size: 52053 bytes
Desc: JDK-8217465_mach5.pdf
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190128/1ca0c4b3/JDK-8217465_mach5-0001.pdf>

From aph at redhat.com  Mon Jan 28 14:53:28 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 28 Jan 2019 14:53:28 +0000
Subject: RFR: 8217874: Shenandoah: AArch64: Clobbered register in
 ShenandoahBarrierSetAssembler::cmpxchg_oop()
In-Reply-To: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
References: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
Message-ID: <1e348974-7e8c-28eb-ca07-51b4cd54b816@redhat.com>

On 1/28/19 12:46 PM, Roman Kennke wrote:
> Can I get a review please?

Isn't this a bug in the C2 pattern?

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From nils.eliasson at oracle.com  Mon Jan 28 15:15:17 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Mon, 28 Jan 2019 16:15:17 +0100
Subject: RFR(S): C2: Disallow definition split on MachCopySpill nodes
Message-ID: <2f287f46-0bfb-b489-83b9-ba8e4e54199e@oracle.com>

Hi,

We have a problem that we sometimes hit an assert in reg_split.cpp.

https://bugs.openjdk.java.net/browse/JDK-8087128

http://cr.openjdk.java.net/~neliasso/8087128/webrev.01/

We have a block that looks like this:

1262: #??????? B1264 B1263 <- N7283? Freq: 0,0927179
 ?7282?? Region? ===? 7282? 1704? [[ 7282? 1702? 1715 ]]
 ?11500? MemToRegSpillCopy?????? === _? 11190? [[ 9184 ]] 
Oop:com/sun/tools/javac/$
 ?9184?? DefinitionSpillCopy???? === _? 11500? [[ 9185? 1702 11503? 
11505 ]]?? Oop:$
 ?11501? MemToRegSpillCopy?????? === _? 9169? [[ 11665? 11506 11504? 
11502 ]]?? Oop$
 ?11645? MemToRegSpillCopy?????? === _? 9167? [[ 9182 ]] 
Oop:com/sun/tools/javac/c$
 ?9182?? DefinitionSpillCopy???? === _? 11645? [[ 1702? 11646 11647? 
11648 ]]?? Oop$
 ?1715?? checkCastPP???? ===? 7282? 1716? [[ 1702 ]] 
java/util/HashMap$TreeNode:NotN$
 ?9185?? BoundSpillCopy? === _? 9184? [[ 1702 ]] 
Oop:com/sun/tools/javac/code/Symb$
 ?11665? RegToMemSpillCopy?????? === _? 11501? [[ 1702 ]] 
Oop:com/sun/tools/javac/$
 ?1702?? CallStaticJavaDirect??? ===? 7282? 185? 182? 16? 0? 1715 187? 
9185? 11665 $
 ?1703?? MachProj??????? ===? 1702? [[]] #10006/fat


11501 "MemToRegSpillCopy" has one use in this block, "11665 
RegToMemSpillCopy", and three uses in other blocks. The use "11665 
RegToMemSpillCopy" is used by "1702 CallStaticJavaDirect".

We hit the assert when processing 11501 in PhaseChaitin::Split.

We are in the "Handle DEFS" section of the split routine. We only get 
here if the live range has been marked as spilled when the coloring have 
ran out of colors. There are two code paths, one default, where we just 
record the def and updates the side tables, and one where we do a 
definition split on the live range. This split is guarded by several 
conditions. In this case we get here by having a register mask that is 
only regs (UP) and by being in a high pressure region. Everything seems 
ok, so why don't we end up with MachSpillCopies here more often?

Also - one of the 4 uses is in this block (11655) and it's a reg-to-mem 
already. It doesn't make much sense to add even more spills here. Why 
does this happen?

UseFPUForSpilling added a restriction to coalesing - it skips coalescing 
when the two live ranges have different pressure. The reason for this is 
that with FPU spilling, the possible extra spilling is for "free". (I 
can't find any documentation on benchmarks where this is beneficial 
though.) The downside is that we get longer spill chains like: 
DefSpill-memToReg-RegToMem-MemToReg, that doesn't collapse, because of 
pressure changes. This may cause the live ranges defined by 
memToReg-nodes to become spilled, and if we are in a high-pressure 
region - we hit the assert.

So my conclusion is that nothing is really wrong. Everything still works 
without the assert. The spill-chains are unnecessary long, but only 
because we have chosen to restrict the coalescing. But we shouldn't 
split the spill-nodes even more. In the next iteration the coalescing 
within the block will have reduced the chains, and later a proper 
coloring will be found.

My solution is that we prevent the MachSpillCopies (only Mem-To-Regs can 
end up here) from being split again. This is ok - because this is 
exactly what would have happened if we would have been in a low pressure 
region.

I have done some measurements and it doesn't increase the number of 
spill-iterations.

Regards,

Nils


From rkennke at redhat.com  Mon Jan 28 15:59:11 2019
From: rkennke at redhat.com (Roman Kennke)
Date: Mon, 28 Jan 2019 16:59:11 +0100
Subject: RFR: 8217874: Shenandoah: AArch64: Clobbered register in
 ShenandoahBarrierSetAssembler::cmpxchg_oop()
In-Reply-To: <1e348974-7e8c-28eb-ca07-51b4cd54b816@redhat.com>
References: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
 <1e348974-7e8c-28eb-ca07-51b4cd54b816@redhat.com>
Message-ID: <b3dce1fd-73e7-ddb7-b995-5650dc82e3f2@redhat.com>

> On 1/28/19 12:46 PM, Roman Kennke wrote:
>> Can I get a review please?
> 
> Isn't this a bug in the C2 pattern?

No. The declaration in C2 means that the result argument is supposed to
be live starting from the (end of) the instruction, and input arguments
are live up to the instruction. Overlap within an instruction need to be
taken care of by the implementation. (Doesn't usually happen, because
instructions are usually lowered to a single instruction.)

Roman

From aph at redhat.com  Mon Jan 28 16:54:22 2019
From: aph at redhat.com (Andrew Haley)
Date: Mon, 28 Jan 2019 16:54:22 +0000
Subject: RFR: 8217874: Shenandoah: AArch64: Clobbered register in
 ShenandoahBarrierSetAssembler::cmpxchg_oop()
In-Reply-To: <b3dce1fd-73e7-ddb7-b995-5650dc82e3f2@redhat.com>
References: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
 <1e348974-7e8c-28eb-ca07-51b4cd54b816@redhat.com>
 <b3dce1fd-73e7-ddb7-b995-5650dc82e3f2@redhat.com>
Message-ID: <ea9c6789-5cec-e54b-c7c4-606a5459b8cf@redhat.com>

On 1/28/19 3:59 PM, Roman Kennke wrote:
>> On 1/28/19 12:46 PM, Roman Kennke wrote:
>>> Can I get a review please?
>>
>> Isn't this a bug in the C2 pattern?
> 
> No. The declaration in C2 means that the result argument is supposed to
> be live starting from the (end of) the instruction, and input arguments
> are live up to the instruction. Overlap within an instruction need to be
> taken care of by the implementation. (Doesn't usually happen, because
> instructions are usually lowered to a single instruction.)

Oh yeah. It's all rather grim, but I guess it's not really worth
worrying about.

-- 
Andrew Haley
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671

From vladimir.kozlov at oracle.com  Mon Jan 28 17:16:48 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 28 Jan 2019 09:16:48 -0800
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
 <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
Message-ID: <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>

+1

Thanks,
Vladimir

On 1/28/19 2:00 AM, Tobias Hartmann wrote:
> Hi Sandhya,
> 
> looks good to me too.
> 
> Best regards,
> Tobias
> 
> On 28.01.19 09:40, Nils Eliasson wrote:
>> Hi Sandhya,
>>
>> Looks good,
>>
>> Regards,
>>
>> Nils
>>
>> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>>
>>> Hi All,
>>>
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>>
>>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>>
>>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>>
>>> I have corrected the guard to _LP64 and updated the spill/fill instructions.? This bug only
>>> affected the Knights family where AVX512VL is not supported.
>>>   
>>> I have tested it on SKX and Knights family with compiler jtreg tests.   
>>> Please review.
>>>
>>>
>>> Best Regards,
>>> Sandhya
>>>

From sandhya.viswanathan at intel.com  Mon Jan 28 17:22:15 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Mon, 28 Jan 2019 17:22:15 +0000
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
 <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
 <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A547B0@FMSMSX126.amr.corp.intel.com>

Thanks Vladimir, Tobias and Nils.

Could Vivek go ahead and push it?

Best Regards,
Sandhya


-----Original Message-----
From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com] 
Sent: Monday, January 28, 2019 9:17 AM
To: hotspot-compiler-dev at openjdk.java.net; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after JDK-8210764 (Update avx512 implementation)

+1

Thanks,
Vladimir

On 1/28/19 2:00 AM, Tobias Hartmann wrote:
> Hi Sandhya,
> 
> looks good to me too.
> 
> Best regards,
> Tobias
> 
> On 28.01.19 09:40, Nils Eliasson wrote:
>> Hi Sandhya,
>>
>> Looks good,
>>
>> Regards,
>>
>> Nils
>>
>> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>>
>>> Hi All,
>>>
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>>
>>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>>
>>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>>
>>> I have corrected the guard to _LP64 and updated the spill/fill 
>>> instructions.? This bug only affected the Knights family where AVX512VL is not supported.
>>>   
>>> I have tested it on SKX and Knights family with compiler jtreg tests.   
>>> Please review.
>>>
>>>
>>> Best Regards,
>>> Sandhya
>>>

From vladimir.kozlov at oracle.com  Mon Jan 28 17:38:36 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Mon, 28 Jan 2019 09:38:36 -0800
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A547B0@FMSMSX126.amr.corp.intel.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
 <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
 <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A547B0@FMSMSX126.amr.corp.intel.com>
Message-ID: <3e712082-1def-28ce-f45d-3ef54864e89e@oracle.com>

Hi Sandhya,

Can you also run Lucene tests which hit previous avx512 issues on SKX and Knights?
This is spilling code and it is used when a lot of xmm registers are used and our jtreg tests may not use this code.

You can push after that.

Thanks,
Vladimir

On 1/28/19 9:22 AM, Viswanathan, Sandhya wrote:
> Thanks Vladimir, Tobias and Nils.
> 
> Could Vivek go ahead and push it?
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
> Sent: Monday, January 28, 2019 9:17 AM
> To: hotspot-compiler-dev at openjdk.java.net; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
> Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after JDK-8210764 (Update avx512 implementation)
> 
> +1
> 
> Thanks,
> Vladimir
> 
> On 1/28/19 2:00 AM, Tobias Hartmann wrote:
>> Hi Sandhya,
>>
>> looks good to me too.
>>
>> Best regards,
>> Tobias
>>
>> On 28.01.19 09:40, Nils Eliasson wrote:
>>> Hi Sandhya,
>>>
>>> Looks good,
>>>
>>> Regards,
>>>
>>> Nils
>>>
>>> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>>>
>>>> Hi All,
>>>>
>>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>>>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>>>
>>>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>>>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>>>
>>>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>>>
>>>> I have corrected the guard to _LP64 and updated the spill/fill
>>>> instructions.? This bug only affected the Knights family where AVX512VL is not supported.
>>>>    
>>>> I have tested it on SKX and Knights family with compiler jtreg tests.
>>>> Please review.
>>>>
>>>>
>>>> Best Regards,
>>>> Sandhya
>>>>

From vivek.r.deshpande at intel.com  Mon Jan 28 17:44:36 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Mon, 28 Jan 2019 17:44:36 +0000
Subject: RFR(XS):8216580:X86: Fix generation of VNNI vector code by
 allowing adjacent LoadS nodes to be isomorphic
In-Reply-To: <abb3e9b2-15c5-20a5-46c8-e6cc01ba4a62@oracle.com>
References: <53E8E64DB2403849AFD89B7D4DAC8B2A9A14A6DA@ORSMSX106.amr.corp.intel.com>
 <abb3e9b2-15c5-20a5-46c8-e6cc01ba4a62@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A15F100@ORSMSX106.amr.corp.intel.com>

Hi Vladimir

Would you please take a look at the patch. 
The Adjacent LoadS have different control RangeCheck node for accesses of type a[2i] and a[2i+1].
This patch allows those nodes to be isomorphic as they belong same counted loop and MulAddS2I nodes.
 
Webrev:
http://cr.openjdk.java.net/~vdeshpande/8216580/webrev.01/

Regards,
Vivek

-----Original Message-----
From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] 
Sent: Tuesday, January 15, 2019 2:57 AM
To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; hotspot-compiler-dev at openjdk.java.net compiler <hotspot-compiler-dev at openjdk.java.net>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
Subject: Re: RFR(XS):8216580:X86: Fix generation of VNNI vector code by allowing adjacent LoadS nodes to be isomorphic

Hi Vivek,

please add parentheses around the == comparison in lines 1225,1226.

Otherwise this looks reasonable to me but I'm not too familiar with that code.

Best regards,
Tobias

On 12.01.19 01:03, Deshpande, Vivek R wrote:
> Hi Tobias
> 
> The webrev for the bug JDK-821650 is here:
> http://cr.openjdk.java.net/~vdeshpande/8216580/webrev.00/
> This fixes generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes for a[i] and a[i+1] accesses in same MulAddS2I node.
> Could you please review it.
> 
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Deshpande, Vivek R
> Sent: Friday, January 11, 2019 11:38 AM
> To: 'Tobias Hartmann' <tobias.hartmann at oracle.com>; 
> hotspot-compiler-dev at openjdk.java.net compiler 
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya 
> <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: RE: RFR(S):8216050:X86: Fix for Superword optimization fails 
> with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Tobias
> 
> Thanks for reviewing the patch.
> I have made the changes according to your suggestion.
> In this webrev: 
> http://cr.openjdk.java.net/~vdeshpande/8216050/webrev.01/
> I have fix for the crash reported in the 8216050.
> 
> The lower cost is needed for generation of vpdpwssd instruction, by combining AddVI and MulAddVS2VI.
> For other instructions pmaddwd and vpmaddwd, they get generated on platforms upto skylake with default cost.
> 
> I have updated the bug also with the link to webrev.
> 
> I have created a different bug JDK-8216580 for
>  3) Fix generation of vector code by allowing adjacent LoadS nodes to be isomorphic when they have different control RangeCheck nodes
>      for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> Thank you.
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com]
> Sent: Friday, January 11, 2019 4:49 AM
> To: Deshpande, Vivek R <vivek.r.deshpande at intel.com>; 
> hotspot-compiler-dev at openjdk.java.net compiler 
> <hotspot-compiler-dev at openjdk.java.net>
> Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; Viswanathan, Sandhya 
> <sandhya.viswanathan at intel.com>; Raj, Guru <guru.raj at intel.com>
> Subject: Re: RFR(S):8216050:X86: Fix for Superword optimization fails 
> with assert(0 <= i && i < _len) failed: illegal index
> 
> Hi Vivek,
> 
> On 11.01.19 07:58, Deshpande, Vivek R wrote:
>> 1) Fix for the crash by matching the operand by swapping to right positions. 
> 
> Looks good but the change to loopopts.cpp:530 screwed up the indentation around the ifs, please fix.
> 
>> 2) Cost based generation of vpdpwssd instruction. 
> 
> Other instructions added by JDK-8214751 still miss a cost definition, for example:
> http://hg.openjdk.java.net/jdk/jdk/rev/4bb6e0871bf7#l5.20
> 
>> 3) Fix generation of vector code by allowing adjacent LoadS nodes to 
>> be isomorphic when they have different control RangeCheck nodes
>> ????for a[i] and a[i+1] accesses in same MulAddS2I node
> 
> This is unrelated to the original bug, right? If so, this should be integrated with a separate RFE.
> 
> Thanks,
> Tobias
> 

From sandhya.viswanathan at intel.com  Mon Jan 28 18:39:47 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Mon, 28 Jan 2019 18:39:47 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <c4fdca2f-7c6a-7429-6b1a-efdff2994bf6@oracle.com>
 <3df3b5cd-dbc7-7fdd-bfc5-2a54d11127da@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5489E@FMSMSX126.amr.corp.intel.com>

Hi Alan,

Could you please let us know more on what does it mean to be a jdk-specific feature? How it is to be implemented? An example would be very helpful. 
ByteBuffer is a widely used API and deprecating ByteBuffer any time would make it difficult for more and more Java software frameworks to move up to the latest JDK.  

Best Regards,
Sandhya


-----Original Message-----
From: Alan Bateman [mailto:Alan.Bateman at oracle.com] 
Sent: Friday, January 18, 2019 5:33 AM
To: Andrew Dinn <adinn at redhat.com>; Brian Goetz <brian.goetz at oracle.com>
Cc: core-libs-dev at openjdk.java.net; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; Jonathan Halliday <jonathan.halliday at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Subject: Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

On 17/01/2019 14:27, Andrew Dinn wrote:
> :
>> Vladimir and I have reviewed the JEP, it will need an area lead to 
>> endorse, I think it can be Brian or Mikael in this case.
> Ok, thanks for the above answers. Looking forward to hearing further 
> from Brian and/or Mikael (Vidstedt, I assume? :-).
I had a brief discussion with Brian about this yesterday. He brought up the same concern about using MBB as it's not the right API for this in the longer term.? So this JEP is very much about a short term/tactical solution as we've already concluded here. This leads to the question as to whether this JEP needs to evolve the standard/Java SE API or not. 
It's convenient for the implementation of course but we should at least explore doing this as a JDK-specific feature.

To that end, one approach to explore is allowing the FC.map method accept map modes beyond those defined by MapMode. There is precedence for extensibility in this area already, e.g. FC.open allows you to specify options beyond the standard options specified by the method. It would require MapMode to define a protected constructor and would require a bit of plumbing to support MapMode defined in a JDK-specific module but there are examples to point to. Another approach is aanother class in a JDK-specific module to define the map method. It would require the same plumbing under the covers but would avoid touch the FC spec.

-Alan


From Alan.Bateman at oracle.com  Mon Jan 28 19:45:26 2019
From: Alan.Bateman at oracle.com (Alan Bateman)
Date: Mon, 28 Jan 2019 19:45:26 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5489E@FMSMSX126.amr.corp.intel.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5489E@FMSMSX126.amr.corp.intel.com>
Message-ID: <2bfc26aa-ec5e-e36c-5de9-c44853038d99@oracle.com>

On 28/01/2019 18:39, Viswanathan, Sandhya wrote:
> Hi Alan,
>
> Could you please let us know more on what does it mean to be a jdk-specific feature? How it is to be implemented? An example would be very helpful.
> ByteBuffer is a widely used API and deprecating ByteBuffer any time would make it difficult for more and more Java software frameworks to move up to the latest JDK.
>
In the API docs, you'll see a number of JDK-specific modules with names 
that start with "jdk." instead of "java.". Many of these modules export 
JDK-specific APIs. The jdk.attach module exports the JDK-specific 
com.sun.tools.attach API. The jdk.management module exports the 
com.sun.management API which has defined JDK-specific extensions to the 
management API since JDK 5.

Closer to the feature under discussion are APIs that are extensible to 
allow for support beyond what the Java SE API specifies. The Direct I/O 
feature in JDK 10 defined a JDK-specific OpenOption that you can specify 
to FileChannel.open, e.g.:

 ? var channel = FileChannel.open(file, StandardOpenOption.WRITE, 
ExtendedOpenOption.DIRECT);

Another example is socket options. Java SE defines? "standard" socket 
options in java.net.StandardSocketOptions but an implementation can 
support many others. The JDK has the jdk.net.ExtendedSocketOption to 
define additional socket options so you can do things like this:

 ?? Socket s = ...
 ?? s.setOption(ExtendedSocketOption.TCP_KEEPIDLE, 5);

The suggestion on the table is to see if we can do the same for file 
mapping modes so that the platform specific MAP_SYNC mode can be used 
with the existing API. This would allow for code like this:

 ?? MappedByteBuffer mbb = fc.map(ExtendedMapMode.READ_WRITE_SYNC, 
position, size);

There's plumbing needed to make this work as the underlying 
implementation would be in java.base but the platform specific map mode 
defined in a JDK-specific module. There are several advantages to the 
approach, the main one is that it doesn't commit Java SE to supporting 
this mode. I'm hoping to meet up with Andrew Dinn at FOSDEM to discuss 
this approach in a bit more detail.

You asked about deprecating ByteBuffer but I don't think there is any 
suggestion to do that here. Once Panama is further along, specifically 
the memory region or scope/pointer API, then interop with ByteBuffer 
will need to be worked out.

-Alan

From sandhya.viswanathan at intel.com  Mon Jan 28 19:54:20 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Mon, 28 Jan 2019 19:54:20 +0000
Subject: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over
 non-volatile memory
In-Reply-To: <2bfc26aa-ec5e-e36c-5de9-c44853038d99@oracle.com>
References: <07d4d0fe-3a2a-414c-74ac-69e8527785d9@redhat.com>
 <27c9458d-7257-378a-4e3a-bd03402794be@oracle.com>
 <df03ca64-d68a-ca8b-5309-395672ac7dcb@redhat.com>
 <09c92b1a-c6da-e16b-bb68-553876e8a6ea@oracle.com>
 <a68f718e-9f87-f7e5-515c-f63d827fedaa@redhat.com>
 <ed8fd92d-fd4e-dc30-af8c-8d30da37560c@oracle.com>
 <b19ba189-0141-3e25-8301-48f2d3ce3851@redhat.com>
 <b3f402a3-65ba-b776-cc9e-40ceaaeafefd@redhat.com>
 <f21dac0b-0043-7e66-72a4-c64a45fb59d3@redhat.com>
 <2a0a385d-81ee-df66-f147-e4dd9aa5b72e@oracle.com>
 <8b2ab749-20f1-8dd3-3cc7-64db5d45bc7d@redhat.com>
 <0aae37aa-7797-fde5-63d5-96c8eb961183@oracle.com>
 <86a1988a-a8d2-b6af-0985-11a94d6d76a5@redhat.com>
 <69510788-52e6-815b-1ed7-a6f4886d0398@oracle.com>
 <3e3c4f7d-049e-4aec-c165-f2664e7c98ef@redhat.com>
 <34cfc530-8517-ac1a-0c04-446dc3dc2436@oracle.com>
 <21ef0e11-3f3d-e9a4-5dc6-898d4ac18efa@redhat.com>
 <a377e3da-549a-1ee1-a416-ef243545f92b@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5489E@FMSMSX126.amr.corp.intel.com>
 <2bfc26aa-ec5e-e36c-5de9-c44853038d99@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5494F@FMSMSX126.amr.corp.intel.com>

Thanks a lot Alan! This is very helpful.

Best Regards,
Sandhya


-----Original Message-----
From: Alan Bateman [mailto:Alan.Bateman at oracle.com] 
Sent: Monday, January 28, 2019 11:45 AM
To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; Andrew Dinn <adinn at redhat.com>; Brian Goetz <brian.goetz at oracle.com>
Cc: core-libs-dev at openjdk.java.net; hotspot compiler <hotspot-compiler-dev at openjdk.java.net>; Jonathan Halliday <jonathan.halliday at redhat.com>
Subject: Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

On 28/01/2019 18:39, Viswanathan, Sandhya wrote:
> Hi Alan,
>
> Could you please let us know more on what does it mean to be a jdk-specific feature? How it is to be implemented? An example would be very helpful.
> ByteBuffer is a widely used API and deprecating ByteBuffer any time would make it difficult for more and more Java software frameworks to move up to the latest JDK.
>
In the API docs, you'll see a number of JDK-specific modules with names that start with "jdk." instead of "java.". Many of these modules export JDK-specific APIs. The jdk.attach module exports the JDK-specific com.sun.tools.attach API. The jdk.management module exports the com.sun.management API which has defined JDK-specific extensions to the management API since JDK 5.

Closer to the feature under discussion are APIs that are extensible to allow for support beyond what the Java SE API specifies. The Direct I/O feature in JDK 10 defined a JDK-specific OpenOption that you can specify to FileChannel.open, e.g.:

 ? var channel = FileChannel.open(file, StandardOpenOption.WRITE, ExtendedOpenOption.DIRECT);

Another example is socket options. Java SE defines? "standard" socket options in java.net.StandardSocketOptions but an implementation can support many others. The JDK has the jdk.net.ExtendedSocketOption to define additional socket options so you can do things like this:

 ?? Socket s = ...
 ?? s.setOption(ExtendedSocketOption.TCP_KEEPIDLE, 5);

The suggestion on the table is to see if we can do the same for file mapping modes so that the platform specific MAP_SYNC mode can be used with the existing API. This would allow for code like this:

 ?? MappedByteBuffer mbb = fc.map(ExtendedMapMode.READ_WRITE_SYNC,
position, size);

There's plumbing needed to make this work as the underlying implementation would be in java.base but the platform specific map mode defined in a JDK-specific module. There are several advantages to the approach, the main one is that it doesn't commit Java SE to supporting this mode. I'm hoping to meet up with Andrew Dinn at FOSDEM to discuss this approach in a bit more detail.

You asked about deprecating ByteBuffer but I don't think there is any suggestion to do that here. Once Panama is further along, specifically the memory region or scope/pointer API, then interop with ByteBuffer will need to be worked out.

-Alan

From igor.ignatyev at oracle.com  Mon Jan 28 21:23:52 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Mon, 28 Jan 2019 13:23:52 -0800
Subject: RFR(T)[12] : 8207922  : ctw of jdk.security.auth failed with
 "Unexpected zero exit codebefore finishing all compilations"
Message-ID: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html
> 2 lines changed: 2 ins; 0 del; 0 mod;

Hi all,

could you please review this tiny patch for ctw tests? ctw tests failed if the line which has the last classname got collided with JVM output (from WhiteBox class). the fix redirects all JVM output to stderr, so it will not mix up w/ ctw-library output.

webrev: http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8207922

Thanks,
-- Igor

From claes.redestad at oracle.com  Mon Jan 28 21:32:01 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 28 Jan 2019 22:32:01 +0100
Subject: RFR: 8217922: Compiler dead code removal
Message-ID: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>

Hi,

please review this patch to remove dead code in the compiler area.

Bug:    https://bugs.openjdk.java.net/browse/JDK-8217922
Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/

Testing: tier1+2

I've taken care to check every change to ensure it's not in use by some
platform specific code on a platform that's not built/supported by
Oracle, but help verifying this would be greatly appreciated.

I've also checked the patch applies cleanly to a few valhalla and amber
branches to avoid excessive merge issues.

Thanks!

/Claes

From per.liden at oracle.com  Mon Jan 28 21:50:41 2019
From: per.liden at oracle.com (Per Liden)
Date: Mon, 28 Jan 2019 22:50:41 +0100
Subject: RFR: 8217922: Compiler dead code removal
In-Reply-To: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
References: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
Message-ID: <8218b322-1046-b0a0-7f7e-a0132da9cb27@oracle.com>

On 01/28/2019 10:32 PM, Claes Redestad wrote:
> Hi,
> 
> please review this patch to remove dead code in the compiler area.
> 
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8217922
> Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/

Do you have a GC version of this in the works too? I noticed that this 
patch touches src/hotspot/share/gc/shared/collectorPolicy.hpp, which in 
that case could move to that patch.

/Per

> 
> Testing: tier1+2
> 
> I've taken care to check every change to ensure it's not in use by some
> platform specific code on a platform that's not built/supported by
> Oracle, but help verifying this would be greatly appreciated.
> 
> I've also checked the patch applies cleanly to a few valhalla and amber
> branches to avoid excessive merge issues.
> 
> Thanks!
> 
> /Claes

From claes.redestad at oracle.com  Mon Jan 28 21:59:40 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Mon, 28 Jan 2019 22:59:40 +0100
Subject: RFR: 8217922: Compiler dead code removal
In-Reply-To: <8218b322-1046-b0a0-7f7e-a0132da9cb27@oracle.com>
References: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
 <8218b322-1046-b0a0-7f7e-a0132da9cb27@oracle.com>
Message-ID: <677dcce2-40f4-2f1e-30a0-7cb2aee82ff8@oracle.com>


On 2019-01-28 22:50, Per Liden wrote:
>> Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/
> 
> Do you have a GC version of this in the works too? I noticed that this 
> patch touches src/hotspot/share/gc/shared/collectorPolicy.hpp, which in 
> that case could move to that patch.

Oops, seems I filtered this mentally as "compilationPolicy". I don't
have a focused GC cleanup ready, but I'll pull that change and save
it for later.

/Claes

From igor.veresov at oracle.com  Mon Jan 28 23:14:13 2019
From: igor.veresov at oracle.com (Igor Veresov)
Date: Mon, 28 Jan 2019 15:14:13 -0800
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <MWHPR13MB1696CCE77E8FB27FF3301A8BA1940@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <06B51E32-14DD-49B7-9DBC-79A677EC70AC@oracle.com>
 <MWHPR13MB1696CCE77E8FB27FF3301A8BA1940@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <F2E366FE-E768-4BDD-A9C8-DFD5487C452A@oracle.com>

Alright, the code seems ok to me. I ran through our testing and there are no issues. Let?s get you a second review and then I can push your change upstream to the Graal repo.

igor


> On Jan 25, 2019, at 4:53 PM, Andrew Luo <andrewluotechnologies at outlook.com> wrote:
> 
> Hi Igor,
>  
> Yes, I?ve signed an OCA.  I?ve contributed to OpenJDK before, just not on this mailing list.
>  
> Thanks,
>  
> -Andrew
>  
> From: Igor Veresov <igor.veresov at oracle.com> 
> Sent: Friday, January 25, 2019 4:20 PM
> To: Andrew Luo <andrewluotechnologies at outlook.com>
> Cc: hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [PATCH] Enhance jaotc to automatically find VS2017+ linker
>  
> Just checking, have you signed the OCA?
>  
> igor
> 
> 
> 
> 
> 
> On Jan 25, 2019, at 3:56 PM, Andrew Luo <andrewluotechnologies at outlook.com <mailto:andrewluotechnologies at outlook.com>> wrote:
>  
> Minor public -> private visibility fix.  Just noticed right after I sent it out?
>  
> Thanks,
>  
> -Andrew
>  
> From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net <mailto:hotspot-compiler-dev-bounces at openjdk.java.net>> On Behalf Of Andrew Luo
> Sent: Friday, January 25, 2019 3:55 PM
> To: hotspot-compiler-dev at openjdk.java.net <mailto:hotspot-compiler-dev at openjdk.java.net>
> Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
>  
> See attached patch.  Any feedback is welcome.
>  
> Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors?
>  
> Thanks,
>  
> -Andrew
>  
> <jaotcdiff2.txt>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190128/c978d7d6/attachment-0001.html>

From jatin.bhateja at intel.com  Tue Jan 29 04:18:22 2019
From: jatin.bhateja at intel.com (Bhateja, Jatin)
Date: Tue, 29 Jan 2019 04:18:22 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
Message-ID: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>

Hi All,

Please find attached a patch for intrinsification for java library methods Math.max and Math.min  for scalar floating point types (float and double).

New intrinsics match the semantics of java library function and takes care of NaN and signed zero operands.

Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/

Kindly review.

Regards,
Jatin


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190129/6d412ded/attachment-0001.html>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: jdk.patch.txt
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190129/6d412ded/jdk.patch-0001.txt>

From tobias.hartmann at oracle.com  Tue Jan 29 08:18:49 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 29 Jan 2019 09:18:49 +0100
Subject: RFR(T)[12] : 8207922 : ctw of jdk.security.auth failed with
 "Unexpected zero exit codebefore finishing all compilations"
In-Reply-To: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
References: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
Message-ID: <99a65a68-6950-c685-8d3e-3d6160b6a8fb@oracle.com>

Hi Igor,

looks good to me.

Best regards,
Tobias

On 28.01.19 22:23, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html
>> 2 lines changed: 2 ins; 0 del; 0 mod;
> 
> Hi all,
> 
> could you please review this tiny patch for ctw tests? ctw tests failed if the line which has the last classname got collided with JVM output (from WhiteBox class). the fix redirects all JVM output to stderr, so it will not mix up w/ ctw-library output.
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8207922
> 
> Thanks,
> -- Igor
> 

From rwestrel at redhat.com  Tue Jan 29 08:23:40 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Tue, 29 Jan 2019 09:23:40 +0100
Subject: RFR(T)[12] : 8207922 : ctw of jdk.security.auth failed with
 "Unexpected zero exit codebefore finishing all compilations"
In-Reply-To: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
References: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
Message-ID: <87d0of3m9f.fsf@redhat.com>


> webrev: http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html

Looks good to me but there's a typo in the comment (cerr?).

Roland.

From tobias.hartmann at oracle.com  Tue Jan 29 08:28:00 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Tue, 29 Jan 2019 09:28:00 +0100
Subject: RFR: 8217922: Compiler dead code removal
In-Reply-To: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
References: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
Message-ID: <9676ae17-92fa-e57a-4a61-68feeeb4b027@oracle.com>

Hi Claes,

nice cleanup, looks good to me.

Best regards,
Tobias

On 28.01.19 22:32, Claes Redestad wrote:
> Hi,
> 
> please review this patch to remove dead code in the compiler area.
> 
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217922
> Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/
> 
> Testing: tier1+2
> 
> I've taken care to check every change to ensure it's not in use by some
> platform specific code on a platform that's not built/supported by
> Oracle, but help verifying this would be greatly appreciated.
> 
> I've also checked the patch applies cleanly to a few valhalla and amber
> branches to avoid excessive merge issues.
> 
> Thanks!
> 
> /Claes

From adinn at redhat.com  Tue Jan 29 09:55:00 2019
From: adinn at redhat.com (Andrew Dinn)
Date: Tue, 29 Jan 2019 09:55:00 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
Message-ID: <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>

Hi Jatin,

On 29/01/2019 04:18, Bhateja, Jatin wrote:

> Please find attached a patch for intrinsification for java library
> methods Math.max and Math.min ?for scalar floating point types (float
> and double).
> 
> New intrinsics match the semantics of java library function and takes
> care of NaN and signed zero operands.
> 
> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
The patch file attachment came through all right. However, I am getting
"403 - Forbidden" when I try to read the files linked in the webrev.
Does Sandya need to change the permissions on the files?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From nils.eliasson at oracle.com  Tue Jan 29 10:02:59 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 29 Jan 2019 11:02:59 +0100
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
Message-ID: <00e98c3a-403e-215e-8b9b-6b8a57668286@oracle.com>

Hi Jatin,

The diffs in your webrev have the wrong permissions - they return "403 - 
Forbidden".

The patch is readable though, so I will have a look.

Regards,

Nils

On 2019-01-29 05:18, Bhateja, Jatin wrote:
>
> Hi All,
>
> Please find attached a patch for intrinsification for java library 
> methods Math.max and Math.min ?for scalar floating point types (float 
> and double).
>
> New intrinsics match the semantics of java library function and takes 
> care of NaN and signed zero operands.
>
> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/ 
> <http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/>
>
> Kindly review.
>
> Regards,
>
> Jatin
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190129/7ff9b61f/attachment.html>

From jatin.bhateja at intel.com  Tue Jan 29 10:20:26 2019
From: jatin.bhateja at intel.com (Bhateja, Jatin)
Date: Tue, 29 Jan 2019 10:20:26 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
Message-ID: <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>

Hi Andrew,

Some permission issue seem to crop up in webrev, which is why sent the patch diff along. 

Hi Sandhya,

Kindly help in resolving this.

Regards,
Jatin

-----Original Message-----
From: Andrew Dinn [mailto:adinn at redhat.com] 
Sent: Tuesday, January 29, 2019 3:25 PM
To: Bhateja, Jatin <jatin.bhateja at intel.com>; hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics

Hi Jatin,

On 29/01/2019 04:18, Bhateja, Jatin wrote:

> Please find attached a patch for intrinsification for java library 
> methods Math.max and Math.min ?for scalar floating point types (float 
> and double).
> 
> New intrinsics match the semantics of java library function and takes 
> care of NaN and signed zero operands.
> 
> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
The patch file attachment came through all right. However, I am getting
"403 - Forbidden" when I try to read the files linked in the webrev.
Does Sandya need to change the permissions on the files?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From nils.eliasson at oracle.com  Tue Jan 29 10:43:33 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 29 Jan 2019 11:43:33 +0100
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <00e98c3a-403e-215e-8b9b-6b8a57668286@oracle.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <00e98c3a-403e-215e-8b9b-6b8a57668286@oracle.com>
Message-ID: <9384ed27-a508-b616-dcec-fe3a57b97ea3@oracle.com>

Hi again,

I can get the test to fail in the -XX:-TieredCompilation case:

java.lang.AssertionError: Unexpected result of double min/maxa = 4.823909266625017E17, b = -3.3333333333333331E17, result = (-3.333334338197927E17, 4.823908261760423E17), expected = (-3.3333333333333331E17, 4.823909266625017E17)
	at compiler.intrinsics.math.TestFpMinMaxIntrinsics.dTest(TestFpMinMaxIntrinsics.java:113)
	at java.base/java.util.Spliterators$ArraySpliterator.forEachRemaining(Spliterators.java:948)
	at java.base/java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:658)
	at compiler.intrinsics.math.TestFpMinMaxIntrinsics.main(TestFpMinMaxIntrinsics.java:124)
	at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
	at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.base/java.lang.reflect.Method.invoke(Method.java:567)
	at com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapper.java:127)
	at java.base/java.lang.Thread.run(Thread.java:835)

Allow compilation of the methods in the test:

"-XX:CompileCommand=compileonly,compiler.intrinsics.math.TestFpMinMaxIntrinsics*"

And add a loop in the main method, so the test case is run multiple times.

Regards,

Nils


On 2019-01-29 11:02, Nils Eliasson wrote:
>
> Hi Jatin,
>
> The diffs in your webrev have the wrong permissions - they return "403 
> - Forbidden".
>
> The patch is readable though, so I will have a look.
>
> Regards,
>
> Nils
>
> On 2019-01-29 05:18, Bhateja, Jatin wrote:
>>
>> Hi All,
>>
>> Please find attached a patch for intrinsification for java library 
>> methods Math.max and Math.min ?for scalar floating point types (float 
>> and double).
>>
>> New intrinsics match the semantics of java library function and takes 
>> care of NaN and signed zero operands.
>>
>> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/ 
>> <http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/>
>>
>> Kindly review.
>>
>> Regards,
>>
>> Jatin
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190129/06d5a98e/attachment.html>

From jatin.bhateja at intel.com  Tue Jan 29 13:11:02 2019
From: jatin.bhateja at intel.com (Bhateja, Jatin)
Date: Tue, 29 Jan 2019 13:11:02 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <9384ed27-a508-b616-dcec-fe3a57b97ea3@oracle.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <00e98c3a-403e-215e-8b9b-6b8a57668286@oracle.com>
 <9384ed27-a508-b616-dcec-fe3a57b97ea3@oracle.com>
Message-ID: <A66BBE673E08E1428E3A918AE4D5B32CED4A32@BGSMSX106.gar.corp.intel.com>

Hi Nils,

This failure will occur even when intrinsification for Max[FD]/Min[FD] which this patch supported is disabled.

Try passing another option -XX:DisableIntrinsic=_maxD  and test will still assert with non-intrinsified jitt'ed code, test case is not correct since its doing
an assert over direct floating point comparison.

Will update the patch.

Thanks,
Jatin

From: hotspot-compiler-dev [mailto:hotspot-compiler-dev-bounces at openjdk.java.net] On Behalf Of Nils Eliasson
Sent: Tuesday, January 29, 2019 4:14 PM
To: hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics


Hi again,

I can get the test to fail in the -XX:-TieredCompilation case:

java.lang.AssertionError: Unexpected result of double min/maxa = 4.823909266625017E17, b = -3.3333333333333331E17, result = (-3.333334338197927E17, 4.823908261760423E17), expected = (-3.3333333333333331E17, 4.823909266625017E17)

        at compiler.intrinsics.math.TestFpMinMaxIntrinsics.dTest(TestFpMinMaxIntrinsics.java:113)

        at java.base/java.util.Spliterators$ArraySpliterator.forEachRemaining(Spliterators.java:948)

        at java.base/java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:658)

        at compiler.intrinsics.math.TestFpMinMaxIntrinsics.main(TestFpMinMaxIntrinsics.java:124)

        at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

        at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)

        at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

        at java.base/java.lang.reflect.Method.invoke(Method.java:567)

        at com.sun.javatest.regtest.agent.MainWrapper$MainThread.run(MainWrapper.java:127)

        at java.base/java.lang.Thread.run(Thread.java:835)

Allow compilation of the methods in the test:

"-XX:CompileCommand=compileonly,compiler.intrinsics.math.TestFpMinMaxIntrinsics*"

And add a loop in the main method, so the test case is run multiple times.

Regards,

Nils


On 2019-01-29 11:02, Nils Eliasson wrote:

Hi Jatin,

The diffs in your webrev have the wrong permissions - they return "403 - Forbidden".

The patch is readable though, so I will have a look.

Regards,

Nils
On 2019-01-29 05:18, Bhateja, Jatin wrote:
Hi All,

Please find attached a patch for intrinsification for java library methods Math.max and Math.min  for scalar floating point types (float and double).

New intrinsics match the semantics of java library function and takes care of NaN and signed zero operands.

Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/

Kindly review.

Regards,
Jatin


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190129/07695dc4/attachment.html>

From nils.eliasson at oracle.com  Tue Jan 29 13:25:21 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 29 Jan 2019 14:25:21 +0100
Subject: RFR: 8217922: Compiler dead code removal
In-Reply-To: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
References: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
Message-ID: <ae125087-1f44-5905-f08e-ebe5626bdcf7@oracle.com>

Hi Claes,

Nice clean up!

Reviewed,

// Nils

On 2019-01-28 22:32, Claes Redestad wrote:
> Hi,
>
> please review this patch to remove dead code in the compiler area.
>
> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217922
> Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/
>
> Testing: tier1+2
>
> I've taken care to check every change to ensure it's not in use by some
> platform specific code on a platform that's not built/supported by
> Oracle, but help verifying this would be greatly appreciated.
>
> I've also checked the patch applies cleanly to a few valhalla and amber
> branches to avoid excessive merge issues.
>
> Thanks!
>
> /Claes

From claes.redestad at oracle.com  Tue Jan 29 13:37:12 2019
From: claes.redestad at oracle.com (Claes Redestad)
Date: Tue, 29 Jan 2019 14:37:12 +0100
Subject: RFR: 8217922: Compiler dead code removal
In-Reply-To: <ae125087-1f44-5905-f08e-ebe5626bdcf7@oracle.com>
References: <dbe69316-c290-3463-949a-02aca8702be4@oracle.com>
 <ae125087-1f44-5905-f08e-ebe5626bdcf7@oracle.com>
Message-ID: <556776fd-e244-fd45-0bc6-ccfdb62541cb@oracle.com>

Tobias, Nils,

thanks for reviewing!

/Claes

On 2019-01-29 14:25, Nils Eliasson wrote:
> Hi Claes,
> 
> Nice clean up!
> 
> Reviewed,
> 
> // Nils
> 
> On 2019-01-28 22:32, Claes Redestad wrote:
>> Hi,
>>
>> please review this patch to remove dead code in the compiler area.
>>
>> Bug:??? https://bugs.openjdk.java.net/browse/JDK-8217922
>> Webrev: http://cr.openjdk.java.net/~redestad/8217922/open.00/
>>
>> Testing: tier1+2
>>
>> I've taken care to check every change to ensure it's not in use by some
>> platform specific code on a platform that's not built/supported by
>> Oracle, but help verifying this would be greatly appreciated.
>>
>> I've also checked the patch applies cleanly to a few valhalla and amber
>> branches to avoid excessive merge issues.
>>
>> Thanks!
>>
>> /Claes

From nils.eliasson at oracle.com  Tue Jan 29 14:03:25 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Tue, 29 Jan 2019 15:03:25 +0100
Subject: [13] RFR (M): 6986483: CHA: optimize calls through interfaces
In-Reply-To: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>
References: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>
Message-ID: <10d3bbdf-c110-cb89-fdb2-1760776b486b@oracle.com>

Hi Vladimir,

A really good improvement, and I really like the test, excellent coverage.

Reviewed,

// Nils


On 2019-01-25 22:27, Vladimir Ivanov wrote:
> http://cr.openjdk.java.net/~vlivanov/6986483/webrev.01/
> https://bugs.openjdk.java.net/browse/JDK-6986483
>
> Another candidate for revival. At that time it was reviewed, but 
> integration was blocked pending another bug fix. Now the fix is in.
>
> Quote from original review request [1]:
>
> "Proposed change adds CHA support in C2 for interface calls.
>
> Consider the following hierarchy:
>
> ?? interface Intf { m(); }
> ?? class C implements Intf { public m() { ... } }
> ?? class C1 extends C { /* doesn't override m() */ }
> ?? ...
> ?? class Cn extends C { /* doesn't override m() */ }
>
> Call site: invokeinterface Intf.m() ...
>
> If Intf were an abstract class, CHA could deduce that Intf::m() can be
> replaced with C::m(), but it doesn't work for interfaces. Verifier
> doesn't check interface types in bytecode, so CHA can't assume the
> receiver implements Intf.
>
> CHA in C1 handles such call sites for interfaces with a single
> implementor. It replaces invokeinterface Intf.m() with invokevirtual
> C.m() guarded by a subtype check (instanceof C). C2 doesn't do that and
> this request is about adding that. Type profiling doesn't help here (the
> call site is usually megamorphic), so C2 can't inline it.
>
> The proposed implementation is similar to C1, except that the code
> deoptimizes when subtype check fails and ICCE is thrown from the
> interpreter.
>
> While working on it, I spotted and fixed a couple of inefficiencies in
> C1 implementation:
>
> ?? (1) dependency context being used was broader than necessary -
> resolved instead of declared interface (hence, possibility of
> unnecessary invalidations);
>
> ?? (2) didn't work for interfaces w/ any default methods: CHA doesn't
> support default methods at the moment, so what matters is whether
> Intf::m() is default or not and not whether Intf has *any* concrete 
> methods."
>
>
> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>
> Best regards,
> Vladimir Ivanov
>
> [1] 
> https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2017-February/025630.html

From sandhya.viswanathan at intel.com  Tue Jan 29 15:45:36 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Tue, 29 Jan 2019 15:45:36 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>

Please use the following updated link to webrev for review:

http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/

Best Regards,
Sandhya


-----Original Message-----
From: Bhateja, Jatin 
Sent: Tuesday, January 29, 2019 2:20 AM
To: Andrew Dinn <adinn at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Cc: hotspot-compiler-dev at openjdk.java.net
Subject: RE: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics

Hi Andrew,

Some permission issue seem to crop up in webrev, which is why sent the patch diff along. 

Hi Sandhya,

Kindly help in resolving this.

Regards,
Jatin

-----Original Message-----
From: Andrew Dinn [mailto:adinn at redhat.com]
Sent: Tuesday, January 29, 2019 3:25 PM
To: Bhateja, Jatin <jatin.bhateja at intel.com>; hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics

Hi Jatin,

On 29/01/2019 04:18, Bhateja, Jatin wrote:

> Please find attached a patch for intrinsification for java library 
> methods Math.max and Math.min ?for scalar floating point types (float 
> and double).
> 
> New intrinsics match the semantics of java library function and takes 
> care of NaN and signed zero operands.
> 
> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
The patch file attachment came through all right. However, I am getting
"403 - Forbidden" when I try to read the files linked in the webrev.
Does Sandya need to change the permissions on the files?

regards,


Andrew Dinn
-----------
Senior Principal Software Engineer
Red Hat UK Ltd
Registered in England and Wales under Company Registration No. 03798903
Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander

From vladimir.kozlov at oracle.com  Tue Jan 29 17:17:09 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 09:17:09 -0800
Subject: RFR: 8217874: Shenandoah: AArch64: Clobbered register in
 ShenandoahBarrierSetAssembler::cmpxchg_oop()
In-Reply-To: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
References: <b66618f5-62dd-9255-8ce5-bb70be556afc@redhat.com>
Message-ID: <95675ba2-9697-ed4a-216b-61eebf5e36d8@oracle.com>

Hi Roman

Since this bug has fix version 12 you need to follow fix request procedure:
http://openjdk.java.net/jeps/3#Fix-Request-Process

Vladimir

On 1/28/19 4:46 AM, Roman Kennke wrote:
> In AArch64, when called from C2, in
> ShenandoahBarrierSetAssembler::cmpxchg_oop() the result register may
> overlap with other input argument registers and thus fail the leading
> assert, and lead to clobbered registers. In the body of the code block,
> a temporary register should be used instead, and result should only get
> filled in at the end.
> 
> Bug:
> https://bugs.openjdk.java.net/browse/JDK-8217874
> Webrev:
> http://cr.openjdk.java.net/~rkennke/JDK-8217874/webrev.00/
> 
> Testing: Some tests failed before (e.g. TestVerifyJCStress.java), those
> are good now. No regressions in hotspot_gc_shenandoah either.
> 
> Can I get a review please?
> 
> Thanks, Roman
> 

From igor.ignatyev at oracle.com  Tue Jan 29 18:10:43 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Tue, 29 Jan 2019 10:10:43 -0800
Subject: RFR(T)[12] : 8207922  : ctw of jdk.security.auth failed with
 "Unexpected zero exit codebefore finishing all compilations"
In-Reply-To: <87d0of3m9f.fsf@redhat.com>
References: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
 <87d0of3m9f.fsf@redhat.com>
Message-ID: <28582EAA-F6F4-4DA2-A112-F81DDAC537BD@oracle.com>

Hi Roland,

thanks for your review, cerr isn't a typo, it's std::cerr, although if you think it's confusing I'll replace it w/ err.

-- Igor

> On Jan 29, 2019, at 12:23 AM, Roland Westrelin <rwestrel at redhat.com> wrote:
> 
> 
>> webrev: http://cr.openjdk.java.net/~iignatyev//8207922/webrev.00/index.html
> 
> Looks good to me but there's a typo in the comment (cerr?).
> 
> Roland.


From vladimir.kozlov at oracle.com  Tue Jan 29 18:26:10 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 10:26:10 -0800
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
Message-ID: <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>

Code change seems fine.

For -XX:TieredStopAtLevel=1 test command add explicit -XX:+TieredCompilation
Also both JIT (not -Xint) @run command add -XX:+IgnoreUnrecognizedVMOptions flag to make sure test works with all VM 
build variants.

Was the test fixed to solve issue found by Nils?

Thanks,
Vladimir

On 1/29/19 7:45 AM, Viswanathan, Sandhya wrote:
> Please use the following updated link to webrev for review:
> 
> http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Bhateja, Jatin
> Sent: Tuesday, January 29, 2019 2:20 AM
> To: Andrew Dinn <adinn at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
> Cc: hotspot-compiler-dev at openjdk.java.net
> Subject: RE: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
> 
> Hi Andrew,
> 
> Some permission issue seem to crop up in webrev, which is why sent the patch diff along.
> 
> Hi Sandhya,
> 
> Kindly help in resolving this.
> 
> Regards,
> Jatin
> 
> -----Original Message-----
> From: Andrew Dinn [mailto:adinn at redhat.com]
> Sent: Tuesday, January 29, 2019 3:25 PM
> To: Bhateja, Jatin <jatin.bhateja at intel.com>; hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
> 
> Hi Jatin,
> 
> On 29/01/2019 04:18, Bhateja, Jatin wrote:
> 
>> Please find attached a patch for intrinsification for java library
>> methods Math.max and Math.min ?for scalar floating point types (float
>> and double).
>>
>> New intrinsics match the semantics of java library function and takes
>> care of NaN and signed zero operands.
>>
>> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
> The patch file attachment came through all right. However, I am getting
> "403 - Forbidden" when I try to read the files linked in the webrev.
> Does Sandya need to change the permissions on the files?
> 
> regards,
> 
> 
> Andrew Dinn
> -----------
> Senior Principal Software Engineer
> Red Hat UK Ltd
> Registered in England and Wales under Company Registration No. 03798903
> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander
> 

From sandhya.viswanathan at intel.com  Tue Jan 29 19:18:45 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Tue, 29 Jan 2019 19:18:45 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
 <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>

Hi Vladimir,

From what I understand the test is correct and the implementation needs to be fixed (the effect on registers a and b should be USE_KILL). 
Jatin plans to send an updated webrev after fixing the issue.

Best Regards,
Sandhya


-----Original Message-----
From: hotspot-compiler-dev [mailto:hotspot-compiler-dev-bounces at openjdk.java.net] On Behalf Of Vladimir Kozlov
Sent: Tuesday, January 29, 2019 10:26 AM
To: hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics

Code change seems fine.

For -XX:TieredStopAtLevel=1 test command add explicit -XX:+TieredCompilation
Also both JIT (not -Xint) @run command add -XX:+IgnoreUnrecognizedVMOptions flag to make sure test works with all VM 
build variants.

Was the test fixed to solve issue found by Nils?

Thanks,
Vladimir

On 1/29/19 7:45 AM, Viswanathan, Sandhya wrote:
> Please use the following updated link to webrev for review:
> 
> http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Bhateja, Jatin
> Sent: Tuesday, January 29, 2019 2:20 AM
> To: Andrew Dinn <adinn at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
> Cc: hotspot-compiler-dev at openjdk.java.net
> Subject: RE: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
> 
> Hi Andrew,
> 
> Some permission issue seem to crop up in webrev, which is why sent the patch diff along.
> 
> Hi Sandhya,
> 
> Kindly help in resolving this.
> 
> Regards,
> Jatin
> 
> -----Original Message-----
> From: Andrew Dinn [mailto:adinn at redhat.com]
> Sent: Tuesday, January 29, 2019 3:25 PM
> To: Bhateja, Jatin <jatin.bhateja at intel.com>; hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
> 
> Hi Jatin,
> 
> On 29/01/2019 04:18, Bhateja, Jatin wrote:
> 
>> Please find attached a patch for intrinsification for java library
>> methods Math.max and Math.min ?for scalar floating point types (float
>> and double).
>>
>> New intrinsics match the semantics of java library function and takes
>> care of NaN and signed zero operands.
>>
>> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
> The patch file attachment came through all right. However, I am getting
> "403 - Forbidden" when I try to read the files linked in the webrev.
> Does Sandya need to change the permissions on the files?
> 
> regards,
> 
> 
> Andrew Dinn
> -----------
> Senior Principal Software Engineer
> Red Hat UK Ltd
> Registered in England and Wales under Company Registration No. 03798903
> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander
> 

From vladimir.kozlov at oracle.com  Tue Jan 29 20:56:17 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 12:56:17 -0800
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
 <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
Message-ID: <bab2a04f-cb55-66d3-4f97-0ecce69c3e3f@oracle.com>

Okay. I will wait next version.

Thanks,
Vladimir

On 1/29/19 11:18 AM, Viswanathan, Sandhya wrote:
> Hi Vladimir,
> 
>  From what I understand the test is correct and the implementation needs to be fixed (the effect on registers a and b should be USE_KILL).
> Jatin plans to send an updated webrev after fixing the issue.
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: hotspot-compiler-dev [mailto:hotspot-compiler-dev-bounces at openjdk.java.net] On Behalf Of Vladimir Kozlov
> Sent: Tuesday, January 29, 2019 10:26 AM
> To: hotspot-compiler-dev at openjdk.java.net
> Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
> 
> Code change seems fine.
> 
> For -XX:TieredStopAtLevel=1 test command add explicit -XX:+TieredCompilation
> Also both JIT (not -Xint) @run command add -XX:+IgnoreUnrecognizedVMOptions flag to make sure test works with all VM
> build variants.
> 
> Was the test fixed to solve issue found by Nils?
> 
> Thanks,
> Vladimir
> 
> On 1/29/19 7:45 AM, Viswanathan, Sandhya wrote:
>> Please use the following updated link to webrev for review:
>>
>> http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/
>>
>> Best Regards,
>> Sandhya
>>
>>
>> -----Original Message-----
>> From: Bhateja, Jatin
>> Sent: Tuesday, January 29, 2019 2:20 AM
>> To: Andrew Dinn <adinn at redhat.com>; Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
>> Cc: hotspot-compiler-dev at openjdk.java.net
>> Subject: RE: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
>>
>> Hi Andrew,
>>
>> Some permission issue seem to crop up in webrev, which is why sent the patch diff along.
>>
>> Hi Sandhya,
>>
>> Kindly help in resolving this.
>>
>> Regards,
>> Jatin
>>
>> -----Original Message-----
>> From: Andrew Dinn [mailto:adinn at redhat.com]
>> Sent: Tuesday, January 29, 2019 3:25 PM
>> To: Bhateja, Jatin <jatin.bhateja at intel.com>; hotspot-compiler-dev at openjdk.java.net
>> Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
>>
>> Hi Jatin,
>>
>> On 29/01/2019 04:18, Bhateja, Jatin wrote:
>>
>>> Please find attached a patch for intrinsification for java library
>>> methods Math.max and Math.min ?for scalar floating point types (float
>>> and double).
>>>
>>> New intrinsics match the semantics of java library function and takes
>>> care of NaN and signed zero operands.
>>>
>>> Webrev : http://cr.openjdk.java.net/~sviswanathan/8217561/webrev.01/
>> The patch file attachment came through all right. However, I am getting
>> "403 - Forbidden" when I try to read the files linked in the webrev.
>> Does Sandya need to change the permissions on the files?
>>
>> regards,
>>
>>
>> Andrew Dinn
>> -----------
>> Senior Principal Software Engineer
>> Red Hat UK Ltd
>> Registered in England and Wales under Company Registration No. 03798903
>> Directors: Michael Cunningham, Michael ("Mike") O'Neill, Eric Shander
>>

From vladimir.x.ivanov at oracle.com  Tue Jan 29 22:09:17 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 29 Jan 2019 14:09:17 -0800
Subject: [13] RFR (M): 6986483: CHA: optimize calls through interfaces
In-Reply-To: <10d3bbdf-c110-cb89-fdb2-1760776b486b@oracle.com>
References: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>
 <10d3bbdf-c110-cb89-fdb2-1760776b486b@oracle.com>
Message-ID: <2d6eb23a-877b-6938-2460-f58b36c8e3ac@oracle.com>

Thanks, Nils.

Additional testing revealed one problematic corner case - Object methods 
on interfaces. The transformation is not valid for them, because it 
eliminates proper receiver subtype check: subtype check against Object 
is a no-op.

Updated version:
   http://cr.openjdk.java.net/~vlivanov/6986483/webrev.02/

The changes in c1_GraphBuilder.cpp and doCall.cpp are trivial 
(cha_monomorphic_target->holder() != Object_klass()), but I extended the 
test with more cases.

Testing: hs-precheckin-comp, tier1-5

Best regards,
Vladimir Ivanov

On 29/01/2019 06:03, Nils Eliasson wrote:
> Hi Vladimir,
> 
> A really good improvement, and I really like the test, excellent coverage.
> 
> Reviewed,
> 
> // Nils
> 
> 
> On 2019-01-25 22:27, Vladimir Ivanov wrote:
>> http://cr.openjdk.java.net/~vlivanov/6986483/webrev.01/
>> https://bugs.openjdk.java.net/browse/JDK-6986483
>>
>> Another candidate for revival. At that time it was reviewed, but 
>> integration was blocked pending another bug fix. Now the fix is in.
>>
>> Quote from original review request [1]:
>>
>> "Proposed change adds CHA support in C2 for interface calls.
>>
>> Consider the following hierarchy:
>>
>> ?? interface Intf { m(); }
>> ?? class C implements Intf { public m() { ... } }
>> ?? class C1 extends C { /* doesn't override m() */ }
>> ?? ...
>> ?? class Cn extends C { /* doesn't override m() */ }
>>
>> Call site: invokeinterface Intf.m() ...
>>
>> If Intf were an abstract class, CHA could deduce that Intf::m() can be
>> replaced with C::m(), but it doesn't work for interfaces. Verifier
>> doesn't check interface types in bytecode, so CHA can't assume the
>> receiver implements Intf.
>>
>> CHA in C1 handles such call sites for interfaces with a single
>> implementor. It replaces invokeinterface Intf.m() with invokevirtual
>> C.m() guarded by a subtype check (instanceof C). C2 doesn't do that and
>> this request is about adding that. Type profiling doesn't help here (the
>> call site is usually megamorphic), so C2 can't inline it.
>>
>> The proposed implementation is similar to C1, except that the code
>> deoptimizes when subtype check fails and ICCE is thrown from the
>> interpreter.
>>
>> While working on it, I spotted and fixed a couple of inefficiencies in
>> C1 implementation:
>>
>> ?? (1) dependency context being used was broader than necessary -
>> resolved instead of declared interface (hence, possibility of
>> unnecessary invalidations);
>>
>> ?? (2) didn't work for interfaces w/ any default methods: CHA doesn't
>> support default methods at the moment, so what matters is whether
>> Intf::m() is default or not and not whether Intf has *any* concrete 
>> methods."
>>
>>
>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> [1] 
>> https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2017-February/025630.html 
>>

From vladimir.kozlov at oracle.com  Tue Jan 29 22:33:59 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 14:33:59 -0800
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
Message-ID: <d9d8e59b-2cc9-3fd7-349f-33a45539fefe@oracle.com>

Looks good.

Thanks,
Vladimir

On 1/25/19 3:56 PM, Andrew Luo wrote:
> Minor public -> private visibility fix.? Just noticed right after I sent it out?
> 
> Thanks,
> 
> -Andrew
> 
> *From:* hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net> *On Behalf Of *Andrew Luo
> *Sent:* Friday, January 25, 2019 3:55 PM
> *To:* hotspot-compiler-dev at openjdk.java.net
> *Subject:* [PATCH] Enhance jaotc to automatically find VS2017+ linker
> 
> See attached patch.? Any feedback is welcome.
> 
> Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output 
> with no errors?
> 
> Thanks,
> 
> -Andrew
> 

From vladimir.kozlov at oracle.com  Tue Jan 29 22:48:43 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 14:48:43 -0800
Subject: RFR(S): 8087128 C2: Disallow definition split on MachCopySpill
 nodes
In-Reply-To: <2f287f46-0bfb-b489-83b9-ba8e4e54199e@oracle.com>
References: <2f287f46-0bfb-b489-83b9-ba8e4e54199e@oracle.com>
Message-ID: <8858054d-b05f-b82e-cdc7-b336eb5df0d5@oracle.com>

Thank you, Nils

Looks good to me.

Please run hs-precheckin-comp testing too.

Vladimir

On 1/28/19 7:15 AM, Nils Eliasson wrote:
> Hi,
> 
> We have a problem that we sometimes hit an assert in reg_split.cpp.
> 
> https://bugs.openjdk.java.net/browse/JDK-8087128
> 
> http://cr.openjdk.java.net/~neliasso/8087128/webrev.01/
> 
> We have a block that looks like this:
> 
> 1262: #??????? B1264 B1263 <- N7283? Freq: 0,0927179
>  ?7282?? Region? ===? 7282? 1704? [[ 7282? 1702? 1715 ]]
>  ?11500? MemToRegSpillCopy?????? === _? 11190? [[ 9184 ]] Oop:com/sun/tools/javac/$
>  ?9184?? DefinitionSpillCopy???? === _? 11500? [[ 9185? 1702 11503 11505 ]]?? Oop:$
>  ?11501? MemToRegSpillCopy?????? === _? 9169? [[ 11665? 11506 11504 11502 ]]?? Oop$
>  ?11645? MemToRegSpillCopy?????? === _? 9167? [[ 9182 ]] Oop:com/sun/tools/javac/c$
>  ?9182?? DefinitionSpillCopy???? === _? 11645? [[ 1702? 11646 11647 11648 ]]?? Oop$
>  ?1715?? checkCastPP???? ===? 7282? 1716? [[ 1702 ]] java/util/HashMap$TreeNode:NotN$
>  ?9185?? BoundSpillCopy? === _? 9184? [[ 1702 ]] Oop:com/sun/tools/javac/code/Symb$
>  ?11665? RegToMemSpillCopy?????? === _? 11501? [[ 1702 ]] Oop:com/sun/tools/javac/$
>  ?1702?? CallStaticJavaDirect??? ===? 7282? 185? 182? 16? 0? 1715 187 9185? 11665 $
>  ?1703?? MachProj??????? ===? 1702? [[]] #10006/fat
> 
> 
> 11501 "MemToRegSpillCopy" has one use in this block, "11665 RegToMemSpillCopy", and three uses in other blocks. The use 
> "11665 RegToMemSpillCopy" is used by "1702 CallStaticJavaDirect".
> 
> We hit the assert when processing 11501 in PhaseChaitin::Split.
> 
> We are in the "Handle DEFS" section of the split routine. We only get here if the live range has been marked as spilled 
> when the coloring have ran out of colors. There are two code paths, one default, where we just record the def and 
> updates the side tables, and one where we do a definition split on the live range. This split is guarded by several 
> conditions. In this case we get here by having a register mask that is only regs (UP) and by being in a high pressure 
> region. Everything seems ok, so why don't we end up with MachSpillCopies here more often?
> 
> Also - one of the 4 uses is in this block (11655) and it's a reg-to-mem already. It doesn't make much sense to add even 
> more spills here. Why does this happen?
> 
> UseFPUForSpilling added a restriction to coalesing - it skips coalescing when the two live ranges have different 
> pressure. The reason for this is that with FPU spilling, the possible extra spilling is for "free". (I can't find any 
> documentation on benchmarks where this is beneficial though.) The downside is that we get longer spill chains like: 
> DefSpill-memToReg-RegToMem-MemToReg, that doesn't collapse, because of pressure changes. This may cause the live ranges 
> defined by memToReg-nodes to become spilled, and if we are in a high-pressure region - we hit the assert.
> 
> So my conclusion is that nothing is really wrong. Everything still works without the assert. The spill-chains are 
> unnecessary long, but only because we have chosen to restrict the coalescing. But we shouldn't split the spill-nodes 
> even more. In the next iteration the coalescing within the block will have reduced the chains, and later a proper 
> coloring will be found.
> 
> My solution is that we prevent the MachSpillCopies (only Mem-To-Regs can end up here) from being split again. This is ok 
> - because this is exactly what would have happened if we would have been in a low pressure region.
> 
> I have done some measurements and it doesn't increase the number of spill-iterations.
> 
> Regards,
> 
> Nils
> 
> 
> 

From vladimir.kozlov at oracle.com  Wed Jan 30 00:19:21 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Tue, 29 Jan 2019 16:19:21 -0800
Subject: 8217465: RFR(S): [REDO] - Optimize CodeHeap Analytics
In-Reply-To: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
References: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
Message-ID: <71c4e476-ebf4-d739-f53b-ecaf9cd06cd0@oracle.com>

Looks good.

It passed tier1,hstier2 testing. It builds on all our platforms.

Thanks,
Vladimir

On 1/28/19 5:13 AM, Schmidt, Lutz wrote:
> Hi all,
> 
> may I please request reviews for this REDO of JDK-8217250. The only relevant difference of this REDO is that I moved the
>    #define USE_BUFFEREDSTREAM
> line further down. It is now located after all the #include statements.
> 
> The changeset is included in our inhouse tests since Jan 23rd with no issues detected. It was submitted to jdk/submit on Jan 25th with one failure reported on windows-x64 (see attachment). I cannot relate the failure to my changes. Could someone please have a look at the logs? If the reported failure is a false positive, here are the bug and webrev links for your reviews:
> 
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8217465
> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217465.00/
> 
> Thanks a lot!
> Lutz
> 
>   
> 

From vladimir.x.ivanov at oracle.com  Wed Jan 30 02:19:29 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Tue, 29 Jan 2019 18:19:29 -0800
Subject: Why does call_site_target keep changing for a Nashorn method?
In-Reply-To: <0B7E6CB4-0953-4C48-8F36-823D43E354D0@amazon.com>
References: <616C8E42-4B18-405B-B28A-C9F062EC9B6C@amazon.com>
 <186d49e7-daa9-a0db-b0c6-1b9d4ff2adda@oracle.com>
 <837C4B07-9A3F-4459-A625-12F82C9E604F@amazon.com>
 <30a97290-71c5-c445-cfaf-f8eda14fdfba@oracle.com>
 <0B7E6CB4-0953-4C48-8F36-823D43E354D0@amazon.com>
Message-ID: <7991d700-1825-6b97-088a-5d92df1e0972@oracle.com>

Thanks for experimenting with a fix!

> What's about this change? It can pass hotspot-tier1 tests.
>  From my  observation, my application sticks with C1 because it's MDO::_invocation_counter can't increment.

Can you elaborate a bit here? Is it because invocation counter decays 
faster than C2 gets a change to compile it?

> diff -r d02f1f4ff3a6 src/hotspot/share/ci/ciEnv.cpp
> --- a/src/hotspot/share/ci/ciEnv.cpp	Thu Jan 24 14:22:50 2019 -0800
> +++ b/src/hotspot/share/ci/ciEnv.cpp	Thu Jan 24 23:35:52 2019 -0800
> @@ -939,6 +939,11 @@
>     if (result != Dependencies::end_marker) {
>       if (result == Dependencies::call_site_target_value) {
>         _inc_decompile_count_on_failure = false;
> +
> +      MethodData* mdo = target->get_Method()->method_data();
> +      if (mdo != NULL) {
> +        mdo->invocation_counter()->decay();
> +      }
>         record_failure("call site target change");
>       } else if (Dependencies::is_klass_type(result)) {
>         record_failure("concurrent class loading");
> 

I believe MethodHandles::flush_dependent_nmethods() should do something 
similar.

> I am not sure if I should update ciMethodData::_invocation_counter as well. There's no mutator function for it.
> It looks that nobody consumes ciMethodDate::_invocation_counter.

It should be updated by ciMethodData::load_data().

> I believe I can shake off the unstable dynamic functions in inliner.  Do you think it can solve  JDK-8147550?
> In Dependencies::check_call_site_target_value, we have the exact callsite oop.

> We can add a new node type called 'DynamicCallData" in MethodData.hpp. Each dynamic call node  attaches a profiling node. It contains information like "call site target changes".
> Inliner uses the profiling data to determine inline or not.

I'm not too optimistic about solving JDK-8147550 in such vein. As 
comment from John says:

"There is a serious design tension here, though: Some users apparently 
are willing to endure an infinite series of recompilations as part of 
the cost of doing business; JDK-7177745 addresses this need by turning 
off the fail-safe against (accidental, buggy) infinite recompilation for 
unstable CSs. Other users might find that having a percentage of machine 
time devoted to recompilation is a problem."

Tweaking inlining will definitely make some users unhappy.

The only viable option I see is to reduce the overhead of excessive 
recompilation by delaying recompilations and thus increasing warmup period.

For example, in tiered mode:
   * tier 2-3 can conservatively avoid inlining through unstable CallSites;
   * tier 4 always inline through CallSites, but use exponential (?) 
backoff to guide recompilations (it's worth to consider lower & upper 
limits for periods between recompilations);

With such design, execution always stays in tier2-3 (C1) in the worst 
case, but there's always a chance left for the code to jump into tier4 
once execution stabilizes.

> Btw, this patch in 2011 seems to abort inlining based on the profiling data of callsite.  my idea is same.
> http://cr.openjdk.java.net/~twisti/7087838/src/share/vm/opto/callGenerator.cpp.udiff.html
Unfortunately, it doesn't fit some important use cases as stated in 
JDK-7087838:

"The consensus among language runtime implementors is that they want 
control over switch points (and thus call sites) and so it's their 
responsibility to handle extensive invalidation of such."

Best regards,
Vladimir Ivanov

> ?On 1/18/19, 5:06 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:
> 
>      
>      > Thank you for the response. After reading your email and associated RFEs,  now I got the background story.
>      > I understand the design decision in hotspot.
>      >
>      > In my case, compiler thread crowds out the app thread because we run application in docker with 1 CPU.
>      > Is it good idea that we decay the invocation counts of the methods if they fail due to 'call_site_target value change?'
>      
>      Yes, sounds reasonable. I believe compilation bailed out due to
>      invalidated call_site_target dependency should be treated as if it were
>      a deoptimization with Action_reinterpret, but resetting invocation
>      counts may be too much. So, decaying counters instead sounds reasonable.
>      
>      Also, it's hard to tell what method to act on: problematic CallSite may
>      be located somewhere deep in inline tree, but only root method is known.
>      
>      Best regards,
>      Vladimir Ivanov
>      
>      > On 1/17/19, 2:36 PM, "Vladimir Ivanov" <vladimir.x.ivanov at oracle.com> wrote:
>      >
>      >      C1/C2 optimistically inline through CallSite instances even if those are
>      >      mutable (MutableCallSite/VolatileCallSite). It requires a nmethod
>      >      dependency and once CallSite target changes, all dependent nmethods
>      >      should be invalidated. If such change happens during compilation,
>      >      nmethod installation fails.
>      >
>      >      That's exactly what you observe: the dependency is recorded during
>      >      inlining, but failed verification during installation.
>      >
>      >      Regarding the observed behavior, it is well-known [1] [2] and was a
>      >      deliberate choice. As JDK-7087838 [1] states:
>      >
>      >      "The consensus among language runtime implementors is that they want
>      >      control over switch points (and thus call sites) and so it's their
>      >      responsibility to handle extensive invalidation of such."
>      >
>      >      So, such pathological behavior is treated as a bug in user code (Nashorn
>      >      in this particular case).
>      >
>      >      There's an RFE filed [3] to consider alternative options for unstable
>      >      calls.
>      >
>      >      Best regards,
>      >      Vladimir Ivanov
>      >
>      >      [1] https://bugs.openjdk.java.net/browse/JDK-7087838
>      >      [2] https://bugs.openjdk.java.net/browse/JDK-7177745
>      >      [3] https://bugs.openjdk.java.net/browse/JDK-8147550
>      >
>      >      On 16/01/2019 14:04, Liu, Xin wrote:
>      >      > In one of our applications, C1/C2 keeps compiling a Javascript method
>      >      > generated by Nashorn but the code fails a dependency check right before
>      >      > installing in the code cache. This is with JDK tip.
>      >      >
>      >      > It can?t pass ?Dependencies::check_call_site_target_value?.
>      >      >
>      >      > [C2 Parsing]
>      >      >
>      >      > <bc code='182' bci='1'/>
>      >      >
>      >      > <dependency type='unique_concrete_method' ctxk='1131' x='1141'/>
>      >      >
>      >      > <call method='1141' count='878838' prof_factor='0.024122' inline='1'/>
>      >      >
>      >      > <inline_success reason='accessor'/>
>      >      >
>      >      > <parse method='1141' uses='21249.000000' stamp='1112.538'>
>      >      >
>      >      > <bc code='180' bci='1'/>
>      >      >
>      >      > <unknown id='1556'/>
>      >      >
>      >      > <unknown id='1866'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
>      >      >
>      >      > <parse_done nodes='41636' live='12226' memory='12130984' stamp='1112.538'/>
>      >      >
>      >      > </parse>
>      >      >
>      >      > [Validating compilation dependencies]
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1132' x='1143'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1334' x='1337'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1424' x='1425'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1437' x='1438'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1454' x='1455'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1465' x='1466'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1482' x='1483'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1498' x='1499'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1509' x='1510'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1526' x='1576'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1528' x='1667'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1536' x='1692'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1537' x='1707'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1538' x='1730'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1539' x='1746'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1540' x='1787'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1550' x='1804'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1553' x='1820'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1554' x='1836'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1555' x='1849'/>
>      >      >
>      >      > <dependency type='call_site_target_value' x0='1556' x='1866'/>
>      >      >
>      >      > <dependency_failed type='call_site_target_value' x0='1556' x='1866'
>      >      > witness='jdk/nashorn/internal/runtime/linker/LinkerCallSite'
>      >      > stamp='1113.578'/>
>      >      >
>      >      > It?s related to the GWT methodHandle.  The 2 mismatched methodhandles
>      >      > are very similar except for argL3, which is an int[2].
>      >      >
>      >      > Even though arg0-2 are not identical objects, their contents are same.
>      >      >
>      >      > (gdb)call java_lang_invoke_CallSite::target(call_site)->print()
>      >      >
>      >      > java.lang.invoke.BoundMethodHandle$Species_LLLL
>      >      >
>      >      > {0x00000000f586ca98}-
>      >      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
>      >      >
>      >      > - ---- fields(total size 6 words):
>      >      >
>      >      > -'customizationCount''B'@12 0
>      >      >
>      >      > - private final'type''Ljava/lang/invoke/MethodType;'@16
>      >      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
>      >      >
>      >      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
>      >      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
>      >      >
>      >      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
>      >      >
>      >      > - final'argL0''Ljava/lang/Object;'@28
>      >      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f586c9e8}(f586c9e8)
>      >      >
>      >      > - final'argL1''Ljava/lang/Object;'@32
>      >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca28}(f586ca28)
>      >      >
>      >      > - final'argL2''Ljava/lang/Object;'@36
>      >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f586ca60}(f586ca60)
>      >      >
>      >      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f586ca10}(f586ca10)
>      >      >
>      >      > (gdb)call method_handle->print()
>      >      >
>      >      > java.lang.invoke.BoundMethodHandle$Species_LLLL
>      >      >
>      >      > {0x00000000f6b18500}-
>      >      > klass:'java/lang/invoke/BoundMethodHandle$Species_LLLL'
>      >      >
>      >      > - ---- fields(total size 6 words):
>      >      >
>      >      > -'customizationCount''B'@12 0
>      >      >
>      >      > - private final'type''Ljava/lang/invoke/MethodType;'@16
>      >      > a'java/lang/invoke/MethodType'{0x00000000e21e2878}=(Ljava/lang/Object;Ljdk/nashorn/internal/runtime/Undefined;Ljava/lang/Object;)Ljava/lang/Object;(e21e2878)
>      >      >
>      >      > - final'form''Ljava/lang/invoke/LambdaForm;'@20
>      >      > a'java/lang/invoke/LambdaForm'{0x00000000e1e4a670}=>a'java/lang/invoke/MemberName'{0x00000000e1e4a938}={method}{0x00007fffa512cb68}'guard''(Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;Ljava/lang/Object;)Ljava/lang/Object;'in'java/lang/invoke/LambdaForm$MH'(e1e4a670)
>      >      >
>      >      > -'asTypeCache''Ljava/lang/invoke/MethodHandle;'@24 NULL(0)
>      >      >
>      >      > - final'argL0''Ljava/lang/Object;'@28
>      >      > a'java/lang/invoke/BoundMethodHandle$Species_LL'{0x00000000f6b18450}(f6b18450)
>      >      >
>      >      > - final'argL1''Ljava/lang/Object;'@32
>      >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b18490}(f6b18490)
>      >      >
>      >      > - final'argL2''Ljava/lang/Object;'@36
>      >      > a'java/lang/invoke/MethodHandleImpl$CountingWrapper'{0x00000000f6b184c8}(f6b184c8)
>      >      >
>      >      > - final'argL3''Ljava/lang/Object;'@40 [I{0x00000000f6b18478}(f6b18478)
>      >      >
>      >      > My guess is argL3 is counters in Java.lang.invoke.MethodHandleImpl.
>      >      >
>      >      > // Intrinsified by C2. Counters are used during parsing to calculate
>      >      > branch frequencies.
>      >      > @LambdaForm.Hidden
>      >      > @jdk.internal.HotSpotIntrinsicCandidate
>      >      > static
>      >      > boolean profileBoolean(boolean result, int[] counters) {
>      >      > // Profile is int[2] where [0] and [1] correspond to false and true
>      >      > occurrences respectively.
>      >      > int idx = result ? 1 : 0;
>      >      >      try {
>      >      >          counters[idx] = Math./addExact/(counters[idx], 1);
>      >      > } catch (ArithmeticException e) {
>      >      > // Avoid continuous overflow by halving the problematic count.
>      >      > counters[idx] = counters[idx] / 2;
>      >      > }
>      >      > return result;
>      >      > }
>      >      >
>      >      > I am still struggling to understand the source code in
>      >      > java.lang.invoke.*.  Could anybody enlighten me why the target of the
>      >      > callsite changes every time here?  it is relative to this profiling thing?
>      >      >
>      >      > In validation log, it has validated the dep ?dependency
>      >      > type='call_site_target_value' x0='1556' x='1866'? above. Why it can?t
>      >      > pass it after then? My guess is one MH object has been changed by
>      >      > another Java thread.
>      >      >
>      >      > One interesting fact that compiler thread can?t pass 22^th dep.  My
>      >      > tuition is it goes over an unknown threshold.
>      >      >
>      >      > The 2nd question is about ciEnv:: validate_compile_task_dependencies.
>      >      >   Why does failure of call_site_target_value_changed not count as a deopt?
>      >      >
>      >      > The flag  _inc_decompile_count_on_failure =false stops MDO to mark this
>      >      > method ?not_compileable?.  C2 doesn?t set the flag, so C2 ends up
>      >      > compiling it over and over, which makes C2 a cpu hog. Here?s the code in
>      >      > validate_compile_task_dependencies
>      >      >
>      >      >    bool counter_changed = system_dictionary_modification_counter_changed();
>      >      >
>      >      >    Dependencies::DepType result =
>      >      > dependencies()->validate_dependencies(_task, counter_changed);
>      >      >
>      >      >    if (result != Dependencies::end_marker) {
>      >      >
>      >      >      if (result == Dependencies::call_site_target_value) {
>      >      >
>      >      >        _inc_decompile_count_on_failure = false;
>      >      >
>      >      >        record_failure("call site target change");
>      >      >
>      >      > Maybe the right thing to do is to count this as a deopt and change the
>      >      > deopt limit computation to take into account the size of the method in
>      >      > nodes, just as done for abandoning compilation if the graph is too big.
>      >      >
>      >      > Thanks,
>      >      >
>      >      > --lx
>      >      >
>      >
>      >
>      
> 

From andrewluotechnologies at outlook.com  Wed Jan 30 03:57:24 2019
From: andrewluotechnologies at outlook.com (Andrew Luo)
Date: Wed, 30 Jan 2019 03:57:24 +0000
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker
In-Reply-To: <F2E366FE-E768-4BDD-A9C8-DFD5487C452A@oracle.com>
References: <MWHPR13MB16964D36E03D218A7AD12FC3A19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <MWHPR13MB1696FD3691C0A0080506666EA19B0@MWHPR13MB1696.namprd13.prod.outlook.com>
 <06B51E32-14DD-49B7-9DBC-79A677EC70AC@oracle.com>
 <MWHPR13MB1696CCE77E8FB27FF3301A8BA1940@MWHPR13MB1696.namprd13.prod.outlook.com>
 <F2E366FE-E768-4BDD-A9C8-DFD5487C452A@oracle.com>
Message-ID: <MWHPR13MB16968B26F52187F8C8BA3E1AA1900@MWHPR13MB1696.namprd13.prod.outlook.com>

Thanks for the reviews.

Quick question ? for future reference ? what is the correct process for Graal contributions?  After seeing your email, I realized that Graal is on Github as well ? and has pull requests enabled ? should we be creating pull requests on Github or following the same process as other OpenJDK-related changes (mailing lists)?

Thanks,

-Andrew

From: Igor Veresov <igor.veresov at oracle.com>
Sent: Monday, January 28, 2019 3:14 PM
To: Andrew Luo <andrewluotechnologies at outlook.com>
Cc: hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] Enhance jaotc to automatically find VS2017+ linker

Alright, the code seems ok to me. I ran through our testing and there are no issues. Let?s get you a second review and then I can push your change upstream to the Graal repo.

igor


On Jan 25, 2019, at 4:53 PM, Andrew Luo <andrewluotechnologies at outlook.com<mailto:andrewluotechnologies at outlook.com>> wrote:

Hi Igor,

Yes, I?ve signed an OCA.  I?ve contributed to OpenJDK before, just not on this mailing list.

Thanks,

-Andrew

From: Igor Veresov <igor.veresov at oracle.com<mailto:igor.veresov at oracle.com>>
Sent: Friday, January 25, 2019 4:20 PM
To: Andrew Luo <andrewluotechnologies at outlook.com<mailto:andrewluotechnologies at outlook.com>>
Cc: hotspot-compiler-dev at openjdk.java.net<mailto:hotspot-compiler-dev at openjdk.java.net>
Subject: Re: [PATCH] Enhance jaotc to automatically find VS2017+ linker

Just checking, have you signed the OCA?

igor


On Jan 25, 2019, at 3:56 PM, Andrew Luo <andrewluotechnologies at outlook.com<mailto:andrewluotechnologies at outlook.com>> wrote:

Minor public -> private visibility fix.  Just noticed right after I sent it out?

Thanks,

-Andrew

From: hotspot-compiler-dev <hotspot-compiler-dev-bounces at openjdk.java.net<mailto:hotspot-compiler-dev-bounces at openjdk.java.net>> On Behalf Of Andrew Luo
Sent: Friday, January 25, 2019 3:55 PM
To: hotspot-compiler-dev at openjdk.java.net<mailto:hotspot-compiler-dev at openjdk.java.net>
Subject: [PATCH] Enhance jaotc to automatically find VS2017+ linker

See attached patch.  Any feedback is welcome.

Tested on a system with only VS2017 installed, just ran jaotc with a simple class file, and got the expected .dll output with no errors?

Thanks,

-Andrew

<jaotcdiff2.txt>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/fbecb371/attachment.html>

From aoqi at loongson.cn  Wed Jan 30 06:34:22 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Wed, 30 Jan 2019 14:34:22 +0800
Subject: RFR 8218031: Zero build broken
Message-ID: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>

Hi,

It seems that zero build is broken. InterpreterInvocationLimit,
InterpreterBackwardBranchLimit and InterpreterProfileLimit were
removed in 8217922, but they are still used when CC_INTERP is true.

Please review the following webrev which fixes broken zero build.

Bugid: https://bugs.openjdk.java.net/browse/JDK-8218031
Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00

Thanks,
Ao Qi

From rwestrel at redhat.com  Wed Jan 30 08:13:26 2019
From: rwestrel at redhat.com (Roland Westrelin)
Date: Wed, 30 Jan 2019 09:13:26 +0100
Subject: RFR(T)[12] : 8207922 : ctw of jdk.security.auth failed with
 "Unexpected zero exit codebefore finishing all compilations"
In-Reply-To: <28582EAA-F6F4-4DA2-A112-F81DDAC537BD@oracle.com>
References: <70A5BA4C-CDB3-4C74-ACEA-2FC692047691@oracle.com>
 <87d0of3m9f.fsf@redhat.com> <28582EAA-F6F4-4DA2-A112-F81DDAC537BD@oracle.com>
Message-ID: <871s4u36mx.fsf@redhat.com>


Hi Igor,

> thanks for your review, cerr isn't a typo, it's std::cerr, although if
> you think it's confusing I'll replace it w/ err.

Fine then. Thanks for the explanation.

Roland.

From tobias.hartmann at oracle.com  Wed Jan 30 08:32:23 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 30 Jan 2019 09:32:23 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
Message-ID: <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>

Hi,

this looks good to me.

Thanks,
Tobias

On 30.01.19 07:34, Ao Qi wrote:
> Hi,
> 
> It seems that zero build is broken. InterpreterInvocationLimit,
> InterpreterBackwardBranchLimit and InterpreterProfileLimit were
> removed in 8217922, but they are still used when CC_INTERP is true.
> 
> Please review the following webrev which fixes broken zero build.
> 
> Bugid: https://bugs.openjdk.java.net/browse/JDK-8218031
> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
> 
> Thanks,
> Ao Qi
> 

From sgehwolf at redhat.com  Wed Jan 30 09:12:51 2019
From: sgehwolf at redhat.com (Severin Gehwolf)
Date: Wed, 30 Jan 2019 10:12:51 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
Message-ID: <9254139ee639a36315a8d07f063ff04aef6e6756.camel@redhat.com>

Hi,

On Wed, 2019-01-30 at 14:34 +0800, Ao Qi wrote:
> Hi,
> 
> It seems that zero build is broken. InterpreterInvocationLimit,
> InterpreterBackwardBranchLimit and InterpreterProfileLimit were
> removed in 8217922, but they are still used when CC_INTERP is true.
> 
> Please review the following webrev which fixes broken zero build.
> 
> Bugid: https://bugs.openjdk.java.net/browse/JDK-8218031
> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00

I've verified Zero builds again with this on Linux x86_64. Patch seems
to match what was removed in 8217922 related to the C++ interpreter.
Looks OK to me. I'm not a Reviewer, though.

Thanks,
Severin


From tobias.hartmann at oracle.com  Wed Jan 30 09:21:46 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 30 Jan 2019 10:21:46 +0100
Subject: [13] RFR (M): 6986483: CHA: optimize calls through interfaces
In-Reply-To: <2d6eb23a-877b-6938-2460-f58b36c8e3ac@oracle.com>
References: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>
 <10d3bbdf-c110-cb89-fdb2-1760776b486b@oracle.com>
 <2d6eb23a-877b-6938-2460-f58b36c8e3ac@oracle.com>
Message-ID: <955b414e-6c87-0eb8-3b01-452085c8b8e2@oracle.com>

Hi Vladimir,

this looks good to me!

Just spotted some typos in the comments:
- ciInstanceKlass.hpp: "than one implementors" -> "than one implementor"
- doCall.cpp: "may be able bind" -> "may be able to bind"
  "is the it's supposed" -> "is that it's supposed"
- The copyright date in the test is wrong

Best regards,
Tobias


On 29.01.19 23:09, Vladimir Ivanov wrote:
> Thanks, Nils.
> 
> Additional testing revealed one problematic corner case - Object methods on interfaces. The
> transformation is not valid for them, because it eliminates proper receiver subtype check: subtype
> check against Object is a no-op.
> 
> Updated version:
> ? http://cr.openjdk.java.net/~vlivanov/6986483/webrev.02/
> 
> The changes in c1_GraphBuilder.cpp and doCall.cpp are trivial (cha_monomorphic_target->holder() !=
> Object_klass()), but I extended the test with more cases.
> 
> Testing: hs-precheckin-comp, tier1-5
> 
> Best regards,
> Vladimir Ivanov
> 
> On 29/01/2019 06:03, Nils Eliasson wrote:
>> Hi Vladimir,
>>
>> A really good improvement, and I really like the test, excellent coverage.
>>
>> Reviewed,
>>
>> // Nils
>>
>>
>> On 2019-01-25 22:27, Vladimir Ivanov wrote:
>>> http://cr.openjdk.java.net/~vlivanov/6986483/webrev.01/
>>> https://bugs.openjdk.java.net/browse/JDK-6986483
>>>
>>> Another candidate for revival. At that time it was reviewed, but integration was blocked pending
>>> another bug fix. Now the fix is in.
>>>
>>> Quote from original review request [1]:
>>>
>>> "Proposed change adds CHA support in C2 for interface calls.
>>>
>>> Consider the following hierarchy:
>>>
>>> ?? interface Intf { m(); }
>>> ?? class C implements Intf { public m() { ... } }
>>> ?? class C1 extends C { /* doesn't override m() */ }
>>> ?? ...
>>> ?? class Cn extends C { /* doesn't override m() */ }
>>>
>>> Call site: invokeinterface Intf.m() ...
>>>
>>> If Intf were an abstract class, CHA could deduce that Intf::m() can be
>>> replaced with C::m(), but it doesn't work for interfaces. Verifier
>>> doesn't check interface types in bytecode, so CHA can't assume the
>>> receiver implements Intf.
>>>
>>> CHA in C1 handles such call sites for interfaces with a single
>>> implementor. It replaces invokeinterface Intf.m() with invokevirtual
>>> C.m() guarded by a subtype check (instanceof C). C2 doesn't do that and
>>> this request is about adding that. Type profiling doesn't help here (the
>>> call site is usually megamorphic), so C2 can't inline it.
>>>
>>> The proposed implementation is similar to C1, except that the code
>>> deoptimizes when subtype check fails and ICCE is thrown from the
>>> interpreter.
>>>
>>> While working on it, I spotted and fixed a couple of inefficiencies in
>>> C1 implementation:
>>>
>>> ?? (1) dependency context being used was broader than necessary -
>>> resolved instead of declared interface (hence, possibility of
>>> unnecessary invalidations);
>>>
>>> ?? (2) didn't work for interfaces w/ any default methods: CHA doesn't
>>> support default methods at the moment, so what matters is whether
>>> Intf::m() is default or not and not whether Intf has *any* concrete methods."
>>>
>>>
>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>>
>>> Best regards,
>>> Vladimir Ivanov
>>>
>>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2017-February/025630.html

From nils.eliasson at oracle.com  Wed Jan 30 09:23:18 2019
From: nils.eliasson at oracle.com (Nils Eliasson)
Date: Wed, 30 Jan 2019 10:23:18 +0100
Subject: RFR(S): 8087128 C2: Disallow definition split on MachCopySpill
 nodes
In-Reply-To: <8858054d-b05f-b82e-cdc7-b336eb5df0d5@oracle.com>
References: <2f287f46-0bfb-b489-83b9-ba8e4e54199e@oracle.com>
 <8858054d-b05f-b82e-cdc7-b336eb5df0d5@oracle.com>
Message-ID: <ca8e33ac-3c87-241e-e29b-9ba3fd733b0c@oracle.com>

Thank you Vladimir,

// Nils

On 2019-01-29 23:48, Vladimir Kozlov wrote:
> Thank you, Nils
>
> Looks good to me.
>
> Please run hs-precheckin-comp testing too.
>
> Vladimir
>
> On 1/28/19 7:15 AM, Nils Eliasson wrote:
>> Hi,
>>
>> We have a problem that we sometimes hit an assert in reg_split.cpp.
>>
>> https://bugs.openjdk.java.net/browse/JDK-8087128
>>
>> http://cr.openjdk.java.net/~neliasso/8087128/webrev.01/
>>
>> We have a block that looks like this:
>>
>> 1262: #??????? B1264 B1263 <- N7283? Freq: 0,0927179
>> ??7282?? Region? ===? 7282? 1704? [[ 7282? 1702? 1715 ]]
>> ??11500? MemToRegSpillCopy?????? === _? 11190? [[ 9184 ]] 
>> Oop:com/sun/tools/javac/$
>> ??9184?? DefinitionSpillCopy???? === _? 11500? [[ 9185? 1702 11503 
>> 11505 ]]?? Oop:$
>> ??11501? MemToRegSpillCopy?????? === _? 9169? [[ 11665? 11506 11504 
>> 11502 ]]?? Oop$
>> ??11645? MemToRegSpillCopy?????? === _? 9167? [[ 9182 ]] 
>> Oop:com/sun/tools/javac/c$
>> ??9182?? DefinitionSpillCopy???? === _? 11645? [[ 1702? 11646 11647 
>> 11648 ]]?? Oop$
>> ??1715?? checkCastPP???? ===? 7282? 1716? [[ 1702 ]] 
>> java/util/HashMap$TreeNode:NotN$
>> ??9185?? BoundSpillCopy? === _? 9184? [[ 1702 ]] 
>> Oop:com/sun/tools/javac/code/Symb$
>> ??11665? RegToMemSpillCopy?????? === _? 11501? [[ 1702 ]] 
>> Oop:com/sun/tools/javac/$
>> ??1702?? CallStaticJavaDirect??? ===? 7282? 185? 182? 16? 0 1715 187 
>> 9185? 11665 $
>> ??1703?? MachProj??????? ===? 1702? [[]] #10006/fat
>>
>>
>> 11501 "MemToRegSpillCopy" has one use in this block, "11665 
>> RegToMemSpillCopy", and three uses in other blocks. The use "11665 
>> RegToMemSpillCopy" is used by "1702 CallStaticJavaDirect".
>>
>> We hit the assert when processing 11501 in PhaseChaitin::Split.
>>
>> We are in the "Handle DEFS" section of the split routine. We only get 
>> here if the live range has been marked as spilled when the coloring 
>> have ran out of colors. There are two code paths, one default, where 
>> we just record the def and updates the side tables, and one where we 
>> do a definition split on the live range. This split is guarded by 
>> several conditions. In this case we get here by having a register 
>> mask that is only regs (UP) and by being in a high pressure region. 
>> Everything seems ok, so why don't we end up with MachSpillCopies here 
>> more often?
>>
>> Also - one of the 4 uses is in this block (11655) and it's a 
>> reg-to-mem already. It doesn't make much sense to add even more 
>> spills here. Why does this happen?
>>
>> UseFPUForSpilling added a restriction to coalesing - it skips 
>> coalescing when the two live ranges have different pressure. The 
>> reason for this is that with FPU spilling, the possible extra 
>> spilling is for "free". (I can't find any documentation on benchmarks 
>> where this is beneficial though.) The downside is that we get longer 
>> spill chains like: DefSpill-memToReg-RegToMem-MemToReg, that doesn't 
>> collapse, because of pressure changes. This may cause the live ranges 
>> defined by memToReg-nodes to become spilled, and if we are in a 
>> high-pressure region - we hit the assert.
>>
>> So my conclusion is that nothing is really wrong. Everything still 
>> works without the assert. The spill-chains are unnecessary long, but 
>> only because we have chosen to restrict the coalescing. But we 
>> shouldn't split the spill-nodes even more. In the next iteration the 
>> coalescing within the block will have reduced the chains, and later a 
>> proper coloring will be found.
>>
>> My solution is that we prevent the MachSpillCopies (only Mem-To-Regs 
>> can end up here) from being split again. This is ok - because this is 
>> exactly what would have happened if we would have been in a low 
>> pressure region.
>>
>> I have done some measurements and it doesn't increase the number of 
>> spill-iterations.
>>
>> Regards,
>>
>> Nils
>>
>>
>>

From tobias.hartmann at oracle.com  Wed Jan 30 09:36:07 2019
From: tobias.hartmann at oracle.com (Tobias Hartmann)
Date: Wed, 30 Jan 2019 10:36:07 +0100
Subject: 8217465: RFR(S): [REDO] - Optimize CodeHeap Analytics
In-Reply-To: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
References: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
Message-ID: <f3276201-341d-6767-cf1f-90020ff24e65@oracle.com>

Hi Lutz,

looks good to me too.

Best regards,
Tobias

On 28.01.19 14:13, Schmidt, Lutz wrote:
> Hi all,
> 
> may I please request reviews for this REDO of JDK-8217250. The only relevant difference of this REDO is that I moved the 
>   #define USE_BUFFEREDSTREAM
> line further down. It is now located after all the #include statements. 
> 
> The changeset is included in our inhouse tests since Jan 23rd with no issues detected. It was submitted to jdk/submit on Jan 25th with one failure reported on windows-x64 (see attachment). I cannot relate the failure to my changes. Could someone please have a look at the logs? If the reported failure is a false positive, here are the bug and webrev links for your reviews:
> 
> Bug:    https://bugs.openjdk.java.net/browse/JDK-8217465
> Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217465.00/
> 
> Thanks a lot!
> Lutz
> 
>  
> 

From lutz.schmidt at sap.com  Wed Jan 30 10:30:28 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Wed, 30 Jan 2019 10:30:28 +0000
Subject: 8217465: RFR(S): [REDO] - Optimize CodeHeap Analytics
In-Reply-To: <71c4e476-ebf4-d739-f53b-ecaf9cd06cd0@oracle.com>
References: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
 <71c4e476-ebf4-d739-f53b-ecaf9cd06cd0@oracle.com>
Message-ID: <17B00A1A-5E99-4A71-A1D0-DFF8D7EA3447@sap.com>

Thank you, Vladimir,

for reviewing and running all the tests. So the error report I got from the submit repo was unrelated noise. The REDO-patch builds and runs on all "our" platforms (s390x, ppc64, ppc64le, AIX) as well. 

Best Regards,
Lutz

?On 30.01.19, 01:19, "Vladimir Kozlov" <vladimir.kozlov at oracle.com> wrote:

    Looks good.
    
    It passed tier1,hstier2 testing. It builds on all our platforms.
    
    Thanks,
    Vladimir
    
    On 1/28/19 5:13 AM, Schmidt, Lutz wrote:
    > Hi all,
    > 
    > may I please request reviews for this REDO of JDK-8217250. The only relevant difference of this REDO is that I moved the
    >    #define USE_BUFFEREDSTREAM
    > line further down. It is now located after all the #include statements.
    > 
    > The changeset is included in our inhouse tests since Jan 23rd with no issues detected. It was submitted to jdk/submit on Jan 25th with one failure reported on windows-x64 (see attachment). I cannot relate the failure to my changes. Could someone please have a look at the logs? If the reported failure is a false positive, here are the bug and webrev links for your reviews:
    > 
    > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217465
    > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217465.00/
    > 
    > Thanks a lot!
    > Lutz
    > 
    >   
    > 
    

From lutz.schmidt at sap.com  Wed Jan 30 10:31:20 2019
From: lutz.schmidt at sap.com (Schmidt, Lutz)
Date: Wed, 30 Jan 2019 10:31:20 +0000
Subject: 8217465: RFR(S): [REDO] - Optimize CodeHeap Analytics
In-Reply-To: <f3276201-341d-6767-cf1f-90020ff24e65@oracle.com>
References: <B2E29569-592F-4963-9D91-7E774501129A@sap.com>
 <f3276201-341d-6767-cf1f-90020ff24e65@oracle.com>
Message-ID: <F4B58F2E-9CBE-4209-9886-DEBE4FD9F8AE@sap.com>

Thanks for reviewing, Tobias!
Best, Lutz

?On 30.01.19, 10:36, "Tobias Hartmann" <tobias.hartmann at oracle.com> wrote:

    Hi Lutz,
    
    looks good to me too.
    
    Best regards,
    Tobias
    
    On 28.01.19 14:13, Schmidt, Lutz wrote:
    > Hi all,
    > 
    > may I please request reviews for this REDO of JDK-8217250. The only relevant difference of this REDO is that I moved the 
    >   #define USE_BUFFEREDSTREAM
    > line further down. It is now located after all the #include statements. 
    > 
    > The changeset is included in our inhouse tests since Jan 23rd with no issues detected. It was submitted to jdk/submit on Jan 25th with one failure reported on windows-x64 (see attachment). I cannot relate the failure to my changes. Could someone please have a look at the logs? If the reported failure is a false positive, here are the bug and webrev links for your reviews:
    > 
    > Bug:    https://bugs.openjdk.java.net/browse/JDK-8217465
    > Webrev: http://cr.openjdk.java.net/~lucy/webrevs/8217465.00/
    > 
    > Thanks a lot!
    > Lutz
    > 
    >  
    > 
    

From aoqi at loongson.cn  Wed Jan 30 16:13:29 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 00:13:29 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <9254139ee639a36315a8d07f063ff04aef6e6756.camel@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <9254139ee639a36315a8d07f063ff04aef6e6756.camel@redhat.com>
Message-ID: <CALjzQn4rKWL52U2vJ2WdW_Kq136P9+FKK9yvS=JLeskRSYJsxQ@mail.gmail.com>

On Wed, Jan 30, 2019 at 5:12 PM Severin Gehwolf <sgehwolf at redhat.com> wrote:
>
> Hi,
>
> On Wed, 2019-01-30 at 14:34 +0800, Ao Qi wrote:
> > Hi,
> >
> > It seems that zero build is broken. InterpreterInvocationLimit,
> > InterpreterBackwardBranchLimit and InterpreterProfileLimit were
> > removed in 8217922, but they are still used when CC_INTERP is true.
> >
> > Please review the following webrev which fixes broken zero build.
> >
> > Bugid: https://bugs.openjdk.java.net/browse/JDK-8218031
> > Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
>
> I've verified Zero builds again with this on Linux x86_64. Patch seems
> to match what was removed in 8217922 related to the C++ interpreter.
> Looks OK to me. I'm not a Reviewer, though.

Thanks, Severin!

>
> Thanks,
> Severin
>

From aoqi at loongson.cn  Wed Jan 30 16:17:20 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 00:17:20 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
Message-ID: <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>

Hi Tobias,

Thanks! What should be the next? Is it ok that someone help me to push?

On Wed, Jan 30, 2019 at 4:32 PM Tobias Hartmann
<tobias.hartmann at oracle.com> wrote:
>
> Hi,
>
> this looks good to me.
>
> Thanks,
> Tobias
>
> On 30.01.19 07:34, Ao Qi wrote:
> > Hi,
> >
> > It seems that zero build is broken. InterpreterInvocationLimit,
> > InterpreterBackwardBranchLimit and InterpreterProfileLimit were
> > removed in 8217922, but they are still used when CC_INTERP is true.
> >
> > Please review the following webrev which fixes broken zero build.
> >
> > Bugid: https://bugs.openjdk.java.net/browse/JDK-8218031
> > Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
> >
> > Thanks,
> > Ao Qi
> >

From shade at redhat.com  Wed Jan 30 16:26:17 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 30 Jan 2019 17:26:17 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
Message-ID: <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>

On 1/30/19 5:17 PM, Ao Qi wrote:
> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00

This looks good to me.

> Thanks! What should be the next? Is it ok that someone help me to push?

Yes, you need a sponsor to push. I can be your sponsor.

Let me rename the synopsis a bit:
 "Zero broken after JDK-8217922 (Compiler dead code removal)"

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/f3174376/signature.asc>

From aoqi at loongson.cn  Wed Jan 30 16:38:51 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 00:38:51 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
Message-ID: <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>

On Thu, Jan 31, 2019 at 12:26 AM Aleksey Shipilev <shade at redhat.com> wrote:
>
> On 1/30/19 5:17 PM, Ao Qi wrote:
> > Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
>
> This looks good to me.
>
> > Thanks! What should be the next? Is it ok that someone help me to push?
>
> Yes, you need a sponsor to push. I can be your sponsor.
>

Thanks! Is there something I need to do?

> Let me rename the synopsis a bit:
>  "Zero broken after JDK-8217922 (Compiler dead code removal)"
>

That's ok.

> -Aleksey
>

From shade at redhat.com  Wed Jan 30 16:44:44 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 30 Jan 2019 17:44:44 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
Message-ID: <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>

On 1/30/19 5:38 PM, Ao Qi wrote:
> On Thu, Jan 31, 2019 at 12:26 AM Aleksey Shipilev <shade at redhat.com> wrote:
>>
>> On 1/30/19 5:17 PM, Ao Qi wrote:
>>> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
>>
>> This looks good to me.


Let me ask a question, though, don't we want these asserts back too?

  assert(0 <= InterpreterBackwardBranchLimit,
         "OSR threshold should be non-negative");
  assert(0 <= InterpreterProfileLimit &&
         InterpreterProfileLimit <= InterpreterInvocationLimit,
         "profile threshold should be less than the compilation threshold "
         "and non-negative");

Also, why these are removed? Do they mess with buildability?

 102   bool reached_BackwardBranchLimit(InvocationCounter *back_edge_count) const {
 103     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
 104            (unsigned int) InterpreterBackwardBranchLimit;
 105   }
 106   // Do this just like asm interpreter does for max speed.
 107   bool reached_ProfileLimit(InvocationCounter *back_edge_count) const {
 108     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
 109            (unsigned int) InterpreterProfileLimit;


I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
should we keep the assert and reached_BackwardBranchLimit then?

>> Yes, you need a sponsor to push. I can be your sponsor.
> 
> Thanks! Is there something I need to do?

Nothing new, just answer a few questions above.

-Aleksey

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/77648c1b/signature-0001.asc>

From shade at redhat.com  Wed Jan 30 17:18:15 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 30 Jan 2019 18:18:15 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
 <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
Message-ID: <e89d38cb-f96b-a901-837a-6a78e29b2370@redhat.com>

On 1/30/19 5:44 PM, Aleksey Shipilev wrote:
> On 1/30/19 5:38 PM, Ao Qi wrote:
>> On Thu, Jan 31, 2019 at 12:26 AM Aleksey Shipilev <shade at redhat.com> wrote:
>>>
>>> On 1/30/19 5:17 PM, Ao Qi wrote:
>>>> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
>>>
>>> This looks good to me.
> 
> 
> Let me ask a question, though, don't we want these asserts back too?
> 
>   assert(0 <= InterpreterBackwardBranchLimit,
>          "OSR threshold should be non-negative");
>   assert(0 <= InterpreterProfileLimit &&
>          InterpreterProfileLimit <= InterpreterInvocationLimit,
>          "profile threshold should be less than the compilation threshold "
>          "and non-negative");
> 
> Also, why these are removed? Do they mess with buildability?
> 
>  102   bool reached_BackwardBranchLimit(InvocationCounter *back_edge_count) const {
>  103     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
>  104            (unsigned int) InterpreterBackwardBranchLimit;
>  105   }
>  106   // Do this just like asm interpreter does for max speed.
>  107   bool reached_ProfileLimit(InvocationCounter *back_edge_count) const {
>  108     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
>  109            (unsigned int) InterpreterProfileLimit;
> 
> 
> I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
> should we keep the assert and reached_BackwardBranchLimit then?

I am thinking this:
  http://cr.openjdk.java.net/~shade/8218031/webrev.01/

(also note the Contributed-by line)

This passes Linux x86_64 {zero, server} builds.

-Aleksey


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/d49ed20c/signature.asc>

From aoqi at loongson.cn  Wed Jan 30 17:30:52 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 01:30:52 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
 <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
Message-ID: <CALjzQn4VYcN3o3Zj5BJGqnDmNiSyUOa8bdb1OSfhnCorTypqfw@mail.gmail.com>

On Thu, Jan 31, 2019 at 12:44 AM Aleksey Shipilev <shade at redhat.com> wrote:
>
> On 1/30/19 5:38 PM, Ao Qi wrote:
> > On Thu, Jan 31, 2019 at 12:26 AM Aleksey Shipilev <shade at redhat.com> wrote:
> >>
> >> On 1/30/19 5:17 PM, Ao Qi wrote:
> >>> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
> >>
> >> This looks good to me.
>
>
> Let me ask a question, though, don't we want these asserts back too?
>
>   assert(0 <= InterpreterBackwardBranchLimit,
>          "OSR threshold should be non-negative");

I think this assert does not mess with buildability...

>   assert(0 <= InterpreterProfileLimit &&
>          InterpreterProfileLimit <= InterpreterInvocationLimit,
>          "profile threshold should be less than the compilation threshold "
>          "and non-negative");
>

I grep InterpreterProfileLimit, and it is used only once in
reached_ProfileLimit. reached_ProfileLimit is never called.

> Also, why these are removed? Do they mess with buildability?

Actually they are not removed by me. I think they were removed because
they were thought not to be used anymore.
InterpreterBackwardBranchLimit and InterpreterInvocationLimit are
still used when CC_INTERP defined, but InterpreterProfileLimit is not.

>
>  102   bool reached_BackwardBranchLimit(InvocationCounter *back_edge_count) const {
>  103     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
>  104            (unsigned int) InterpreterBackwardBranchLimit;
>  105   }
>  106   // Do this just like asm interpreter does for max speed.
>  107   bool reached_ProfileLimit(InvocationCounter *back_edge_count) const {
>  108     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
>  109            (unsigned int) InterpreterProfileLimit;
>
>
> I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
> should we keep the assert and reached_BackwardBranchLimit then?
>

reached_BackwardBranchLimit is never called too, it can be removed in
my opinion.

> >> Yes, you need a sponsor to push. I can be your sponsor.
> >
> > Thanks! Is there something I need to do?
>
> Nothing new, just answer a few questions above.
>
> -Aleksey
>

From aoqi at loongson.cn  Wed Jan 30 17:32:45 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 01:32:45 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <e89d38cb-f96b-a901-837a-6a78e29b2370@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
 <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
 <e89d38cb-f96b-a901-837a-6a78e29b2370@redhat.com>
Message-ID: <CALjzQn6NGPXJa195W3nOyHRkVo_WCti5sOj2kpHoiMAckjmEdA@mail.gmail.com>

On Thu, Jan 31, 2019 at 1:18 AM Aleksey Shipilev <shade at redhat.com> wrote:
>
> On 1/30/19 5:44 PM, Aleksey Shipilev wrote:
> > On 1/30/19 5:38 PM, Ao Qi wrote:
> >> On Thu, Jan 31, 2019 at 12:26 AM Aleksey Shipilev <shade at redhat.com> wrote:
> >>>
> >>> On 1/30/19 5:17 PM, Ao Qi wrote:
> >>>> Webrev: http://cr.openjdk.java.net/~aoqi/8218031/webrev.00
> >>>
> >>> This looks good to me.
> >
> >
> > Let me ask a question, though, don't we want these asserts back too?
> >
> >   assert(0 <= InterpreterBackwardBranchLimit,
> >          "OSR threshold should be non-negative");
> >   assert(0 <= InterpreterProfileLimit &&
> >          InterpreterProfileLimit <= InterpreterInvocationLimit,
> >          "profile threshold should be less than the compilation threshold "
> >          "and non-negative");
> >
> > Also, why these are removed? Do they mess with buildability?
> >
> >  102   bool reached_BackwardBranchLimit(InvocationCounter *back_edge_count) const {
> >  103     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
> >  104            (unsigned int) InterpreterBackwardBranchLimit;
> >  105   }
> >  106   // Do this just like asm interpreter does for max speed.
> >  107   bool reached_ProfileLimit(InvocationCounter *back_edge_count) const {
> >  108     return (_counter & count_mask) + (back_edge_count->_counter & count_mask) >=
> >  109            (unsigned int) InterpreterProfileLimit;
> >
> >
> > I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
> > should we keep the assert and reached_BackwardBranchLimit then?
>
> I am thinking this:
>   http://cr.openjdk.java.net/~shade/8218031/webrev.01/
>

I am ok with this.

> (also note the Contributed-by line)
>
> This passes Linux x86_64 {zero, server} builds.
>
> -Aleksey
>
>
>

From shade at redhat.com  Wed Jan 30 17:37:10 2019
From: shade at redhat.com (Aleksey Shipilev)
Date: Wed, 30 Jan 2019 18:37:10 +0100
Subject: RFR 8218031: Zero build broken
In-Reply-To: <CALjzQn6NGPXJa195W3nOyHRkVo_WCti5sOj2kpHoiMAckjmEdA@mail.gmail.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
 <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
 <e89d38cb-f96b-a901-837a-6a78e29b2370@redhat.com>
 <CALjzQn6NGPXJa195W3nOyHRkVo_WCti5sOj2kpHoiMAckjmEdA@mail.gmail.com>
Message-ID: <dc01a625-bb36-dc67-8e1f-06ed64ec8565@redhat.com>

On 1/30/19 6:32 PM, Ao Qi wrote:
>>> I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
>>> should we keep the assert and reached_BackwardBranchLimit then?
>>
>> I am thinking this:
>>   http://cr.openjdk.java.net/~shade/8218031/webrev.01/
>>
> 
> I am ok with this.

Good, I pushed the version above. That should be it!

-Aleksey


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/0a555eee/signature-0001.asc>

From aoqi at loongson.cn  Wed Jan 30 17:40:26 2019
From: aoqi at loongson.cn (Ao Qi)
Date: Thu, 31 Jan 2019 01:40:26 +0800
Subject: RFR 8218031: Zero build broken
In-Reply-To: <dc01a625-bb36-dc67-8e1f-06ed64ec8565@redhat.com>
References: <CALjzQn43tEHnGMmA2NjfkLGncYRNvwDSQb8014yS0NHaHF3xmg@mail.gmail.com>
 <45f16dcf-24bf-c6f2-90d0-db5e44078825@oracle.com>
 <CALjzQn7XbGdj0QY9qyS8FW19TD3Xgemao45CjKM5bvZDadZEYQ@mail.gmail.com>
 <68d42bc9-ef60-1237-5c85-c7a9384113ed@redhat.com>
 <CALjzQn5bxr_b901-7jWTPqJpnomTc6ENT+8AymZwabLZYEG_xg@mail.gmail.com>
 <a7b4a43a-99c6-d0c1-c0b1-8c4c8100f769@redhat.com>
 <e89d38cb-f96b-a901-837a-6a78e29b2370@redhat.com>
 <CALjzQn6NGPXJa195W3nOyHRkVo_WCti5sOj2kpHoiMAckjmEdA@mail.gmail.com>
 <dc01a625-bb36-dc67-8e1f-06ed64ec8565@redhat.com>
Message-ID: <CALjzQn5KH7V8bjVwR+iGNooDRYFrnSsBauRR1T=e-7k+puXMGA@mail.gmail.com>

On Thu, Jan 31, 2019 at 1:37 AM Aleksey Shipilev <shade at redhat.com> wrote:
>
> On 1/30/19 6:32 PM, Ao Qi wrote:
> >>> I see that InterpreterProfileLimit is gone, but InterpreterBackwardBranchLimit is still there,
> >>> should we keep the assert and reached_BackwardBranchLimit then?
> >>
> >> I am thinking this:
> >>   http://cr.openjdk.java.net/~shade/8218031/webrev.01/
> >>
> >
> > I am ok with this.
>
> Good, I pushed the version above. That should be it!
>

Thanks:)

> -Aleksey
>
>

From bsrbnd at gmail.com  Wed Jan 30 19:35:09 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Wed, 30 Jan 2019 20:35:09 +0100
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
 <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
Message-ID: <CAEgw74CVwJEqxO9KVFR9KetFgYK+wkDS4UOam7sOMnc5Kk-0Ww@mail.gmail.com>

Hi Sandhya,

On Tue, 29 Jan 2019 at 20:19, Viswanathan, Sandhya
<sandhya.viswanathan at intel.com> wrote:
>
> Hi Vladimir,
>
> From what I understand the test is correct and the implementation needs to be fixed (the effect on registers a and b should be USE_KILL).
> Jatin plans to send an updated webrev after fixing the issue.
>
> Best Regards,
> Sandhya

Just a few questions as I looked at this intrinsic some time ago and I
wasn't convinced we could easily improve the current impl on x86
because MIN/MAX instructions don't conform to the Java doc for 0.0 and
NaN.

Here is the current impl and how its code is currently generated with
a sequence of 'ucomisd+jcc':

http://hg.openjdk.java.net/jdk/jdk/file/3997614d4834/src/java.base/share/classes/java/lang/Math.java#l1491

    public static double max(double a, double b) {
        if (a != a)
// ucomisd XMM0, XMM0
// jp / je
            return a;   // a is NaN

        if ((a == 0.0d) &&
// ucomisd XMM0, [constant table base + #0]    # load from constant
table: double=#0.000000
// jp / je
            (b == 0.0d) &&
// ucomisd XMM1, [constant table base + #0]    # load from constant
table: double=#0.000000
// jp / jne
            (Double.doubleToRawLongBits(a) == negativeZeroDoubleBits)
// movd    R10,XMM0    # MoveD2L
// movq    R11, #-9223372036854775808    # long
// cmpq    R10, R11
// jne
        ) {
            // Raw conversion ok since NaN can't map to -0.0.
            return b;
// movapd  XMM0, XMM1    # spill
        }

        return (a >= b) ? a : b;
// ucomisd XMM0, XMM1 test
// jb
// movapd  XMM1, XMM0    # spill
// movapd  XMM0, XMM1    # spill
    }

It isn't obvious from your suggested fix that all paths are really
faster than the current impl:

http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/src/hotspot/cpu/x86/x86.ad.frames.html

2851 // Following pseudo code describes the algorithm for max[FD]/min[FD]:
2852 //  if ( b < 0 )
2853 //    swap(a, b)
2854 //  Tmp  = Max_Float( a , b)
2855 //  Mask = a == NaN ? 1 : 0
2856 //  Res  = Mask ? a : Tmp

2881 // max = java.lang.Max(double a , double b)
2882 instruct maxD_reg(legRegD dst, legRegD a, legRegD b, legRegD tmp,
legRegD mask) %{
2883   predicate(UseAVX > 0);
2884   match(Set dst (MaxD a b));
2885   effect(USE a, USE b, TEMP tmp, TEMP mask);
2886   format %{
2887      "blendvpd         $tmp,$b,$a,$b   \n\t"
2888      "blendvpd         $a,$a,$b,$b     \n\t"
2889      "movapd           $b,$tmp         \n\t"
2890      "vmaxpd           $tmp,$a,$b      \n\t"
2891      "cmppd.unordered  $mask, $a, $a   \n\t"
2892      "blendvpd         $dst,$tmp,$a,$mask  \n\t"
2893   %}

We've recently seen that branches are sometimes faster than
conditional moves/copies, would it be possible to quantify the
improvement?

Also, I note that your code uses packed instructions, is this really
better than using 'ucomisd' acting on scalar data types?
Or maybe, 'vmaxsd' and 'cmpsd would be more appropriate?

Thanks,
Bernard

From sandhya.viswanathan at intel.com  Wed Jan 30 20:26:08 2019
From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya)
Date: Wed, 30 Jan 2019 20:26:08 +0000
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <CAEgw74CVwJEqxO9KVFR9KetFgYK+wkDS4UOam7sOMnc5Kk-0Ww@mail.gmail.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
 <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
 <CAEgw74CVwJEqxO9KVFR9KetFgYK+wkDS4UOam7sOMnc5Kk-0Ww@mail.gmail.com>
Message-ID: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A55B2B@FMSMSX126.amr.corp.intel.com>

Hi Bernard,

Thanks a lot for your feedback. Let me try to answer your questions below. 

We also started with the same assumption that we may not be able to easily improve the current implementation on x86 because MIN/MAX instructions don't conform to the Java doc for 0.0 and NaN. 
Jatin took this as a challenge and came up with a sequence that does show benefit.
Our performance run shows about 30% gain with this patch vs the ucomisd sequence generated by the jitted code. 
As you suggest, we could use scalar instructions for max, min and cmp instead of using the packed variant.
But the blend has to be the packed variant as there is no scalar flavor for that and so changing the others to scalar flavor may not show much perf change, we will confirm both ways.
The path that won't show much benefit or may show some regression is when both the operands are NaN which is not frequently occurring case I would think.

Jatin is going to send updated patch fixing the issue reported by Nils. We will include performance numbers along with the updated patch.

Best Regards,
Sandhya


-----Original Message-----
From: B. Blaser [mailto:bsrbnd at gmail.com] 
Sent: Wednesday, January 30, 2019 11:35 AM
To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>
Cc: Vladimir Kozlov <vladimir.kozlov at oracle.com>; hotspot-compiler-dev at openjdk.java.net
Subject: Re: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics

Hi Sandhya,

On Tue, 29 Jan 2019 at 20:19, Viswanathan, Sandhya <sandhya.viswanathan at intel.com> wrote:
>
> Hi Vladimir,
>
> From what I understand the test is correct and the implementation needs to be fixed (the effect on registers a and b should be USE_KILL).
> Jatin plans to send an updated webrev after fixing the issue.
>
> Best Regards,
> Sandhya

Just a few questions as I looked at this intrinsic some time ago and I wasn't convinced we could easily improve the current impl on x86 because MIN/MAX instructions don't conform to the Java doc for 0.0 and NaN.

Here is the current impl and how its code is currently generated with a sequence of 'ucomisd+jcc':

http://hg.openjdk.java.net/jdk/jdk/file/3997614d4834/src/java.base/share/classes/java/lang/Math.java#l1491

    public static double max(double a, double b) {
        if (a != a)
// ucomisd XMM0, XMM0
// jp / je
            return a;   // a is NaN

        if ((a == 0.0d) &&
// ucomisd XMM0, [constant table base + #0]    # load from constant
table: double=#0.000000
// jp / je
            (b == 0.0d) &&
// ucomisd XMM1, [constant table base + #0]    # load from constant
table: double=#0.000000
// jp / jne
            (Double.doubleToRawLongBits(a) == negativeZeroDoubleBits)
// movd    R10,XMM0    # MoveD2L
// movq    R11, #-9223372036854775808    # long
// cmpq    R10, R11
// jne
        ) {
            // Raw conversion ok since NaN can't map to -0.0.
            return b;
// movapd  XMM0, XMM1    # spill
        }

        return (a >= b) ? a : b;
// ucomisd XMM0, XMM1 test
// jb
// movapd  XMM1, XMM0    # spill
// movapd  XMM0, XMM1    # spill
    }

It isn't obvious from your suggested fix that all paths are really faster than the current impl:

http://cr.openjdk.java.net/~sviswanathan/Jatin/8217561/webrev.00/src/hotspot/cpu/x86/x86.ad.frames.html

2851 // Following pseudo code describes the algorithm for max[FD]/min[FD]:
2852 //  if ( b < 0 )
2853 //    swap(a, b)
2854 //  Tmp  = Max_Float( a , b)
2855 //  Mask = a == NaN ? 1 : 0
2856 //  Res  = Mask ? a : Tmp

2881 // max = java.lang.Max(double a , double b)
2882 instruct maxD_reg(legRegD dst, legRegD a, legRegD b, legRegD tmp, legRegD mask) %{
2883   predicate(UseAVX > 0);
2884   match(Set dst (MaxD a b));
2885   effect(USE a, USE b, TEMP tmp, TEMP mask);
2886   format %{
2887      "blendvpd         $tmp,$b,$a,$b   \n\t"
2888      "blendvpd         $a,$a,$b,$b     \n\t"
2889      "movapd           $b,$tmp         \n\t"
2890      "vmaxpd           $tmp,$a,$b      \n\t"
2891      "cmppd.unordered  $mask, $a, $a   \n\t"
2892      "blendvpd         $dst,$tmp,$a,$mask  \n\t"
2893   %}

We've recently seen that branches are sometimes faster than conditional moves/copies, would it be possible to quantify the improvement?

Also, I note that your code uses packed instructions, is this really better than using 'ucomisd' acting on scalar data types?
Or maybe, 'vmaxsd' and 'cmpsd would be more appropriate?

Thanks,
Bernard

From vladimir.x.ivanov at oracle.com  Wed Jan 30 21:52:16 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Wed, 30 Jan 2019 13:52:16 -0800
Subject: [13] RFR (M): 6986483: CHA: optimize calls through interfaces
In-Reply-To: <955b414e-6c87-0eb8-3b01-452085c8b8e2@oracle.com>
References: <77261b19-d2d2-683c-dde4-6f261e2b78a0@oracle.com>
 <10d3bbdf-c110-cb89-fdb2-1760776b486b@oracle.com>
 <2d6eb23a-877b-6938-2460-f58b36c8e3ac@oracle.com>
 <955b414e-6c87-0eb8-3b01-452085c8b8e2@oracle.com>
Message-ID: <0d0afee8-bfab-c0e2-d67e-b8eb0b9e35f3@oracle.com>

Thanks, Tobias.

Updated webrev in-place.

Best regards,
Vladimir Ivanov

On 30/01/2019 01:21, Tobias Hartmann wrote:
> Hi Vladimir,
> 
> this looks good to me!
> 
> Just spotted some typos in the comments:
> - ciInstanceKlass.hpp: "than one implementors" -> "than one implementor"
> - doCall.cpp: "may be able bind" -> "may be able to bind"
>    "is the it's supposed" -> "is that it's supposed"
> - The copyright date in the test is wrong
> 
> Best regards,
> Tobias
> 
> 
> On 29.01.19 23:09, Vladimir Ivanov wrote:
>> Thanks, Nils.
>>
>> Additional testing revealed one problematic corner case - Object methods on interfaces. The
>> transformation is not valid for them, because it eliminates proper receiver subtype check: subtype
>> check against Object is a no-op.
>>
>> Updated version:
>>  ? http://cr.openjdk.java.net/~vlivanov/6986483/webrev.02/
>>
>> The changes in c1_GraphBuilder.cpp and doCall.cpp are trivial (cha_monomorphic_target->holder() !=
>> Object_klass()), but I extended the test with more cases.
>>
>> Testing: hs-precheckin-comp, tier1-5
>>
>> Best regards,
>> Vladimir Ivanov
>>
>> On 29/01/2019 06:03, Nils Eliasson wrote:
>>> Hi Vladimir,
>>>
>>> A really good improvement, and I really like the test, excellent coverage.
>>>
>>> Reviewed,
>>>
>>> // Nils
>>>
>>>
>>> On 2019-01-25 22:27, Vladimir Ivanov wrote:
>>>> http://cr.openjdk.java.net/~vlivanov/6986483/webrev.01/
>>>> https://bugs.openjdk.java.net/browse/JDK-6986483
>>>>
>>>> Another candidate for revival. At that time it was reviewed, but integration was blocked pending
>>>> another bug fix. Now the fix is in.
>>>>
>>>> Quote from original review request [1]:
>>>>
>>>> "Proposed change adds CHA support in C2 for interface calls.
>>>>
>>>> Consider the following hierarchy:
>>>>
>>>>  ?? interface Intf { m(); }
>>>>  ?? class C implements Intf { public m() { ... } }
>>>>  ?? class C1 extends C { /* doesn't override m() */ }
>>>>  ?? ...
>>>>  ?? class Cn extends C { /* doesn't override m() */ }
>>>>
>>>> Call site: invokeinterface Intf.m() ...
>>>>
>>>> If Intf were an abstract class, CHA could deduce that Intf::m() can be
>>>> replaced with C::m(), but it doesn't work for interfaces. Verifier
>>>> doesn't check interface types in bytecode, so CHA can't assume the
>>>> receiver implements Intf.
>>>>
>>>> CHA in C1 handles such call sites for interfaces with a single
>>>> implementor. It replaces invokeinterface Intf.m() with invokevirtual
>>>> C.m() guarded by a subtype check (instanceof C). C2 doesn't do that and
>>>> this request is about adding that. Type profiling doesn't help here (the
>>>> call site is usually megamorphic), so C2 can't inline it.
>>>>
>>>> The proposed implementation is similar to C1, except that the code
>>>> deoptimizes when subtype check fails and ICCE is thrown from the
>>>> interpreter.
>>>>
>>>> While working on it, I spotted and fixed a couple of inefficiencies in
>>>> C1 implementation:
>>>>
>>>>  ?? (1) dependency context being used was broader than necessary -
>>>> resolved instead of declared interface (hence, possibility of
>>>> unnecessary invalidations);
>>>>
>>>>  ?? (2) didn't work for interfaces w/ any default methods: CHA doesn't
>>>> support default methods at the moment, so what matters is whether
>>>> Intf::m() is default or not and not whether Intf has *any* concrete methods."
>>>>
>>>>
>>>> Testing: hs-precheckin-comp, hs-tier1, hs-tier2
>>>>
>>>> Best regards,
>>>> Vladimir Ivanov
>>>>
>>>> [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2017-February/025630.html

From vivek.r.deshpande at intel.com  Wed Jan 30 21:52:53 2019
From: vivek.r.deshpande at intel.com (Deshpande, Vivek R)
Date: Wed, 30 Jan 2019 21:52:53 +0000
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <3e712082-1def-28ce-f45d-3ef54864e89e@oracle.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
 <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
 <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A547B0@FMSMSX126.amr.corp.intel.com>
 <3e712082-1def-28ce-f45d-3ef54864e89e@oracle.com>
Message-ID: <53E8E64DB2403849AFD89B7D4DAC8B2A9A16DBAD@FMSMSX152.amr.corp.intel.com>

Hi Vladimir

I tested the patch with Lucene tests.
Then I have pushed the patch:
http://hg.openjdk.java.net/jdk/jdk/rev/6121eee15c23
Thanks.

Regards,
Vivek

-----Original Message-----
From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com] 
Sent: Monday, January 28, 2019 9:39 AM
To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot-compiler-dev at openjdk.java.net
Cc: Deshpande, Vivek R <vivek.r.deshpande at intel.com>
Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after JDK-8210764 (Update avx512 implementation)

Hi Sandhya,

Can you also run Lucene tests which hit previous avx512 issues on SKX and Knights?
This is spilling code and it is used when a lot of xmm registers are used and our jtreg tests may not use this code.

You can push after that.

Thanks,
Vladimir

On 1/28/19 9:22 AM, Viswanathan, Sandhya wrote:
> Thanks Vladimir, Tobias and Nils.
> 
> Could Vivek go ahead and push it?
> 
> Best Regards,
> Sandhya
> 
> 
> -----Original Message-----
> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
> Sent: Monday, January 28, 2019 9:17 AM
> To: hotspot-compiler-dev at openjdk.java.net; Viswanathan, Sandhya 
> <sandhya.viswanathan at intel.com>
> Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad 
> after JDK-8210764 (Update avx512 implementation)
> 
> +1
> 
> Thanks,
> Vladimir
> 
> On 1/28/19 2:00 AM, Tobias Hartmann wrote:
>> Hi Sandhya,
>>
>> looks good to me too.
>>
>> Best regards,
>> Tobias
>>
>> On 28.01.19 09:40, Nils Eliasson wrote:
>>> Hi Sandhya,
>>>
>>> Looks good,
>>>
>>> Regards,
>>>
>>> Nils
>>>
>>> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>>>
>>>> Hi All,
>>>>
>>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>>>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>>>
>>>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>>>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>>>
>>>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>>>
>>>> I have corrected the guard to _LP64 and updated the spill/fill 
>>>> instructions.? This bug only affected the Knights family where AVX512VL is not supported.
>>>>    
>>>> I have tested it on SKX and Knights family with compiler jtreg tests.
>>>> Please review.
>>>>
>>>>
>>>> Best Regards,
>>>> Sandhya
>>>>

From vladimir.kozlov at oracle.com  Wed Jan 30 23:30:19 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 30 Jan 2019 15:30:19 -0800
Subject: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after
 JDK-8210764 (Update avx512 implementation)
In-Reply-To: <53E8E64DB2403849AFD89B7D4DAC8B2A9A16DBAD@FMSMSX152.amr.corp.intel.com>
References: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A5421D@FMSMSX126.amr.corp.intel.com>
 <a7deaae8-da75-0068-26cf-e0e40f45f8d2@oracle.com>
 <c263026d-6c94-eec3-2605-63a8c4a09c4e@oracle.com>
 <8d70f0c6-4caa-ab2c-d5dd-608c564037c5@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A547B0@FMSMSX126.amr.corp.intel.com>
 <3e712082-1def-28ce-f45d-3ef54864e89e@oracle.com>
 <53E8E64DB2403849AFD89B7D4DAC8B2A9A16DBAD@FMSMSX152.amr.corp.intel.com>
Message-ID: <3057263a-2b52-9edb-16f6-a58acb0c01d9@oracle.com>

Thank you, Vivek

Vladimir

On 1/30/19 1:52 PM, Deshpande, Vivek R wrote:
> Hi Vladimir
> 
> I tested the patch with Lucene tests.
> Then I have pushed the patch:
> http://hg.openjdk.java.net/jdk/jdk/rev/6121eee15c23
> Thanks.
> 
> Regards,
> Vivek
> 
> -----Original Message-----
> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
> Sent: Monday, January 28, 2019 9:39 AM
> To: Viswanathan, Sandhya <sandhya.viswanathan at intel.com>; hotspot-compiler-dev at openjdk.java.net
> Cc: Deshpande, Vivek R <vivek.r.deshpande at intel.com>
> Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad after JDK-8210764 (Update avx512 implementation)
> 
> Hi Sandhya,
> 
> Can you also run Lucene tests which hit previous avx512 issues on SKX and Knights?
> This is spilling code and it is used when a lot of xmm registers are used and our jtreg tests may not use this code.
> 
> You can push after that.
> 
> Thanks,
> Vladimir
> 
> On 1/28/19 9:22 AM, Viswanathan, Sandhya wrote:
>> Thanks Vladimir, Tobias and Nils.
>>
>> Could Vivek go ahead and push it?
>>
>> Best Regards,
>> Sandhya
>>
>>
>> -----Original Message-----
>> From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
>> Sent: Monday, January 28, 2019 9:17 AM
>> To: hotspot-compiler-dev at openjdk.java.net; Viswanathan, Sandhya
>> <sandhya.viswanathan at intel.com>
>> Subject: Re: RFR (XS): 8217371: C2: Incorrect LP64 guard in x86.ad
>> after JDK-8210764 (Update avx512 implementation)
>>
>> +1
>>
>> Thanks,
>> Vladimir
>>
>> On 1/28/19 2:00 AM, Tobias Hartmann wrote:
>>> Hi Sandhya,
>>>
>>> looks good to me too.
>>>
>>> Best regards,
>>> Tobias
>>>
>>> On 28.01.19 09:40, Nils Eliasson wrote:
>>>> Hi Sandhya,
>>>>
>>>> Looks good,
>>>>
>>>> Regards,
>>>>
>>>> Nils
>>>>
>>>> On 2019-01-27 04:47, Viswanathan, Sandhya wrote:
>>>>>
>>>>> Hi All,
>>>>>
>>>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8217371
>>>>> <https://bugs.openjdk.java.net/browse/JDK-8217371>
>>>>>
>>>>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/
>>>>> <http://cr.openjdk.java.net/~sviswanathan/8217371/webrev.00/>
>>>>>
>>>>> The above webrev fixes the incorrect LP64 guard issue in x86.ad file.
>>>>>
>>>>> I have corrected the guard to _LP64 and updated the spill/fill
>>>>> instructions.? This bug only affected the Knights family where AVX512VL is not supported.
>>>>>     
>>>>> I have tested it on SKX and Knights family with compiler jtreg tests.
>>>>> Please review.
>>>>>
>>>>>
>>>>> Best Regards,
>>>>> Sandhya
>>>>>

From igor.ignatyev at oracle.com  Thu Jan 31 00:58:45 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 30 Jan 2019 16:58:45 -0800
Subject: RFR(T)[12] : 8178798 : Two compiler/aot/verification/vmflags tests
 fail by timeout with UseAVX=3
Message-ID: <DC48D196-47B2-4027-93DC-FC432F8CC934@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
> 3 lines changed: 2 ins; 0 del; 1 mod; 

Hi all,

could you please review this small and trivial patch for aot jtreg tests? aot tests timeout when they are run w/ Xcomp as jaotc execution becomes too slow, the patch forces jaotc to be always run w/ Xmixed.

webrev: http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8178798
testing: test/hotspot/jtreg/compiler/aot w/ and w/o Xcomp

Thanks,
-- Igor
 

From vladimir.kozlov at oracle.com  Thu Jan 31 01:12:50 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 30 Jan 2019 17:12:50 -0800
Subject: RFR(T)[12] : 8178798 : Two compiler/aot/verification/vmflags
 tests fail by timeout with UseAVX=3
In-Reply-To: <9771b139-9367-6d85-5fd2-ca2211b3b394@oracle.com>
References: <DC48D196-47B2-4027-93DC-FC432F8CC934@oracle.com>
 <9771b139-9367-6d85-5fd2-ca2211b3b394@oracle.com>
Message-ID: <c1665c84-852e-9eb9-a78c-c82ef938a80c@oracle.com>

On 1/30/19 5:10 PM, Vladimir Kozlov wrote:
> Yes, we should not run jaotc with Xcomp (until we have libgraal.so).
> compiler/aot tests are explicitly listed in not_xcopm group [1].
> Unfortunately it does not prevent running these tests with -Xcomp in testing infrastructures which does not use this info.
> Note, this bug 8178798 was filed (2017-04-14) before aot tested were moved to not_xcomp group [2] (2018-03-22).
> 
> I am fine with these changes which guarantee that we would not run compiler/aot regardless testing environment.

would not run *with -Xcomp* compiler/aot tests

> Reviewed.
> 
> Thanks,
> Vladimir
> 
> [1] http://hg.openjdk.java.net/jdk/jdk/file/dfacdb971494/test/hotspot/jtreg/TEST.groups#l143
> [2] https://bugs.openjdk.java.net/browse/JDK-8199212
> 
> On 1/30/19 4:58 PM, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
>>> 3 lines changed: 2 ins; 0 del; 1 mod;
>>
>> Hi all,
>>
>> could you please review this small and trivial patch for aot jtreg tests? aot tests timeout when they are run w/ Xcomp 
>> as jaotc execution becomes too slow, the patch forces jaotc to be always run w/ Xmixed.
>>
>> webrev: http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8178798
>> testing: test/hotspot/jtreg/compiler/aot w/ and w/o Xcomp
>>
>> Thanks,
>> -- Igor
>>

From vladimir.kozlov at oracle.com  Thu Jan 31 01:10:57 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Wed, 30 Jan 2019 17:10:57 -0800
Subject: RFR(T)[12] : 8178798 : Two compiler/aot/verification/vmflags
 tests fail by timeout with UseAVX=3
In-Reply-To: <DC48D196-47B2-4027-93DC-FC432F8CC934@oracle.com>
References: <DC48D196-47B2-4027-93DC-FC432F8CC934@oracle.com>
Message-ID: <9771b139-9367-6d85-5fd2-ca2211b3b394@oracle.com>

Yes, we should not run jaotc with Xcomp (until we have libgraal.so).
compiler/aot tests are explicitly listed in not_xcopm group [1].
Unfortunately it does not prevent running these tests with -Xcomp in testing infrastructures which does not use this info.
Note, this bug 8178798 was filed (2017-04-14) before aot tested were moved to not_xcomp group [2] (2018-03-22).

I am fine with these changes which guarantee that we would not run compiler/aot regardless testing environment.
Reviewed.

Thanks,
Vladimir

[1] http://hg.openjdk.java.net/jdk/jdk/file/dfacdb971494/test/hotspot/jtreg/TEST.groups#l143
[2] https://bugs.openjdk.java.net/browse/JDK-8199212

On 1/30/19 4:58 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
>> 3 lines changed: 2 ins; 0 del; 1 mod;
> 
> Hi all,
> 
> could you please review this small and trivial patch for aot jtreg tests? aot tests timeout when they are run w/ Xcomp as jaotc execution becomes too slow, the patch forces jaotc to be always run w/ Xmixed.
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8178798/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8178798
> testing: test/hotspot/jtreg/compiler/aot w/ and w/o Xcomp
> 
> Thanks,
> -- Igor
>   
> 

From igor.ignatyev at oracle.com  Thu Jan 31 01:34:37 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 30 Jan 2019 17:34:37 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
Message-ID: <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>

Hi Alex,

UseJVMCICompiler is declared only if INCLUDE_JVMCI is defined, so your current patch will break builds where it's undefined. the rest (esp. problem list ;) ) looks good to me.

adding hotspot-compiler alias.

Thanks,
-- Igor

> On Jan 30, 2019, at 5:27 PM, Alex Menkov <alexey.menkov at oracle.com> wrote:
> 
> Hi all,
> 
> Please review a fix for tck-red bug:
> https://bugs.openjdk.java.net/browse/JDK-8218025
> webrev:
> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev/
> 
> ForceEarlyReturn and PopFrame JCK tests intermittently fail with Graal.
> Real fix for the issue is too risky for jdk12, so we have to disable pop_frame and force_early_return capabilities running with Graal (the capabilities are optional).
> Currently Graal is the only compiler, so the fix checks if JVMCI compiler is enabled.
> JCK test passes with disabled capabilities.
> A number of hotspot tests do not check if the capabilities are enabled (as hotspot is expected to support all capabilities) and fail with Graal - they are problem-listed until the real problem (JDK-8195635) is resolved.
> 
> --alex


From serguei.spitsyn at oracle.com  Thu Jan 31 04:46:53 2019
From: serguei.spitsyn at oracle.com (serguei.spitsyn at oracle.com)
Date: Wed, 30 Jan 2019 20:46:53 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
Message-ID: <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>

Hi Alex,

Vladimir I. also mentioned the same as Igor.

One example from the runtime/thread.cpp:
#if INCLUDE_JVMCI
 ? bool force_JVMCI_intialization = false;
 ? if (EnableJVMCI) {
 ??? // Initialize JVMCI eagerly when it is explicitly requested.
 ??? // Or when JVMCIPrintProperties is enabled.
 ??? // The JVMCI Java initialization code will read this flag and
 ??? // do the printing if it's set.
 ??? force_JVMCI_intialization = EagerJVMCI || JVMCIPrintProperties;

 ??? if (!force_JVMCI_intialization) {
 ????? // 8145270: Force initialization of JVMCI runtime otherwise 
requests for blocking
 ????? // compilations via JVMCI will not actually block until JVMCI is 
initialized.
 ????? force_JVMCI_intialization = UseJVMCICompiler && (!UseInterpreter 
|| !BackgroundCompilation);
 ??? }
 ? }
#endif

One more example from prims/jni.cpp:

#if INCLUDE_JVMCI
 ??? if (EnableJVMCI) {
 ????? if (UseJVMCICompiler) {
 ??????? . . .
 ????? }
 ??? }
#endif


On 1/30/19 17:34, Igor Ignatyev wrote:
> Hi Alex,
>
> UseJVMCICompiler is declared only if INCLUDE_JVMCI is defined, so your current patch will break builds where it's undefined. the rest (esp. problem list ;) ) looks good to me.
>
> adding hotspot-compiler alias.
>
> Thanks,
> -- Igor
>
>> On Jan 30, 2019, at 5:27 PM, Alex Menkov <alexey.menkov at oracle.com> wrote:
>>
>> Hi all,
>>
>> Please review a fix for tck-red bug:
>> https://bugs.openjdk.java.net/browse/JDK-8218025
>> webrev:
>> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev/
>>
>> ForceEarlyReturn and PopFrame JCK tests intermittently fail with Graal.
>> Real fix for the issue is too risky for jdk12, so we have to disable pop_frame and force_early_return capabilities running with Graal (the capabilities are optional).
>> Currently Graal is the only compiler, so the fix checks if JVMCI compiler is enabled.
>> JCK test passes with disabled capabilities.
>> A number of hotspot tests do not check if the capabilities are enabled (as hotspot is expected to support all capabilities) and fail with Graal - they are problem-listed until the real problem (JDK-8195635) is resolved.
>>
>> --alex


From serguei.spitsyn at oracle.com  Thu Jan 31 04:59:55 2019
From: serguei.spitsyn at oracle.com (serguei.spitsyn at oracle.com)
Date: Wed, 30 Jan 2019 20:59:55 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
Message-ID: <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>

An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/aaae98c2/attachment.html>

From dean.long at oracle.com  Thu Jan 31 05:18:05 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Wed, 30 Jan 2019 21:18:05 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
Message-ID: <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>

On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
> So, the fix needs to be more like this:
> + // Workaround for 8195635:
> + // disable pop_frame and force_early_return capabilities with Graal
> + #if INCLUDE_JVMCI
> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>     jc.can_pop_frame = 1;
>     jc.can_force_early_return = 1;
> + } + #endif Not sure, if the check for EnableJVMCI can be removed above.

We still need it to work when INCLUDE_JVMCI is not defined.
How about

JVMCI_ONLY(if (UseJVMCICompiler)) {
...
}

or

if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
...
}

dl

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/bc6b16cb/attachment.html>

From igor.ignatyev at oracle.com  Thu Jan 31 05:22:45 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 30 Jan 2019 21:22:45 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
Message-ID: <690A8E3B-1B24-4193-9B93-F21B687FE57B@oracle.com>

from my point of view, having the following code at the end of JvmtiManageCapabilities::init_onload_capabilities is much clear and easier to understand:

> // Workaround for 8195635: disable pop_frame and force_early_return capabilities
> #if INCLUDE_JVMCI
>   if (UseJVMCICompiler) {
>     jc.can_pop_frame = 0;
>     jc.can_force_early_return = 0;
>   }
> #endif // INCLUDE_JVMCI


Thanks,
-- Igor

> On Jan 30, 2019, at 9:18 PM, dean.long at oracle.com wrote:
> 
> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com <mailto:serguei.spitsyn at oracle.com> wrote:
>> So, the fix needs to be more like this:
>> +  // Workaround for 8195635:
>> +  // disable pop_frame and force_early_return capabilities with Graal
>> + #if INCLUDE_JVMCI
>> +  if (!(EnableJVMCI && UseJVMCICompiler)) {
>>    jc.can_pop_frame = 1;
>>    jc.can_force_early_return = 1;
>> +  }
>> + #endif
>> 
>> Not sure, if the check for EnableJVMCI can be removed above.
> 
> We still need it to work when INCLUDE_JVMCI is not defined.
> How about
> 
> JVMCI_ONLY(if (UseJVMCICompiler)) {
> ...
> }
> 
> or
> 
> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
> ...
> }
> 
> dl
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/e37df6ae/attachment.html>

From david.holmes at oracle.com  Thu Jan 31 05:24:37 2019
From: david.holmes at oracle.com (David Holmes)
Date: Thu, 31 Jan 2019 15:24:37 +1000
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
Message-ID: <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>

On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>> So, the fix needs to be more like this:
>> + // Workaround for 8195635:
>> + // disable pop_frame and force_early_return capabilities with Graal
>> + #if INCLUDE_JVMCI
>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>     jc.can_pop_frame = 1;
>>     jc.can_force_early_return = 1;
>> + } + #endif Not sure, if the check for EnableJVMCI can be removed above.
> 
> We still need it to work when INCLUDE_JVMCI is not defined.
> How about
> 
> JVMCI_ONLY(if (UseJVMCICompiler)) {
> ...
> }
> 
> or
> 
> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
> ...
> }

Or just turn them on unconditionally first and turn off explicitly for 
JVMCI:

  jc.can_pop_frame = 1;
  jc.can_force_early_return = 1;
+ #if INCLUDE_JVMCI
+  // Workaround for 8195635:
+  // disable pop_frame and force_early_return capabilities with Graal
+ if (EnableJVMCI && UseJVMCICompiler) {
+     jc.can_pop_frame = 0;
+     jc.can_force_early_return = 0;
+ }
+ #endif

David

> dl
> 

From serguei.spitsyn at oracle.com  Thu Jan 31 05:31:39 2019
From: serguei.spitsyn at oracle.com (serguei.spitsyn at oracle.com)
Date: Wed, 30 Jan 2019 21:31:39 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
Message-ID: <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>

On 1/30/19 21:24, David Holmes wrote:
> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>> So, the fix needs to be more like this:
>>> + // Workaround for 8195635:
>>> + // disable pop_frame and force_early_return capabilities with Graal
>>> + #if INCLUDE_JVMCI
>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>> ??? jc.can_pop_frame = 1;
>>> ??? jc.can_force_early_return = 1;
>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed 
>>> above.
>>
>> We still need it to work when INCLUDE_JVMCI is not defined.
>> How about
>>
>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>> ...
>> }
>>
>> or
>>
>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>> ...
>> }
>
> Or just turn them on unconditionally first and turn off explicitly for 
> JVMCI:
>
> ?jc.can_pop_frame = 1;
> ?jc.can_force_early_return = 1;
> + #if INCLUDE_JVMCI
> +? // Workaround for 8195635:
> +? // disable pop_frame and force_early_return capabilities with Graal
> + if (EnableJVMCI && UseJVMCICompiler) {
> +???? jc.can_pop_frame = 0;
> +???? jc.can_force_early_return = 0;
> + }
> + #endif
>
Oh, Dean is right.
We need these caps initialized even if the macro INCLUDE_JVMCI is undefined.
Then I like variant from David above.

Thanks,
Serguei


> David
>
>> dl
>>


From serguei.spitsyn at oracle.com  Thu Jan 31 05:52:51 2019
From: serguei.spitsyn at oracle.com (serguei.spitsyn at oracle.com)
Date: Wed, 30 Jan 2019 21:52:51 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <690A8E3B-1B24-4193-9B93-F21B687FE57B@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <690A8E3B-1B24-4193-9B93-F21B687FE57B@oracle.com>
Message-ID: <9abeb866-43a5-cec9-27be-cae0642d5835@oracle.com>

An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190130/9ef4d8c3/attachment.html>

From igor.ignatyev at oracle.com  Thu Jan 31 05:53:01 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Wed, 30 Jan 2019 21:53:01 -0800
Subject: RFR(S) [13] : 8217848 : [Graal]
 vmTestbase/nsk/jvmti/ResourceExhausted/resexhausted003/TestDescription.java
 fails
Message-ID: <A3D86F3A-AEEA-421E-810F-2CF730713C58@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
> 2 lines changed: 0 ins; 0 del; 2 mod;

Hi all,

could you please review this small fix? the test fails w/ Graal b/c it sets MaxMetaspaceSize=9m, but when we run w/ JVMCI compiler we increase default value of MetaspaceSize. the fix makes sure we don't set MetaspaceSize greater than MaxMetaspaceSize.

webrev: http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8217848
testing: 
 - vmTestbase/nsk/jvmti/ResourceExhausted tests w/ enabled and disabled Graal
 - java -XX:MaxMetaspaceSize=9m -version w/ enabled and disabled Graal

Thanks,
-- Igor

From bsrbnd at gmail.com  Thu Jan 31 11:46:13 2019
From: bsrbnd at gmail.com (B. Blaser)
Date: Thu, 31 Jan 2019 12:46:13 +0100
Subject: [PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics
In-Reply-To: <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A55B2B@FMSMSX126.amr.corp.intel.com>
References: <A66BBE673E08E1428E3A918AE4D5B32CED489E@BGSMSX106.gar.corp.intel.com>
 <a02302f8-40c1-9758-eb1d-3598f5dd1f3d@redhat.com>
 <A66BBE673E08E1428E3A918AE4D5B32CED49B6@BGSMSX106.gar.corp.intel.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A54F39@FMSMSX126.amr.corp.intel.com>
 <fecf3d19-4d30-6a67-a4ce-d90f6df08f10@oracle.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A550D5@FMSMSX126.amr.corp.intel.com>
 <CAEgw74CVwJEqxO9KVFR9KetFgYK+wkDS4UOam7sOMnc5Kk-0Ww@mail.gmail.com>
 <02FCFB8477C4EF43A2AD8E0C60F3DA2BB1A55B2B@FMSMSX126.amr.corp.intel.com>
Message-ID: <CAEgw74Cm9O4Lv9ZFGJYPwzfMRfGHYrBbqpegLKFJbnYR9M7dUg@mail.gmail.com>

On Wed, 30 Jan 2019 at 21:26, Viswanathan, Sandhya
<sandhya.viswanathan at intel.com> wrote:
>
> Hi Bernard,
>
> Thanks a lot for your feedback. Let me try to answer your questions below.
>
> We also started with the same assumption that we may not be able to easily improve the current implementation on x86 because MIN/MAX instructions don't conform to the Java doc for 0.0 and NaN.
> Jatin took this as a challenge and came up with a sequence that does show benefit.
> Our performance run shows about 30% gain with this patch vs the ucomisd sequence generated by the jitted code.
> As you suggest, we could use scalar instructions for max, min and cmp instead of using the packed variant.
> But the blend has to be the packed variant as there is no scalar flavor for that and so changing the others to scalar flavor may not show much perf change, we will confirm both ways.
> The path that won't show much benefit or may show some regression is when both the operands are NaN which is not frequently occurring case I would think.
>
> Jatin is going to send updated patch fixing the issue reported by Nils. We will include performance numbers along with the updated patch.
>
> Best Regards,
> Sandhya

Thanks for your answers, we'll wait for Jatin's fixes and measures.
I'll check the updated patch once more but I like this idea and I hope
most paths will be faster.
Please let me know if you need help to push the patch providing that
you get a Reviewer approval.

Regards,
Bernard

From vladimir.kozlov at oracle.com  Thu Jan 31 18:25:10 2019
From: vladimir.kozlov at oracle.com (Vladimir Kozlov)
Date: Thu, 31 Jan 2019 10:25:10 -0800
Subject: RFR(S) [13] : 8217848 : [Graal]
 vmTestbase/nsk/jvmti/ResourceExhausted/resexhausted003/TestDescription.java
 fails
In-Reply-To: <A3D86F3A-AEEA-421E-810F-2CF730713C58@oracle.com>
References: <A3D86F3A-AEEA-421E-810F-2CF730713C58@oracle.com>
Message-ID: <5026b741-78be-72bb-062a-59216e6f903c@oracle.com>

Yes, this is correct fix. Later if a test set MaxMetaspaceSize too small to run with Graal we will update test.

thanks,
Vladimir

On 1/30/19 9:53 PM, Igor Ignatyev wrote:
> http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
>> 2 lines changed: 0 ins; 0 del; 2 mod;
> 
> Hi all,
> 
> could you please review this small fix? the test fails w/ Graal b/c it sets MaxMetaspaceSize=9m, but when we run w/ JVMCI compiler we increase default value of MetaspaceSize. the fix makes sure we don't set MetaspaceSize greater than MaxMetaspaceSize.
> 
> webrev: http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
> JBS: https://bugs.openjdk.java.net/browse/JDK-8217848
> testing:
>   - vmTestbase/nsk/jvmti/ResourceExhausted tests w/ enabled and disabled Graal
>   - java -XX:MaxMetaspaceSize=9m -version w/ enabled and disabled Graal
> 
> Thanks,
> -- Igor
> 

From alexey.menkov at oracle.com  Thu Jan 31 18:38:20 2019
From: alexey.menkov at oracle.com (Alex Menkov)
Date: Thu, 31 Jan 2019 10:38:20 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
 <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
Message-ID: <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>

Hi guys,

thank you for the feedback.

updated webrev (used the way suggested by David & Igor):
http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev.02/

--alex

On 01/30/2019 21:31, serguei.spitsyn at oracle.com wrote:
> On 1/30/19 21:24, David Holmes wrote:
>> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>>> So, the fix needs to be more like this:
>>>> + // Workaround for 8195635:
>>>> + // disable pop_frame and force_early_return capabilities with Graal
>>>> + #if INCLUDE_JVMCI
>>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>>> ??? jc.can_pop_frame = 1;
>>>> ??? jc.can_force_early_return = 1;
>>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed 
>>>> above.
>>>
>>> We still need it to work when INCLUDE_JVMCI is not defined.
>>> How about
>>>
>>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>>> ...
>>> }
>>>
>>> or
>>>
>>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>>> ...
>>> }
>>
>> Or just turn them on unconditionally first and turn off explicitly for 
>> JVMCI:
>>
>> ?jc.can_pop_frame = 1;
>> ?jc.can_force_early_return = 1;
>> + #if INCLUDE_JVMCI
>> +? // Workaround for 8195635:
>> +? // disable pop_frame and force_early_return capabilities with Graal
>> + if (EnableJVMCI && UseJVMCICompiler) {
>> +???? jc.can_pop_frame = 0;
>> +???? jc.can_force_early_return = 0;
>> + }
>> + #endif
>>
> Oh, Dean is right.
> We need these caps initialized even if the macro INCLUDE_JVMCI is 
> undefined.
> Then I like variant from David above.
> 
> Thanks,
> Serguei
> 
> 
>> David
>>
>>> dl
>>>
> 

From igor.ignatyev at oracle.com  Thu Jan 31 18:52:05 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 31 Jan 2019 10:52:05 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
 <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
 <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
Message-ID: <3CD7BD6E-5FEE-4A0E-9A60-993B69DFA695@oracle.com>

Hi Alex,

you have 'if INCLUDE_JVMCI' inside 'ifndef ZERO', although we currently don't have zero builds w/ JVMCI (and I don't think such builds would make much sense), it's better not to rely on that.  not a blocker from my point of view though.

Thanks,
-- Igor

> On Jan 31, 2019, at 10:38 AM, Alex Menkov <alexey.menkov at oracle.com> wrote:
> 
> Hi guys,
> 
> thank you for the feedback.
> 
> updated webrev (used the way suggested by David & Igor):
> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev.02/
> 
> --alex
> 
> On 01/30/2019 21:31, serguei.spitsyn at oracle.com wrote:
>> On 1/30/19 21:24, David Holmes wrote:
>>> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>>>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>>>> So, the fix needs to be more like this:
>>>>> + // Workaround for 8195635:
>>>>> + // disable pop_frame and force_early_return capabilities with Graal
>>>>> + #if INCLUDE_JVMCI
>>>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>>>>     jc.can_pop_frame = 1;
>>>>>     jc.can_force_early_return = 1;
>>>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed above.
>>>> 
>>>> We still need it to work when INCLUDE_JVMCI is not defined.
>>>> How about
>>>> 
>>>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>>>> ...
>>>> }
>>>> 
>>>> or
>>>> 
>>>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>>>> ...
>>>> }
>>> 
>>> Or just turn them on unconditionally first and turn off explicitly for JVMCI:
>>> 
>>>  jc.can_pop_frame = 1;
>>>  jc.can_force_early_return = 1;
>>> + #if INCLUDE_JVMCI
>>> +  // Workaround for 8195635:
>>> +  // disable pop_frame and force_early_return capabilities with Graal
>>> + if (EnableJVMCI && UseJVMCICompiler) {
>>> +     jc.can_pop_frame = 0;
>>> +     jc.can_force_early_return = 0;
>>> + }
>>> + #endif
>>> 
>> Oh, Dean is right.
>> We need these caps initialized even if the macro INCLUDE_JVMCI is undefined.
>> Then I like variant from David above.
>> Thanks,
>> Serguei
>>> David
>>> 
>>>> dl
>>>> 


From alexey.menkov at oracle.com  Thu Jan 31 18:59:46 2019
From: alexey.menkov at oracle.com (Alex Menkov)
Date: Thu, 31 Jan 2019 10:59:46 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <3CD7BD6E-5FEE-4A0E-9A60-993B69DFA695@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
 <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
 <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
 <3CD7BD6E-5FEE-4A0E-9A60-993B69DFA695@oracle.com>
Message-ID: <5cf35843-1781-4b16-0309-bf2545a11208@oracle.com>


On 01/31/2019 10:52, Igor Ignatyev wrote:
> Hi Alex,
> 
> you have 'if INCLUDE_JVMCI' inside 'ifndef ZERO', although we currently don't have zero builds w/ JVMCI (and I don't think such builds would make much sense), it's better not to rely on that.  not a blocker from my point of view though.

can_pop_frame & can_force_early_return are set only if ZERO is not 
defined (for zero builds the capabilities are not supported), so I added 
disabling logic in the same block.

--alex

> 
> Thanks,
> -- Igor
> 
>> On Jan 31, 2019, at 10:38 AM, Alex Menkov <alexey.menkov at oracle.com> wrote:
>>
>> Hi guys,
>>
>> thank you for the feedback.
>>
>> updated webrev (used the way suggested by David & Igor):
>> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev.02/
>>
>> --alex
>>
>> On 01/30/2019 21:31, serguei.spitsyn at oracle.com wrote:
>>> On 1/30/19 21:24, David Holmes wrote:
>>>> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>>>>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>>>>> So, the fix needs to be more like this:
>>>>>> + // Workaround for 8195635:
>>>>>> + // disable pop_frame and force_early_return capabilities with Graal
>>>>>> + #if INCLUDE_JVMCI
>>>>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>>>>>      jc.can_pop_frame = 1;
>>>>>>      jc.can_force_early_return = 1;
>>>>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed above.
>>>>>
>>>>> We still need it to work when INCLUDE_JVMCI is not defined.
>>>>> How about
>>>>>
>>>>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>>>>> ...
>>>>> }
>>>>>
>>>>> or
>>>>>
>>>>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>>>>> ...
>>>>> }
>>>>
>>>> Or just turn them on unconditionally first and turn off explicitly for JVMCI:
>>>>
>>>>   jc.can_pop_frame = 1;
>>>>   jc.can_force_early_return = 1;
>>>> + #if INCLUDE_JVMCI
>>>> +  // Workaround for 8195635:
>>>> +  // disable pop_frame and force_early_return capabilities with Graal
>>>> + if (EnableJVMCI && UseJVMCICompiler) {
>>>> +     jc.can_pop_frame = 0;
>>>> +     jc.can_force_early_return = 0;
>>>> + }
>>>> + #endif
>>>>
>>> Oh, Dean is right.
>>> We need these caps initialized even if the macro INCLUDE_JVMCI is undefined.
>>> Then I like variant from David above.
>>> Thanks,
>>> Serguei
>>>> David
>>>>
>>>>> dl
>>>>>
> 

From vladimir.x.ivanov at oracle.com  Thu Jan 31 19:10:22 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 31 Jan 2019 11:10:22 -0800
Subject: [13] RFR (S): 8217918: C2: -XX:+AggressiveUnboxing is broken
Message-ID: <3aada868-68e0-627e-1519-535bb8a8f8dd@oracle.com>

https://bugs.openjdk.java.net/browse/JDK-8217918
http://cr.openjdk.java.net/~vlivanov/8217918/webrev.00/

When -XX:+AggressiveUnboxing is enabled, LoadNode::split_through_phi() 
produces Phi nodes with non-negative _inst_mem_id & _inst_id early 
enough, so it breaks PhaseRenumber pass which doesn't support nodes with 
embedded IDs.

Proposed fix tracks nodes with embedded IDs and updates them once 
renumbering pass over the graph is over.

Testing: hs-precheckin-comp, tier1-5

Best regards,
Vladimir Ivanov

From igor.ignatyev at oracle.com  Thu Jan 31 19:09:04 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 31 Jan 2019 11:09:04 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <5cf35843-1781-4b16-0309-bf2545a11208@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
 <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
 <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
 <3CD7BD6E-5FEE-4A0E-9A60-993B69DFA695@oracle.com>
 <5cf35843-1781-4b16-0309-bf2545a11208@oracle.com>
Message-ID: <9A426E4F-99BD-4B77-9C99-39783B64135B@oracle.com>

Hi Alex,

sure I understand that, but let's say we decide to add these capabilities to ZERO or remove all ZERO-specific code. I am *not* saying we are going to do that, just speculating. In such cases, your code[1] will require more effort to figure that should be changed comparing to the almost identical code[2]. but as I said I'm fine w/ your current code.

Thanks,
-- Igor


[1]
>  #ifndef ZERO
>    jc.can_pop_frame = 1;
>    jc.can_force_early_return = 1;
> +  // Workaround for 8195635:
> +  // disable pop_frame and force_early_return capabilities with Graal
> +#if INCLUDE_JVMCI
> +  if (UseJVMCICompiler) {
> +    jc.can_pop_frame = 0;
> +    jc.can_force_early_return = 0;
> +  }
> +#endif // INCLUDE_JVMCI
>  #endif // !ZERO

[2]
>  #ifndef ZERO
>    jc.can_pop_frame = 1;
>    jc.can_force_early_return = 1;
>  #endif // !ZERO
> +  // Workaround for 8195635:
> +  // disable pop_frame and force_early_return capabilities with Graal
> +#if INCLUDE_JVMCI
> +  if (UseJVMCICompiler) {
> +    jc.can_pop_frame = 0;
> +    jc.can_force_early_return = 0;
> +  }
> +#endif // INCLUDE_JVMCI


> On Jan 31, 2019, at 10:59 AM, Alex Menkov <alexey.menkov at oracle.com> wrote:
> 
> 
> 
> On 01/31/2019 10:52, Igor Ignatyev wrote:
>> Hi Alex,
>> you have 'if INCLUDE_JVMCI' inside 'ifndef ZERO', although we currently don't have zero builds w/ JVMCI (and I don't think such builds would make much sense), it's better not to rely on that.  not a blocker from my point of view though.
> 
> can_pop_frame & can_force_early_return are set only if ZERO is not defined (for zero builds the capabilities are not supported), so I added disabling logic in the same block.
> 
> --alex
> 
>> Thanks,
>> -- Igor
>>> On Jan 31, 2019, at 10:38 AM, Alex Menkov <alexey.menkov at oracle.com> wrote:
>>> 
>>> Hi guys,
>>> 
>>> thank you for the feedback.
>>> 
>>> updated webrev (used the way suggested by David & Igor):
>>> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev.02/
>>> 
>>> --alex
>>> 
>>> On 01/30/2019 21:31, serguei.spitsyn at oracle.com wrote:
>>>> On 1/30/19 21:24, David Holmes wrote:
>>>>> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>>>>>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>>>>>> So, the fix needs to be more like this:
>>>>>>> + // Workaround for 8195635:
>>>>>>> + // disable pop_frame and force_early_return capabilities with Graal
>>>>>>> + #if INCLUDE_JVMCI
>>>>>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>>>>>>     jc.can_pop_frame = 1;
>>>>>>>     jc.can_force_early_return = 1;
>>>>>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed above.
>>>>>> 
>>>>>> We still need it to work when INCLUDE_JVMCI is not defined.
>>>>>> How about
>>>>>> 
>>>>>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>>>>>> ...
>>>>>> }
>>>>>> 
>>>>>> or
>>>>>> 
>>>>>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>>>>>> ...
>>>>>> }
>>>>> 
>>>>> Or just turn them on unconditionally first and turn off explicitly for JVMCI:
>>>>> 
>>>>>  jc.can_pop_frame = 1;
>>>>>  jc.can_force_early_return = 1;
>>>>> + #if INCLUDE_JVMCI
>>>>> +  // Workaround for 8195635:
>>>>> +  // disable pop_frame and force_early_return capabilities with Graal
>>>>> + if (EnableJVMCI && UseJVMCICompiler) {
>>>>> +     jc.can_pop_frame = 0;
>>>>> +     jc.can_force_early_return = 0;
>>>>> + }
>>>>> + #endif
>>>>> 
>>>> Oh, Dean is right.
>>>> We need these caps initialized even if the macro INCLUDE_JVMCI is undefined.
>>>> Then I like variant from David above.
>>>> Thanks,
>>>> Serguei
>>>>> David
>>>>> 
>>>>>> dl
>>>>>> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20190131/c15dd24d/attachment-0001.html>

From vladimir.x.ivanov at oracle.com  Thu Jan 31 19:16:14 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 31 Jan 2019 11:16:14 -0800
Subject: [13] RFR (T): 8217919: C2: Enable -XX:+AggressiveUnboxing by default
Message-ID: <0622ef31-ec73-9c1e-dcea-8027bf555cc4@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8217919/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8217919

Once JDK-8217918 [1] is fixed, I propose to enable 
-XX:+AggressiveUnboxing by default.

The optimization noticeably improves EA for primitive boxes (e.g., 
JDK-8055340 [2]), so it's worth having it turned on. It'll also ensure 
the code won't rot again.

Testing: hs-precheckin-comp, tier1-5

Best regards,
Vladimir Ivanov

[1] https://bugs.openjdk.java.net/browse/JDK-8217918
[2] https://bugs.openjdk.java.net/browse/JDK-8055340

From vladimir.x.ivanov at oracle.com  Thu Jan 31 19:29:59 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 31 Jan 2019 11:29:59 -0800
Subject: [13] RFR (S): 8188133: C2: Static field accesses in clinit can
 trigger deoptimizations
Message-ID: <fc542cae-4682-4ad9-f0b4-1843ece2f789@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8188133/webrev.00
https://bugs.openjdk.java.net/browse/JDK-8188133

The test case in the bug demonstrates a pathological case with 
long-running static initializer: though it's allowed to access static 
fields from the thread performing the initialization, C2 can't prove 
that in general.

While solving the general problem doesn't seem worth the effort 
(requires barriers on method entries), I propose to extend the logic to 
cover simple cases: when static initializer is the root of the 
compilation, all accesses to static fields of the corresponding class 
are allowed. It extends the coverage to all inlinees from OSR 
compilation of clinit.

Testing: hs-precheckin-comp, tier1-5

Best regards,
Vladimir Ivanov

From serguei.spitsyn at oracle.com  Thu Jan 31 19:39:35 2019
From: serguei.spitsyn at oracle.com (serguei.spitsyn at oracle.com)
Date: Thu, 31 Jan 2019 11:39:35 -0800
Subject: RFR (12) JDK-8218025: disable pop_frame and force_early_return
 caps for Graal
In-Reply-To: <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
References: <6daa70b3-d10c-afea-b775-12e52d7e2d58@oracle.com>
 <919AB047-1F67-45BB-80DD-205672FFE49F@oracle.com>
 <9069d34b-ad8e-17ce-c5f4-8765730d6497@oracle.com>
 <7afec9a8-602c-9d5b-f899-b806595ad859@oracle.com>
 <6e609bd6-ef09-3323-9225-255a0e750aa3@oracle.com>
 <95ed0aa0-7d8c-b448-fd0d-c25b930cafcb@oracle.com>
 <148c8531-6f2b-142d-cae0-a4d5bc2de970@oracle.com>
 <f866b956-d40c-5689-49d8-afbbf5a69b07@oracle.com>
Message-ID: <79d984ba-f3bd-b329-4d5f-4ba2083e9628@oracle.com>

Hi Alex,

Looks fine to me.

Thanks,
Serguei


On 1/31/19 10:38, Alex Menkov wrote:
> Hi guys,
>
> thank you for the feedback.
>
> updated webrev (used the way suggested by David & Igor):
> http://cr.openjdk.java.net/~amenkov/tck_red_disable_caps/webrev.02/
>
> --alex
>
> On 01/30/2019 21:31, serguei.spitsyn at oracle.com wrote:
>> On 1/30/19 21:24, David Holmes wrote:
>>> On 31/01/2019 3:18 pm, dean.long at oracle.com wrote:
>>>> On 1/30/19 8:59 PM, serguei.spitsyn at oracle.com wrote:
>>>>> So, the fix needs to be more like this:
>>>>> + // Workaround for 8195635:
>>>>> + // disable pop_frame and force_early_return capabilities with Graal
>>>>> + #if INCLUDE_JVMCI
>>>>> + if (!(EnableJVMCI && UseJVMCICompiler)) {
>>>>> ??? jc.can_pop_frame = 1;
>>>>> ??? jc.can_force_early_return = 1;
>>>>> + } + #endif Not sure, if the check for EnableJVMCI can be removed 
>>>>> above.
>>>>
>>>> We still need it to work when INCLUDE_JVMCI is not defined.
>>>> How about
>>>>
>>>> JVMCI_ONLY(if (UseJVMCICompiler)) {
>>>> ...
>>>> }
>>>>
>>>> or
>>>>
>>>> if (JVMCI_ONLY(UseJVMCICompiler) NOT_JVMCI(true)) {
>>>> ...
>>>> }
>>>
>>> Or just turn them on unconditionally first and turn off explicitly 
>>> for JVMCI:
>>>
>>> ?jc.can_pop_frame = 1;
>>> ?jc.can_force_early_return = 1;
>>> + #if INCLUDE_JVMCI
>>> +? // Workaround for 8195635:
>>> +? // disable pop_frame and force_early_return capabilities with Graal
>>> + if (EnableJVMCI && UseJVMCICompiler) {
>>> +???? jc.can_pop_frame = 0;
>>> +???? jc.can_force_early_return = 0;
>>> + }
>>> + #endif
>>>
>> Oh, Dean is right.
>> We need these caps initialized even if the macro INCLUDE_JVMCI is 
>> undefined.
>> Then I like variant from David above.
>>
>> Thanks,
>> Serguei
>>
>>
>>> David
>>>
>>>> dl
>>>>
>>


From vladimir.x.ivanov at oracle.com  Thu Jan 31 19:46:08 2019
From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov)
Date: Thu, 31 Jan 2019 11:46:08 -0800
Subject: [13] RFR (S): 8218163: C2: Continuous deoptimization w/
 Reason_speculate_class_check and Action_none
Message-ID: <34442179-7541-226b-065f-061d45465212@oracle.com>

http://cr.openjdk.java.net/~vlivanov/8218163/webrev.00/
https://bugs.openjdk.java.net/browse/JDK-8218163

Guarded inlining doesn't take recompilation count into account when 
decided whether to issue an uncommon trap or virtual call on slow path.
But it may end up as uncommon trap with Action_none when recompilation 
count is too high.

The fix is to consider both trap & recompilation counts when making the 
decision.

Testing: hs-precheckin-comp, tier1-5

Best regards,
Vladimir Ivanov

From dean.long at oracle.com  Thu Jan 31 19:47:49 2019
From: dean.long at oracle.com (dean.long at oracle.com)
Date: Thu, 31 Jan 2019 11:47:49 -0800
Subject: RFR(S) [13] : 8217848 : [Graal]
 vmTestbase/nsk/jvmti/ResourceExhausted/resexhausted003/TestDescription.java
 fails
In-Reply-To: <5026b741-78be-72bb-062a-59216e6f903c@oracle.com>
References: <A3D86F3A-AEEA-421E-810F-2CF730713C58@oracle.com>
 <5026b741-78be-72bb-062a-59216e6f903c@oracle.com>
Message-ID: <3c31826a-c00b-520b-9dc9-46f402cbb749@oracle.com>

Looks good.

dl

On 1/31/19 10:25 AM, Vladimir Kozlov wrote:
> Yes, this is correct fix. Later if a test set MaxMetaspaceSize too 
> small to run with Graal we will update test.
>
> thanks,
> Vladimir
>
> On 1/30/19 9:53 PM, Igor Ignatyev wrote:
>> http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
>>> 2 lines changed: 0 ins; 0 del; 2 mod;
>>
>> Hi all,
>>
>> could you please review this small fix? the test fails w/ Graal b/c 
>> it sets MaxMetaspaceSize=9m, but when we run w/ JVMCI compiler we 
>> increase default value of MetaspaceSize. the fix makes sure we don't 
>> set MetaspaceSize greater than MaxMetaspaceSize.
>>
>> webrev: 
>> http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217848
>> testing:
>> ? - vmTestbase/nsk/jvmti/ResourceExhausted tests w/ enabled and 
>> disabled Graal
>> ? - java -XX:MaxMetaspaceSize=9m -version w/ enabled and disabled Graal
>>
>> Thanks,
>> -- Igor
>>


From igor.ignatyev at oracle.com  Thu Jan 31 20:50:04 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 31 Jan 2019 12:50:04 -0800
Subject: RFR(T) [12] : 8218162 : problem list
 j/u/s/t/o/o/t/java/util/stream/StreamLinkTest.java on solaris w/ Xcomp
Message-ID: <7693124D-9D5E-4F9E-85C9-CDD8700DD920@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8218162/webrev.00/index.html
> 1 line changed: 1 ins; 0 del; 0 mod;

Hi all,

could you please review this one-liner which problem list java/util/stream/test/org/openjdk/tests/java/util/stream/StreamLinkTest.java in Xcomp mode on solaris?

webrev: http://cr.openjdk.java.net/~iignatyev//8218162/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8218162

Thanks,
-- Igor

From igor.ignatyev at oracle.com  Thu Jan 31 20:53:50 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 31 Jan 2019 12:53:50 -0800
Subject: RFR(S) [13] : 8217848 : [Graal]
 vmTestbase/nsk/jvmti/ResourceExhausted/resexhausted003/TestDescription.java
 fails
In-Reply-To: <3c31826a-c00b-520b-9dc9-46f402cbb749@oracle.com>
References: <A3D86F3A-AEEA-421E-810F-2CF730713C58@oracle.com>
 <5026b741-78be-72bb-062a-59216e6f903c@oracle.com>
 <3c31826a-c00b-520b-9dc9-46f402cbb749@oracle.com>
Message-ID: <493B1835-1FD7-4149-BF08-6FB1323C713F@oracle.com>

Dean, Vladimir,

thanks for your review.

-- Igor

> On Jan 31, 2019, at 11:47 AM, dean.long at oracle.com wrote:
> 
> Looks good.
> 
> dl
> 
> On 1/31/19 10:25 AM, Vladimir Kozlov wrote:
>> Yes, this is correct fix. Later if a test set MaxMetaspaceSize too small to run with Graal we will update test.
>> 
>> thanks,
>> Vladimir
>> 
>> On 1/30/19 9:53 PM, Igor Ignatyev wrote:
>>> http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
>>>> 2 lines changed: 0 ins; 0 del; 2 mod;
>>> 
>>> Hi all,
>>> 
>>> could you please review this small fix? the test fails w/ Graal b/c it sets MaxMetaspaceSize=9m, but when we run w/ JVMCI compiler we increase default value of MetaspaceSize. the fix makes sure we don't set MetaspaceSize greater than MaxMetaspaceSize.
>>> 
>>> webrev: http://cr.openjdk.java.net/~iignatyev//8217848/webrev.00/index.html
>>> JBS: https://bugs.openjdk.java.net/browse/JDK-8217848
>>> testing:
>>>   - vmTestbase/nsk/jvmti/ResourceExhausted tests w/ enabled and disabled Graal
>>>   - java -XX:MaxMetaspaceSize=9m -version w/ enabled and disabled Graal
>>> 
>>> Thanks,
>>> -- Igor
>>> 
> 


From igor.ignatyev at oracle.com  Thu Jan 31 21:42:53 2019
From: igor.ignatyev at oracle.com (Igor Ignatyev)
Date: Thu, 31 Jan 2019 13:42:53 -0800
Subject: RFR(T)[12] : 8218168 : clean up hotspot ProblemList
Message-ID: <D901A413-07DA-4C1B-A1DD-A0C5FFF2DA1B@oracle.com>

http://cr.openjdk.java.net/~iignatyev//8218168/webrev.00/index.html
> 10 lines changed: 0 ins; 0 del; 10 mod; 

Hi all,

JDK-8208255 and JDK-8208235 got closed as a duplicate of JDK-8058176 but still referenced in the problem list, this trivial patch is to fix that.

webrev: http://cr.openjdk.java.net/~iignatyev//8218168/webrev.00/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8218168

Thanks,
-- Igor