RFR: JDK-8270147: Increase stride size allowing unrolling more loops [v2]
Radoslaw Smogura
github.com+7535718+rsmogura at openjdk.java.net
Sat Jul 10 15:08:49 UTC 2021
On Sat, 10 Jul 2021 15:05:09 GMT, Radoslaw Smogura <github.com+7535718+rsmogura at openjdk.org> wrote:
>> # Description
>>
>> This increase allowed stride size for loop unrolling to almost maximum possible
>> value which is around `max_jint / 2 - 2`,
>> so the value which will prevent overflow when stride is doubled in C2.
>>
>> The motivation of this change is discussion and research about unrolling
>> vector (SIMD) loops. In such a case stride size depends on elements size
>> and machine size of vector for AVX256 and int stride size is 8,
>> and loop unroll happens. However short vectors will not cause loop unroll.
>>
>>
>> for (int i = 0; i < SPECIES.loopBound(longSize); i += SPECIES.length() /* 8 for int, 16 for short */ ) {
>> var v = ShortVector.fromByteBuffer(SPECIES, srcBufferHeap, i << 1, ByteOrder.nativeOrder());
>> v.intoByteBuffer(dstBufferHeap, i << 1, ByteOrder.nativeOrder());
>> }
>>
>>
>> # Notes
>> Stride size was decreased some time ago https://github.com/openjdk/panama-foreign/commit/2683d5390bd58683ae13bdd8582127c308d8fd04
>>
>> The exact reasons for this are not known for me (over unroll of some loops?).
>>
>> Original thread https://mail.openjdk.java.net/pipermail/panama-dev/2021-June/014310.html
>
> Radoslaw Smogura has updated the pull request incrementally with two additional commits since the last revision:
>
> - Drop whitespace, again.
> - Set maximum stride size to be 1 << 2 or max vector size
>
> Keep overflow test if stride can overflow when doubling
> Drop diagnostic flag
Hi Vladimir & thank you for comments.
I really don't understand the limit `1<< 2` for initial stride size, as loop unroll count is determined by LoopMaxUnroll.
I adjusted code to your suggestions, right now `initial_stride_sz = MAX2(1<<2, Matcher::max_vector_size(T_BYTE) / 2)`
I dropped the JVM flag controlling what procedure should be used to determine max stride size, is it ok?
-------------
PR: https://git.openjdk.java.net/jdk/pull/4658
More information about the hotspot-compiler-dev
mailing list