RFR: 8274243: Implement fast-path for ASCII-compatible CharsetEncoders on aarch64 [v3]

Patric Hedlin phedlin at openjdk.java.net
Wed Jan 12 15:47:28 UTC 2022


On Tue, 11 Jan 2022 09:04:12 GMT, Patric Hedlin <phedlin at openjdk.org> wrote:

>> Implementation of ISO/ASCII char set encoding, extending current implementation with ASCII encoding support.
>> 
>> The motivation is found in the original x86 issue ([JDK-8274242](https://bugs.openjdk.java.net/browse/JDK-8274242)).
>> 
>> Implementation with some focus on balance between footprint and efficiency, trying to utilise a dual SIMD path (e.g. Neoverse N1) for the additional ASCII-check and avoid performance loss in the ISO-only case.
>> 
>> - Interleaved ISO and ASCII check code.
>> - Avoid 'umaxv' in the ISO main flow.
>> - Using post inc in main loop.
>> - Retain 8-char loop.
>> - Removing conditional prefetch (no upside).
>> - Adding ISO-8859-1 to encode-decode benchmark.
>> 
>> Testing (Linux): tier1-6
>> 
>> The revised version compares like this (master vs. update).
>> 
>> Benchmark                   (size)       (type)  Mode  Cnt    Score   Error  Units
>> CharsetEncodeDecode.encode   16384        UTF-8  avgt   30   17.920 ± 0.229  us/op
>> CharsetEncodeDecode.encode   16384         BIG5  avgt   30   18.867 ± 0.356  us/op
>> CharsetEncodeDecode.encode   16384  ISO-8859-15  avgt   30   17.419 ± 0.220  us/op
>> CharsetEncodeDecode.encode   16384   ISO-8859-1  avgt   30    6.200 ± 0.134  us/op
>> CharsetEncodeDecode.encode   16384        ASCII  avgt   30   17.149 ± 0.219  us/op
>> CharsetEncodeDecode.encode   16384       UTF-16  avgt   30  135.115 ± 1.440  us/op
>> 
>> 
>> Benchmark                   (size)       (type)  Mode  Cnt    Score   Error  Units
>> CharsetEncodeDecode.encode   16384        UTF-8  avgt   30    9.018 ± 0.179  us/op
>> CharsetEncodeDecode.encode   16384         BIG5  avgt   30   10.550 ± 0.470  us/op
>> CharsetEncodeDecode.encode   16384  ISO-8859-15  avgt   30    8.843 ± 0.187  us/op
>> CharsetEncodeDecode.encode   16384   ISO-8859-1  avgt   30    6.406 ± 0.155  us/op
>> CharsetEncodeDecode.encode   16384        ASCII  avgt   30    8.822 ± 0.173  us/op
>> CharsetEncodeDecode.encode   16384       UTF-16  avgt   30  135.195 ± 1.432  us/op
>
> Patric Hedlin has updated the pull request incrementally with one additional commit since the last revision:
> 
>   Update src/hotspot/cpu/aarch64/macroAssembler_aarch64.cpp
>   
>   Co-authored-by: Andrew Haley <aph-open at littlepinkcloud.com>

Thanks for reviewing.

-------------

PR: https://git.openjdk.java.net/jdk/pull/6945


More information about the hotspot-compiler-dev mailing list