RFR: 8196331: Optimize Character.digit for latin1 input
Paul Sandoz
paul.sandoz at oracle.com
Tue Jan 30 16:29:13 UTC 2018
OK! A surprising increase in code size. Thanks for checking.
Paul.
> On Jan 30, 2018, at 2:40 AM, Claes Redestad <claes.redestad at oracle.com> wrote:
>
> The ASM is harder than usual to follow and compare since everything is
> inlined aggressively, but it seems that since CharacterDataLatin1 is only
> invoked for 0 <= ch < 256 (invariant established in CharacterData.of(int ch))
> then the compiler is able to elide bounds check entirely when the byte[] is
> also 256 elements.
>
> Shrinking the array adds more branches and grows the compiled code size
> for UUID.fromString from 751 to 1341 bytes, so it seems that even from a
> footprint perspective then keeping the array at 256 elements is a win. :-)
>
> /Claes
>
> On 2018-01-29 22:04, Claes Redestad wrote:
>> Right, I can't really explain why, but the effect is very visible and
>> reproducible in microbenchmarks. Differences in generated ASM might
>> be indicating that the inlining behavior changes enough to shift the
>> result around; maybe a job for @ForceInline?
>>
>> I'll experiment and analyze a bit more tomorrow to see if I can find a
>> good explanation for the observed difference and/or a solution that
>> allows us to slim down the lookup array.
>>
>> /Claes
>>
>> On 2018-01-29 20:38, Paul Sandoz wrote:
>>> Smaller in only the upper bound? I would an explicit upper bounds check would dominate that of the bounds check for the array itself.
>>>
>>> Paul.
>>>
>>>> On Jan 29, 2018, at 11:37 AM, Claes Redestad <claes.redestad at oracle.com <mailto:claes.redestad at oracle.com>> wrote:
>>>>
>>>> I ran with a smaller byte[] initially and saw about a 10% improvement from removing the branch, which is why I felt the superfluous bytes were motivated.
>>>>
>>>> /Claes
>>>>
>>>> Paul Sandoz <paul.sandoz at oracle.com <mailto:paul.sandoz at oracle.com>> skrev: (29 januari 2018 19:14:44 CET)
>>>>
>>>>
>>>> On Jan 29, 2018, at 9:44 AM, Martin Buchholz
>>>> <martinrb at google.com <mailto:martinrb at google.com>> wrote:
>>>> Thanks. I might try to shrink the size of the static array,
>>>> by only storing values between '0' and 'z', at the cost of
>>>> slightly greater lookup costs for each char.
>>>>
>>>> I was wondering the same, or just clip the end of the array to’z'
>>>>
>>>> if (ch <= ‘z’ && radix …) { // Might even fold the upper bounds check for DIGITS
>>>> value = DIGITS[ch];
>>>> ...
>>>> }
>>>>
>>>> Paul.
>>>>
>>>> On Mon, Jan 29, 2018 at 3:15 AM, Claes Redestad
>>>> <claes.redestad at oracle.com
>>>> <mailto:claes.redestad at oracle.com>> wrote:
>>>>
>>>> Hi, for the latin1 block of CharacterData we can improve
>>>> the digit(int, int) implementation by providing an
>>>> optimized lookup table. This improves microbenchmarks
>>>> exercising Integer.parseInt, Long.parseLong and
>>>> UUID.fromString etc by around 50%for typical inputs.
>>>> Webrev:
>>>> http://cr.openjdk.java.net/~redestad/8196331/open.00/
>>>> <http://cr.openjdk.java.net/%7Eredestad/8196331/open.00/>
>>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8196331 The
>>>> lookup array is pre-calculated to minimize startup impact
>>>> (adds 1,027 bytecodes executed during initialization) /Claes
>>>>
>>>>
>>>> --
>>>> Sent from my Android device with K-9 Mail. Please excuse my brevity.
>>>
>>
>
More information about the core-libs-dev
mailing list