RFR: 8365205: C2: Optimize popcount value computation using knownbits [v3]
Jatin Bhateja
jbhateja at openjdk.org
Tue Sep 9 02:12:11 UTC 2025
On Fri, 5 Sep 2025 17:17:52 GMT, Jatin Bhateja <jbhateja at openjdk.org> wrote:
>> This patch optimizes PopCount value transforms using KnownBits information.
>> Following are the results of the micro-benchmark included with the patch
>>
>>
>>
>> System: 13th Gen Intel(R) Core(TM) i3-1315U
>>
>> Baseline:
>> Benchmark Mode Cnt Score Error Units
>> PopCountValueTransform.LogicFoldingKerenLong thrpt 2 215460.670 ops/s
>> PopCountValueTransform.LogicFoldingKerenlInt thrpt 2 294014.826 ops/s
>> PopCountValueTransform.StockKernelInt thrpt 2 409295.875 ops/s
>> PopCountValueTransform.StockKernelLong thrpt 2 368025.608 ops/s
>>
>> Withopt:
>> Benchmark Mode Cnt Score Error Units
>> PopCountValueTransform.LogicFoldingKerenLong thrpt 2 389978.082 ops/s
>> PopCountValueTransform.LogicFoldingKerenlInt thrpt 2 417261.583 ops/s
>> PopCountValueTransform.StockKernelInt thrpt 2 418649.269 ops/s
>> PopCountValueTransform.StockKernelLong thrpt 2 381330.221 ops/s
>>
>>
>> Kindly review and share your feedback.
>>
>> Best Regards,
>> Jatin
>
> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision:
>
> Update countbitsnode.cpp
Hi @TobiHartmann , @SirYwell , @eme64 , can you kindly verify the changes in the latest patch?
-------------
PR Comment: https://git.openjdk.org/jdk/pull/27075#issuecomment-3268608172
More information about the hotspot-compiler-dev
mailing list