RFR: 8310843: Reimplement ByteArray and ByteArrayLittleEndian with Unsafe [v10]

Maurizio Cimadamore mcimadamore at openjdk.org
Thu Jul 20 16:42:45 UTC 2023


On Thu, 20 Jul 2023 16:39:31 GMT, Glavo <duke at openjdk.org> wrote:

>> Here's with the same parameters as the one you are using:
>> 
>> Benchmark                      Mode  Cnt       Score       Error   Units
>> ByteArray.readByte            thrpt    5  268722.378 ±  5979.787  ops/ms
>> ByteArray.readByteFromBuffer  thrpt    5  254333.844 ±  6975.034  ops/ms
>> ByteArray.readInt             thrpt    5  271849.915 ±  4746.869  ops/ms
>> ByteArray.readIntFromBuffer   thrpt    5  263109.760 ±  2764.372  ops/ms
>> ByteArray.readLong            thrpt    5  271635.129 ± 14705.673  ops/ms
>> ByteArray.readLongFromBuffer  thrpt    5  252633.826 ±  2950.184  ops/ms
>> 
>> (Java 22/Ubuntu 20.04 on Ryzen 3900X here).
>
>> Here's with the same parameters as the one you are using:
> 
> I don't understand why there is such a difference. I have replicated similar results on several other devices:
> 
> Apple M1:
> 
> Benchmark                      Mode  Cnt        Score       Error   Units
> ByteArray.readByte            thrpt    5  1418606.665 ±  2308.711  ops/ms
> ByteArray.readByteFromBuffer  thrpt    5   879084.678 ± 20981.303  ops/ms
> ByteArray.readInt             thrpt    5  1404164.063 ±  4845.412  ops/ms
> ByteArray.readIntFromBuffer   thrpt    5   829768.843 ±  8730.467  ops/ms
> ByteArray.readLong            thrpt    5  1400713.660 ±  1924.752  ops/ms
> ByteArray.readLongFromBuffer  thrpt    5   825721.508 ±  4146.961  ops/ms
> 
> 
> 
> Intel Celeron N5105:
> 
> Benchmark                      Mode  Cnt       Score        Error   Units
> ByteArray.readByte            thrpt    5  927248.807 ±  29630.442  ops/ms
> ByteArray.readByteFromBuffer  thrpt    5  448374.359 ±   3210.571  ops/ms
> ByteArray.readInt             thrpt    5  719143.935 ± 183584.747  ops/ms
> ByteArray.readIntFromBuffer   thrpt    5  398578.488 ±    100.058  ops/ms
> ByteArray.readLong            thrpt    5  742555.147 ±  13006.710  ops/ms
> ByteArray.readLongFromBuffer  thrpt    5  398771.660 ±   1673.524  ops/ms
> 
> 
> AMD Ryzen 7 5800X:
> 
> Benchmark                      Mode  Cnt        Score       Error   Units
> ByteArray.readByte            thrpt    5  1784096.905 ± 47611.535  ops/ms
> ByteArray.readByteFromBuffer  thrpt    5   992974.747 ± 19389.011  ops/ms
> ByteArray.readInt             thrpt    5  1719368.129 ± 70606.284  ops/ms
> ByteArray.readIntFromBuffer   thrpt    5   919654.075 ±  9292.283  ops/ms
> ByteArray.readLong            thrpt    5  1719873.280 ± 18441.122  ops/ms
> ByteArray.readLongFromBuffer  thrpt    5   921992.992 ±  6439.747  ops/ms

On a newer processor/OS (Alder Lake/Ubuntu 22.04) I see a bit more difference:


Benchmark                      Mode  Cnt       Score       Error   Units
ByteArray.readByte            thrpt    5  680397.046 ± 27504.022  ops/ms
ByteArray.readByteFromBuffer  thrpt    5  576449.569 ±  3633.135  ops/ms
ByteArray.readInt             thrpt    5  685419.089 ±  6740.268  ops/ms
ByteArray.readIntFromBuffer   thrpt    5  542887.418 ±  2863.907  ops/ms
ByteArray.readLong            thrpt    5  687949.037 ±  3510.613  ops/ms
ByteArray.readLongFromBuffer  thrpt    5  548120.950 ±  5461.145  ops/ms


But still far from 2x.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/14636#discussion_r1269715099


More information about the core-libs-dev mailing list