RFR: 8358032: Use crypto pmull for CRC32/CRC32C intrinsics on Ampere CPU [v4]
    Andrew Haley 
    aph at openjdk.org
       
    Fri Jun  6 09:49:58 UTC 2025
    
    
  
On Thu, 5 Jun 2025 05:16:25 GMT, Liming Liu <lliu at openjdk.org> wrote:
> According to perf, post-increment ops help to reduce the access to TLB on Ampere1 in this case.
Hmm, but it's code in a rather odd style in shared code. And from what I see, the intrinsic is only 22% of the runtime (for 128 bytes) anyway, and you're making the code larger. I certainly don't want to see this sort of thing proliferating in the intrinsics.
In general, it's up to CPU designers to make simple, straightforward code work well.
How important is this?
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/25609#discussion_r2131885760
    
    
More information about the hotspot-dev
mailing list