RFR: 8355216: Accelerate P-256 arithmetic on aarch64 [v4]

Thu Feb 5 21:36:10 UTC 2026

On Mon, 26 Jan 2026 13:57:06 GMT, Andrew Dinn <adinn at openjdk.org> wrote:

>> Alternatively, if you need to fold in a fixed initial offset plus a suitable step then use `vs_ldpq_indexed`
>
> n.b. Note that you can use `vs_even(a_vec)` and `vs_odd(a_vec)` to select vector subsequences `a_vec[0]` and `a_vec[2]` or `a_vec[1]` and `a_vec[3]` respectively. Likewise, there is `vs_front` and `vs_back` to select the first and second halves of the vector sequence.

fixed this and in the process found a bug in the `vs_ldpq` method - it increased the address by 64 each iteration instead of 32

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/27946#discussion_r2771243624