[vector] Some fine-tunes for AArch64 reduce_add operations

Yang Zhang (Arm Technology China) Yang.Zhang at arm.com
Thu Nov 21 07:40:49 UTC 2019


Hi

Reduce_add2l can also be optimized by addp. I change it and push the patch as follows.
http://cr.openjdk.java.net/~yzhang/vectorapi/vectorapi.finetune/webrev.01/index.html

Regards
Yang

-----Original Message-----
From: panama-dev <panama-dev-bounces at openjdk.java.net> On Behalf Of Qi Feng (Arm Technology China)
Sent: Tuesday, November 19, 2019 5:32 PM
To: panama-dev at openjdk.java.net
Cc: nd <nd at arm.com>
Subject: [vector] Some fine-tunes for AArch64 reduce_add operations

Hi all,

Could you please help me review this patch?

In this patch, I make some fine-tunes for AArch64 reduce_add operations.
As reduce_add operation can be applied to signed values, we should use sign-extension when possible. And using faddp instead of fadd can reduce the instructions from 3 to 2.

http://cr.openjdk.java.net/~yzhang/vectorapi/vectorapi.finetune/webrev.00/

Thanks,
Qi


More information about the panama-dev mailing list