[aarch64-port-dev ] RFR: aarch64: add support for vectorizing floating-point fabs & fneg
Felix Yang
felix.yang at linaro.org
Fri Sep 25 04:55:56 UTC 2015
Hi,
As the patch is not trivial, I also managed to put it on a http server
for your convenience.
Please review this:
http://people.linaro.org/~felix.yang/aarch64-fabs-fneg-v1.diff
Thanks for your help.
Felix
On 24 September 2015 at 21:22, Felix Yang <felix.yang at linaro.org> wrote:
> Hi,
>
> I would like to contribute support for vectorizing fabs & fneg
> instructions on aarch64 architecture.
> The patch takes a similar way as adding support for vectorizing double
> precision sqrt on Intel architectures. And two new tests are added.
> The performance gain for the "sumReductionImplement" function is: 30%
> for single precision and 10% for double precision with LoopMaxUnroll=16.
> I have tested this with JTreg hotspot+langtools+jdk with no
> regressions. Is it OK?
> As this patch involves changes to shared code, I also need an Oracle
> sponsor for it if approved.
>
> Thanks,
> Felix
>
More information about the aarch64-port-dev
mailing list