VectorAPI: SubAll intrinsics for byte, short, float and double
Paul Sandoz
paul.sandoz at oracle.com
Mon Apr 30 23:51:16 UTC 2018
Hi Jp,
Looks ok. Can we derive subAll from addAll().neg(), the additional negation might be an acceptable cost but i am uncertain of the FP behavior.
IIUC, for reductive addition or subtraction, the accumulated value is kept in first lane of the destination register and the src lane element to subtract is shuffled down for each iteration. In effect it preserves the sequential order, but i wonder if there are faster data parallel approaches if we are relaxed about rounding producing different results?
Thanks,
Paul.
> On Apr 30, 2018, at 10:10 AM, Halimi, Jean-Philippe <jean-philippe.halimi at intel.com> wrote:
>
> Hi all,
>
>
>
> I would like to share a patch adding support for subAll intrinsic for byte, short, long, float and double types in VectorAPI.
>
>
>
> Could you please review the two following patches?
>
> http://cr.openjdk.java.net/~jphalimi/webrev_subAll_FP_v1.1/
>
> http://cr.openjdk.java.net/~jphalimi/webrev_subAll_BS_v1.0/
>
>
>
> Thank you,
>
>
>
> Jp
>
More information about the panama-dev
mailing list