[vector] Infrastructure for calling math routines in .s files

Vladimir Ivanov vladimir.x.ivanov at oracle.com
Mon May 28 13:00:36 UTC 2018


> Please take a look at the updated version:
> http://cr.openjdk.java.net/~rlupusoru/panama/webrev_callsvml_11/

Thanks, it looks really nice!

Some minor comments:

src/hotspot/share/opto/callnode.cpp:

+void CallLeafVectorNode::calling_convention( BasicType* sig_bt, 
VMRegPair *parm_regs, uint argcnt ) const {
+  uint num_bits = 
tf()->range()->field_at(TypeFunc::Parms)->is_vect()->length_in_bytes() * 
BitsPerByte;
+#ifndef PRODUCT
+  const TypeTuple* d = tf()->domain();
+  for (uint i = TypeFunc::Parms; i < d->cnt(); i++) {
+    Node* arg = in(i);
+    assert(arg->bottom_type()->is_vect()->length_in_bytes() * 
BitsPerByte == num_bits, "vector arguments must match");
+  }
+#endif

For assertion purposes it's preferred to use "#ifdef ASSERT".

"#ifndef PRODUCT" is used primarily for "optimized" build configuration.

Also, what do you think about putting num_bits value into 
CallLeafVectorNode?



What's the story for Windows x64 support? I see some build changes, but 
it seems assembly is tailored specifically for Linux.

src/hotspot/os_cpu/linux_x86/svml_s_expf4_core_l9ha.s:

+#ifdef __VECTOR_API_MATH_INTRINSICS_LINUX
...
+__svml_expf4_ha_l9:


src/hotspot/cpu/x86/stubGenerator_x86_64.cpp:

+#ifdef __VECTOR_API_MATH_INTRINSICS_COMMON
+    if (UseVectorApiIntrinsics) {
+      if (UseAVX >= 2) {
+        StubRoutines::_vector_float64_exp = CAST_FROM_FN_PTR(address, 
__svml_expf4_ha_l9);
+        StubRoutines::_vector_float128_exp = CAST_FROM_FN_PTR(address, 
__svml_expf4_ha_l9);


src/hotspot/share/utilities/globalDefinitions_vecApi.hpp:

+#if defined(_WIN64) && (defined(_MSC_VER) && (_MSC_VER >= 1600))
+__VECTOR_API_MATH_INTRINSICS_COMMON TEXTEQU <>
+#define __VECTOR_API_MATH_INTRINSICS_COMMON
+__VECTOR_API_MATH_INTRINSICS_WINDOWS TEXTEQU <>
+#define __VECTOR_API_MATH_INTRINSICS_WINDOWS
+#endif


> Also, I separated the tests this time and you can review here. I can merge this patch separately:
> http://cr.openjdk.java.net/~rlupusoru/panama/webrev_mathtests_00/

Looks good.

I second Paul - kudos to you and the team at Intel! That's an impressive 
and very valuable contribution. Thanks a lot for pushing the limits of 
Java platform on that front!

Best regards,
Vladimir Ivanov

> 
> Thanks and have a great weekend!
> 
> --Razvan
> 
> -----Original Message-----
> From: Vladimir Ivanov [mailto:vladimir.x.ivanov at oracle.com]
> Sent: Friday, May 18, 2018 7:25 PM
> To: Lupusoru, Razvan A <razvan.a.lupusoru at intel.com>; panama-dev at openjdk.java.net
> Subject: Re: [vector] Infrastructure for calling math routines in .s files
> 
>> So I am sending out the initial patch which adds infrastructure (plus one implementation - exp) in order to be able to call these assembly routines from JIT compiled code. In a subsequent patches, we will contribute the other implementations in form of assembly files.
>> http://cr.openjdk.java.net/~rlupusoru/panama/webrev_callsvml_09/index.
>> html
> 
> Wow, vector calls! Nice! :-)
> 
> My main concern is amount of changes in shared code needed. I'd like to see the logic somehow confined just to those vector stub calls.
> 
> Some suggestions:
> 
>     (1) introduce new node for calling vector stubs - CallLeafVector?
>          - no need in CallLeafNode::has_vector_arguments_or_return()
> 
>     (2) Implement custom calling convention instead of extending c_calling_convention. It seems there is a need for a limited set of shapes (unary, binary, ?) and even having those cases hard-coded should be fine. It can simplify the patch in the following way:
>          - have the logic factored out into platform-specific
> vector_calling_convention() / vector_return_value();
>          - get rid of T_VECTOR* basic types
>          - no need to extend return_value in x86_64.ad for vector types
> 
>> Please take a look at the patch and let me know if you have any questions. I would like to point out that among changes there are also a few which make sure that compiler does not tag with "NO_FP" any calls that may end up calling methods that are using FP/vector code. This is quite important for Vector API since unlike auto-vectorizer, there may be live vector registers at any point in the code.
> 
> Doesn't it cause problems for floats/doubles in xmm registers then? I have some doubts all those cases are problematic, but, anyway, I suggest to handle it separately and focus on vector stub calls alone here.
> 
> Best regards,
> Vladimir Ivanov
> 


More information about the panama-dev mailing list