RFR: 8343685: C2 SuperWord: refactor VPointer with MemPointer [v6]

Emanuel Peter epeter at openjdk.org
Thu Jan 16 08:36:45 UTC 2025


On Thu, 16 Jan 2025 06:42:57 GMT, Emanuel Peter <epeter at openjdk.org> wrote:

>> src/hotspot/share/opto/vectorization.hpp line 1212:
>> 
>>> 1210:     tty->print("m * q(%d) + r(%d)", _q, _r);
>>> 1211:     if (_vpointer.count_invar_summands() > 0) {
>>> 1212:       tty->print(" - invar / (iv_scale(%d) * pre_stride)", _vpointer.iv_scale());
>> 
>> Would it make sense to print all invariant summands here as well?
>
> I think I would rather not do that, because it would be too verbose. With the `TraceAlignVector` `ALIGN_VECTOR` flag we do this printing here, which should be on one line:
> 
> `solution for pack:         m * q(2) + r(0) - invar / (iv_scale(4) * pre_stride) [- init / pre_stride], mem_ref[1047]`
> 
> And the `invar` is already printed earlier, so it is known from the context:
> 
>   invar = SUM(invar_summands), invar_summands:
>    4 * [101 LoadI] ->   101  LoadI  === _ 7 100  [[ 1083 226 235 280 372 458 ]]  @java/lang/Class (java/io/Serializable,java/lang/constant/Constable,java/lang/reflect/AnnotatedElement,java/lang/invoke/TypeDescriptor,java/lang/reflect/GenericDeclaration,java/lang/reflect/Type,java/lang/invoke/TypeDescriptor$OfField):exact+144 *, name=zero, idx=5; #int !orig=[225] !jvms: Test::test00001 @ bci:10 (line 94)
>   invar_factor = 4
> 
> 
> This is the fuller context:
> 
>  vector mem_ref: 1047  StoreI  === 1119 1117 1056 1048  [[ 1021 1023 1029 ]]  @int[int:>=0] (java/lang/Cloneable,java/io/Serializable):exact+any *, idx=6;  Memory: @int[int:>=0] (java/lang/Cloneable,java/io/Serializable):NotNull:exact+any *, idx=6; !orig=902,764,625,185,650 !jvms: Test::test00001 @ bci:24 (line 94)
>   VPointer: VPointer[size:  4, object, base(37 CastPP) + con( 16) + iv_scale(  4) * iv + invar(4 * [101 LoadI])]
>   vector_width = 64
>   aw = alignment_width = min(vector_width(64), ObjectAlignmentInBytes(8)) = 8
>   invar = SUM(invar_summands), invar_summands:
>    4 * [101 LoadI] ->   101  LoadI  === _ 7 100  [[ 1083 226 235 280 372 458 ]]  @java/lang/Class (java/io/Serializable,java/lang/constant/Constable,java/lang/reflect/AnnotatedElement,java/lang/invoke/TypeDescriptor,java/lang/reflect/GenericDeclaration,java/lang/reflect/Type,java/lang/invoke/TypeDescriptor$OfField):exact+144 *, name=zero, idx=5; #int !orig=[225] !jvms: Test::test00001 @ bci:10 (line 94)
>   invar_factor = 4
>   iv = init(   0) + pre_iter * pre_stride(1) + main_iter * main_stride(16)
>   adr = base[37] + con(16) + invar + iv_scale(4) * iv      = base[37] + C_const(16) + C_invar(4) * var_invar + C_init(0) * var_init + C_pre(4) * pre_iter + C_main(64) * main_iter
>   init is constant:
>     C_const_init = 0
>     C_init = 0
>   invariant present:
>     C_invar = invar_factor = 4
>   C_const = con(16) + iv_scale(4) * C_const_init(0) = 16
>   C_pre   = iv_scale(4) * pre_stride(1) = 4
>   C_main  = iv_scale(4) * main_stride(16) = 64
>   EQ(1 ): (C_const(16) + C_invar(4) * var_invar + C_init(0) * var_init + C_pre(4) * pre_iter + C_main(64) * main_i...

Ok, now printing it like this, looks better to me :)

  solution for pack:         m * q(2) + r(0) - invar(4 * [101 LoadI] + 4 * [105 LoadI]) / (iv_scale(4) * pre_stride) [- init / pre_stride], mem_ref[1067]
  intersection with current: m * q(2) + r(0) - invar(4 * [101 LoadI] + 4 * [105 LoadI]) / (iv_scale(4) * pre_stride) [- init / pre_stride], mem_ref[1067]

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/21926#discussion_r1917970809


More information about the hotspot-compiler-dev mailing list