RFR: 8366427: C2 SuperWord: refactor VTransform scalar nodes

Emanuel Peter epeter at openjdk.org
Fri Aug 29 14:44:43 UTC 2025


On Fri, 29 Aug 2025 14:25:14 GMT, Vladimir Kozlov <kvn at openjdk.org> wrote:

>> I'm working on cost-modeling, and am integrating some smaller changes from this proof-of-concept PR:
>> https://github.com/openjdk/jdk/pull/20964
>> 
>> This is a pure refactoring - no change in behaviour. I'm presenting it like this because it will make reviews easier.
>> 
>> The goal is to split up some cases that are currently treated the same, but will alter have different behavior. There may be a little bit of code duplication, but the code will soon be made different ;)
>> 
>> We split the `VTransformScalarNode`:
>> - `VTransformMemopScalarNode`
>>   - Uses that only wanted scalar mem nodes can now directly check for `isa_MemopScalar`.
>>   - We can directly store the `_vpointer` in a field, that way we don't need to do a lookup via `vloop_analyzer`. This could also be helpful later on if we ever do widening (unrolling during auto vectorization): we could then do the necessary modifications to the `vpointer`.
>> - `VTransformLoopPhiNode`
>>   - Later on, they will play a more special role, they will give us easy access to the beginning state of the loop body and the backedges.
>> - `VTransformCFGNode`
>>   - Calling them scalar nodes is not 100% accurate. We'll probably have to further refine them later on. But splitting them off now seems like a reasonable choice. Once we do if-conversion we'll have to do more work on CFG.
>> - `VTransformDataScalarNode`
>>   - These represent all the normal "calculation" nodes in the loop.
>> - `VTransformInputScalarNode` -> `VTransformOuterNode`:
>>   - For now, we are still just tracking input nodes, but soon we will need to track input and output nodes: basically just the 1-hop neighbourhood of nodes outside the loop. I'm already renaming them now, so it will be less noise later.
>> 
>> I decided to rather split up more, and avoid the `VTransformScalarNode` together, avoiding having to override overrides - that can be really confusing (e.g. what I had with `is_load_in_loop`).
>
> src/hotspot/share/opto/vtransform.cpp line 1009:
> 
>> 1007:   tty->print("node[%d %s] ", _node->_idx, _node->Name());
>> 1008:   _vpointer.print_on(tty, false);
>> 1009: }
> 
> Consider separate RFE to use `outputStream*` for all prints.  If we go into UL word we need to collect all outputs in one buffer as we discussed on recent meeting.

Good idea! I'll file an RFE.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/27002#discussion_r2310365758


More information about the hotspot-compiler-dev mailing list