RFR: 8325589: C2 SuperWord refactoring: create VLoopAnalyzer with Submodules [v3]

Christian Hagedorn chagedorn at openjdk.org
Wed Feb 14 14:41:09 UTC 2024


On Tue, 13 Feb 2024 16:26:25 GMT, Emanuel Peter <epeter at openjdk.org> wrote:

>> Subtask of https://github.com/openjdk/jdk/pull/16620
>> 
>> **Goals**:
>> - **modularity**: SuperWord is a nasty large class, and it is difficult to know what impacts what.
>> - **code reuse**: In the future, we may want to reuse some components of SuperWord in other Vectorizers. For example the post-loop-vectorizer ([JDK-8308994](https://bugs.openjdk.org/browse/JDK-8308994)).
>> 
>> Hence, I pull some parts out of SuperWord, and put them in a new class `VLoopAnalyzer`, with submodules:
>> 
>> VLoopAnalyzer
>>   VLoopReductions
>>   VLoopMemorySlices
>>   VLoopBody
>>   VLoopTypes
>> 
>> 
>> **Future Work**
>> In my draft for https://github.com/openjdk/jdk/pull/16620, I also created a submodule `VLoopDependenceGraph`.
>> But that is also a lot of code change.
>> I decided to do this in a separate RFE, to keep this patch here reasonably short.
>> 
>> I also left many functions in `superword.cpp`, even though their classes are in `vectorization.hpp`.
>> I think it is easier to review them in their old palces, with minor changes, for now.
>> In a future RFE, I can then cleanly copy them, without any changes to them.
>
> Emanuel Peter has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains eight commits:
> 
>  - manual merge
>  - move _loop_or_ctrl from ResouceArena, bc ResourceMark in SuperWord::dependence_graph
>  - remove some comments
>  - VLoopTypes
>  - VLoopBody
>  - VLoopMemorySlices
>  - VLoopReductions
>  - 8325589

Nice refactoring into submodules! Here are some first comments. I have to do another iteration looking closer at the interplay of all the new submodules.

src/hotspot/share/opto/superword.cpp line 761:

> 759:   // First, assign a dependence node to each memory node
> 760:   for (int i = 0; i < body().length(); i++ ) {
> 761:     Node *n = body().at(i);

Suggestion:

    Node* n = body().at(i);

src/hotspot/share/opto/superword.cpp line 768:

> 766: 
> 767:   const GrowableArray<PhiNode*> &mem_slice_head = vloop_analyzer().memory_slices().heads();
> 768:   const GrowableArray<MemNode*> &mem_slice_tail = vloop_analyzer().memory_slices().tails();

Suggestion:

  const GrowableArray<PhiNode*>& mem_slice_head = vloop_analyzer().memory_slices().heads();
  const GrowableArray<MemNode*>& mem_slice_tail = vloop_analyzer().memory_slices().tails();

src/hotspot/share/opto/vectorization.cpp line 130:

> 128:     if (state == VLoopAnalyzer::SUCCESS) {
> 129:     return true; // success
> 130:   }

I think it's more common to have the pattern "if (!foo()) print error and return". I therefore suggest to flip this:

  if (state != VLoopAnalyzer::SUCCESS) {
#ifndef PRODUCT
    if (vloop().is_trace_loop_analyzer()) {
      tty->print_cr("\nVLoopAnalyze::setup_submodules: failed: %s", state);
    }
#endif
    return false;
  }
  return true;

src/hotspot/share/opto/vectorization.cpp line 144:

> 142:   // Skip any loop that has not been assigned max unroll by analysis.
> 143:   if (SuperWordLoopUnrollAnalysis && vloop().cl()->slp_max_unroll() == 0) {
> 144:     return VLoopAnalyzer::FAILURE_NO_MAX_UNROLL;

How about directly printing here instead of going over a failure string and returning false? You could, for example, have a method `trace_setup_submodules_failure(failure_string)`, and directly pass the string of `FAILURE_NO_MAX_UNROLL` here. Otherwise, I guess you still need to find this usage of `FAILURE_NO_MAX_UNROLL` here to understand exactly which conditions failed. By directly printing here, you have the entire context of the error.

Doing that would also allow the merge of `setup_submodules()` and `setup_submodules_helper()` into a single method.

It's just a suggestion - I leave it up to you to decide :-)

src/hotspot/share/opto/vectorization.cpp line 153:

> 151:   _memory_slices.find_memory_slices();
> 152: 
> 153:   // If there is no memory slice detected, that means there is no store.

Suggestion:

  // If there is no memory slice detected, it means there is no store.

src/hotspot/share/opto/vectorization.hpp line 116:

> 114: 
> 115:   bool is_trace_loop_analyzer() const {
> 116:     return vtrace().is_trace(TraceAutoVectorizationTag::LOOP_ANALYZER);

Any specific reason to not directly use `_vtrace` here instead of `vtrace()`? You also use the accessor method at other places inside the classes where you could also just use the fields. I think it improves the readability to use the fields directly such that you do not need to worry about side effects or other code inside the accessor methods.

src/hotspot/share/opto/vectorization.hpp line 235:

> 233: 
> 234: public:
> 235:   // Find and mark reductions in a loop. Running mark_reductions() is similar to

I suggest to add new lines between the methods.

src/hotspot/share/opto/vectorization.hpp line 271:

> 269:   GrowableArray<MemNode*> _tails;
> 270: 
> 271:   const VLoop& vloop() const { return _vloop; }

Is this method required? You could directly access the `VLoop` with `_vloop` inside the class.

src/hotspot/share/opto/vectorization.hpp line 463:

> 461:   bool success() const { return _success; }
> 462: 
> 463:   Arena* arena()       { return &_arena; }

Since the `arena` is only for the submodules, do we really need to expose it here?

-------------

PR Review: https://git.openjdk.org/jdk/pull/17800#pullrequestreview-1879805273
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489554813
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489555331
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489184004
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489196826
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489200794
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489198749
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489557573
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489560942
PR Review Comment: https://git.openjdk.org/jdk/pull/17800#discussion_r1489207625


More information about the hotspot-compiler-dev mailing list