RFR: 8324890: C2 SuperWord: refactor out VLoop, make unrolling_analysis static, remove init/reset mechanism [v7]
Emanuel Peter
epeter at openjdk.org
Mon Feb 5 14:57:22 UTC 2024
> Subtask of https://github.com/openjdk/jdk/pull/16620
> (The basic goal is to break SuperWord into different modules. This makes the code more maintainable and extensible. And eventually this allows some modules to be reused by other/new vectorizers.)
>
> 1. Move out the shared code between `SuperWord::SLP_extract` (where we do vectorization) and `SuperWord::unrolling_analysis`, and move it to a new class `VLoop`. This allows us to decouple `unrolling_analysis` from the SuperWord object, and we can make it static.
> 2. So far, SuperWord was reused for all loops in a compilation, and then "reset" (with `SuperWord::init`) for every loop. This is a bit of a nasty pattern. I now make a new `VLoop` and a new `SuperWord` object per loop.
> 3. Since we now make more `SuperWord` objects, we allocate the internal data structures more often. Therefore, I now pre-allocate/reserve sufficient space on initialization.
>
> Side-note about https://github.com/openjdk/jdk/pull/17604 (integrated, no need to read any more):
> I would like to remove the use of `SuperWord::is_marked_reduction` from `SuperWord::unrolling_analysis`. For starters: it is not clear what it was ever good for. Second: it requires us to do reduction marking/analysis before `unrolling_analysis`, and hence makes the reduction marking shared between `unrolling_analysis` and vectorization. I could move the reduction marking to `VLoop` now. But the `_loop_reducitons` set would have to be put on an arena, and I would like to avoid creating an arena for the `unrolling_analysis`. Plus, it would just be nicer code, to have reduction analysis together with body analysis, type analysis, etc. and all of them in only in `SLP_extract`.
Emanuel Peter has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 17 commits:
- Merge branch 'master' into JDK-8324890
- add VSharedData class
- manual merge
- timing code from JDK-8325159
- handle AutoVectorizeStatus::TriedAndFailed outside autovectorize
- Merge branch 'master' into JDK-8324890
- _vtrace is moved to VLoop
- comment update
- cosmetics
- rename in preconditions
- ... and 7 more: https://git.openjdk.org/jdk/compare/d395ac28...e2d9deae
-------------
Changes: https://git.openjdk.org/jdk/pull/17624/files
Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17624&range=06
Stats: 656 lines in 9 files changed: 292 ins; 181 del; 183 mod
Patch: https://git.openjdk.org/jdk/pull/17624.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/17624/head:pull/17624
PR: https://git.openjdk.org/jdk/pull/17624
More information about the hotspot-compiler-dev
mailing list