RFR: 8326139: C2 SuperWord: split packs (match use/def packs, implemented, mutual independence) [v2]
Vladimir Kozlov
kvn at openjdk.org
Mon Feb 26 18:22:50 UTC 2024
On Mon, 26 Feb 2024 14:52:08 GMT, Emanuel Peter <epeter at openjdk.org> wrote:
>> After `combine_pairs_to_longer_packs`, we sometimes create packs that are too long and cannot be vectorized.
>> There are multiple reason for that:
>>
>> - A pack does not "match" with a use of def pack, and we need to split it. Example: split Z:
>>
>> X X X X Y Y Y Y
>> Z Z Z Z Z Z Z Z
>>
>>
>> - A pack is not implemented in the current size, but would be implemented for a smaller size. Some operations are not implemented at max vector size, and we just need to split every pack to the smaller size that is implemented. Or we have a pack of a non-power-of-2 size, and need to split it down into smaller sizes that are power-of-2.
>>
>> - Packs can have pack internal dependence. This dependence happens at a certain "distance". If we split a pack to be smaller than that "distance", then the internal dependence disappears, and we have the desired mutual independence. Example:
>> https://github.com/openjdk/jdk/blob/9ee274b0480a6e8e399830fd40c34d99c5621c1b/test/hotspot/jtreg/compiler/loopopts/superword/TestSplitPacks.java#L657-L669
>>
>> Note: Soon I will refactor the packset into a new `PackSet` class, and move the split / filter code there.
>>
>> **Further Work**
>>
>> [JDK-8309267](https://bugs.openjdk.org/browse/JDK-8309267) C2 SuperWord: some tests fail on KNL machines - fail to vectorize
>> The issue on KNL machines is that some operations only support vector widths that are smaller than the maximal vector length. Hence, we must split all other vectors that are uses/defs. This change here does exactly that, and so I will be able to put the `UseKNLSetting` in the IRFramework whitelist. I will do that in a separate RFE.
>
> Emanuel Peter has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 27 commits:
>
> - manual merge
> - more fixes for TestSplitPacks.java
> - fix some IR rules in TestSplitPacks.java
> - fix MulAddS2I
> - Fix data dependency IR rules in LoopArrayIndexComputeTest.java
> - fix populate index cases and test7
> - test7 WIP
> - WIP fix reductions
> - Update test/hotspot/jtreg/compiler/loopopts/superword/TestSplitPacks.java
>
> fix bug number
> - cleanup
> - ... and 17 more: https://git.openjdk.org/jdk/compare/490825fb...c39ace04
Few comments.
src/hotspot/share/opto/superword.cpp line 1008:
> 1006: }
> 1007:
> 1008: // Do we have pattern n1 = (iv + c) and n2 = (iv + c + 1)?
Why `+1` is special case? Does SuperWord handle case when loop's increment is not `1`?
src/hotspot/share/opto/superword.cpp line 1492:
> 1490: // Split pack according to task. Return true if any change was made, else false.
> 1491: SuperWord::SplitStatus
> 1492: SuperWord::split_pack(const char* split_name, Node_List* pack, SplitTask task) {
Style. Please, put return type on the same line as method. Yes, the line will be bigger but I prefer to have arguments on separate line than the return type.
src/hotspot/share/opto/superword.hpp line 436:
> 434: private:
> 435: const uint _split_size;
> 436: const bool _reject;
Should this be enum with 3 states you listed in comment?
src/hotspot/share/opto/superword.hpp line 471:
> 469: Node_List* _first_pack;
> 470: Node_List* _second_pack;
> 471: bool _changed;
Again, should this be enum?
-------------
PR Review: https://git.openjdk.org/jdk/pull/17848#pullrequestreview-1901655261
PR Review Comment: https://git.openjdk.org/jdk/pull/17848#discussion_r1503093102
PR Review Comment: https://git.openjdk.org/jdk/pull/17848#discussion_r1503095224
PR Review Comment: https://git.openjdk.org/jdk/pull/17848#discussion_r1503087670
PR Review Comment: https://git.openjdk.org/jdk/pull/17848#discussion_r1503089095
More information about the hotspot-compiler-dev
mailing list