RFR: 8371603: C2: Missing Ideal optimizations for load and store vectors on SVE [v2]
Xiaohong Gong
xgong at openjdk.org
Mon Dec 8 02:01:06 UTC 2025
On Fri, 5 Dec 2025 09:37:22 GMT, Xiaohong Gong <xgong at openjdk.org> wrote:
>> **Problem:**
>>
>> This issue occurs on a 256-bit SVE machine, caused by the following problematic pattern in `LoadVectorNode::Ideal()`:
>>
>>
>> Node* LoadVectorNode::Ideal(PhaseGVN* phase, bool can_reshape) {
>> const TypeVect* vt = vect_type();
>> if (Matcher::vector_needs_partial_operations(this, vt)) {
>> return VectorNode::try_to_gen_masked_vector(phase, this, vt);
>> }
>> return LoadNode::Ideal(phase, can_reshape);
>> }
>>
>>
>> The condition `Matcher::vector_needs_partial_operations(this, vt)` returns true for `LoadVectorNode` with 256-bit vector size even when the vector size equals the maximum vector size on SVE. In such cases, when `VectorNode::try_to_gen_masked_vector()` returns `nullptr`, the method exits early without calling `LoadNode::Ideal()`. This results in missing crucial optimizations that would normally be applied by the superclass.
>>
>> This code was introduced by https://bugs.openjdk.org/browse/JDK-8286941 to generate vector masks for partial vector operations, but it failed to ensure that the superclass `Ideal()` method is always invoked when no transformation is applied.
>>
>> **Solution:**
>>
>> This patch addresses the issue through two changes:
>>
>> 1. Refine `Matcher::vector_needs_partial_operations()` to return true only when the vector node genuinely represents a partial vector operation that requires masking.
>> 2. Modify `VectorNode::try_to_gen_masked_vector()` to never return `nullptr`, ensuring the superclass `Ideal()` method is always invoked when no transformation is applied.
>>
>> **Testing:**
>>
>> - Verified on different SVE platforms with different vector sizes (128|256|512 bits).
>> - Verified on X86 platforms with different avx options (-XX:UseAVX=1|2|3).
>> - Added two new IR tests to verify 1) previously missing optimizations for `LoadVector/StoreVector` are now applied, and 2) that mask and the correct IR patterns are generated for partial vector operations.
>
> Xiaohong Gong has updated the pull request incrementally with one additional commit since the last revision:
>
> Combine the condition check and IR transformation to a method
Hi @erifan @shqking , could you please help take a look at this PR? Thanks a lot!
-------------
PR Comment: https://git.openjdk.org/jdk/pull/28651#issuecomment-3624159509
More information about the hotspot-compiler-dev
mailing list