RFR: 8278296: Generalize long range check transformation [v5]

John R Rose jrose at openjdk.java.net
Tue Mar 1 23:05:03 UTC 2022


On Wed, 26 Jan 2022 16:04:06 GMT, Roland Westrelin <roland at openjdk.org> wrote:

>> 8259609 (C2: optimize long range checks in long counted loops) only
>> covered the case of a counted loop with a positive stride and a range
>> check with a positive scale. This change generalizes the long range
>> check transformation to all 4 combinations of stride and scale signs.
>> 
>> The stride > 0, scale > 0 case (covered 8259609) was tweaked so it now
>> uses Qmax computed as the inclusive limit of j*K+Q. That helps in
>> generalizing the formulas to other cases.
>> 
>> The addition of PhaseIdealLoop::is_scaled_iv_plus_extra_offset() was
>> required for the case of negative scale in an int loop. The range
>> check then has the shape:
>> 
>> (CmpUL (AddL (ConvI2L (SubI ConI (LshiftI (Phi
>> 
>> with ConI, the zero constant.
>> 
>> This change also addresses this comment from John:
>> 
>> https://github.com/openjdk/jdk/pull/6576#discussion_r765343664
>> 
>> as part of 8276116 (C2: optimize long range checks in int counted loops)
>
> Roland Westrelin has updated the pull request incrementally with one additional commit since the last revision:
> 
>   review

Attached is another bunch of documentation changes, explaining what's going on in great detail.  I think I'm done, actually.  I did this in concert with an executable test harness, to catch logic errors.

[roland-review-0301-patch.txt](https://github.com/openjdk/jdk/files/8165295/roland-review-0301-patch.txt)

In the transform code, I changed the names of some of the variables and some of the order of operations, to correspond better to the documentation (or so I hope; please check).

I made an algorithmic change, replacing an overflowing `Q_max+1` by simply `max_jlong` rather than `R`, since `max_jlong` is a simple rematerializable constant but `R` might need spilling.  I validated this change in my test harness; any value between R and `max_jlong` inclusive will work as `H_clamp`.

Near the conditional moves I made comments suggesting alternative flow-free idioms.  But I can't think of any good reason not to use the conditional moves, even on platforms (like old PPC) which implement them as control flow.  Feel free to toss those comments, though I enjoyed coming up with them.

For the record, I did a series of experiments to validate the subtle logic of this transformation.  Here is the code:
<http://cr.openjdk.java.net/~jrose/jvm/ClampedRangeCheckSimulation.zip>

Attached is another bunch of documentation changes, explaining what's going on in great detail.  I think I'm done, actually.  I did this in concert with an executable test harness, to catch logic errors.

[roland-review-0301-patch.txt](https://github.com/openjdk/jdk/files/8165295/roland-review-0301-patch.txt)

In the transform code, I changed the names of some of the variables and some of the order of operations, to correspond better to the documentation (or so I hope; please check).

I made an algorithmic change, replacing an overflowing `Q_max+1` by simply `max_jlong` rather than `R`, since `max_jlong` is a simple rematerializable constant but `R` might need spilling.  I validated this change in my test harness; any value between R and `max_jlong` inclusive will work as `H_clamp`.

Near the conditional moves I made comments suggesting alternative flow-free idioms.  But I can't think of any good reason not to use the conditional moves, even on platforms (like old PPC) which implement them as control flow.  Feel free to toss those comments, though I enjoyed coming up with them.

For the record, I did a series of experiments to validate the subtle logic of this transformation.  Here is the code:
<http://cr.openjdk.java.net/~jrose/jvm/ClampedRangeCheckSimulation.zip>

-------------

PR: https://git.openjdk.java.net/jdk/pull/6989


More information about the hotspot-compiler-dev mailing list