RFR: 8340602: C2: LoadNode::split_through_phi might exhaust nodes in case of base_is_phi

Daohan Qu dqu at openjdk.org
Mon Sep 23 12:06:12 UTC 2024


# Description 

[JDK-6934604](https://github.com/openjdk/jdk/commit/b4977e887a53c898b96a7d37a3bf94742c7cc194) introduces the flag `AggressiveUnboxing` in jdk8u, and [JDK-8217919](https://github.com/openjdk/jdk/commit/71759e3177fcd6926bb36a30a26f9f3976f2aae8) enables it by default in jdk13u.

But it seems that JDK-6934604 forgets to check duplicate `PhiNode` generated in the function `LoadNode::split_through_phi(PhaseGVN *phase)` (in `memnode.cpp`) in the case that `base` is phi but `mem` is not phi. More exactly, `LoadNode::Identity(PhaseTransform *phase)` doesn't search for `PhiNode` in the correct region in that case.

This might cause infinite split which is similar to the bugs fixed in [JDK-6673473](https://github.com/openjdk/jdk/commit/30dc0edfc877000c0ae20384f228b45ba82807b7) and [JDK-8038348](https://github.com/openjdk/jdk/commit/913622a64157c4c2ce496ecddf7a8c4315e1ff84). The infinite split results in "Out of nodes" and make the method "not compilable".

Since JDK-8217919 (in jdk13u), all the later versions of jdks are affected by this bug when the expected optimization pattern appears in the code. For example, the following three micro-benchmarks running with


make test \
  TEST="micro:org.openjdk.bench.java.util.stream.tasks.IntegerMax.Bulk.bulk_seq_inner micro:org.openjdk.bench.java.util.stream.tasks.IntegerMax.Lambda.bulk_seq_lambda micro:org.openjdk.bench.java.util.stream.tasks.IntegerMax.Lambda.bulk_seq_methodRef" \
  MICRO="FORK=1;WARMUP_ITER=2" \
  TEST_OPTS="VM_OPTIONS=-XX:+UseParallelGC"


 shows performance improvement after this PR applied. (`-XX:+UseParallelGC` is only for reproduce this bug, all the bms in the following table are run with this option.)

|benchmark (throughput, unit: ops/s)| jdk-before-this-patch | jdk-after-this-patch |
|---|---|---|
|org.openjdk.bench.java.util.stream.tasks.IntegerMax.Bulk.bulk_seq_inner | 26.452 ±(99.9%) 0.185 ops/s | 62.060 ±(99.9%) 0.878 ops/s |
|org.openjdk.bench.java.util.stream.tasks.IntegerMax.Lambda.bulk_seq_lambda | 26.922 ±(99.9%) 1.710 ops/s | 67.961 ±(99.9%) 0.850 ops/s |
|org.openjdk.bench.java.util.stream.tasks.IntegerMax.Lambda.bulk_seq_methodRef | 28.382 ±(99.9%) 1.021 ops/s | 67.998 ±(99.9%) 0.751 ops/s |

# Reproduction

Compiled and run the reduced test case `Test.java` in the appendix below using 


java -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:+LogCompilation -XX:LogFile=comp.log -XX:+UseParallelGC Test


and you could find that `Test$Obj.calc` is tagged with `make_not_compilable` and see some output like 


<failure reason='Out of nodes' phase='compile'/>"


And when `-XX:+AbortVMOnCompilationFailure` is used, vm will crash.

# Solutions

You could set `AggressiveUnboxing` to `false` to disable this optimization thus bypassing this bug, but I'm not sure if it will cause other regressions since it has been set to `true` for several years.

This PR tries to fix this bug with minimal side effect.

# Tests
I have run the following tests locally, and only see the `vmTestbase/jit/misctests/fpustack/GraphApplet.java` failure due to `DISPLAY` unset, which seems to be unrelated to this patch.
- [x] `jtreg:test/hotspot/jtreg/compiler`
- [x] `jtreg:test/hotspot/jtreg/vmTestbase`


# Appendix

import java.util.Random;

public class Test {

  static class Obj {
    final Integer[] array;
    final int start;
    final int end;

    Integer max = Integer.MIN_VALUE;

    Obj(Integer[] array, int start, int end) {
      this.array = array;
      this.start = start;
      this.end = end;
    }

    Integer cmp(Integer i, Integer j) {
      return i > j ? i : j;
    }

    void calc() {
      int i = start;
      do {
        max = cmp(max, array[i]);
        i++;
      } while (i < end);
    }
  }

  static final int LEN = 2000;
  static final Integer[] a = new Integer[LEN];
  static {
    Random r = new Random(0x30052012);
    for (int i = 0; i < LEN; i++) {
      a[i] = new Integer(r.nextInt());
    }
  }

  public static void main (String[] args) {
    System.out.println("Start");
    Obj o = new Obj(a, 0, LEN);
    for (int i = 0; i < 1000; i++) {
      o.calc();
    }
    System.out.println(o.max);
  }
}

-------------

Commit messages:
 - Check duplicate split in case of base_is_phi

Changes: https://git.openjdk.org/jdk/pull/21134/files
  Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21134&range=00
  Issue: https://bugs.openjdk.org/browse/JDK-8340602
  Stats: 100 lines in 3 files changed: 63 ins; 29 del; 8 mod
  Patch: https://git.openjdk.org/jdk/pull/21134.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/21134/head:pull/21134

PR: https://git.openjdk.org/jdk/pull/21134


More information about the hotspot-compiler-dev mailing list