From duke at openjdk.org Wed Feb 1 07:14:55 2023 From: duke at openjdk.org (Viktor Klang) Date: Wed, 1 Feb 2023 07:14:55 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset In-Reply-To: <1su0_EK4Sx_blL9MQLS7147NWLaYH80QCSvM2iyV7h4=.6c053033-45ce-4234-912c-3de05ee0ef46@github.com> References: <1su0_EK4Sx_blL9MQLS7147NWLaYH80QCSvM2iyV7h4=.6c053033-45ce-4234-912c-3de05ee0ef46@github.com> Message-ID: On Tue, 31 Jan 2023 16:02:07 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > src/java.base/share/classes/java/time/ZoneOffset.java line 432: > >> 430: if (totalSeconds % (15 * SECONDS_PER_MINUTE) == 0) { >> 431: int slot = cacheSlot(totalSeconds); >> 432: ZoneOffset cached = SECONDS_CACHE.get(slot); > > I miss `AtomicReferenceArray::computeIfNull` that atomically will compute an element if the value at a certain index is `null`. @minborg You could compareAndExchange in a CompletableFuture?if you succeed you can complete it with the computation (bonus points since the computation can be done async) and if you fail you get either a value or a CompletableFuture you can decide if you want to block on it, and for how long? ------------- PR: https://git.openjdk.org/jdk/pull/12346 From stsypanov at openjdk.org Wed Feb 1 07:19:19 2023 From: stsypanov at openjdk.org (Sergey Tsypanov) Date: Wed, 1 Feb 2023 07:19:19 GMT Subject: RFR: 8301492: Modernize equals() method of ResourceBundle.CacheKey and Bundles.CacheKey [v2] In-Reply-To: <5xdnQZ8wLrkyB6U2miAOizqpvHGTFzy7OF_a24CUabc=.db6a65d7-9a29-44dc-94aa-692fbbfc82c7@github.com> References: <5xdnQZ8wLrkyB6U2miAOizqpvHGTFzy7OF_a24CUabc=.db6a65d7-9a29-44dc-94aa-692fbbfc82c7@github.com> Message-ID: > `ResourceBundle.CacheKey.equals()` and `Bundles.CacheKey.equals()` are quire outdated. This simple clean-up modernizes them. Sergey Tsypanov has updated the pull request incrementally with one additional commit since the last revision: Fix logic ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12328/files - new: https://git.openjdk.org/jdk/pull/12328/files/b74164a3..82b03202 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12328&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12328&range=00-01 Stats: 4 lines in 1 file changed: 2 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/12328.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12328/head:pull/12328 PR: https://git.openjdk.org/jdk/pull/12328 From aturbanov at openjdk.org Wed Feb 1 08:00:51 2023 From: aturbanov at openjdk.org (Andrey Turbanov) Date: Wed, 1 Feb 2023 08:00:51 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset In-Reply-To: References: Message-ID: On Tue, 31 Jan 2023 15:57:43 GMT, Per Minborg wrote: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. src/java.base/share/classes/java/time/ZoneOffset.java line 147: > 145: > 146: /** Cache of time-zone offset by offset in seconds [-18h, +18h] for each even quarter of an hour. */ > 147: private static final AtomicReferenceArray SECONDS_CACHE = new AtomicReferenceArray<>(MAX_SECONDS_CACHE_SLOT * 2 + 1); Can we use regular array instead? ------------- PR: https://git.openjdk.org/jdk/pull/12346 From rgiulietti at openjdk.org Wed Feb 1 10:26:26 2023 From: rgiulietti at openjdk.org (Raffaello Giulietti) Date: Wed, 1 Feb 2023 10:26:26 GMT Subject: RFR: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter [v8] In-Reply-To: References: Message-ID: > Align `double` and `float` decimal conversions in `java.util.Formatter` with the algorithm used in `Double.toString(double)`. Raffaello Giulietti has updated the pull request incrementally with one additional commit since the last revision: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12259/files - new: https://git.openjdk.org/jdk/pull/12259/files/5e488a70..bf7d9f64 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12259&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12259&range=06-07 Stats: 16 lines in 1 file changed: 16 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/12259.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12259/head:pull/12259 PR: https://git.openjdk.org/jdk/pull/12259 From rgiulietti at openjdk.org Wed Feb 1 10:26:27 2023 From: rgiulietti at openjdk.org (Raffaello Giulietti) Date: Wed, 1 Feb 2023 10:26:27 GMT Subject: RFR: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter [v7] In-Reply-To: References:

Message-ID: <4FDCIv_rq0v1SUez4QWFS_45ja2Wdp6uvud9mv2vElM=.58113abe-beb3-498f-ae8d-cbbeb440780d@github.com> On Tue, 31 Jan 2023 22:19:25 GMT, Joe Darcy wrote: >> Raffaello Giulietti has updated the pull request incrementally with one additional commit since the last revision: >> >> 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter > > src/java.base/share/classes/jdk/internal/math/FormattedFPDecimal.java line 28: > >> 26: package jdk.internal.math; >> 27: >> 28: public final class FormattedFPDecimal { > > I suggest adding a short explanation of what this class is used for. Addressed. ------------- PR: https://git.openjdk.org/jdk/pull/12259 From stsypanov at openjdk.org Wed Feb 1 10:36:12 2023 From: stsypanov at openjdk.org (Sergey Tsypanov) Date: Wed, 1 Feb 2023 10:36:12 GMT Subject: RFR: 8301492: Modernize equals() method of ResourceBundle.CacheKey and Bundles.CacheKey [v3] In-Reply-To: <5xdnQZ8wLrkyB6U2miAOizqpvHGTFzy7OF_a24CUabc=.db6a65d7-9a29-44dc-94aa-692fbbfc82c7@github.com> References: <5xdnQZ8wLrkyB6U2miAOizqpvHGTFzy7OF_a24CUabc=.db6a65d7-9a29-44dc-94aa-692fbbfc82c7@github.com> Message-ID: > `ResourceBundle.CacheKey.equals()` and `Bundles.CacheKey.equals()` are quire outdated. This simple clean-up modernizes them. Sergey Tsypanov has updated the pull request incrementally with one additional commit since the last revision: Restore logic ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12328/files - new: https://git.openjdk.org/jdk/pull/12328/files/82b03202..2b07e47e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12328&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12328&range=01-02 Stats: 4 lines in 1 file changed: 0 ins; 2 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/12328.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12328/head:pull/12328 PR: https://git.openjdk.org/jdk/pull/12328 From pminborg at openjdk.org Wed Feb 1 11:41:16 2023 From: pminborg at openjdk.org (Per Minborg) Date: Wed, 1 Feb 2023 11:41:16 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v2] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with one additional commit since the last revision: Rework using a regular array and acquire/release semantics ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/eedec09d..a99dd083 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=00-01 Stats: 43 lines in 1 file changed: 33 ins; 1 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Wed Feb 1 11:41:18 2023 From: pminborg at openjdk.org (Per Minborg) Date: Wed, 1 Feb 2023 11:41:18 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v2] In-Reply-To: References:

Message-ID: On Wed, 1 Feb 2023 07:58:02 GMT, Andrey Turbanov wrote: >> Per Minborg has updated the pull request incrementally with one additional commit since the last revision: >> >> Rework using a regular array and acquire/release semantics > > src/java.base/share/classes/java/time/ZoneOffset.java line 147: > >> 145: >> 146: /** Cache of time-zone offset by offset in seconds [-18h, +18h] for each even quarter of an hour. */ >> 147: private static final AtomicReferenceArray SECONDS_CACHE = new AtomicReferenceArray<>(MAX_SECONDS_CACHE_SLOT * 2 + 1); > > Can we use regular array instead? We can but that entails special handling to ensure thread-safety. I will provide such a solution. Thanks. ------------- PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Wed Feb 1 12:06:25 2023 From: pminborg at openjdk.org (Per Minborg) Date: Wed, 1 Feb 2023 12:06:25 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v3] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with one additional commit since the last revision: Remove code commented out ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/a99dd083..5a8e9720 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=01-02 Stats: 13 lines in 1 file changed: 0 ins; 13 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Wed Feb 1 13:18:31 2023 From: pminborg at openjdk.org (Per Minborg) Date: Wed, 1 Feb 2023 13:18:31 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with two additional commits since the last revision: - Simplify benchmark - Add benchmark ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/5a8e9720..562885c7 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=02-03 Stats: 70 lines in 1 file changed: 70 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Wed Feb 1 13:40:52 2023 From: pminborg at openjdk.org (Per Minborg) Date: Wed, 1 Feb 2023 13:40:52 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: References:

Message-ID: On Wed, 1 Feb 2023 13:18:31 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with two additional commits since the last revision: > > - Simplify benchmark > - Add benchmark A > 3X performance increase is obtained via this PR for `ZoneOffset::ofTotalSeconds` for values from the cache (higher is better): Baseline Result "org.openjdk.bench.java.time.ZoneOffsetBench.getFromCache": 1.088 ?(99.9%) 0.019 ops/us [Average] (min, avg, max) = (1.046, 1.088, 1.109), stdev = 0.018 CI (99.9%): [1.069, 1.108] (assumes normal distribution) PR Result "org.openjdk.bench.java.time.ZoneOffsetBench.getFromCache": 3.710 ?(99.9%) 0.031 ops/us [Average] (min, avg, max) = (3.651, 3.710, 3.745), stdev = 0.029 CI (99.9%): [3.680, 3.741] (assumes normal distribution) ![image](https://user-images.githubusercontent.com/7457876/216058153-03c06037-5ddb-40cb-8e2c-5beae0560126.png) ------------- PR: https://git.openjdk.org/jdk/pull/12346 From stsypanov at openjdk.org Wed Feb 1 18:59:51 2023 From: stsypanov at openjdk.org (Sergey Tsypanov) Date: Wed, 1 Feb 2023 18:59:51 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: References:

Message-ID: On Wed, 1 Feb 2023 13:18:31 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with two additional commits since the last revision: > > - Simplify benchmark > - Add benchmark test/micro/org/openjdk/bench/java/time/ZoneOffsetBench.java line 66: > 64: for (int s : CACHED_SECONDS) { > 65: ZoneOffset zo = ZoneOffset.ofTotalSeconds(s); > 66: sum += zo.getTotalSeconds(); I think we should feed the value of `zo.getTotalSeconds()` into Blackhole, see https://github.com/openjdk/jmh/blob/master/jmh-samples/src/main/java/org/openjdk/jmh/samples/JMHSample_34_SafeLooping.java#L128 ------------- PR: https://git.openjdk.org/jdk/pull/12346 From duke at openjdk.org Wed Feb 1 20:22:49 2023 From: duke at openjdk.org (cheenar) Date: Wed, 1 Feb 2023 20:22:49 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: References:

Message-ID: <05rUH2WPjpBg564PQ1g8t7Sq5Ui4-hZ8mnUAQj3uYqU=.bcc1afda-8172-4e4c-9afe-d5272b60defc@github.com> On Wed, 1 Feb 2023 13:18:31 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with two additional commits since the last revision: > > - Simplify benchmark > - Add benchmark test/jdk/java/time/tck/java/time/zone/TCKFixedZoneRules.java line 141: > 139: @Test(dataProvider="rules") > 140: public void test_isValidOffset_LDT_ZO(ZoneRules test, ZoneOffset expectedOffset) { > 141: if (expectedOffset == ZoneOffset.UTC) Extremely minor but why not wrap if with `{}` for improved readability here with the comment ------------- PR: https://git.openjdk.org/jdk/pull/12346 From duke at openjdk.org Wed Feb 1 20:22:51 2023 From: duke at openjdk.org (cheenar) Date: Wed, 1 Feb 2023 20:22:51 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: <05rUH2WPjpBg564PQ1g8t7Sq5Ui4-hZ8mnUAQj3uYqU=.bcc1afda-8172-4e4c-9afe-d5272b60defc@github.com> References:

<05rUH2WPjpBg564PQ1g8t7Sq5Ui4-hZ8mnUAQj3uYqU=.bcc1afda-8172-4e4c-9afe-d5272b60defc@github.com> Message-ID: <4H30se4qw_TKSWrvInldU08F3KKcHmAPPSSuWc9kXYY=.6e30a782-6617-43f9-83d2-8237d5389b85@github.com> On Wed, 1 Feb 2023 20:16:56 GMT, cheenar wrote: >> Per Minborg has updated the pull request incrementally with two additional commits since the last revision: >> >> - Simplify benchmark >> - Add benchmark > > test/jdk/java/time/tck/java/time/zone/TCKFixedZoneRules.java line 141: > >> 139: @Test(dataProvider="rules") >> 140: public void test_isValidOffset_LDT_ZO(ZoneRules test, ZoneOffset expectedOffset) { >> 141: if (expectedOffset == ZoneOffset.UTC) > > Extremely minor but why not wrap if with `{}` for improved readability here with the comment Same [here](https://github.com/openjdk/jdk/pull/12346/commits/ec49ca3bc03d2e97fa0429c84290923066667871?diff=unified&w=0#diff-9e5aa282dc2d02c31e1d7c5ec8196a1d3d23c06e471d5114d0bd0c78ee4fe5f6R433) although it feels much more dangerous than the test! ------------- PR: https://git.openjdk.org/jdk/pull/12346 From duke at openjdk.org Thu Feb 2 09:29:59 2023 From: duke at openjdk.org (j3graham) Date: Thu, 2 Feb 2023 09:29:59 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v4] In-Reply-To: References:

Message-ID: <1JFNX-OhKD3tX8MPVdvQ6WEdsdrdId1C7VJshazArZU=.52bfd266-dede-4362-b41d-38e535d53aa5@github.com> On Wed, 1 Feb 2023 11:37:16 GMT, Per Minborg wrote: >> src/java.base/share/classes/java/time/ZoneOffset.java line 147: >> >>> 145: >>> 146: /** Cache of time-zone offset by offset in seconds [-18h, +18h] for each even quarter of an hour. */ >>> 147: private static final AtomicReferenceArray SECONDS_CACHE = new AtomicReferenceArray<>(MAX_SECONDS_CACHE_SLOT * 2 + 1); >> >> Can we use regular array instead? > > We can but that entails special handling to ensure thread-safety. I will provide such a solution. Thanks. If you need the thread-safety, perhaps sticking with `AtomicReferenceArray` is a simpler solution than the regular array one. ------------- PR: https://git.openjdk.org/jdk/pull/12346 From rgiulietti at openjdk.org Thu Feb 2 09:46:34 2023 From: rgiulietti at openjdk.org (Raffaello Giulietti) Date: Thu, 2 Feb 2023 09:46:34 GMT Subject: RFR: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter [v8] In-Reply-To: References:

<05rUH2WPjpBg564PQ1g8t7Sq5Ui4-hZ8mnUAQj3uYqU=.bcc1afda-8172-4e4c-9afe-d5272b60defc@github.com> <4H30se4qw_TKSWrvInldU08F3KKcHmAPPSSuWc9kXYY=.6e30a782-6617-43f9-83d2-8237d5389b85@github.com> Message-ID: On Wed, 1 Feb 2023 20:19:20 GMT, cheenar wrote: >> test/jdk/java/time/tck/java/time/zone/TCKFixedZoneRules.java line 141: >> >>> 139: @Test(dataProvider="rules") >>> 140: public void test_isValidOffset_LDT_ZO(ZoneRules test, ZoneOffset expectedOffset) { >>> 141: if (expectedOffset == ZoneOffset.UTC) >> >> Extremely minor but why not wrap if with `{}` for improved readability here with the comment > > Same [here](https://github.com/openjdk/jdk/pull/12346/commits/ec49ca3bc03d2e97fa0429c84290923066667871?diff=unified&w=0#diff-9e5aa282dc2d02c31e1d7c5ec8196a1d3d23c06e471d5114d0bd0c78ee4fe5f6R433) although it feels much more dangerous than the test! Yes, please always wrap with `{}` in java.time.* ------------- PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Thu Feb 2 15:13:08 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 2 Feb 2023 15:13:08 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v5] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with one additional commit since the last revision: Add brackets ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/562885c7..48438fd8 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=03-04 Stats: 2 lines in 1 file changed: 1 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Thu Feb 2 15:13:11 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 2 Feb 2023 15:13:11 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v5] In-Reply-To: <1JFNX-OhKD3tX8MPVdvQ6WEdsdrdId1C7VJshazArZU=.52bfd266-dede-4362-b41d-38e535d53aa5@github.com> References:

<1JFNX-OhKD3tX8MPVdvQ6WEdsdrdId1C7VJshazArZU=.52bfd266-dede-4362-b41d-38e535d53aa5@github.com> Message-ID: On Thu, 2 Feb 2023 09:26:55 GMT, j3graham wrote: >> We can but that entails special handling to ensure thread-safety. I will provide such a solution. Thanks. > > If you need the thread-safety, perhaps sticking with `AtomicReferenceArray` is a simpler solution than the regular array one. As I used a regular array, I realized I could provide better performance with not much added complexity. See the use of `@Stable` for the components in the array. ------------- PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Thu Feb 2 15:44:54 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 2 Feb 2023 15:44:54 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v6] In-Reply-To: References: Message-ID: <3yEmX_TScPCLb-W1zwHwEXZXvXWAzemtwCtYTFPsSXg=.d8aaa569-8812-45dd-a3b5-af99b677165b@github.com> > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with one additional commit since the last revision: Fix benchmark ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/48438fd8..1cd2b406 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=04-05 Stats: 7 lines in 1 file changed: 1 ins; 2 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Thu Feb 2 16:23:55 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 2 Feb 2023 16:23:55 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v6] In-Reply-To: <3yEmX_TScPCLb-W1zwHwEXZXvXWAzemtwCtYTFPsSXg=.d8aaa569-8812-45dd-a3b5-af99b677165b@github.com> References: <3yEmX_TScPCLb-W1zwHwEXZXvXWAzemtwCtYTFPsSXg=.d8aaa569-8812-45dd-a3b5-af99b677165b@github.com> Message-ID: On Thu, 2 Feb 2023 15:44:54 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with one additional commit since the last revision: > > Fix benchmark Updated figures with the new benchmark: Baseline Result "org.openjdk.bench.java.time.ZoneOffsetBench.getFromCache": 887.478 ?(99.9%) 10.206 ns/op [Average] (min, avg, max) = (876.206, 887.478, 906.754), stdev = 9.547 CI (99.9%): [877.271, 897.684] (assumes normal distribution) Patch Result "org.openjdk.bench.java.time.ZoneOffsetBench.getFromCache": 252.646 ?(99.9%) 2.794 ns/op [Average] (min, avg, max) = (250.451, 252.646, 258.890), stdev = 2.614 CI (99.9%): [249.851, 255.440] (assumes normal distribution) JDK 20 - Baseline (JDK 21 before patch) JDK 21 - Patch ![image](https://user-images.githubusercontent.com/7457876/216381349-38318d43-a9f3-44f8-9019-293ddd7bf3e2.png) ------------- PR: https://git.openjdk.org/jdk/pull/12346 From naoto at openjdk.org Thu Feb 2 17:50:28 2023 From: naoto at openjdk.org (Naoto Sato) Date: Thu, 2 Feb 2023 17:50:28 GMT Subject: RFR: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter [v8] In-Reply-To: References:

Message-ID: On Wed, 1 Feb 2023 10:26:26 GMT, Raffaello Giulietti wrote: >> Align `double` and `float` decimal conversions in `java.util.Formatter` with the algorithm used in `Double.toString(double)`. > > Raffaello Giulietti has updated the pull request incrementally with one additional commit since the last revision: > > 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter I skimmed through the changes, and see no problem wrt the i18n area. ------------- Marked as reviewed by naoto (Reviewer). PR: https://git.openjdk.org/jdk/pull/12259 From rgiulietti at openjdk.org Thu Feb 2 19:14:35 2023 From: rgiulietti at openjdk.org (Raffaello Giulietti) Date: Thu, 2 Feb 2023 19:14:35 GMT Subject: Integrated: 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter In-Reply-To: References: Message-ID: On Fri, 27 Jan 2023 16:02:38 GMT, Raffaello Giulietti wrote: > Align `double` and `float` decimal conversions in `java.util.Formatter` with the algorithm used in `Double.toString(double)`. This pull request has now been integrated. Changeset: f696785f Author: Raffaello Giulietti URL: https://git.openjdk.org/jdk/commit/f696785fd3bc5b27c06260088a2e0ce520e12142 Stats: 936 lines in 11 files changed: 524 ins; 372 del; 40 mod 8300869: Make use of the Double.toString(double) algorithm in java.util.Formatter Reviewed-by: darcy, naoto ------------- PR: https://git.openjdk.org/jdk/pull/12259 From rriggs at openjdk.org Fri Feb 3 15:47:07 2023 From: rriggs at openjdk.org (Roger Riggs) Date: Fri, 3 Feb 2023 15:47:07 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v6] In-Reply-To: <3yEmX_TScPCLb-W1zwHwEXZXvXWAzemtwCtYTFPsSXg=.d8aaa569-8812-45dd-a3b5-af99b677165b@github.com> References: <3yEmX_TScPCLb-W1zwHwEXZXvXWAzemtwCtYTFPsSXg=.d8aaa569-8812-45dd-a3b5-af99b677165b@github.com> Message-ID: <3NLsh7ML4B9x-YSlpvJjfFPCNIS5ZCpeGisUHgmTF_k=.f6cea935-ee7d-4cac-ac6d-03178f0dfd8b@github.com> On Thu, 2 Feb 2023 15:44:54 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with one additional commit since the last revision: > > Fix benchmark Is this added complexity and startup time (VarHandle) worth it? Will normal usage be noticed by typical applications?(Microbenchmark not withstanding). ------------- PR: https://git.openjdk.org/jdk/pull/12346 From psadhukhan at openjdk.org Mon Feb 6 08:54:00 2023 From: psadhukhan at openjdk.org (Prasanta Sadhukhan) Date: Mon, 6 Feb 2023 08:54:00 GMT Subject: Integrated: 4934362: see also refers to self In-Reply-To: References: Message-ID: On Tue, 3 Jan 2023 06:00:37 GMT, Prasanta Sadhukhan wrote: > Some methods and constants has a hyperreference to self in javadoc which is rectified to reference proper methods This pull request has now been integrated. Changeset: ab528ce3 Author: Prasanta Sadhukhan URL: https://git.openjdk.org/jdk/commit/ab528ce3cd4bb75a00f5eaadae1f5e45d26712b5 Stats: 27 lines in 10 files changed: 4 ins; 4 del; 19 mod 4934362: see also refers to self Reviewed-by: prr, serb, aivanov ------------- PR: https://git.openjdk.org/jdk/pull/11820 From pminborg at openjdk.org Thu Feb 9 13:39:20 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 9 Feb 2023 13:39:20 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v7] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with one additional commit since the last revision: Add a generic LazyReferenceArray ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/1cd2b406..b89a9aae Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=05-06 Stats: 305 lines in 4 files changed: 275 ins; 19 del; 11 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From pminborg at openjdk.org Thu Feb 9 13:46:14 2023 From: pminborg at openjdk.org (Per Minborg) Date: Thu, 9 Feb 2023 13:46:14 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v8] In-Reply-To: References: Message-ID: > `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). > > Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. Per Minborg has updated the pull request incrementally with three additional commits since the last revision: - Remove unused setup method - Rename method in test - Add copyright header ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12346/files - new: https://git.openjdk.org/jdk/pull/12346/files/b89a9aae..29674b14 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12346&range=06-07 Stats: 32 lines in 3 files changed: 23 ins; 8 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/12346.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12346/head:pull/12346 PR: https://git.openjdk.org/jdk/pull/12346 From jlaskey at openjdk.org Fri Feb 10 13:37:37 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Fri, 10 Feb 2023 13:37:37 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v35] In-Reply-To: References: Message-ID: > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 50 commits: - Merge branch 'master' into 8285932 - Update to JDK 21 - Merge branch 'master' into 8285932 - Merge branch 'master' into 8285932 - FormatProcessor changes - Update @since - Requested changes #12 - Seal Digits - Requested changes #11 - Typo - ... and 40 more: https://git.openjdk.org/jdk/compare/c8ace482...264120a9 ------------- Changes: https://git.openjdk.org/jdk/pull/10889/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=34 Stats: 9519 lines in 81 files changed: 9356 ins; 28 del; 135 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From jlaskey at openjdk.org Fri Feb 10 14:27:17 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Fri, 10 Feb 2023 14:27:17 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v36] In-Reply-To: References: Message-ID: > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request incrementally with one additional commit since the last revision: Bring up to date ------------- Changes: - all: https://git.openjdk.org/jdk/pull/10889/files - new: https://git.openjdk.org/jdk/pull/10889/files/264120a9..665cded9 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=35 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=34-35 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From jlaskey at openjdk.org Fri Feb 10 17:07:24 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Fri, 10 Feb 2023 17:07:24 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v37] In-Reply-To: References: Message-ID: <3Cffq9T2VOO7KsFUbANnyixAkxi4Ztlojk9voEvmF1I=.2ed8c2bf-e704-4661-9185-102ac3f15e7a@github.com> > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request incrementally with one additional commit since the last revision: CSR review ------------- Changes: - all: https://git.openjdk.org/jdk/pull/10889/files - new: https://git.openjdk.org/jdk/pull/10889/files/665cded9..8f5ad0a4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=36 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=35-36 Stats: 65 lines in 6 files changed: 55 ins; 0 del; 10 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From jlaskey at openjdk.org Fri Feb 10 17:09:18 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Fri, 10 Feb 2023 17:09:18 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v38] In-Reply-To: References: Message-ID: > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 53 commits: - Merge branch 'master' into 8285932 - CSR review - Bring up to date - Merge branch 'master' into 8285932 - Update to JDK 21 - Merge branch 'master' into 8285932 - Merge branch 'master' into 8285932 - FormatProcessor changes - Update @since - Requested changes #12 - ... and 43 more: https://git.openjdk.org/jdk/compare/4539899c...5fab46c1 ------------- Changes: https://git.openjdk.org/jdk/pull/10889/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=37 Stats: 9576 lines in 81 files changed: 9411 ins; 28 del; 137 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From jlaskey at openjdk.org Fri Feb 10 17:32:00 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Fri, 10 Feb 2023 17:32:00 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v39] In-Reply-To: References: Message-ID: > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request incrementally with one additional commit since the last revision: Minor correction to javadoc ------------- Changes: - all: https://git.openjdk.org/jdk/pull/10889/files - new: https://git.openjdk.org/jdk/pull/10889/files/5fab46c1..5a031bda Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=38 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=37-38 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From rriggs at openjdk.org Fri Feb 10 21:21:29 2023 From: rriggs at openjdk.org (Roger Riggs) Date: Fri, 10 Feb 2023 21:21:29 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v8] In-Reply-To: References:

Message-ID: On Thu, 9 Feb 2023 13:46:14 GMT, Per Minborg wrote: >> `ZoneOffset` instances are cached by the `ZoneOffset` class itself for values in the range [-18h, 18h] for each second that is on an even quarter of an hour (i.e. at most 2*18*4+1 = 145 values). >> >> Instead of using a `ConcurrentHashMap` for caching instanced, we could instead use an `AtomicReferenceArray` with direct slot value access for said even seconds. This will improve performance and reduce the number of object even though the backing array will go from an initial 32 in the CHM to an initial/final 145 in the ARA. The CHM will contain much more objects and array slots for typical numbers of entries in the cache and will compute hash/bucket/collision on the hot code path for each cache access. > > Per Minborg has updated the pull request incrementally with three additional commits since the last revision: > > - Remove unused setup method > - Rename method in test > - Add copyright header Another musing related to ZoneOffset; it would be desirable if ZoneOffset was a value class (forward looking to the Valhalla project) for performance reasons, meaning immutable and without identity. Currently, it is "value-based", intending to be immutable, but it has a mutable field that caches the ZoneRules. At some point, it may be desirable to refactor the implementation to avoid the mutable field. ------------- PR: https://git.openjdk.org/jdk/pull/12346 From jlaskey at openjdk.org Sat Feb 11 17:49:49 2023 From: jlaskey at openjdk.org (Jim Laskey) Date: Sat, 11 Feb 2023 17:49:49 GMT Subject: RFR: JDK-8285932 Implementation of JEP 430 String Templates (Preview) [v40] In-Reply-To: References: Message-ID: > Enhance the Java programming language with string templates, which are similar to string literals but contain embedded expressions. A string template is interpreted at run time by replacing each expression with the result of evaluating that expression, possibly after further validation and transformation. This is a [preview language feature and API](http://openjdk.java.net/jeps/12). Jim Laskey has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 55 commits: - Merge branch 'master' into 8285932 - Minor correction to javadoc - Merge branch 'master' into 8285932 - CSR review - Bring up to date - Merge branch 'master' into 8285932 - Update to JDK 21 - Merge branch 'master' into 8285932 - Merge branch 'master' into 8285932 - FormatProcessor changes - ... and 45 more: https://git.openjdk.org/jdk/compare/6f9f2b5d...95d219af ------------- Changes: https://git.openjdk.org/jdk/pull/10889/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=10889&range=39 Stats: 9492 lines in 81 files changed: 9394 ins; 28 del; 70 mod Patch: https://git.openjdk.org/jdk/pull/10889.diff Fetch: git fetch https://git.openjdk.org/jdk pull/10889/head:pull/10889 PR: https://git.openjdk.org/jdk/pull/10889 From jpai at openjdk.org Fri Feb 17 12:03:54 2023 From: jpai at openjdk.org (Jaikiran Pai) Date: Fri, 17 Feb 2023 12:03:54 GMT Subject: RFR: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize Message-ID: Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. ------------- Commit messages: - 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize Changes: https://git.openjdk.org/jdk/pull/12595/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12595&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302664 Stats: 19 lines in 8 files changed: 0 ins; 0 del; 19 mod Patch: https://git.openjdk.org/jdk/pull/12595.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12595/head:pull/12595 PR: https://git.openjdk.org/jdk/pull/12595 From djelinski at openjdk.org Fri Feb 17 12:23:24 2023 From: djelinski at openjdk.org (Daniel =?UTF-8?B?SmVsacWEc2tp?=) Date: Fri, 17 Feb 2023 12:23:24 GMT Subject: RFR: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize In-Reply-To: References: Message-ID: On Thu, 16 Feb 2023 14:42:52 GMT, Jaikiran Pai wrote: > Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. > > There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. > > I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. > > tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. As far as I could tell, this only changes the generated exception message. The condition is (off+len<=size), so the exception will be thrown on the same input as before. LGTM. ------------- Marked as reviewed by djelinski (Committer). PR: https://git.openjdk.org/jdk/pull/12595 From dfuchs at openjdk.org Fri Feb 17 13:12:50 2023 From: dfuchs at openjdk.org (Daniel Fuchs) Date: Fri, 17 Feb 2023 13:12:50 GMT Subject: RFR: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize In-Reply-To: References: Message-ID: On Thu, 16 Feb 2023 14:42:52 GMT, Jaikiran Pai wrote: > Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. > > There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. > > I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. > > tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. As Daniel noted the computation is symmetrical which explains why it worked. But passing inverted parameters was confusing. LGTM2. ------------- Marked as reviewed by dfuchs (Reviewer). PR: https://git.openjdk.org/jdk/pull/12595 From alanb at openjdk.org Fri Feb 17 13:38:27 2023 From: alanb at openjdk.org (Alan Bateman) Date: Fri, 17 Feb 2023 13:38:27 GMT Subject: RFR: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize In-Reply-To: References: Message-ID: On Thu, 16 Feb 2023 14:42:52 GMT, Jaikiran Pai wrote: > Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. > > There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. > > I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. > > tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. It's easy to get the parameters wrong to these methods but if the fromIndex and size are mixed up then it should just impact the exception message. ------------- Marked as reviewed by alanb (Reviewer). PR: https://git.openjdk.org/jdk/pull/12595 From jpai at openjdk.org Sat Feb 18 00:51:35 2023 From: jpai at openjdk.org (Jaikiran Pai) Date: Sat, 18 Feb 2023 00:51:35 GMT Subject: RFR: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize In-Reply-To: References: Message-ID: On Thu, 16 Feb 2023 14:42:52 GMT, Jaikiran Pai wrote: > Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. > > There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. > > I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. > > tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. Thank you all for the reviews. ------------- PR: https://git.openjdk.org/jdk/pull/12595 From jpai at openjdk.org Sat Feb 18 00:51:35 2023 From: jpai at openjdk.org (Jaikiran Pai) Date: Sat, 18 Feb 2023 00:51:35 GMT Subject: Integrated: 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize In-Reply-To: References: Message-ID: On Thu, 16 Feb 2023 14:42:52 GMT, Jaikiran Pai wrote: > Can I please get a review of this change which fixes the usage of `Preconditions.checkFromIndexSize`? This addresses https://bugs.openjdk.org/browse/JDK-8302664. > > There was an oversight when these changes were introduced in https://github.com/openjdk/jdk/pull/4507. I have now gone through that patch again to make sure the relevant places where this fix is needed has been addressed in this current PR. > > I have also looked into other changes in that PR, just to be sure that there aren't any similar fixes needed for other method calls that were introduced in it - they all look fine. > > tier1,tier2 and tier3 testing with this change passed successfully. I thought (and experimented a bit) to add new tests for Deflater/Inflater to catch these byte array indexing issues, but it wasn't straight forward and I would have to write the entire inflate/deflate test just to verify this issue. So I decided to leave out new tests for now in this PR. This pull request has now been integrated. Changeset: 43cf8b3d Author: Jaikiran Pai URL: https://git.openjdk.org/jdk/commit/43cf8b3d8067bc7128c98f86d5f8b6fa8bbed80e Stats: 19 lines in 8 files changed: 0 ins; 0 del; 19 mod 8302664: Fix several incorrect usages of Preconditions.checkFromIndexSize Reviewed-by: djelinski, dfuchs, alanb ------------- PR: https://git.openjdk.org/jdk/pull/12595 From tvaleev at openjdk.org Sat Feb 18 10:58:10 2023 From: tvaleev at openjdk.org (Tagir F. Valeev) Date: Sat, 18 Feb 2023 10:58:10 GMT Subject: RFR: 8302815 Use new Math.clamp method in core libraries Message-ID: For cleanup and dogfooding the new method, it would be nice to use Math.clamp where possible in java.base. See PR #12428. As Math.clamp performs an additional check that min is not greater than max, I conservatively replaced only those occurrences where I can see that this invariant is always held. There are more occurrences, where clamp can be potentially used but it's unclear whether min <= max is always true. ------------- Commit messages: - 8302815 Use new Math.clamp method in core libraries Changes: https://git.openjdk.org/jdk/pull/12633/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12633&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302815 Stats: 41 lines in 12 files changed: 0 ins; 8 del; 33 mod Patch: https://git.openjdk.org/jdk/pull/12633.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12633/head:pull/12633 PR: https://git.openjdk.org/jdk/pull/12633 From tvaleev at openjdk.org Sat Feb 18 21:40:08 2023 From: tvaleev at openjdk.org (Tagir F. Valeev) Date: Sat, 18 Feb 2023 21:40:08 GMT Subject: RFR: 8302815 Use new Math.clamp method in core libraries [v2] In-Reply-To: References: Message-ID: > For cleanup and dogfooding the new method, it would be nice to use Math.clamp where possible in java.base. See PR #12428. > > As Math.clamp performs an additional check that min is not greater than max, I conservatively replaced only those occurrences where I can see that this invariant is always held. There are more occurrences, where clamp can be potentially used but it's unclear whether min <= max is always true. Tagir F. Valeev has updated the pull request incrementally with one additional commit since the last revision: Revert changes in JrtPath, as it seems to be compiled with bootstrap JDK ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12633/files - new: https://git.openjdk.org/jdk/pull/12633/files/3f3618ae..be13683b Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12633&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12633&range=00-01 Stats: 4 lines in 1 file changed: 0 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/12633.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12633/head:pull/12633 PR: https://git.openjdk.org/jdk/pull/12633 From duke at openjdk.org Sat Feb 18 22:25:09 2023 From: duke at openjdk.org (SWinxy) Date: Sat, 18 Feb 2023 22:25:09 GMT Subject: RFR: 8302813: awt.image.incrementaldraw can use Boolean.parseBoolean() to parse the system property Message-ID: <0mHk6u6LUnXAxqrDC8U4-qGYimD4_yH6XEwNCF8eVQ8=.5581ffa5-5c56-408f-9bce-789f8f26f2e4@github.com> Please review this change which moves the parsing of the `awt.image.incrementaldraw` property from the static initializer block into the field itself by invoking `Boolean.parseBoolean()` on the system property getter. Hopefully in the near future we can do away with `AccessController`s and simply go with `System.getProperty` :) ------------- Commit messages: - Make isInc private - awt.image.incrementaldraw can use Boolean.parseBoolean() to parse the system property Changes: https://git.openjdk.org/jdk/pull/12639/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12639&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302813 Stats: 7 lines in 1 file changed: 1 ins; 5 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/12639.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12639/head:pull/12639 PR: https://git.openjdk.org/jdk/pull/12639 From alanb at openjdk.org Sun Feb 19 08:57:22 2023 From: alanb at openjdk.org (Alan Bateman) Date: Sun, 19 Feb 2023 08:57:22 GMT Subject: RFR: 8302815 Use new Math.clamp method in core libraries [v2] In-Reply-To: References:

Message-ID: On Sat, 18 Feb 2023 21:40:08 GMT, Tagir F. Valeev wrote: > Revert changes in JrtPath, as it seems to be compiled with bootstrap JDK Yes, the jrt file system provider is compiled --release 8 to create lib/jrt-fs.jar. That's the plumbing needed to allow IDEs/tools running on JDK 8 access the contents of a target run-time image as a file system. ------------- PR: https://git.openjdk.org/jdk/pull/12633 From duke at openjdk.org Sun Feb 19 23:59:13 2023 From: duke at openjdk.org (Archie L. Cobbs) Date: Sun, 19 Feb 2023 23:59:13 GMT Subject: RFR: 8026369: javac potentially ambiguous overload warning needs an improved scheme Message-ID: This bug relates to the "potentially ambiguous overload" warning which is enabled by `-Xlint:overloads`. The warning detects certain ambiguities that can cause problems for lambdas. For example, consider the interface `Spliterator.OfInt`, which declares these two methods: void forEachRemaining(Consumer action); void forEachRemaining(IntConsumer action); Both methods have the same name, same number of parameters, and take a lambda with the same "shape" in the same argument position. This causes an ambiguity in any code that wants to do this: spliterator.forEachRemaining(x -> { ... }); That code won't compile; instead, you'll get this error: Ambiguity.java:4: error: reference to forEachRemaining is ambiguous spliterator.forEachRemaining(x -> { }); ^ both method forEachRemaining(IntConsumer) in OfInt and method forEachRemaining(Consumer) in OfInt match The problem reported by the bug is that the warning fails to detect ambiguities which are created purely by inheritance, for example: interface ConsumerOfInteger { void foo(Consumer c); } interface IntegerConsumer { void foo(IntConsumer c); } // We should get a warning here... interface Test extends ConsumerOfInteger, IntegerConsumer { } The cause of the bug is that ambiguities are detected on a per-method basis, by checking whether a method is part of an ambiguity pair when we visit that method. So if the methods in an ambiguity pair are inherited from two distinct supertypes, we'll miss the ambiguity. To fix the problem, we need to look for ambiguities on a per-class level, checking all pairs of methods. However, it's not that simple - we only want to "blame" a class when that class itself, and not some supertype, is responsible for creating the ambiguity. For example, any interface extending `Spliterator.OfInt` will automatically inherit the two ambiguities mentioned above, but these are not the interface's fault so to speak so no warning should be generated. Making things more complicated is the fact that methods can be overridden and declared in generic classes so they only conflict in some subtypes, etc. So we generate the warning when there are two methods m1 and m2 in a class C such that: * m1 and m2 consitiute a "potentially ambiguous overload" (using the same definition as before) * There is no direct supertype T of C such that m1 and m2, or some methods they override, both exist in T and constitute a "potentially ambiguous overload" as members of T * We haven't already generated a warning for either m1 or m2 in class C If either method is declared in C, we locate the warning there, but when both methods are inherited, there's no method declaration to point at so the warning is instead located at the class declaration. I noticed a couple of other minor bugs; these are also being fixed here: (1) For inherited methods, the method signatures were being reported as they are declared, rather than in the context of the class being visited. As a result, when a methods is inherited from a generic supertype, the ambiguity is less clear. Here's an example: interface Upper { void foo(T c); } interface Lower extends Upper { void foo(Consumer c); } Currently, the error is reported as: warning: [overloads] foo(Consumer) in Lower is potentially ambiguous with foo(T) in Upper Reporting the method signatures in the context of the class being visited makes the ambiguity clearer: warning: [overloads] foo(Consumer) in Lower is potentially ambiguous with foo(IntConsumer) in Upper (2) When a method is identified as part of an ambiguous pair, we were setting a `POTENTIALLY_AMBIGUOUS` flag on it. This caused it to be forever excluded from future warnings. For methods that are declared in the class we're visiting, this makes sense, but it doesn't make sense for inherited methods, because it disqualifies them from participating in the analysis of any other class that also inherits them. As a result, for a class like the one below, the compiler was only generating one warning instead of three: public interface SuperIface { void foo(Consumer c); } public interface I1 extends SuperIface { void foo(IntConsumer c); // warning was generated here } public interface I2 extends SuperIface { void foo(IntConsumer c); // no warning was generated here } public interface I3 extends SuperIface { void foo(IntConsumer c); // no warning was generated here } With this patch the `POTENTIALLY_AMBIGUOUS` flag is no longer needed. I wasn't sure whether to renumber all the subsequent flags, or just leave an empty placeholder, so I chose the latter. Finally, this fix uncovers new warnings in `java.base` and `java.desktop`, so these are now suppressed in the patch. ------------- Commit messages: - Fix incomplete detection of potentially ambiguous method declarations. Changes: https://git.openjdk.org/jdk/pull/12645/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12645&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8026369 Stats: 356 lines in 18 files changed: 280 ins; 36 del; 40 mod Patch: https://git.openjdk.org/jdk/pull/12645.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12645/head:pull/12645 PR: https://git.openjdk.org/jdk/pull/12645 From pminborg at openjdk.org Mon Feb 20 09:59:27 2023 From: pminborg at openjdk.org (Per Minborg) Date: Mon, 20 Feb 2023 09:59:27 GMT Subject: RFR: 8301552: Use AtomicReferenceArray for caching instead of CHM in ZoneOffset [v8] In-Reply-To: References:

Message-ID: <_mGSxILQqWkEQ4oxqilr8A5FZ2eeu_3A34dZpjduXo0=.55145913-1c2f-4700-9713-81b09d387440@github.com> On Sat, 18 Feb 2023 19:04:24 GMT, David Schlosnagle wrote: >> This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. >> >> The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` >> >> To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. >> >> Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. > > src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 181: > >> 179: return ( U <= 'Z' // In range A-Z >> 180: || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication >> 181: && U == (b2 & 0xDF); // b2 has same uppercase > > I'm curious if the order of comparisons could alter performance to a small degree. For example, it might be interesting to compare various permutations like below to short circuit reject unequal uppercased b2 > > Suggestion: > > // uppercase b1 using 'the oldest ASCII trick in the book' > int U = b1 & 0xDF; > return (U == (b2 & 0xDF)) > && ((U >= 'A' && U <= 'Z') // In range A-Z > || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication Yeah, as you noticed this code is tricky and sensitive to the order of operations. I did some quite extensive exploration before ending on the current structure. This particular one seems to improve rejection somewhat at the cost of matches. Since rejection is relatively speaking already very fast, I think we should favour fast matching here. Results: enchmark (codePoints) (size) Mode Cnt Score Error Units RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 917.796 ? 20.285 ns/op RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 4.367 ? 0.348 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 399.656 ? 10.703 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 4.361 ? 0.664 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 1384.443 ? 22.199 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.119 ? 0.451 ns/op ------------- PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 13:19:20 2023 From: duke at openjdk.org (David Schlosnagle) Date: Mon, 20 Feb 2023 13:19:20 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI In-Reply-To: References: Message-ID: On Sat, 18 Feb 2023 09:21:25 GMT, Eirik Bjorsnos wrote: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 181: > 179: return ( U <= 'Z' // In range A-Z > 180: || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication > 181: && U == (b2 & 0xDF); // b2 has same uppercase I'm curious if the order of comparisons could alter performance to a small degree. For example, it might be interesting to compare various permutations like below to short circuit reject unequal uppercased b2 Suggestion: // uppercase b1 using 'the oldest ASCII trick in the book' int U = b1 & 0xDF; return (U == (b2 & 0xDF)) && ((U >= 'A' && U <= 'Z') // In range A-Z || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication ------------- PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 13:19:21 2023 From: duke at openjdk.org (David Schlosnagle) Date: Mon, 20 Feb 2023 13:19:21 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI In-Reply-To: <_mGSxILQqWkEQ4oxqilr8A5FZ2eeu_3A34dZpjduXo0=.55145913-1c2f-4700-9713-81b09d387440@github.com> References:

<_mGSxILQqWkEQ4oxqilr8A5FZ2eeu_3A34dZpjduXo0=.55145913-1c2f-4700-9713-81b09d387440@github.com> Message-ID: <3ybXhT9NAtZLE4znb2PntN0OlC_bB78F7hC3bRai3a8=.667ebc8b-556c-4725-8251-0606f9f32952@github.com> On Sat, 18 Feb 2023 19:45:34 GMT, Eirik Bjorsnos wrote: >> src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 181: >> >>> 179: return ( U <= 'Z' // In range A-Z >>> 180: || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication >>> 181: && U == (b2 & 0xDF); // b2 has same uppercase >> >> I'm curious if the order of comparisons could alter performance to a small degree. For example, it might be interesting to compare various permutations like below to short circuit reject unequal uppercased b2 >> >> Suggestion: >> >> // uppercase b1 using 'the oldest ASCII trick in the book' >> int U = b1 & 0xDF; >> return (U == (b2 & 0xDF)) >> && ((U >= 'A' && U <= 'Z') // In range A-Z >> || (U >= 0xC0 && U <= 0XDE && U != 0xD7)) // ..or A-grave-Thorn, excl. multiplication > > Yeah, as you noticed this code is tricky and sensitive to the order of operations. I did some quite extensive exploration before ending on the current structure. This particular one seems to improve rejection somewhat at the cost of matches. > > Since rejection is relatively speaking already very fast, I think we should favour fast matching here. > > Results: > > > enchmark (codePoints) (size) Mode Cnt Score Error Units > RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 917.796 ? 20.285 ns/op > RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 4.367 ? 0.348 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 399.656 ? 10.703 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 4.361 ? 0.664 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 1384.443 ? 22.199 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.119 ? 0.451 ns/op Thanks for confirming ------------- PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 13:39:09 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 13:39:09 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v2] In-Reply-To: References: Message-ID: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Align whitespace to make example strings easier to read - Merge branch 'master' into regionmatches-latin1-speedup - Exhaustive verification needs to cover the case b1 == b2 - Move multiplication exclusion to the lat1 range branch - Speed up StringLatin1.regionMatchesCI by applying the 'oldest ASCII trick in the book' ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/a583bcd6..59c42298 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=00-01 Stats: 57495 lines in 1316 files changed: 24512 ins; 15682 del; 17301 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 13:52:18 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 13:52:18 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v3] In-Reply-To: References: Message-ID: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: Add clarifying comments and use more descriptive variable names in the latin1 verification EqualsIgnoreCase test ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/59c42298..84517102 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=01-02 Stats: 16 lines in 1 file changed: 5 ins; 2 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 14:45:09 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 14:45:09 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v4] In-Reply-To: References: Message-ID: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request incrementally with two additional commits since the last revision: - Add @bug tag to EqualsIgnoreCase test for correct issue JDK-8302871 - Add @bug tag to EqualsIgnoreCase test for JDK-8302877 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/84517102..03d3e2cb Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=02-03 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 14:47:33 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 14:47:33 GMT Subject: RFR: 8302877: Speed up latin1 case conversions Message-ID: This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. ------------- Commit messages: - Add @bug tag to test - Improved whitespace alignment for Param and switch values - Correct spelling for "exhaustive" - Prefer the term "case conversion" over ""case folding". Refer to 0xB5 as 'Micro Sign' ("Mu" is the Unicode code point it uppercases to) - Improve comments for the two special-cased uppercase code points 'Micro Sign' and 'y with Diaeresis' - Adjust whitespace - Speed up Character.toUpperCase and Character.toLowerCase by applying the 'oldest ASCII trick in the book' Changes: https://git.openjdk.org/jdk/pull/12623/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12623&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302877 Stats: 164 lines in 3 files changed: 143 ins; 0 del; 21 mod Patch: https://git.openjdk.org/jdk/pull/12623.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12623/head:pull/12623 PR: https://git.openjdk.org/jdk/pull/12623 From naoto at openjdk.org Mon Feb 20 14:47:36 2023 From: naoto at openjdk.org (Naoto Sato) Date: Mon, 20 Feb 2023 14:47:36 GMT Subject: RFR: 8302877: Speed up latin1 case conversions In-Reply-To: References: Message-ID: On Fri, 17 Feb 2023 17:31:09 GMT, Eirik Bjorsnos wrote: > This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. > > This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). > > To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. > > The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. > > Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. Looks good to me. I'd rather not use "case folding", as to me it implies "normalizing" but this is simply lowercasing/uppercasing. test/jdk/java/lang/Character/Latin1CaseFolding.java line 31: > 29: /** > 30: * @test > 31: * @summary Provides exchaustive verification of Character.toUpperCase and Character.toLowerCase typo: "exhaustive"? ------------- PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Mon Feb 20 14:47:39 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 14:47:39 GMT Subject: RFR: 8302877: Speed up latin1 case conversions In-Reply-To: References:

Message-ID: On Sat, 18 Feb 2023 00:57:14 GMT, Naoto Sato wrote: > I'd rather not use "case folding", as to me it implies "normalizing" but this is simply lowercasing/uppercasing. I guess I was looking for a generic term for uppercase/lowercase. I picked "case conversion" instead. > test/jdk/java/lang/Character/Latin1CaseFolding.java line 31: > >> 29: /** >> 30: * @test >> 31: * @summary Provides exchaustive verification of Character.toUpperCase and Character.toLowerCase > > typo: "exhaustive"? I did an 'exchaustive' search for 'exchaustive' across the code base and found two comments in `LocaleData` and `LocaleData.cldr` in `jdk/test/jdk/sun/text/resources`. Would you like me to update these as well while we're here, or should we avoid getting out scope for this PR? ------------- PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Mon Feb 20 14:47:37 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 14:47:37 GMT Subject: RFR: 8302877: Speed up latin1 case conversions In-Reply-To: References: Message-ID: On Fri, 17 Feb 2023 17:31:09 GMT, Eirik Bjorsnos wrote: > This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. > > This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). > > To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. > > The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. > > Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. Benchmark results: Baseline: Benchmark (codePoint) Mode Cnt Score Error Units Characters.Latin1CaseConversion.toLowerCase low avgt 15 1.267 ? 0.013 ns/op Characters.Latin1CaseConversion.toLowerCase A avgt 15 1.657 ? 0.011 ns/op Characters.Latin1CaseConversion.toLowerCase a avgt 15 1.258 ? 0.005 ns/op Characters.Latin1CaseConversion.toLowerCase A-grave avgt 15 1.656 ? 0.011 ns/op Characters.Latin1CaseConversion.toLowerCase a-grave avgt 15 1.270 ? 0.023 ns/op Characters.Latin1CaseConversion.toLowerCase mu avgt 15 1.261 ? 0.006 ns/op Characters.Latin1CaseConversion.toLowerCase yD avgt 15 1.260 ? 0.005 ns/op Characters.Latin1CaseConversion.toUpperCase low avgt 15 1.284 ? 0.043 ns/op Characters.Latin1CaseConversion.toUpperCase A avgt 15 1.264 ? 0.008 ns/op Characters.Latin1CaseConversion.toUpperCase a avgt 15 1.818 ? 0.016 ns/op Characters.Latin1CaseConversion.toUpperCase A-grave avgt 15 1.261 ? 0.015 ns/op Characters.Latin1CaseConversion.toUpperCase a-grave avgt 15 1.822 ? 0.013 ns/op Characters.Latin1CaseConversion.toUpperCase mu avgt 15 1.823 ? 0.006 ns/op Characters.Latin1CaseConversion.toUpperCase yD avgt 15 1.822 ? 0.008 ns/op PR: Benchmark (codePoint) Mode Cnt Score Error Units Characters.Latin1CaseConversion.toLowerCase low avgt 15 0.878 ? 0.005 ns/op Characters.Latin1CaseConversion.toLowerCase A avgt 15 1.038 ? 0.009 ns/op Characters.Latin1CaseConversion.toLowerCase a avgt 15 1.036 ? 0.007 ns/op Characters.Latin1CaseConversion.toLowerCase A-grave avgt 15 1.357 ? 0.015 ns/op Characters.Latin1CaseConversion.toLowerCase a-grave avgt 15 1.352 ? 0.003 ns/op Characters.Latin1CaseConversion.toLowerCase mu avgt 15 1.273 ? 0.002 ns/op Characters.Latin1CaseConversion.toLowerCase yD avgt 15 1.352 ? 0.004 ns/op Characters.Latin1CaseConversion.toUpperCase low avgt 15 0.880 ? 0.013 ns/op Characters.Latin1CaseConversion.toUpperCase A avgt 15 0.920 ? 0.071 ns/op Characters.Latin1CaseConversion.toUpperCase a avgt 15 1.055 ? 0.013 ns/op Characters.Latin1CaseConversion.toUpperCase A-grave avgt 15 1.394 ? 0.010 ns/op Characters.Latin1CaseConversion.toUpperCase a-grave avgt 15 1.391 ? 0.009 ns/op Characters.Latin1CaseConversion.toUpperCase mu avgt 15 1.597 ? 0.021 ns/op Characters.Latin1CaseConversion.toUpperCase yD avgt 15 1.354 ? 0.003 ns/op ------------- PR: https://git.openjdk.org/jdk/pull/12623 From redestad at openjdk.org Mon Feb 20 15:43:28 2023 From: redestad at openjdk.org (Claes Redestad) Date: Mon, 20 Feb 2023 15:43:28 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v4] In-Reply-To: References:

Message-ID: <03PmkNiBgSaqEQHWdzOBIEFh3itEUkhd3PU7ZqNt8UA=.4531baf5-d8cc-4cc5-9767-af0f1458d87f@github.com> On Mon, 20 Feb 2023 14:45:09 GMT, Eirik Bjorsnos wrote: >> This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. >> >> The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` >> >> To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. >> >> Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. > > Eirik Bjorsnos has updated the pull request incrementally with two additional commits since the last revision: > > - Add @bug tag to EqualsIgnoreCase test for correct issue JDK-8302871 > - Add @bug tag to EqualsIgnoreCase test for JDK-8302877 src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 170: > 168: * @return true if the two bytes are considered equals ignoring case in latin1 > 169: */ > 170: static boolean equalsIgnoreCase(byte b1, byte b2) { Perhaps put this in `CharacterDataLatin1`, keeping it close to toLowerCase/toUpperCase that you're changing to use similar logic with #12623 If you apply #12623 first - how much difference does this make on the micro you're adding with this PR? ------------- PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 16:19:27 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 16:19:27 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v4] In-Reply-To: <03PmkNiBgSaqEQHWdzOBIEFh3itEUkhd3PU7ZqNt8UA=.4531baf5-d8cc-4cc5-9767-af0f1458d87f@github.com> References:

<03PmkNiBgSaqEQHWdzOBIEFh3itEUkhd3PU7ZqNt8UA=.4531baf5-d8cc-4cc5-9767-af0f1458d87f@github.com> Message-ID: On Mon, 20 Feb 2023 15:40:09 GMT, Claes Redestad wrote: >> Eirik Bjorsnos has updated the pull request incrementally with two additional commits since the last revision: >> >> - Add @bug tag to EqualsIgnoreCase test for correct issue JDK-8302871 >> - Add @bug tag to EqualsIgnoreCase test for JDK-8302877 > > src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 170: > >> 168: * @return true if the two bytes are considered equals ignoring case in latin1 >> 169: */ >> 170: static boolean equalsIgnoreCase(byte b1, byte b2) { > > Perhaps put this in `CharacterDataLatin1`, keeping it close to toLowerCase/toUpperCase that you're changing to use similar logic with #12623 > > If you apply #12623 first - how much difference does this make on the micro you're adding with this PR? Is it not already in CharacterDataLatin1? Here is a comparison of relying on improvements in `CharacterDataLatin1.toUpperCase/toLowerCase` only vs. using `CharacterDataLatin1.equalsIgnoreCase`: Character.toUpperCase/toLowerCase only: Benchmark (codePoints) (size) Mode Cnt Score Error Units RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 1310.582 ? 84.777 ns/op RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 4.547 ? 0.545 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 686.947 ? 11.850 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 3.836 ? 0.634 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 2107.219 ? 17.662 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.924 ? 0.829 ns/op CharacterDataLatin1.equalsIgnoreCase: Benchmark (codePoints) (size) Mode Cnt Score Error Units RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 742.467 ? 34.490 ns/op RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 3.960 ? 0.046 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 361.158 ? 37.096 ns/op RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 4.039 ? 0.521 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 1158.091 ? 41.617 ns/op RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.358 ? 0.123 ns/op ------------- PR: https://git.openjdk.org/jdk/pull/12632 From redestad at openjdk.org Mon Feb 20 16:26:26 2023 From: redestad at openjdk.org (Claes Redestad) Date: Mon, 20 Feb 2023 16:26:26 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v4] In-Reply-To: References:

<03PmkNiBgSaqEQHWdzOBIEFh3itEUkhd3PU7ZqNt8UA=.4531baf5-d8cc-4cc5-9767-af0f1458d87f@github.com> Message-ID: On Mon, 20 Feb 2023 16:16:45 GMT, Eirik Bjorsnos wrote: >> src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 170: >> >>> 168: * @return true if the two bytes are considered equals ignoring case in latin1 >>> 169: */ >>> 170: static boolean equalsIgnoreCase(byte b1, byte b2) { >> >> Perhaps put this in `CharacterDataLatin1`, keeping it close to toLowerCase/toUpperCase that you're changing to use similar logic with #12623 >> >> If you apply #12623 first - how much difference does this make on the micro you're adding with this PR? > > Is it not already in CharacterDataLatin1? > > Here is a comparison of relying on improvements in `CharacterDataLatin1.toUpperCase/toLowerCase` only vs. using `CharacterDataLatin1.equalsIgnoreCase`: > > Character.toUpperCase/toLowerCase only: > > > Benchmark (codePoints) (size) Mode Cnt Score Error Units > RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 1310.582 ? 84.777 ns/op > RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 4.547 ? 0.545 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 686.947 ? 11.850 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 3.836 ? 0.634 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 2107.219 ? 17.662 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.924 ? 0.829 ns/op > > > CharacterDataLatin1.equalsIgnoreCase: > > > Benchmark (codePoints) (size) Mode Cnt Score Error Units > RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 742.467 ? 34.490 ns/op > RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 3.960 ? 0.046 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 361.158 ? 37.096 ns/op > RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 4.039 ? 0.521 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 1158.091 ? 41.617 ns/op > RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.358 ? 0.123 ns/op Oops, I lost context and thought this was in `StringLatin1`. Thanks for running the numbers with #12623. Looks like you're getting big enough of an improvement on top. ------------- PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Mon Feb 20 16:30:29 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Mon, 20 Feb 2023 16:30:29 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v4] In-Reply-To: References:

<03PmkNiBgSaqEQHWdzOBIEFh3itEUkhd3PU7ZqNt8UA=.4531baf5-d8cc-4cc5-9767-af0f1458d87f@github.com>

Message-ID: On Mon, 20 Feb 2023 16:23:32 GMT, Claes Redestad wrote: >> Is it not already in CharacterDataLatin1? >> >> Here is a comparison of relying on improvements in `CharacterDataLatin1.toUpperCase/toLowerCase` only vs. using `CharacterDataLatin1.equalsIgnoreCase`: >> >> Character.toUpperCase/toLowerCase only: >> >> >> Benchmark (codePoints) (size) Mode Cnt Score Error Units >> RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 1310.582 ? 84.777 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 4.547 ? 0.545 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 686.947 ? 11.850 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 3.836 ? 0.634 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 2107.219 ? 17.662 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.924 ? 0.829 ns/op >> >> >> CharacterDataLatin1.equalsIgnoreCase: >> >> >> Benchmark (codePoints) (size) Mode Cnt Score Error Units >> RegionMatchesIC.Latin1.regionMatchesIC ascii-match 1024 avgt 15 742.467 ? 34.490 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC ascii-mismatch 1024 avgt 15 3.960 ? 0.046 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC number-match 1024 avgt 15 361.158 ? 37.096 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC number-mismatch 1024 avgt 15 4.039 ? 0.521 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC lat1-match 1024 avgt 15 1158.091 ? 41.617 ns/op >> RegionMatchesIC.Latin1.regionMatchesIC lat1-mismatch 1024 avgt 15 4.358 ? 0.123 ns/op > > Oops, I lost context and thought this was in `StringLatin1`. > > Thanks for running the numbers with #12623. Looks like you're getting big enough of an improvement on top. Yes, seems `equalsIgnoreCase` carries its weight. ------------- PR: https://git.openjdk.org/jdk/pull/12632 From alanb at openjdk.org Mon Feb 20 17:40:26 2023 From: alanb at openjdk.org (Alan Bateman) Date: Mon, 20 Feb 2023 17:40:26 GMT Subject: RFR: 8302815 Use new Math.clamp method in core libraries [v2] In-Reply-To: References:

Message-ID: On Sat, 18 Feb 2023 21:40:08 GMT, Tagir F. Valeev wrote: >> For cleanup and dogfooding the new method, it would be nice to use Math.clamp where possible in java.base. See PR #12428. >> >> As Math.clamp performs an additional check that min is not greater than max, I conservatively replaced only those occurrences where I can see that this invariant is always held. There are more occurrences, where clamp can be potentially used but it's unclear whether min <= max is always true. > > Tagir F. Valeev has updated the pull request incrementally with one additional commit since the last revision: > > Revert changes in JrtPath, as it seems to be compiled with bootstrap JDK I skimmed through the usages and they look okay. I didn't spot anywhere that it differs to the existing clamping. This is the first update in 2023 for some of these files so I assume you'll bump the copyright year before integrating. ------------- Marked as reviewed by alanb (Reviewer). PR: https://git.openjdk.org/jdk/pull/12633 From duke at openjdk.org Mon Feb 20 23:55:24 2023 From: duke at openjdk.org (SWinxy) Date: Mon, 20 Feb 2023 23:55:24 GMT Subject: RFR: 8026369: javac potentially ambiguous overload warning needs an improved scheme In-Reply-To: References: Message-ID: On Sun, 19 Feb 2023 23:52:52 GMT, Archie L. Cobbs wrote: > This bug relates to the "potentially ambiguous overload" warning which is enabled by `-Xlint:overloads`. > > The warning detects certain ambiguities that can cause problems for lambdas. For example, consider the interface `Spliterator.OfInt`, which declares these two methods: > > void forEachRemaining(Consumer action); > void forEachRemaining(IntConsumer action); > > Both methods have the same name, same number of parameters, and take a lambda with the same "shape" in the same argument position. This causes an ambiguity in any code that wants to do this: > > spliterator.forEachRemaining(x -> { ... }); > > That code won't compile; instead, you'll get this error: > > Ambiguity.java:4: error: reference to forEachRemaining is ambiguous > spliterator.forEachRemaining(x -> { }); > ^ > both method forEachRemaining(IntConsumer) in OfInt and method forEachRemaining(Consumer) in OfInt match > > > The problem reported by the bug is that the warning fails to detect ambiguities which are created purely by inheritance, for example: > > interface ConsumerOfInteger { > void foo(Consumer c); > } > > interface IntegerConsumer { > void foo(IntConsumer c); > } > > // We should get a warning here... > interface Test extends ConsumerOfInteger, IntegerConsumer { > } > > > The cause of the bug is that ambiguities are detected on a per-method basis, by checking whether a method is part of an ambiguity pair when we visit that method. So if the methods in an ambiguity pair are inherited from two distinct supertypes, we'll miss the ambiguity. > > To fix the problem, we need to look for ambiguities on a per-class level, checking all pairs of methods. However, it's not that simple - we only want to "blame" a class when that class itself, and not some supertype, is responsible for creating the ambiguity. For example, any interface extending `Spliterator.OfInt` will automatically inherit the two ambiguities mentioned above, but these are not the interface's fault so to speak so no warning should be generated. Making things more complicated is the fact that methods can be overridden and declared in generic classes so they only conflict in some subtypes, etc. > > So we generate the warning when there are two methods m1 and m2 in a class C such that: > > * m1 and m2 consitiute a "potentially ambiguous overload" (using the same definition as before) > * There is no direct supertype T of C such that m1 and m2, or some methods they override, both exist in T and constitute a "potentially ambiguous overload" as members of T > * We haven't already generated a warning for either m1 or m2 in class C > > If either method is declared in C, we locate the warning there, but when both methods are inherited, there's no method declaration to point at so the warning is instead located at the class declaration. > > I noticed a couple of other minor bugs; these are also being fixed here: > > (1) For inherited methods, the method signatures were being reported as they are declared, rather than in the context of the class being visited. As a result, when a methods is inherited from a generic supertype, the ambiguity is less clear. Here's an example: > > interface Upper { > void foo(T c); > } > > interface Lower extends Upper { > void foo(Consumer c); > } > > Currently, the error is reported as: > > warning: [overloads] foo(Consumer) in Lower is potentially ambiguous with foo(T) in Upper > > Reporting the method signatures in the context of the class being visited makes the ambiguity clearer: > > warning: [overloads] foo(Consumer) in Lower is potentially ambiguous with foo(IntConsumer) in Upper > > > (2) When a method is identified as part of an ambiguous pair, we were setting a `POTENTIALLY_AMBIGUOUS` flag on it. This caused it to be forever excluded from future warnings. For methods that are declared in the class we're visiting, this makes sense, but it doesn't make sense for inherited methods, because it disqualifies them from participating in the analysis of any other class that also inherits them. > > As a result, for a class like the one below, the compiler was only generating one warning instead of three: > > public interface SuperIface { > void foo(Consumer c); > } > > public interface I1 extends SuperIface { > void foo(IntConsumer c); // warning was generated here > } > > public interface I2 extends SuperIface { > void foo(IntConsumer c); // no warning was generated here > } > > public interface I3 extends SuperIface { > void foo(IntConsumer c); // no warning was generated here > } > > > With this patch the `POTENTIALLY_AMBIGUOUS` flag is no longer needed. I wasn't sure whether to renumber all the subsequent flags, or just leave an empty placeholder, so I chose the latter. > > Finally, this fix uncovers new warnings in `java.base` and `java.desktop`, so these are now suppressed in the patch. In the `AWTEventMulticaster` class(es), which interfaces are causing the ambiguity? ------------- PR: https://git.openjdk.org/jdk/pull/12645 From naoto at openjdk.org Tue Feb 21 00:17:25 2023 From: naoto at openjdk.org (Naoto Sato) Date: Tue, 21 Feb 2023 00:17:25 GMT Subject: RFR: 8302877: Speed up latin1 case conversions In-Reply-To: References:

Message-ID: On Sat, 18 Feb 2023 06:43:27 GMT, Eirik Bjorsnos wrote: >> test/jdk/java/lang/Character/Latin1CaseFolding.java line 31: >> >>> 29: /** >>> 30: * @test >>> 31: * @summary Provides exchaustive verification of Character.toUpperCase and Character.toLowerCase >> >> typo: "exhaustive"? > > I did an 'exchaustive' search for 'exchaustive' across the code base and found two comments in `LocaleData` and `LocaleData.cldr` in `jdk/test/jdk/sun/text/resources`. > > Would you like me to update these as well while we're here, or should we avoid getting out scope for this PR? I'd appreciate it. I don't mind fixing it with this PR. ------------- PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 06:54:48 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 06:54:48 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v5] In-Reply-To: References: Message-ID: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: Spell fix for 'exhaustive' in comments in sun/text/resources ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/03d3e2cb..5e9927a4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=03-04 Stats: 2 lines in 2 files changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Tue Feb 21 06:58:52 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 06:58:52 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v6] In-Reply-To: References: Message-ID: <0axKMeojwnFwgufDJPLLALvqougnoM2d5FRMVCxoHtc=.dca5737d-97b4-4f48-84ee-f120e16eb31b@github.com> > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: Revert "Spell fix for 'exhaustive' in comments in sun/text/resources" This reverts commit 5e9927a4b35e157fd3fa72fd2663c8bfbecf32bb. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/5e9927a4..b8139961 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=04-05 Stats: 2 lines in 2 files changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Tue Feb 21 06:59:47 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 06:59:47 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v2] In-Reply-To: References: Message-ID: > This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. > > This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). > > To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. > > The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. > > Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: Spell fix for 'exhaustive' in comments in sun/text/resources ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12623/files - new: https://git.openjdk.org/jdk/pull/12623/files/57a27d39..70c624d7 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12623&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12623&range=00-01 Stats: 2 lines in 2 files changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/12623.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12623/head:pull/12623 PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 08:01:28 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 08:01:28 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v2] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 00:14:20 GMT, Naoto Sato wrote: >> I did an 'exchaustive' search for 'exchaustive' across the code base and found two comments in `LocaleData` and `LocaleData.cldr` in `jdk/test/jdk/sun/text/resources`. >> >> Would you like me to update these as well while we're here, or should we avoid getting out scope for this PR? > > I'd appreciate it. I don't mind fixing it with this PR. Thanks Naoto, I have fixed the spelling in these two unrelated files. ------------- PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 09:37:28 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 09:37:28 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v2] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 09:33:07 GMT, Eirik Bjorsnos wrote: > I have the feeling that most case-insensitive comparisons are pretty short, so not sure how useful this is IRL. There seems to be a win from strings of size 32 bytes upwards. (That's probably longer than most keys in TreeMaps using String.CASE_INSENSITIVE_ORDER, such as j.n.h.HttpHeaders) Benchmark (size) Mode Cnt Score Error Units EqualsIgnoreCase.scalar 16 avgt 2 20.608 ns/op EqualsIgnoreCase.scalar 32 avgt 2 36.510 ns/op EqualsIgnoreCase.vectorized 16 avgt 2 18.601 ns/op EqualsIgnoreCase.vectorized 32 avgt 2 12.795 ns/op This is outside scope for this PR, I just wanted to leave a trace of this observation here for future record. ------------- PR: https://git.openjdk.org/jdk/pull/12623 From redestad at openjdk.org Tue Feb 21 10:40:33 2023 From: redestad at openjdk.org (Claes Redestad) Date: Tue, 21 Feb 2023 10:40:33 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v2] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 06:59:47 GMT, Eirik Bjorsnos wrote: >> This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. >> >> This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). >> >> To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. >> >> The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. >> >> Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. > > Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: > > Spell fix for 'exhaustive' in comments in sun/text/resources Looks good. Some nits inline src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 142: > 140: } > 141: int l = ch | 0x20; // Lowercase using 'oldest ASCII trick in the book' > 142: if ( l <= 'z' // In range a-z Suggestion: if (l <= 'z' // In range a-z test/micro/org/openjdk/bench/java/lang/Characters.java line 92: > 90: @Measurement(iterations = 5, time = 1) > 91: @Fork(3) > 92: public static class Latin1CaseConversions { Not sure if qualifying this as "Latin1" is necessary, even though that's what you've focused on for this PR. We could easily add some codePoints outside of the latin1 range (now or later) without changing the test. While having a switch with some readable names is a nice touch I think we should additionally allow integer codePoint as-is to keep it in line with the outer class, e.g. `default -> Integer.parseInt(codePoint);` ------------- Marked as reviewed by redestad (Reviewer). PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 11:12:18 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 11:12:18 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v3] In-Reply-To: References: Message-ID: > This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. > > This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). > > To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. > > The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. > > Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. Eirik Bjorsnos has updated the pull request incrementally with three additional commits since the last revision: - Allow any integer codePoint by defaulting to Integer.parseInt - Rename Latin1CaseConversions to just CaseConversions - Remove a whitespace following 'if (' ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12623/files - new: https://git.openjdk.org/jdk/pull/12623/files/70c624d7..bff999c4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12623&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12623&range=01-02 Stats: 3 lines in 2 files changed: 0 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/12623.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12623/head:pull/12623 PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 11:12:23 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 11:12:23 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v2] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 10:29:24 GMT, Claes Redestad wrote: >> Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: >> >> Spell fix for 'exhaustive' in comments in sun/text/resources > > src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 142: > >> 140: } >> 141: int l = ch | 0x20; // Lowercase using 'oldest ASCII trick in the book' >> 142: if ( l <= 'z' // In range a-z > > Suggestion: > > if (l <= 'z' // In range a-z Fixed! (My IDE does not highlight this code, making it a bit harder to spot mistakes like this) > test/micro/org/openjdk/bench/java/lang/Characters.java line 92: > >> 90: @Measurement(iterations = 5, time = 1) >> 91: @Fork(3) >> 92: public static class Latin1CaseConversions { > > Not sure if qualifying this as "Latin1" is necessary, even though that's what you've focused on for this PR. We could easily add some codePoints outside of the latin1 range (now or later) without changing the test. > > While having a switch with some readable names is a nice touch I think we should additionally allow integer codePoint as-is to keep it in line with the outer class, e.g. `default -> Integer.parseInt(codePoint);` You are probably right that Latin1 is a bit narrow here, removing the prefix. I added Integer.parseInt as the default, good idea! ------------- PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 11:14:13 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 11:14:13 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v7] In-Reply-To: References: Message-ID: > This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. > > The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` > > To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. > > Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: Remove whitespace following '(' ------------- Changes: - all: https://git.openjdk.org/jdk/pull/12632/files - new: https://git.openjdk.org/jdk/pull/12632/files/b8139961..d7b1c164 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=12632&range=05-06 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/12632.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12632/head:pull/12632 PR: https://git.openjdk.org/jdk/pull/12632 From duke at openjdk.org Tue Feb 21 11:22:33 2023 From: duke at openjdk.org (Eirik Bjorsnos) Date: Tue, 21 Feb 2023 11:22:33 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v3] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 11:14:13 GMT, Eirik Bjorsnos wrote: >> This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. >> >> The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` >> >> To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. >> >> Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. > > Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: > > Remove whitespace following '(' Marked as reviewed by redestad (Reviewer). ------------- PR: https://git.openjdk.org/jdk/pull/12632 From alanb at openjdk.org Tue Feb 21 14:30:29 2023 From: alanb at openjdk.org (Alan Bateman) Date: Tue, 21 Feb 2023 14:30:29 GMT Subject: RFR: 8302871: Speed up StringLatin1.regionMatchesCI [v7] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 11:14:13 GMT, Eirik Bjorsnos wrote: >> This PR suggests we can speed up `StringLatin1.regionMatchesCI` by applying 'the oldest ASCII trick in the book'. >> >> The new static method `CharacterDataLatin1.equalsIgnoreCase` compares two latin1 bytes for equality ignoring case. `StringLatin1.regionMatchesCI` is updated to use `equalsIgnoreCase` >> >> To verify the correctness of `equalsIgnoreCase`, a new test is added to `EqualsIgnoreCase` with an exhaustive verification that all 256x256 latin1 code point pairs have an `equalsIgnoreCase` consistent with Character.toUpperCase, Character.toLowerCase. >> >> Performance is tested for matching and mismatching cases of code point pairs picked from the ASCII letter, ASCII number and latin1 letter ranges. Results in the first comment below. > > Eirik Bjorsnos has updated the pull request incrementally with one additional commit since the last revision: > > Remove whitespace following '(' src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 163: > 161: return mapChar; > 162: } > 163: /** I assume you should insert a blank line between the two methods. src/java.base/share/classes/java/lang/CharacterDataLatin1.java.template line 175: > 173: } > 174: // uppercase b1 using 'the oldest ASCII trick in the book' > 175: int U = b1 & 0xDF; I'm sure some people reading this comment will wonder which book :-) It might be better to drop that bit and if possible, find a better name for "U" as normally variables start with a lower case. ------------- PR: https://git.openjdk.org/jdk/pull/12632 From naoto at openjdk.org Tue Feb 21 17:25:28 2023 From: naoto at openjdk.org (Naoto Sato) Date: Tue, 21 Feb 2023 17:25:28 GMT Subject: RFR: 8302877: Speed up latin1 case conversions [v3] In-Reply-To: References:

Message-ID: On Tue, 21 Feb 2023 11:12:18 GMT, Eirik Bjorsnos wrote: >> This PR suggests we speed up Character.toUpperCase and Character.toLowerCase for latin1 code points by applying the 'oldest ASCII trick in the book'. >> >> This takes advantage of the fact that latin1 uppercase code points are always 0x20 lower than their lowercase (with the exception of two code points which uppercase out of latin1). >> >> To verify the correctness of the new implementation, the test `Latin1CaseConversion` is added with an exhaustive verification of toUpperCase/toLowerCase for all latin1 code points. >> >> The implementation needs to balance the performance of the various ranges in latin1. An effort has been made to favour operations on ASCII code points, without causing excessive regression for higher code points. >> >> Performance is benchmarked for 7 chosen sample code points, each representing a range or a special-case. Results in the first comment. > > Eirik Bjorsnos has updated the pull request incrementally with three additional commits since the last revision: > > - Allow any integer codePoint by defaulting to Integer.parseInt > - Rename Latin1CaseConversions to just CaseConversions > - Remove a whitespace following 'if (' LGTM. Thanks for the fix! ------------- Marked as reviewed by naoto (Reviewer). PR: https://git.openjdk.org/jdk/pull/12623 From duke at openjdk.org Tue Feb 21 18:23:18 2023 From: duke at openjdk.org (Madjosz) Date: Tue, 21 Feb 2023 18:23:18 GMT Subject: RFR: 8302983: ZoneRulesProvider.registerProvider() twice will remove provider Message-ID: Fixes JDK-8302983 (and duplicate JDK-8302898) ------------- Commit messages: - 8302983: ZoneRulesProvider.registerProvider() twice will remove provider Changes: https://git.openjdk.org/jdk/pull/12690/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12690&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302983 Stats: 53 lines in 2 files changed: 49 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/12690.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12690/head:pull/12690 PR: https://git.openjdk.org/jdk/pull/12690 From jlu at openjdk.org Tue Feb 21 19:34:11 2023 From: jlu at openjdk.org (Justin Lu) Date: Tue, 21 Feb 2023 19:34:11 GMT Subject: RFR: 8302512: Update IANA Language Subtag Registry to Version 2023-02-14 Message-ID: Incorporate the latest IANA language subtag registry definition (2023-02-14). ------------- Commit messages: - Add missing JBS #s to test header - IANA update 2/14/23 Changes: https://git.openjdk.org/jdk/pull/12699/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12699&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8302512 Stats: 9 lines in 2 files changed: 6 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/12699.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12699/head:pull/12699 PR: https://git.openjdk.org/jdk/pull/12699 From naoto at openjdk.org Tue Feb 21 19:46:25 2023 From: naoto at openjdk.org (Naoto Sato) Date: Tue, 21 Feb 2023 19:46:25 GMT Subject: RFR: 8301119: Support for GB18030-2022 Message-ID: Upgrading the GB18030 charset in the JDK to the latest 2022 standard. Since this is not a compatible upgrade to the existing mapping, a new system property `jdk.charset.GB18030` is introduced. If it is set to "2000", the mapping falls back to the existing mapping based on the 2000 standard, otherwise, it defaults to 2022 mapping. Refer to the corresponding CSR for more detail. ------------- Commit messages: - Check initPhase and don't call System.getProperty if in phase 1 - Some clean-up - Some more fixes - removed unnecessary method name composition - Removed unnecessary imports - aliases fix - Move GB18030 into standard charsets provider - indentation - Removed unused annotation - Removed null check - ... and 4 more: https://git.openjdk.org/jdk/compare/574b48c6...0f3c25ce Changes: https://git.openjdk.org/jdk/pull/12518/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=12518&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8301119 Stats: 283 lines in 14 files changed: 129 ins; 41 del; 113 mod Patch: https://git.openjdk.org/jdk/pull/12518.diff Fetch: git fetch https://git.openjdk.org/jdk pull/12518/head:pull/12518 PR: https://git.openjdk.org/jdk/pull/12518 From naoto at openjdk.org Tue Feb 21 20:00:26 2023 From: naoto at openjdk.org (Naoto Sato) Date: Tue, 21 Feb 2023 20:00:26 GMT Subject: RFR: 8302512: Update IANA Language Subtag Registry to Version 2023-02-14 In-Reply-To: References: Message-ID: