RFR: 8353013: java.net.URI.create(String) may have low performance to scan the host/domain name from URI string when the hostname starts with number
Alan Bateman
alanb at openjdk.org
Sat Apr 12 06:09:30 UTC 2025
On Fri, 28 Mar 2025 15:19:46 GMT, Rohitash <duke at openjdk.org> wrote:
> `scanByte` throws `NumberFormatException` for URIs that start with numbers, e.g., https://11111111.x.y/
> The current flow is `parseIPv4Address` → `scanIPv4Address` → `scanByte`. `parseIPv4Address` uses `NumberFormatException` for control flow, so it captures the exception, ignores it, and returns -1. This has been reported by AWS customer to cause low performance. Details: [JDK-8353013](https://bugs.openjdk.org/browse/JDK-8353013) & https://github.com/aws/aws-sdk-java-v2/issues/5933
>
> This PR avoids NumberFormatException by skipping calls to `Integer.parseInt` if the number of digits in the octet is > 3.
>
> I benchmarked on local machine for potential regressions.
> https://gist.github.com/rk-kmr/cb1a3d59225c17b180a29cc125ebf887
>
>
> Benchmark Mode Cnt Score Error Units
> URIBenchMark.newImplWithNormalUrl thrpt 10 0.004 ± 0.001 ops/ns
> URIBenchMark.newImplWithNumberlUrl thrpt 10 0.004 ± 0.001 ops/ns
> URIBenchMark.oldImplWithNormalUrl thrpt 10 0.004 ± 0.001 ops/ns
> URIBenchMark.oldImplWithNumUrl thrpt 10 0.001 ± 0.001 ops/ns
> URIBenchMark.newImplWithNormalUrl avgt 10 236.762 ± 8.700 ns/op
> URIBenchMark.newImplWithNumberlUrl avgt 10 264.017 ± 7.274 ns/op
> URIBenchMark.oldImplWithNormalUrl avgt 10 233.853 ± 6.539 ns/op
> URIBenchMark.oldImplWithNumUrl avgt 10 1183.572 ± 29.242 ns/op
>
>
> I ran following tests.
>
> make test-tier1
> make test-tier2
> make test TEST=jdk/java/net
test/jdk/java/net/URI/Test.java line 1791:
> 1789:
> 1790: // 8353013 - java.net.URI.create(String) may have low performance to scan the host/domain name from
> 1791: // URI string when the hostname starts with number
This comment looks a bit out of place in a unit test. I think start with a JMH benchmark and change the comment here to make it clearer that it's provide more test coverage for case the authority component of a hierarchical when the host component starts with a number.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/24295#discussion_r2040578232
More information about the net-dev
mailing list