Optimize sun.invoke.util.BytecodeDescriptor.unparse

Thu Aug 20 18:01:49 UTC 2020

Hi Roger,

thanks for taking a look!

> Though I wonder if performs differently than just calling  t.descriptorString()?

Seems pretty much to be the same.
The difference of 1 ns for the primitive case is a bit weird, but I guess at this levels it's also possible to have fluctuations here and there.

Long.class                                                                               
Benchmark                                          Mode  Cnt     Score     Error   Units 
MyBenchmark.unparseChristoph                       avgt   10    36,995 ±   0,748   ns/op 
MyBenchmark.unparseChristoph:·gc.alloc.rate.norm   avgt   10   168,009 ±   0,001    B/op 
MyBenchmark.unparseRoger                           avgt   10    36,857 ±   1,472   ns/op 
MyBenchmark.unparseRoger:·gc.alloc.rate.norm       avgt   10   168,009 ±   0,001    B/op 
MyBenchmark.unparseOld                             avgt   10    53,926 ±   1,991   ns/op 
MyBenchmark.unparseOld:·gc.alloc.rate.norm         avgt   10   256,014 ±   0,001    B/op 
                                                                                         
long.class                                                                               
Benchmark                                          Mode  Cnt     Score     Error   Units 
MyBenchmark.unparseChristoph                       avgt   10     5,184 ±   0,168   ns/op 
MyBenchmark.unparseChristoph:·gc.alloc.rate.norm   avgt   10    ≈ 10⁻⁶              B/op 
MyBenchmark.unparseRoger                           avgt   10     6,149 ±   0,238   ns/op 
MyBenchmark.unparseRoger:·gc.alloc.rate.norm       avgt   10    ≈ 10⁻⁶              B/op 
MyBenchmark.unparseOld                             avgt   10    11,236 ±   0,464   ns/op 
MyBenchmark.unparseOld:·gc.alloc.rate.norm         avgt   10    80,003 ±   0,001    B/op

> It should be equivalent, without extra checks.

It seems to be indeed equivalent, so I would change my proposal to the following.
I would keep the check for Object.class and int.class above as they seem to be the most common.
At least Object.class is good to have to avoid unnecessary String allocations in from descriptorString() imho.

=========== PATCH ===============

--- a/src/java.base/share/classes/sun/invoke/util/BytecodeDescriptor.java       Thu Aug 13 09:33:28 2020 -0700
+++ b/src/java.base/share/classes/sun/invoke/util/BytecodeDescriptor.java       Thu Aug 20 19:44:57 2020 +0200
@@ -110,9 +110,7 @@
         } else if (type == int.class) {
             return "I";
         }
-        StringBuilder sb = new StringBuilder();
-        unparseSig(type, sb);
-        return sb.toString();
+        return type.descriptorString();
     }

What do you think?

Cheers,
Christoph

On 8/13/20 1:31 PM, Christoph Dreis wrote:
> Hi,
>
> I just stumbled upon sun.invoke.util.BytecodeDescriptor.unparse that seems to unnecessarily create a StringBuilder and checks for the given type to be of Object.class twice in certain scenarios.
>
> When I apply the attached patch below with the following isolated benchmark:
>
> @BenchmarkMode(Mode.AverageTime)
> @OutputTimeUnit(TimeUnit.NANOSECONDS)
> @State(Scope.Thread)
> public class MyBenchmark {
>
> 	@State(Scope.Thread)
> 	public static class BenchmarkState {
> 		private Class<?> test = String.class; // long.class;
> 	}
>
> 	@Benchmark
> 	public String unparseNew(BenchmarkState state) {
> 		return BytecodeDescriptor.unparseNew(state.test);
> 	}
>
> 	@Benchmark
> 	public String unparseOld(BenchmarkState state) {
> 		return BytecodeDescriptor.unparseOld(state.test);
> 	}
> }
>
> I get the following results:
> String.class
> Benchmark                                   Mode  Cnt     Score     Error   Units
> MyBenchmark.unparseNew                      avgt   10    47,207 ±   1,918   ns/op
> MyBenchmark.unparseNew:·gc.alloc.rate.norm  avgt   10   232,011 ±   0,002    B/op
> MyBenchmark.unparseOld                      avgt   10    87,197 ±  22,843   ns/op
> MyBenchmark.unparseOld:·gc.alloc.rate.norm  avgt   10   384,020 ±   0,001    B/op
>                                                                                    
> long.class
> Benchmark                                   Mode  Cnt     Score     Error   Units
> MyBenchmark.unparseNew                      avgt   10     4,996 ±    0,022   ns/op
> MyBenchmark.unparseNew:·gc.alloc.rate.norm  avgt   10    ≈ 10⁻⁶               B/op
> MyBenchmark.unparseOld                      avgt   10    13,303 ±    6,305   ns/op
> MyBenchmark.unparseOld:·gc.alloc.rate.norm  avgt   10    80,003 ±    0,001    B/op
>
> As you can see the new way makes things allocation free for primitives and also improves normal classes.
>
> It seems like a relatively trivial improvement. In case you think this is worthwhile, I would appreciate it if someone could sponsor the change.
>
> Cheers,
> Christoph
>
> ======= PATCH =======
> --- a/src/java.base/share/classes/sun/invoke/util/BytecodeDescriptor.java       Thu Aug 13 09:33:28 2020 -0700
> +++ b/src/java.base/share/classes/sun/invoke/util/BytecodeDescriptor.java       Thu Aug 13 19:27:26 2020 +0200
> @@ -110,9 +110,13 @@
>           } else if (type == int.class) {
>               return "I";
>           }
> -        StringBuilder sb = new StringBuilder();
> -        unparseSig(type, sb);
> -        return sb.toString();
> +        Wrapper basicType = Wrapper.forBasicType(type);
> +        char c = basicType.basicTypeChar();
> +        if (c != 'L') {
> +            return basicType.basicTypeString();
> +        } else {
> +            return type.descriptorString();
> +        }
>       }
>
>