Generic type signatures in MethodRepository have many copies of simple class names
Steven Schlansker
stevenschlansker at gmail.com
Mon Aug 18 16:18:39 UTC 2025
> On Aug 11, 2025, at 9:44 AM, Steven Schlansker <stevenschlansker at gmail.com> wrote:
>
>>
>> On Aug 10, 2025, at 2:29 PM, Johannes Döbler <jd at civilian-framework.org> wrote:
>>
>> Hi Steven,
>>
>> I think you are onto something. Given this example
>>
>>> public class TypeParamsDemo {
>>> public static void main(String[] args) throws Exception {
>>> Method method = TypeParamsDemo.class.getMethod("getList", Set.class);
>>> TypeVariable<Method>[] tp = method.getTypeParameters();
>>> }
>>>
>>> public <T extends Number> List<T> getList(Set<T> set) {
>>> return new ArrayList<>(set);
>>> }
>>> }
>>
>> Then:
>> tp[0].bounds[0].path[0].name is "java.lang.Number" and is a newly created String parsed from the type signature "<T:Ljava/lang/Number;>(Ljava/util/Set<TT;>;)Ljava/util/List<TT;>;".
>> The result of method.getTypeParameters() is cached in Method.genericInfo, so once initialized by a caller (e.g. by Jackson/REST easy) it won't go away.
>> Imho interning the class name should be fine since it refers to an existing class.
>>
>> Best
>> Johannes
>>
>
> Thank you for your analysis Johannes.
> I've opened https://github.com/openjdk/jdk/pull/26730 with my proposed change.
>
It seems that I need a bug issue open in order to proceed any further.
Is anyone able to create a bug report and potentially sponsor this change? Thank you!
Steven
>>
>> On 09/08/2025 01:10, Steven Schlansker wrote:
>>> Hello core-libs-dev, happy Friday!
>>>
>>> While diagnosing an out of memory situation in our application, I noticed a surprising source of memory usage.
>>> While this is not so severe to actually cause our OOM, it seems wasteful and I thought to bring it to your attention.
>>>
>>> We use reflection-based technologies like Jackson JSON and RESTEasy that retrieve generic type information from many of our classes.
>>> Our profiler provides diagnostics of wasted space due to duplicate objects, particularly Strings.
>>>
>>> The analysis highlights many thousands of copies of String instances holding the full name of a class, e.g. "java.util.Optional"
>>> or "com.mycompany.Id". The path to GC route looks like:
>>>
>>> String <- sun.reflect.generics.tree.SimpleClassTypeSignature <- Object[] <- ArrayList <- ClassTypeSignature <- MethodTypeSignature <- MethodRepository <- Method
>>>
>>> Seeing how these SimpleClassTypeSignature instances are created, it looks like they come from the sun.reflect.generics.parser.SignatureParser which calls
>>> `input.substring(mark, index)`, possibly with a call to `replace('/', '.')` to munge the package name. In all but the simplest of cases, this will return a new String
>>> for every call.
>>>
>>> Since this String is representing a Class name, the cardinality should by its nature be very low. For each unique type, there will be many methods referring to it.
>>> Additionally, this generic information is lazy-loaded at most once per Method object.
>>>
>>> Therefore, SimpleClassTypeSignature.n seems like a natural place to apply String.intern(), for example changing:
>>>
>>> public static SimpleClassTypeSignature make(String n,
>>> boolean dollar,
>>> TypeArgument[] tas){
>>> return new SimpleClassTypeSignature(n, dollar, tas);
>>> }
>>>
>>> to intern the name:
>>>
>>> public static SimpleClassTypeSignature make(String n,
>>> boolean dollar,
>>> TypeArgument[] tas){
>>> return new SimpleClassTypeSignature(n.intern(), dollar, tas);
>>> }
>>>
>>> With any luck, maybe this can even share the same string instance as the class itself uses.
>>> Am I correct in thinking this would be a moderately nice improvement, for a relatively cheap cost?
>>> Or perhaps there's some reason this isn't a good idea?
>>>
>>> Thank you for your thoughts on the subject,
>>> Steven
More information about the core-libs-dev
mailing list