RFR: 7903613: Bad nested names are sometimes attached to structs [v6]

Maurizio Cimadamore mcimadamore at openjdk.org
Wed Dec 20 16:10:24 UTC 2023


On Wed, 20 Dec 2023 16:07:37 GMT, Maurizio Cimadamore <mcimadamore at openjdk.org> wrote:

>> The `NameMangler` visitor is used to compute the Java name of a jextract declaration. This is implemented as a declaration visitor. Unfortunately, the logic that computes the Java name can be sensitive to the order in which declarations are visited (because this visitor features a "parent" declaration, whose contents affect as to whether a "nested" struct name is generated or not).
>> 
>> In reality, the logic of the name mangler needs to be able to disambiguate between structs that are either anonymous, or already declared somewhere else, and structs that are declared as part of a typedef, variable, function parameter/return declaration. In the former case, we either need no Java name (anonymous struct) or a toplevel Java name. In the latter we need a nested struct name (as the struct class will be nested inside some other class).
>> 
>> This PR introduces a new visitor which tags all struct/union/enum declarations which fall in the latter bucket. This is done with an algorithm which:
>> 
>> 1. visits all declarations in a toplevel header
>> 2. remembers which scoped declarations have been seen *directly* (e.g. as part of the visit)
>> 3. keeps track of which scoped declarations can be seen *indirectly* (e.g. because they are behind some declared type)
>> 4. subtracts the declarations in (2) from the declarations in (3), and visits the declarations in the remaining set
>> 5. keeps performing (2), (3), (4) until there's no declaration in (3)
>> 
>> All scoped declarations that appear exclusively as part of some declared type are augmented with the `NestedDecl` attribute, which is then read when calling `Utils::nestedDeclarationFor`. This ensures that all the jextract visitor only recurse on a scoped declaration attached to a type which is known not to have been seen anywhere else. As a result, the behavior of the name mangler is independent of the order in which declarations are seen.
>> 
>> It should be possible, in principle, to leverage this infrastructure to define a declaration visitor that automatically looks inside "nested declarations" (so that subsequent visitors don't really need to concern with following declared types).
>> 
>> I've tested this change with windows.h, which works as expected.
>
> Maurizio Cimadamore has updated the pull request incrementally with five additional commits since the last revision:
> 
>  - Fix mangling
>    Add mangling test for nested decls
>  - Add more comments
>  - Fix function pointer typedef mangled names for nested struct in param/returns
>  - Better names for function parameter/return structs
>  - Deal with param/return nested decls

src/main/java/org/openjdk/jextract/impl/TreeMaker.java line 453:

> 451:         c.forEach(m -> {
> 452:             if (m.isDefinition()) {
> 453:                 if (m.kind() == CursorKind.ParmDecl && !ignoreNestedParams) {

Note: if we see a param declaration, we need to recursively inspect it, as it might contain more nested declarations. This is useful for function pointer typedefs and variables whose type is a function pointer. In case of a function declaration this logic is disabled, as a function declaration will always point to its parameter declarations anyway.

-------------

PR Review Comment: https://git.openjdk.org/jextract/pull/167#discussion_r1432909802


More information about the jextract-dev mailing list