Java value layout constants

Maurizio Cimadamore maurizio.cimadamore at oracle.com
Thu Nov 25 15:52:29 UTC 2021


Hi,
This is a followup of the disccussion that started in [1]. In the new 
changes slated for Java 18, the set of Java value layout constants are 
all byte-aligned (e.g. alignment constraints are not set). The 
motivation for this is mostly historical (but there's also a performance 
twist, see below): the dereference primitives in MemoryAccess used to 
setup var handles based on non-aligned layouts. So, to preserve 
compatibility with what we had before, we opted to "relax" alignment 
constraints on the JAVA_XYZ layout constants in ValueLayout. During the 
development of the new dereference API, some issues arised around 
alignment checks and memory copy [2]; which also contributed to 
consolidate the feeling that Java layout constants should be unaligned.

Now, while it's always possible, for clients, to go back to the desired 
alignment constraints (e.g. by defining custom layout constants), from 
the discussion it emerged that it can be somewhat confusing/surprising 
having a layout constant called JAVA_INT, whose alignment is not the VM 
alignment for a Java int value.

For this reason, I'd like to propose a small tweak, which would 
essentially revert alignment constraints for Java layout constants to 
what they were in 17. In other words, let's keep the "good" JAVA_XYZ 
names for the _true_ Java layouts (including alignment as seen by VM). 
If clients want to create unaligned constants they can do so, as they 
can also create big-endian constants where needed. In the majority of 
cases, since access will be aligned (for performance reasons), this will 
not really change much for clients. But some of those clients that need 
to pack data structures more (Lucene?) will need to define their own 
packed/unaligned layout constants.

Does that seem like an acceptable compromise?

A patch for these changes is available here:

https://github.com/mcimadamore/jdk/tree/value_layout_align

While testing it, I was reminded (once more) that access with alignment 
constraints is currently slower than access w/o alignment constraints - 
which has to do with C2 not hoisting alignment checks in cases like this:

((segmentBaseAddress + accessedOffset) & alignmentMask) == 0

Here, segmentBaseAddress is a loop invariant, and the accessedOffset 
depends on the loop variable. So, it is in principle possible for the VM 
to hoist the check for baseAddress and to eliminate the alignment check 
for the offset (which would come from BCE analysis). But this is not how 
things work today. The patch works around this, by using different var 
handles for when the accessed offset is provably aligned (e.g. when 
using the getAtIndex/setAtIndex APIs). Even with those workarounds, 
calling getAtIndex/setAtIndex on a MemoryAddress is still slower than on 
a MemorySegment, because of the way in which we try to workaround the 
long loop optimization problem. Luckily a fix for that problem [3] has 
been integrated in JDK 18, which means we will remove these 
implementation workaround, which will help making performance more 
stable across the board.

If the changes in this patch seem good, I'm happy to try and integrate 
this into 18.

Cheers
Maurizio

[1] - 
https://mail.openjdk.java.net/pipermail/panama-dev/2021-November/015805.html
[2] - 
https://github.com/openjdk/panama-foreign/pull/555#issuecomment-865115787
[3] - https://github.com/openjdk/jdk/pull/2045





More information about the panama-dev mailing list