<div dir="ltr"><div dir="ltr"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><span style="font-family:Arial,Helvetica,sans-serif">On Wed, Jul 17, 2024 at 4:41 AM Maurizio Cimadamore <<a href="mailto:maurizio.cimadamore@oracle.com">maurizio.cimadamore@oracle.com</a>> wrote:</span><br></div></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><u></u>
<div>
<p>Hi David,<br>
I agree that having to second-guess signedness is not fun.</p>
<p>However, I'd like to understand the problem more. I do not see in
SysV ABI any reference to the need zero/sign-extend arguments. Do
you have an example of an ABI with stricter requirements? The SO
post you show says something about clang zero/sign extending all
arguments that are smaller than 32 bits, but it's not clear to me
whether that's a standard, or just something that clang does.</p></div></blockquote><div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">As far as system V goes, this behavior definitely isn't standard (in fact my understanding is that it is undefined). However, many compiler systems, libraries, and even the linux kernel have seem to have relied on caller-side sign/zero extension for values less than 32 bits in size. I think the system V ABI on most (?) CPU types require callee-side extension for 32-bit values and I think most (?) compilers conform to that. But, having a way to indicate signedness for all types would still be safest and most future-proof I think, even if the information isn't always used on all platforms.</div></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div>
<p>There are several ways to address this issue that were discussed
in the past:</p>
<p>* add carriers for unsigned types (e.g. Unsigned<byte>) -
this will likely require Valhalla<br>
* add a sign property to value layouts. This is relatively
harmless. And will also allow Linker::canonicalLayouts to expand
the set of canonical layouts it reports (by including the unsigned
ones)<br>
* deal with this like clang does - e.g. as a Linker option that
can be added to function parameter/return types</p>
<p>Of these, I think my preferred option would be to add the
property to value layouts. This will turn out useful if, in the
future, we will allow the memory part of the FFM API to e.g. take
a JAVA_INT and turn it into a `long` (because we could inspect the
sign, and decide whether to zero or sign-extend).</p></div></blockquote><div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">I think any of these options sounds good (though I don't like the idea of waiting for Valhalla).</div></div></div><div><br></div><span class="gmail_signature_prefix">-- </span><br><div dir="ltr" class="gmail_signature"><div dir="ltr">- DML • he/him<br></div></div></div>