[aarch64-port-dev ] RFR (S) 8131682: C1 should use multibyte nops everywhere

Andrew Haley aph at redhat.com
Mon Jul 27 12:07:12 UTC 2015


On 07/27/2015 11:53 AM, Aleksey Shipilev wrote:
> On 07/27/2015 01:21 PM, Andrew Haley wrote:
>> On 27/07/15 10:13, Aleksey Shipilev wrote:
>>> Thanks Goetz! Fixed the assembler_ppc.inline.hpp.
>>>
>>> Andrew/Edward, are you OK with AArch64 part?
>>>   http://cr.openjdk.java.net/~shade/8131682/webrev.02/
>>
>> I agree that it looks good.  Please have a look to see how many NOPs take the
>> same time as a branch.
> 
> Thanks!
> 
> I don't quite believe we should spend time trying branches for nops, at
> least for x86. The change we are discussing follows the Intel
> Optimization Reference Manual 3.5.1.10 "Using NOPs", which
> Assembler::align for x86 seems to implement with some bells and
> whistles. Agner agrees on using multi-byte nops (0F 1F ...) on modern
> x86 chips as well; up to the point he claims 4 insn/clock throughput for
> them.

Sure.  My apologies: I responded to the wrong person.  My interest is
about AArch64.

Andrew.




More information about the aarch64-port-dev mailing list