RFR: 8289743: AArch64: Clean up patching logic [v2]

Tue Jul 12 10:10:43 UTC 2022

On Mon, 11 Jul 2022 16:33:50 GMT, Andrew Haley <aph at openjdk.org> wrote:

>> The current logic for patching is a mess of if-then-elses. By rearranging the logic and using a switch we can make it both easier to understand and faster.
>
> Andrew Haley has updated the pull request incrementally with one additional commit since the last revision:
> 
>   8289743: AArch64: Clean up patching logic

src/hotspot/cpu/aarch64/macroAssembler_aarch64.cpp line 78:

> 76: // Patch any kind of instruction; there may be several instructions.
> 77: // Return the total length (in bytes) of the instructions.
> 78: int MacroAssembler::pd_patch_instruction_size(address branch, address target) {

You really need to document what you are doing up front here by explaining what patchable sequences arise, categorizing them by leading insn and then explaining how that can categorization can be automated by means of a switch on certain insn bits plus, in some cases, auxiliary bit tests. I suggest the following as a lead-in comment:

  // Instruction sequences whose target may need to be retrieved or
  // patched can be distinguished by their leading instruction,
  // sorting them into three main instruction groups and
  // related subgroups.
  //
  // 1) Branch, Exception and System (insn count = 1)
  //    1a) Unconditional branch (immediate):
  //      b/bl imm19
  //    1b) Compare & branch (immediate):
  //      cbz/cbnz Rt imm19
  //    1c) Test & branch (immediate):
  //      tbz/tbnz Rt imm14
  //    1d) Conditional branch (immediate):
  //      b.cond imm19
  //
  // 2) Loads and Stores (insn count = 1)
  //    2a) Load register literal:
  //      ldr Rt imm19
  //
  // 3) Data Processing Immediate (insn count = 2 or 3)
  //    3a) PC-rel. addressing
  //      adr/adrp Rx imm21; ldr/str Ry Rx  #imm12
  //      adr/adrp Rx imm21; add Ry Rx  #imm12
  //      adr/adrp Rx imm21; movk Rx #imm16<<32; ldr/str Ry, [Rx, #offset_in_page]
  //      adr/adrp Rx imm21; movk Rx #imm16<<32; add Ry, Rx, #offset_in_page
  //      adr/adrp Rx imm21; movk Rx #imm16<<32
  //      adr/adrp Rx imm21
  //    3b) Move wide (immediate)
  //      movz Rx #imm16; movk Rx #imm16 << 16; movk Rx #imm16 << 32;
  //
  // A switch on a subset of the instruction's bits provides an efficient
  // dispatch to these subcases.
  //
  // insn[28:26] -> main group ('x' == don't care)
  //   00x -> UNALLOCATED
  //   100 -> Data Processing Immediate
  //   101 -> Branch, Exception and System
  //   x1x -> Loads and Stores
  //
  // insn[30:25] -> subgroup ('_' == group, 'x' == don't care).
  // n.b. in some cases extra bits need to be checked to verify the
  // instruction is as expected
  //
  // 1) ... xx101x Branch, Exception and System
  //   1a)  00___x Unconditional branch (immediate)
  //   1b)  01___0 Compare & branch (immediate)
  //   1c)  01___1 Test & branch (immediate)
  //   1d)  10___0 Conditional branch (immediate)
  //        other  Should not happen
  //
  // 2) ... xxx1x0 Loads and Stores
  //   2a)  xx1_x_ Load/Store register
  //   2aa) x01_x_0 Load register literal (n.b. requires insn[24] == 0)
  //                strictly should be 64 bit non-FP/SIMD i.e.
  //       0101_0_0 (i.e. requires insn[31:24] == 01011000)
  //
  // 3) ... xx100x Data Processing Immediate
  //   3a)  xx___00 PC-rel. addressing (n.b. requires insn[24] == 0)
  //   3b)  xx___101 Move wide (immediate) (n.b. requires insn[24:23] == 01)
  //                 strictly should be 64 bit movz #imm16<<0
  //       110___10100 (i.e. requires insn[31:21] == 11010010100)
  //

This means you no longer need most of the comments that occur inline in `pd_patch_instruction_size` and `target_addr_for_insn`. You can simply refer to cases 1a, 1b, ..., 2a etc.

-------------

PR: https://git.openjdk.org/jdk/pull/9398