RFR: 8283232: x86: Improve vector broadcast operations [v5]

Quan Anh Mai duke at openjdk.java.net
Thu Mar 17 22:39:06 UTC 2022


> Hi,
> 
> This patch improves the generation of broadcasting a scalar in several ways:
> 
> - Avoid potential data bypass delay which can be observed on some platforms by using the correct type of instruction if it does not require extra instructions.
> - As it has been pointed out, dumping the whole vector into the constant table is costly in terms of code size, this patch minimises this overhead for vector replicate of constants. Also, options are available for constants to be generated with more alignment so that vector load can be made efficiently without crossing cache lines.
> - Vector broadcasting should prefer rematerialising to spilling when register pressure is high.
> 
> This patch also removes some redundant code paths and rename some incorrectly named instructions.
> 
> Thank you very much.

Quan Anh Mai has updated the pull request incrementally with one additional commit since the last revision:

  rematerializing input count

-------------

Changes:
  - all: https://git.openjdk.java.net/jdk/pull/7832/files
  - new: https://git.openjdk.java.net/jdk/pull/7832/files/3dbc7432..bb494bc2

Webrevs:
 - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=7832&range=04
 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=7832&range=03-04

  Stats: 4 lines in 1 file changed: 1 ins; 0 del; 3 mod
  Patch: https://git.openjdk.java.net/jdk/pull/7832.diff
  Fetch: git fetch https://git.openjdk.java.net/jdk pull/7832/head:pull/7832

PR: https://git.openjdk.java.net/jdk/pull/7832


More information about the hotspot-compiler-dev mailing list