RFR: 8283775: VM support for graph querying in debugger with BFS traversal and node filtering [v11]
Emanuel Peter
duke at openjdk.java.net
Thu May 26 10:33:46 UTC 2022
> **Note: Refactoring and extension still in progress**
>
> I implemented VM support for BFS traversal of IR nodes, where one can filter for input/output edges, and the node-type (control / data / memory). If one specifies the `target` node, we find a shortest path between `this` and `target`. With the `options` string one can easily select which node types to traverse.
>
> `void Node::print_bfs(const uint max_distance, Node* target, const char* options)`
>
> While a sufficient summary and description is in the comments above the function definition, I want to explain some useful use-cases here.
>
> Please let me know if you would find this helpful, or if you have any feedback to improve it.
> Thanks, Emanuel
>
> **1. Better dump()**
> The function is similar to `dump()`, we can also follow input / output edges up to a certain distance. There are a few improvements/added features:
>
> 1. Display the distance to the specified node. This way I can quickly assess how many nodes I have at a certain distance, and I can traverse edges from one distance to the next. The parent column shows the node one step closer to the BFS root (this).
> 2. Node filtering with types taken from `type->category()` help one limit traversal to control, data or memory flow. The categories are the same as in IGV.
> 3. Probably controversial: coloring of node types in terminal. By default it is off. May not work on all terminals. But I find it very useful to quickly separate memory, data and control - just as in IGV - but in my terminal!
> 4. After matching, we create Mach nodes from the IR nodes. For most Mach nodes, there is an associated old node. Often an assert is triggered for Mach nodes, but the graph already breaks in the IR graph. It is thus very helpful to have the old node idx displayed.
> 5. I also display the head node idx of the block a Mach node is scheduled for. This can be helpful if one gets bad dominance asserts, and needs to see why the blocks were scheduled in a bad way.
>
> Example:
>
> (rr) p find_node(35)->print_bfs(2, 0, "cdmox+")
> No target: perform BFS.
> dis par c dump
> ---------------------------------------------
> 0 35 d 35 CmpP === _ 34 25 [[ 36 ]]
> 1 35 d 34 LoadP === _ 31 33 [[ 35 ]]
> 1 35 d 25 ConP === 0 [[ 26 27 31 35 41 ]] #NULL
> 2 34 m 31 StoreP === 20 27 29 25 [[ 23 34 41 42 ]]
> 2 34 d 33 AddP === _ 1 12 32 [[ 34 ]]
>
>
> Example with Mach nodes:
>
> (rr) p ctrl->print_bfs(4, 0, "cdmox+OB")
> No target: perform BFS.
> dis [head idom d] old par c dump
> ---------------------------------------------
> 0 159 147 6 _ 159 c 159 Region === 159 57 [[ 159 158 59 ]]
> 1 147 148 5 o183 159 c 57 IfTrue === 8 [[ 159 ]]
> 2 147 148 5 o182 57 c 8 jmpConU === 147 9 [[ 7 57 ]]
> 3 147 148 5 _ 8 c 147 Region === 147 14 [[ 147 8 ]]
> 3 147 148 5 o180 8 d 9 compUL_rReg === _ 10 13 [[ 8 ]]
> 4 148 149 4 o174 147 c 14 IfTrue === 15 [[ 147 ]]
> 4 147 148 5 o203 9 d 10 decL_rReg === _ 11 [[ 12 9 ]]
> 4 147 148 5 o179 9 d 13 convI2L_reg_reg === _ 28 [[ 9 ]]
>
>
> **2. Find loop body**
> When I find myself in a loop, I try to localize the loop head and end, and map out at least one path between them.
> `loop_end->print_bfs(20, loop_head, "cox+")`
> This provides us with a shortest path, given this path has a distance of at most 20.
>
> Example:
>
> (rr) p find_node(158)->print_bfs(20, find_node(160), "cox+")
> Find shortest path: 158 -> 160.
>
> Backtrace target.
> dis c dump
> ---------------------------------------------
> 9 c 160 OuterStripMinedLoop === 160 339 159 [[ 160 358 ]]
> 8 c 358 CountedLoop === 358 160 143 [[ 358 362 363 ]]
> 7 c 363 If === 358 351 [[ 364 367 ]]
> 6 c 364 IfTrue === 363 [[ 128 ]]
> 5 c 128 If === 364 127 [[ 129 130 ]]
> 4 c 129 IfTrue === 128 [[ 155 ]]
> 3 c 155 CountedLoopEnd === 129 154 [[ 157 143 ]] [lt]
> 2 c 157 IfFalse === 155 [[ 162 163 ]]
> 1 c 162 SafePoint === 157 1 7 1 1 163 100 1 1 13 27 133 [[ 158 ]]
> 0 c 158 OuterStripMinedLoopEnd === 162 156 [[ 159 227 ]]
>
> Example with Mach nodes:
>
> (rr) p ctrl->print_bfs(10, val, "cdmox-+OB")
> Find shortest path: 159 -> 27.
>
> Backtrace target.
> dis [head idom d] old e c dump
> ---------------------------------------------
> 2 24 1 2 o10 + d 27 MachProj === 24 [[ 19 28 4 59 95 99 118 ]]
> 1 56 159 7 o239 - d 59 loadB === 159 29 27 60 [[ 55 ]]
> 0 159 147 6 _ c 159 Region === 159 57 [[ 159 158 59 ]]
Emanuel Peter has updated the pull request incrementally with five additional commits since the last revision:
- remove Node::related, dump_related, dump_related_compact
- fixing some comments, fix alignment, don't traverse over root or constants
- align (and optionally color) dump
- filtering extended to visit/boundary query pattern
- refactored into pipeline. sort for input/output.
-------------
Changes:
- all: https://git.openjdk.java.net/jdk/pull/8468/files
- new: https://git.openjdk.java.net/jdk/pull/8468/files/5b0f43e6..d78fb82e
Webrevs:
- full: https://webrevs.openjdk.java.net/?repo=jdk&pr=8468&range=10
- incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=8468&range=09-10
Stats: 681 lines in 13 files changed: 239 ins; 323 del; 119 mod
Patch: https://git.openjdk.java.net/jdk/pull/8468.diff
Fetch: git fetch https://git.openjdk.java.net/jdk pull/8468/head:pull/8468
PR: https://git.openjdk.java.net/jdk/pull/8468
More information about the hotspot-compiler-dev
mailing list