RFR(S) 8204267 - Generate comments in -XX:+PrintInterpreter to link to source code

Ioi Lam ioi.lam at oracle.com
Wed Jul 11 22:46:58 UTC 2018


Hi Coleen,


Thanks for the review. I have updated the patch according to your 
comments. Here's the diff from the last webrev:

http://ioilinux.us.oracle.com/webrev/jdk12.open/8204267-print-interpreter-comments.v002.delta/


On 7/11/18 10:44 AM, coleen.phillimore at oracle.com wrote:
>
> http://cr.openjdk.java.net/~iklam/jdk12/8204267-print-interpreter-comments.v001/src/hotspot/share/compiler/disassembler.cpp.udiff.html 
>
>
> + Link *link = new (ResourceObj::C_HEAP, mtInternal) Link(file, line);
>
>
> Can you use mtCode here?  Why not make Link inherit from 
> CHeapObj<mtCode> then just say new?
>
Fixed.

> + if (_cached_src_lines != NULL) {
> + for (int i=0; i<_cached_src_lines->length(); i++) {
> + os::free((void*)_cached_src_lines->at(i));
> + }
> + _cached_src_lines->clear();
> + } else {
> + _cached_src_lines = new (ResourceObj::C_HEAP, 
> mtInternal)GrowableArray<const char*>(0, true);
> + }
>
>
> Growable Array has a delete operator that will clear_and_deallocate 
> all the elements and the array itself (which I don't think this does).

We have this function:

template<class E> void GrowableArray<E>::clear_and_deallocate() {
     assert(on_C_heap(),
            "clear_and_deallocate should only be called when on C heap");
     clear();
     if (_data != NULL) {
       for (int i = 0; i < _max; i++) _data[i].~E();
       free_C_heap(_data);
       _data = NULL;
     }
}

My <E> type is currently <const char*>. To take advantage of 
clear_and_deallocate I have to wrap the const char* into something like this

struct {
    const char* _line;
    LineInfo(const char* l) : _line(l) {}
    ~LineInfo() {os::free(_line);}
} LineInfo;

but this adds a tone of boiler plate code and still won't work, because 
GrowableArray<E>::grow will call ~E(), which messes things up.

So I'll just keep the existing code and won't bother.

>   But it looks like you're reusing _cached_src_lines for later 
> iterations.  It's an odd usage.  But you can't resource allocate this?
>

I am worried that we may have be under nested ResourceMarks, and the 
allocated _cached_src_lines may be freed unexpectedly.

> 302 if (_cached_src == NULL || strcmp(_cached_src, file) != 0) {
>
>
> The indentation is wrong, and this would be a nice place for a comment 
> at line 301.
>
I fixed indent and added this comment

         // _cached_src_lines is a single cache of the lines of a source 
file, and we refill this cache
         // every time we need to print a line from a different source 
file. It's not the fastest,
         // but seems bearable.


> http://cr.openjdk.java.net/~iklam/jdk12/8204267-print-interpreter-comments.v001/src/hotspot/share/compiler/disassembler.hpp.udiff.html 
>
>
> I wonder if checking PrintInterpreter for each macro assembler line 
> will slow down generating the interpreter for the normal case. Maybe 
> there is an EnableIf<PrintInterpreter> sort of template trick to make 
> it evaporate?
>
> PrintInterpreter is an option in product mode.
>
I am hoping that __ is slow enough, so doing an extra load-and-test 
won't slow down it too much. I benchmarked "java -Xint -version" and the 
slow down is 0.11% after 1000 runs, so I guess it's acceptable.

If people start complaining, we can do this later ....

   template<class T> inline static T* hook(const char* file, int line, 
T* masm) {
#ifdef ASSERT
     if (PrintInterpreter) {
       _hook(file, line, masm);
     }
#endif
     return masm;
   }

> http://cr.openjdk.java.net/~iklam/jdk12/8204267-print-interpreter-comments.v001/sample.txt 
>
>
>   0x00007f72c498d527: callq  0x00007f72c498d531             ;; 126:   
> __ call_VM(noreg,
>
>
> Should this say what it's calling?
>
>   0x00007f72c498d572: callq  0x00007f72e3df9ea0 = 
> InterpreterRuntime::slow_signature_handler(JavaThread*, Method*, 
> long*, long*)
>
>
> But this says what it's calling but I don't think this is part of your 
> change (or is it?)
>
Yes. When you see a call_VM, you have to scan down and see what it's 
actually calling. It's no ideal, but I don't know how to improve it 
without writing a lot of code ... which I want to avoid.

Thanks
- Ioi

> That's all the comments I have. I wish you did this many years ago also!
> Thanks,
> Coleen
>
> On 7/6/18 5:33 PM, Ioi Lam wrote:
>> Hi,
>>
>>
>> Since there's interest for this patch, I've finished it for the x86 
>> port:
>>
>> https://bugs.openjdk.java.net/browse/JDK-8204267
>> http://cr.openjdk.java.net/~iklam/jdk12/8204267-print-interpreter-comments.v001/ 
>>
>>
>> You can see a sample at
>>
>> http://cr.openjdk.java.net/~iklam/jdk12/8204267-print-interpreter-comments.v001/sample.txt 
>>
>>
>> Incorporating this to other CPU ports should be pretty straight 
>> forward. You just
>> need to do something like:
>>
>> -#define __ _masm->
>> +#define __ Disassembler::hook<MacroAssembler>(__FILE__, __LINE__, 
>> _masm)->
>>
>> I'll leave it to other CPU port maintainers.
>>
>> I am testing with hs_tier{1,2}.
>>
>> Thanks
>>
>> - Ioi
>>
>>
>> On 6/4/18 12:12 AM, Ioi Lam wrote:
>>> Hi Folks,
>>>
>>> I've found it very hard to understand the assembler code printed by 
>>> -XX:+PrintInterpreter. Since the C++ source code that generates the 
>>> interpreter is fairly well documented, it might be a good idea to 
>>> print out the source code along with the assembler code.
>>>
>>> I've done a quick proof-of-concept by hacking the "__" macro that's 
>>> used throughout the CPU-dependent sources.
>>>
>>> Please let me know if you think this is worth doing. If so, I will 
>>> try to finish it up and post a real RFR.
>>>
>>> Thanks
>>> - Ioi
>>>
>>>
>>> https://bugs.openjdk.java.net/browse/JDK-8204267
>>> http://cr.openjdk.java.net/~iklam/misc/8204267-print-interpreter-comments.v00 
>>>
>>> http://cr.openjdk.java.net/~iklam/misc/8204267-print-interpreter-comments.v00/hs_interp.S 
>>>
>>>
>>> Here are some examples (if the mailer screws up the long lines, 
>>> please click the above link):
>>>
>>> ifeq  153 ifeq  [0x00007f830cc93da0, 0x00007f830cc941c0] 1056 bytes
>>>
>>> mov    (%rsp),%eax
>>> add    $0x8,%rsp
>>> test   %eax,%eax           ;; 2353:   __ testl(rax, rax);
>>> jne    0x00007f830cc94177  ;; 2354:   __ jcc(j_not(cc), not_taken);
>>> mov    -0x18(%rbp),%rcx    ;; 2120:   __ get_method(rcx); // rcx 
>>> holds method
>>> mov    -0x28(%rbp),%rax    ;; 2121:   __ profile_taken_branch(rax, 
>>> rbx); // rax holds updated MDP, rbx
>>> test   %rax,%rax
>>> je     0x00007f830cc93dd8
>>> mov    0x8(%rax),%rbx
>>> add    $0x1,%rbx
>>> sbb    $0x0,%rbx
>>> mov    %rbx,0x8(%rax)
>>> add    0x10(%rax),%rax
>>> mov    %rax,-0x28(%rbp)
>>> movswl 0x1(%r13),%edx      ;; 2133:     __ load_signed_short(rdx, 
>>> at_bcp(1));
>>> bswap  %edx                ;; 2135:   __ bswapl(rdx);
>>> sar    $0x10,%edx          ;; 2138:     __ sarl(rdx, 16);
>>> movslq %edx,%rdx           ;; 2140:   LP64_ONLY(__ movl2ptr(rdx, rdx));
>>> add    %rdx,%r13           ;; 2164:   __ addptr(rbcp, rdx);
>>> test   %edx,%edx           ;; 2179:     __ testl(rdx, 
>>> rdx);             // check if forward or backward branch
>>> jns    0x00007f830cc93eec  ;; 2180:     __ jcc(Assembler::positive, 
>>> dispatch); // count only if backward branch
>>> mov    0x18(%rcx),%rax     ;; 2184:     __ movptr(rax, Address(rcx, 
>>> Method::method_counters_offset()));
>>> test   %rax,%rax           ;; 2185:     __ testptr(rax, rax);
>>> jne    0x00007f830cc93ead  ;; 2186:     __ jcc(Assembler::notZero, 
>>> has_counters);
>>> push   %rdx                ;; 2187:     __ push(rdx);
>>> push   %rcx                ;; 2188:     __ push(rcx);
>>> callq  0x00007f830cc93e09  ;; 2189:     __ call_VM(noreg, 
>>> CAST_FROM_FN_PTR(address, InterpreterRuntime::build_method_counters),
>>>
>>
>



More information about the hotspot-dev mailing list