RFR: JDK-8152952: Allow G1 phase logging to use individual number of threads

Jon Masamitsu jon.masamitsu at oracle.com
Thu Mar 31 14:45:14 UTC 2016


On 03/31/2016 12:29 AM, Bengt Rutisson wrote:
>
> Hi Jon,
>
> Thanks a lot for looking at this!
>
> Comments inline.
>
> Updated webrev:
> http://cr.openjdk.java.net/~brutisso/8152952/webrev.04/
>
> Diff compared to last version:
> http://cr.openjdk.java.net/~brutisso/8152952/webrev.03-04.diff/
>
>
> On 2016-03-30 22:50, Jon Masamitsu wrote:
>>
>>
>> On 03/30/2016 07:27 AM, Bengt Rutisson wrote:
>>>
>>> One comment at the bottom of this email....
>>>
>>> On 2016-03-30 16:11, Bengt Rutisson wrote:
>>>>
>>>> Hi Thomas,
>>>>
>>>> Thanks for looking at this!
>>>>
>>>> On 2016-03-30 11:15, Thomas Schatzl wrote:
>>>>> Hi Bengt,
>>>>>
>>>>> On Tue, 2016-03-29 at 14:03 +0200, Bengt Rutisson wrote:
>>>>>> Hi everyone,
>>>>>>
>>>>>> Could I have a couple of reviews for this change?
>>>>>>
>>>>>> http://cr.openjdk.java.net/~brutisso/8152952/webrev.00/
>>>>>> https://bugs.openjdk.java.net/browse/JDK-8152952
>>>>>>
>>>>>> Currently if you run with UseDynamicNumberOfGCThreads you can
>>>>>> potentially get a different number of worker threads each GC. There
>>>>>> are improvements coming where we want to select a different 
>>>>>> number of
>>>>>> worker threads for individual phases. The G1GCPhaseTimes and
>>>>>> WorkerDataArray structures need to support this.
>>>>>>
>>>>>> The proposed patch sets all slots in the WorkerDataArray to an
>>>>>> uninitialized value and then only print any values that have 
>>>>>> actually
>>>>>> been set for that phase.
>>>>>>
>>>>>> The patch also extends the log message about the number for worker
>>>>>> threads to also say how many it could potentially have used. And it
>>>>>> also fixes a missing space in the level 3 and level 4 indentation.
>>>>>>
>>>>>> After applying this patch and running with -Xlog:gc*,phases*=trace
>>>>>> you get output like:
>>>>>> [0,581s][info][gc,task ] GC(0) GC Workers: using 2 out of 23
>>>>>> [0,588s][info][gc,phases] GC(0)   Evacuate Collection Set: 5,0ms
>>>>>> [0,588s][trace][gc,phases] GC(0)       GC Worker Start (ms):     
>>>>>> Min:
>>>>>> 580,9, Avg: 580,9, Max: 580,9, Diff:  0,0
>>>>> It would be useful to have the information about the number of 
>>>>> threads
>>>>> used for every top-level WorkerDataArray. That might differ for every
>>>>> phase in the future. Now it does not matter, because at the moment
>>>>> every thread at least sets the time spent to zero (i.e. is forced 
>>>>> to),
>>>>> but that will not be the case later.
>>>>>
>>>>> Not working on something is different to taking "zero" time for it.
>>>>>
>>>>> My suggestion is to add a ", Workers: X" column to the summary 
>>>>> output, like
>>>>>
>>>>>> [0,588s][debug][gc,phases     ] GC(0)     Ext Root Scanning (ms):
>>>>>> Min:  1,7, Avg:  1,7, Max:  1,8, Diff:  0,0, Sum:  3,5
>>>>> Min:  1,7, Avg:  1,7, Max:  1,8, Diff:  0,0, Sum:  3,5, Workers: 2
>>>>
>>>> I added that. The log now looks like this:
>>>>
>>>> [0,613s][debug][gc,phases     ] GC(0)     Ext Root Scanning (ms):   
>>>> Min:  2,4, Avg:  2,5, Max:  2,7, Diff:  0,3, Sum: 5,0, Workers: 2
>>>>
>>>>>
>>>>>> [0,589s][trace][gc,phases,task] GC(0)
>>>>>> 1,8  1,7   -   -   -   - -   -   -   -   -   -   -   - - -   -
>>>>>> -
>>>>>
>>>>>> [0,588s][trace][gc,phases,task] GC(0)
>>>>>> 580,9 580,9   -   -   - -   -   -   -   -   -   -   - - -   -   -
>>>>> - g1GCPhaseTimes.cpp:109: typo in ASSERT_PHASE_UNINITILAIZED, 
>>>>> should be
>>>>> ASSERT_PHASE_UNINITIALIZED.
>>>>
>>>> Fixed. Thanks for catching that! :)
>>>>
>>>>>
>>>>> - ignore the following if wanted: in g1GCPhaseTimes.cpp:127 the if
>>>>> -clause is structured to have only the asserts in the if-part. I 
>>>>> would
>>>>> prefer if the code that performs some useful work would be first, 
>>>>> i.e.
>>>>> the condition reversed. As mentioned, ymmv.
>>>>
>>>> Done.
>>>>
>>>>>
>>>>> - I would kind of prefer if ADD_WORKER_KNOWN time were a method 
>>>>> instead
>>>>> of a macro. In case of an uninitialized value that method could just
>>>>> return 0.0, which would be okay for this addition.
>>>>
>>>> Done.
>>>>
>>>>>
>>>>> - in WorkerDataArray::WDAPrinter::details, the "unknown value" imo 
>>>>> does
>>>>> not need to be padded out to five spaces or so. I think the main
>>>>> motivation for this has been to show the values of the different 
>>>>> phases
>>>>> of the evacuation phase in the same column.
>>>>>
>>>>> The details() method for the size_t values is not aligned at all.
>>>>>
>>>>> The main reason is waste of space. Not sure here, in doubt keep it.
>>>>
>>>> Removed the padding for both variants.
>>>>
>>>>>
>>>>> - I would prefer if the #include "memory/resourceArea.hpp" were put
>>>>> next to the other includes, of course guarded by the define.
>>>>
>>>> I prefer to keep it closer to the tests. If you have strong 
>>>> opinions I can move it up, but since it is only test related I 
>>>> prefer to have it there.
>>>>
>>>>>
>>>>> - in workerDataArray.hpp, at the definitions of sum() and 
>>>>> average() it
>>>>> might be useful to mention what impact on sum/average uninitialized
>>>>> values have.
>>>>
>>>> Added a comment.
>>>>
>>>>>
>>>>> - I would prefer, if a phase has no data, that this would be detected
>>>>> automatically and the phase either not printed, or (preferably)
>>>>> indicated that it has not been executed. (I could also live with a
>>>>> solution where the programmer can choose what happens when there 
>>>>> is no
>>>>> data. That would probably also remove the dependencies on a lot of
>>>>> other components, since as of right now there is quite a bit of
>>>>> checking for particular circumstances in G1GCPhaseTimes::print()).
>>>>>
>>>>> I do not really like the solution based on this change presented for
>>>>> JDK-8152428 that the programmer is responsible for explicitly
>>>>> specifying that a phase has not been executed. This will be 
>>>>> forgotten,
>>>>> and only causes unnecessary failures potentially long after a change
>>>>> has been committed.
>>>>>
>>>>> The code to print has to iterate over all elements to sum/average 
>>>>> them
>>>>> up anyway, so it already knows that there is no data for a particular
>>>>> phase.
>>>>>
>>>>> The reason why I prefer an indication that a phase has not been
>>>>> executed is that instead of a missing line that a missing line 
>>>>> easy to
>>>>> overlook, while a "not executed" line is much more visible.
>>>>
>>>> I've changed to do mostly what you suggested. A skipped phase now 
>>>> logs:
>>>>
>>>> [6,339s][trace][gc,phases      ] GC(79)       Parallel Preserve CM 
>>>> Refs (ms):: skipped
>>>
>>> The double ":" should not be there. The log actually looks like:
>>>
>>> [6,458s][trace][gc,phases      ] GC(82)       Parallel Preserve CM 
>>> Refs (ms): skipped
>>>
>>> And I should also have mentioned that this makes my other fix for 
>>> JDK-8152428 void. I will withdraw that fix if this patch gets approved.
>>>
>>> Thanks,
>>> Bengt
>>>
>>>>
>>>> But you can't control that by a configuration on the 
>>>> WorkerDataArray. On the other hand you can control it just like you 
>>>> did before, but guarding the call in the print() menthod. Like for 
>>>> example StringDeduplication does.
>>>>
>>>> Updated webrev:
>>>> http://cr.openjdk.java.net/~brutisso/8152952/webrev.02/
>>
>> Copyright
>
> Fixed.
>
>>
>> http://cr.openjdk.java.net/~brutisso/8152952/webrev.02/src/share/vm/gc/g1/workerDataArray.cpp.frames.html 
>>
>>
>> 157 const char* expected_string = "Test array Min: 5, Avg: 6,0, Max: 
>> 7, Diff: 2, Sum: 12, Workers: 2\n";
>>
>> The "6,0," should be "6.0" in the US.
>
>
> Yes, good catch. I noticed this also and fixed the tests yesterday as 
> you noted in your other email.
>
>
>>
>> Maybe you've already discussed this but the " - " in the details 
>> output doesn't seem to add much
>> for me.
>>
>> 56 void WorkerDataArray<double>::WDAPrinter::details(const 
>> WorkerDataArray<double>* phase, outputStream* out) {
>>   57   out->print("%-25s", "");
>> 58 for (uint i = 0; i < phase->_length; ++i) {
>> 59 double value = phase->get(i);
>> 60 if (value != phase->uninitialized()) {
>>   61       out->print(" %4.1lf", phase->get(i) * 1000.0);
>> 62 } else {
>> 63 out->print(" -");
>> 64 }
>>   65   }
>>
>> [1.345s][trace][gc,phases            ] GC(0)      GC Worker Start 
>> (ms):     Min: 1049.0, Avg: 1004.8, Max: 1056.2, Diff: 7.3
>> [1.345s][trace][gc,phases,task       ] 
>> GC(0)                                1049.0 1049.0 1049.0 1049.0 
>> 1049.0 1049.0 1049.0 1049.0 1049.0 1049.0 1049.0 1049.0 1049.0 1049.0 
>> 1049.0 1049.1 1049.1 1052.2 1052.2 1052.2 1056.2 1056.2 1056.2
>>
>> [0.505s][trace][gc,phases            ] GC(1)       GC Worker Start 
>> (ms):     Min: 501.4, Avg: 501.5, Max: 501.5, Diff:  0.0, Workers: 2
>> [0.505s][trace][gc,phases,task       ] 
>> GC(1)                                 501.4 501.5 - - - - - - - - - - 
>> - - - - - - - - - - -
>
> I think the " -" adds value when you consider that different threads 
> do different work. The out put might look like:
>
> [2,996s][trace][gc,phases      ] GC(26)       Thread Roots 
> (ms):        Min:  0.1, Avg:  0.15, Max:  0.2, Diff:  0.1, Sum: 0.3, 
> Workers: 2
> [2,996s][trace][gc,phases,task ] 
> GC(26)                                  - 0.2 - - - - 0.1 - - - - - - 
> - - - - - - - - - -
> [2,996s][trace][gc,phases      ] GC(26)       StringTable Roots 
> (ms):   Min:  0.1, Avg:  0.2, Max:  0.3, Diff:  0.2, Sum: 0.3, Workers: 3
> [2,996s][trace][gc,phases,task ] 
> GC(26)                                  - - - - - - - 0.3 - - - 0.2  
> 0.1 - - - - - - - - - -
>
> It is not very readable but you look closely it will at least be 
> possible to see which threads did what work.

Ok.  That's reasonable.

>
>>
>> Worker count (e.g. "Workers: 2" above) is only printed sometimes? 
>> Maybe Thomas already
>> commented on that?  I'd like it better if it was always printed.
>
> It was intended that it should always be printed. In what cases is it 
> not printed? From your example it looks like it is missing from the 
> "GC Worker Start" logging, but in my runs that line also get the 
> "Workers" info:
>
> [0,582s][trace][gc,phases] GC(0)       GC Worker Start (ms): Min: 
> 574,2, Avg: 574,3, Max: 574,3, Diff:  0,0, Workers: 2
> [0,582s][trace][gc,phases,task] GC(0)                                 
> 574,2 574,3 - - - - - - - - - - - - - - - - - - - - -
> [0,582s][debug][gc,phases     ] GC(0)     Ext Root Scanning (ms):   
> Min:  1,8, Avg:  1,8, Max:  1,8, Diff:  0,0, Sum:  3,5, Workers: 2
> [0,582s][trace][gc,phases,task] GC(0)                                
> 1,8  1,8 - - - - - - - - - - - - - - - - - - - - -
> [0,582s][trace][gc,phases     ] GC(0)       Thread Roots (ms):        
> Min:  0,0, Avg:  0,8, Max:  1,6, Diff:  1,6, Sum: 1,6, Workers: 2
> [0,582s][trace][gc,phases,task] GC(0)                                  
> 0,0  1,6 - - - - - - - - - - - - - - - - - - - - -

Ok.  I'll let you know if I see it again.  Might have been user error.

>
>
>> http://cr.openjdk.java.net/~brutisso/8152952/webrev.02/src/share/vm/gc/g1/workerDataArray.inline.hpp.frames.html 
>>
>>
>> Use of the variable name "active_threads" is going to confuse at
>> least me since that is used other places to mean the number of
>> threads trying to do work (not just the threads that actually
>> found work to do so had data).  "contributing_threads" is
>> a suggestion.
>>
>> 122 uint active_threads = 0;
>> 123 for (uint i = start; i < _length; ++i) {
>>  124       T value = get(i);
>> 125 if (value != uninitialized()) {
>>  126         max = MAX2(max, value);
>>  127         min = MIN2(min, value);
>>  128         sum += value;
>> 129 active_threads++;
>> 130 }
>>  131     }
>
> Changed to contributing_threads.

All looks good.

Thanks.

Jon

>
>>
>>
>> Rest looks good.
>
> Great. Thanks!
>
> Bengt
>
>>
>> Thanks.
>>
>> Jon
>>
>>>>
>>>> Diff compared to last version:
>>>> http://cr.openjdk.java.net/~brutisso/8152952/webrev.01-02.diff/
>>>>
>>>> Thanks,
>>>> Bengt
>>>>
>>>>>
>>>>> Thanks,
>>>>>    Thomas
>>>>>
>>>>
>>>
>>
>




More information about the hotspot-gc-dev mailing list