RFR: 8201318: Introduce GCThreadLocalData to abstract GC-specific data belonging to a thread

Aleksey Shipilev shade at redhat.com
Tue Apr 10 16:50:40 UTC 2018


On 04/10/2018 05:59 PM, Robbin Ehn wrote:
> On 2018-04-10 17:47, Aleksey Shipilev wrote:
>> On 04/10/2018 05:25 PM, Robbin Ehn wrote:
>>> Had quick look, what I saw looked good. (not a full review)
>>> Is there a reason for moving the gc data to 'zero offset' in Thread?
>>
>> Oh! I missed that, and I fully agree with this move. At least one reason I see, smaller offsets
>> against TLS open up opportunities for denser code-generation when e.g. GC barriers poll thread-local
>> data. Right now SATB barrier generates something like "cmpb $0x0, 0x3d8(%r15)", while it could
>> generate just "cmpb $0x0, 0x0(%r15)" now :)
> 
> Yes, but it pushes down e.g.:
> 333   volatile void* _polling_page;                 // Thread local polling page
> Which may not matter, as long as it's on the first page I suppose.

I think the major concern should be the instruction size. On x86 what matters is what category
immediate offset falls into. Some hand-crafted assembly:

   0:	48 89 42 7f          	mov    %rax,0x7f(%rdx)
   4:	48 89 82 80 00 00 00 	mov    %rax,0x80(%rdx)
   b:	80 7a 7f 00          	cmpb   $0x0,0x7f(%rdx)
   f:	80 ba 80 00 00 00 00 	cmpb   $0x0,0x80(%rdx)
  16:	80 7a 7f 41          	cmpb   $0x41,0x7f(%rdx)
  1a:	80 ba 80 00 00 00 41 	cmpb   $0x41,0x80(%rdx)
  21:	f6 42 7f 00          	testb  $0x0,0x7f(%rdx)
  25:	f6 82 80 00 00 00 00 	testb  $0x0,0x80(%rdx)
  2c:	f6 42 7f 41          	testb  $0x41,0x7f(%rdx)
  30:	f6 82 80 00 00 00 41 	testb  $0x41,0x80(%rdx)


In our case, we want to pack the most used fields under first 128 bytes. Maybe we should put polling
page at offset 0, and trim GCTLD to 96 bytes?

Thanks,
-Aleksey


More information about the hotspot-dev mailing list