From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 01:31:49 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 01:31:49 GMT Subject: [jdk16] Integrated: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com> References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com> Message-ID: On Sat, 30 Jan 2021 12:02:25 GMT, ?? wrote: > https://bugs.openjdk.java.net/browse/JDK-8260473 > > Function "PhaseVector::expand_vunbox_node" creates a LoadNode, but forgets to make the LoadNode to pass gc barriers. > > Testing: all Vector API related tests have passed. > > Original pr: https://github.com/openjdk/jdk/pull/2253 This pull request has now been integrated. Changeset: 0fdf9cdd Author: casparcwang Committer: Jie Fu URL: https://git.openjdk.java.net/jdk16/commit/0fdf9cdd Stats: 174 lines in 2 files changed: 165 ins; 0 del; 9 mod 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled Co-authored-by: Stuart Monteith Co-authored-by: Wang Chao Reviewed-by: vlivanov, neliasso ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 01:45:46 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 01:45:46 GMT Subject: RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled [v4] In-Reply-To: References:

Message-ID: On Fri, 29 Jan 2021 16:47:53 GMT, Vladimir Ivanov wrote: >>> > ArrayCopyNode::load performs the same work as it does here in PhaseVector::optimize_vector_boxes . >>> > Is there a need to provide a similar function in PhaseVector or GraphKit? >>> >>> My point is since PhaseVector effectively enters the parsing phase (by signaling about the possibility of post-parse inlining), technically I don't see why `GraphKit::access_load_at` won't work. But I need to spend more time looking into the details. >>> >>> So far, I took a look at the review thread of 8212243 (which introduced `ArrayCopyNode::load`) and found the following discussion between Roland and Erik: >>> https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2018-October/030971.html >>> >>> ``` >>> > ... Also it beats me that this is strictly speaking a load barrier for loads performed in >>> > arraycopy. Would it be possible to use something like access_load_at instead? ... >>> ... >>> GraphKit is a parse time only thing. So the existing gc interface >>> doesn't offer any way to add barriers once parsing is over. This code >>> runs after parsing in optimization phases. >>> ... >>> ``` >>> >>> Considering `PhaseVector::optimize_vector_boxes()` already has access to a usable `GraphKit` instance, it is possible that `GraphKit::access_load_at` will "just work". >> >> As far as I can see, during the parse phase, GraphKit contains the jvm state info which can be used to get the control and memory for creating new nodes. But during optimization, the jvm state info may be missing like the situation in `PhaseVector::optimize_vector_boxes` or Macro Expansion. So it should use C2OptAccess to create the Load Node directly by providing control and memory nodes. >> >> I think a similar api like `GraphKit::access_load_at ` should be provided for usage during optimization stages, but where should the API be placed? GraphKit or PhaseIterGVN or somewhere else? > >> As far as I can see, during the parse phase, GraphKit contains the jvm state info which can be used to get the control and memory for creating new nodes. But during optimization, the jvm state info may be missing like the situation in PhaseVector::optimize_vector_boxes or Macro Expansion. > > JVM state is irrelevant here (otherwise, `VectorUnbox` node would have captured relevant info during construction). What is actually missing is `GraphKit` instance lacks info about control and memory. You need to explicitly set it using `GraphKit::set_control()` and `GraphKit::set_all_memory()`. Thanks @iwanowww @neliasso @pliden @stooart-mo @XiaohongGong @fisk @DamonFool for the reviews and helping. The patch has integrated in jdk16 (https://github.com/openjdk/jdk16/pull/139), and this pr should be closed. ------------- PR: https://git.openjdk.java.net/jdk/pull/2253 From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 01:45:45 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 01:45:45 GMT Subject: RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled [v4] In-Reply-To: References:

<6nZPJh_IZbeLrS2D1lrwq7NIIry0zGQ8EzAXD6fkSrE=.4b476693-5877-434e-9e97-b26f73870e33@github.com>

Message-ID: On Fri, 29 Jan 2021 16:43:54 GMT, Vladimir Ivanov wrote: > > I suggest you keep this CR as it is since 16 is in rampdown and we need to get approval and push it before Feb 4th (and we do want some margin). > > I agree. @casparcwang, please, file an RFE. Jie Fu @DamonFool has helped to create an RFE. https://bugs.openjdk.java.net/browse/JDK-8260682 ------------- PR: https://git.openjdk.java.net/jdk/pull/2253 From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 01:45:46 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 01:45:46 GMT Subject: Withdrawn: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: Message-ID: On Wed, 27 Jan 2021 10:05:56 GMT, ?? wrote: > https://bugs.openjdk.java.net/browse/JDK-8260473 > > Function "PhaseVector::expand_vunbox_node" creates a LoadNode, but forgets to make the LoadNode to pass gc barriers. > > > Testing: all Vector API related tests have passed. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.java.net/jdk/pull/2253 From shade at openjdk.java.net Mon Feb 1 08:52:41 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Mon, 1 Feb 2021 08:52:41 GMT Subject: Integrated: 8260591: Shenandoah: improve parallelism for concurrent thread root scans In-Reply-To: References: Message-ID: On Thu, 28 Jan 2021 14:04:07 GMT, Aleksey Shipilev wrote: > Following JDK-8256298, there are a few minor performance issues with the implementation. > > First, in the spirit of JDK-8246100, we should be scanning the Java threads the last, as they have the most parallelism. Less parallel, or lightweight roots should be scanned before them to improve overall parallelism. > > Second, claiming each thread dominates the per-thread processing cost. We should really be doing chunked processing. > > Motivating example is SPECjvm2008 serial, which has very fast concurrent cycles, and thread root scan speed is important. > > Before: > # Baseline > [56.176s][info][gc,stats] Concurrent Mark Roots = 0.308 s (a = 1452 us) (n = 212) (lvls, us = 305, 398, 457, 719, 11216) > [56.176s][info][gc,stats] CMR: = 1.236 s (a = 5832 us) (n = 212) (lvls, us = 2676, 3535, 4199, 5391, 54522) > [56.176s][info][gc,stats] CMR: Thread Roots = 1.179 s (a = 5563 us) (n = 212) (lvls, us = 2441, 3242, 3945, 5156, 54288) > [56.176s][info][gc,stats] CMR: VM Strong Roots = 0.005 s (a = 23 us) (n = 212) (lvls, us = 12, 19, 21, 23, 204) > [56.176s][info][gc,stats] CMR: CLDG Roots = 0.052 s (a = 247 us) (n = 212) (lvls, us = 73, 203, 252, 293, 562) > > ... > [56.176s][info][gc,stats] Concurrent Stack Processing = 0.124 s (a = 5149 us) (n = 24) (lvls, us = 535, 607, 885, 6387, 27177) > [56.176s][info][gc,stats] Threads = 0.632 s (a = 26345 us) (n = 24) (lvls, us = 6465, 8086, 10742, 39453, 145679) > [56.176s][info][gc,stats] CT: = 0.632 s (a = 26345 us) (n = 24) (lvls, us = 6465, 8086, 10742, 39453, 145679) > > After: > [56.010s][info][gc,stats] Concurrent Mark Roots = 0.116 s (a = 587 us) (n = 198) (lvls, us = 312, 371, 400, 502, 4316) > [56.010s][info][gc,stats] CMR: = 0.931 s (a = 4703 us) (n = 198) (lvls, us = 2402, 3438, 3770, 4453, 62629) > [56.010s][info][gc,stats] CMR: Thread Roots = 0.864 s (a = 4366 us) (n = 198) (lvls, us = 1914, 3125, 3477, 4199, 54075) > [56.010s][info][gc,stats] CMR: VM Strong Roots = 0.015 s (a = 76 us) (n = 198) (lvls, us = 20, 31, 35, 38, 4693) > [56.010s][info][gc,stats] CMR: CLDG Roots = 0.052 s (a = 261 us) (n = 198) (lvls, us = 61, 172, 256, 299, 3861) > ... > [56.010s][info][gc,stats] Concurrent Stack Processing = 0.081 s (a = 3671 us) (n = 22) (lvls, us = 457, 537, 770, 3359, 24003) > [56.010s][info][gc,stats] Threads = 0.469 s (a = 21309 us) (n = 22) (lvls, us = 6016, 6855, 8711, 18945, 103939) > [56.010s][info][gc,stats] CT: = 0.469 s (a = 21309 us) (n = 22) (lvls, us = 6016, 6855, 8711, 18945, 103939) This pull request has now been integrated. Changeset: ab727f0a Author: Aleksey Shipilev URL: https://git.openjdk.java.net/jdk/commit/ab727f0a Stats: 39 lines in 3 files changed: 20 ins; 7 del; 12 mod 8260591: Shenandoah: improve parallelism for concurrent thread root scans Reviewed-by: zgu, rkennke ------------- PR: https://git.openjdk.java.net/jdk/pull/2290 From shade at openjdk.java.net Mon Feb 1 09:14:44 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Mon, 1 Feb 2021 09:14:44 GMT Subject: RFR: 8260309: Shenandoah: Clean up ShenandoahBarrierSet [v2] In-Reply-To: References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> Message-ID: On Fri, 29 Jan 2021 14:45:50 GMT, Roman Kennke wrote: >> We collected some cruft in ShenandoahBarrierSet. Time to clean it up. >> >> This fixes/removes a number of includes, fixes some comments and it also removes is_a() and is_aligned() which look like leftovers/requirements from earlier incarnations of the superclass BarrierSet. Using the override keyword would be useful for such situations (btw, are we ok to start using override, nullptr, auto etc in Shenandoah, or do we want to keep it C++ for backporting ease?) >> >> One thing I was not sure about is the ShenandoahHeap* _heap field. Making it const will likely help the compiler avoid repeated access (e.g. in a number of perf-critical paths like the LRB impl). However, maybe we should get rid of the field altogether and make it explicitely using ShenandoahHeap::heap() and avoid repeated access instead of helping the compiler and hoping for the best? >> >> Testing: >> - [x] hotspot_gc_shenandoah release, fastdebug > > Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - Restore some changes that have been lost during merge > - Merge branch 'master' into JDK-8260309 > - 8260309: Shenandoah: Clean up ShenandoahBarrierSet Looks fine, but I have a minor question. src/hotspot/share/gc/shenandoah/shenandoahBarrierSet.inline.hpp line 28: > 26: #define SHARE_GC_SHENANDOAH_SHENANDOAHBARRIERSET_INLINE_HPP > 27: > 28: #include "gc/shared/accessBarrierSupport.hpp" Should it be `accessBarrierSupport.inline.hpp`? Other `*BarrierSet.inline.hpp`-s seem to include that. ------------- Marked as reviewed by shade (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2202 From tschatzl at openjdk.java.net Mon Feb 1 09:46:41 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Mon, 1 Feb 2021 09:46:41 GMT Subject: RFR: 8258508: Merge G1RedirtyCardsQueue into qset In-Reply-To: References: Message-ID: On Sat, 30 Jan 2021 10:14:42 GMT, Kim Barrett wrote: > Please review this change to G1RedirtyCardsLocalQueueSet to directly > incorporate the associated queue, simplifying usage. > > Testing: > mach5 tier1 Lgtm. ------------- Marked as reviewed by tschatzl (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2325 From sjohanss at openjdk.java.net Mon Feb 1 09:54:39 2021 From: sjohanss at openjdk.java.net (Stefan Johansson) Date: Mon, 1 Feb 2021 09:54:39 GMT Subject: RFR: 8217327: G1 Post-Cleanup region liveness printing should not print out-of-date efficiency [v4] In-Reply-To: <95B6j1ZSceUGfTTDsZfF3a5ZbggYlBiv9WJkHKkzO0w=.edd53e67-02ae-4c8a-ae0f-3a50c7ac0676@github.com> References: <95B6j1ZSceUGfTTDsZfF3a5ZbggYlBiv9WJkHKkzO0w=.edd53e67-02ae-4c8a-ae0f-3a50c7ac0676@github.com> Message-ID: On Thu, 28 Jan 2021 12:48:55 GMT, Joakim Nordstr?m wrote: >> **Description** >> This fix addresses the issue where gc-efficiency is printed incorrectly when logging post-marking and post-cleanup. The gc-efficiency is calculated in the end of the marking phase, to be logged in the post-cleanup section. It is however not reset, meaning that next phase's post-marking log will show the old value. >> >> - The gc-efficiency is initialized to -1 when it hasn't been calculated. >> - Negative gc-efficiency is displayed as a hyphen "-" in the summary. >> - The gc-efficiency is reset to -1 in `HeapRegion::note_start_of_marking()` >> >> **Note:** there is a sister issue that moves the post-cleanup printing to a later stage. Without this fix, the logging will still be incorrect, so both fixes are needed. See: [JDK-8260042: G1 Post-cleanup liveness printing occurs too early](https://github.com/openjdk/jdk/pull/2168) >> >> This fix has been tested together with the above mentioned fix. >> >> **Example** >> This is what logging like after fix has been applied. >> ### PHASE Post-Marking @ 410.303 >> ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 >> ### >> ### type address-range used prev-live next-live gc-eff remset state code-roots >> ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) >> ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8464 UPDAT 6096 >> ### OLD 0x0ffd00000-0x0ffe00000 132856 132856 132856 - 2544 UPDAT 16 >> ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 >> ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 >> ### >> ### SUMMARY capacity: 4.00 MB used: 1.15 MB / 28.67 % prev-live: 1.15 MB / 28.67 % next-live: 1.15 MB / 28.67 % remset: 0.02 MB code-roots: 0.01 MB >> ### PHASE Post-Cleanup @ 410.305 >> ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 >> ### >> ### type address-range used prev-live next-live gc-eff remset state code-roots >> ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) >> ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UNTRA 6096 >> ### OLD 0x0ffd00000-0x0ffe00000 132856 132856 132856 1352923.9 2544 CMPLT 16 >> ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 >> ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 >> ### >> ### SUMMARY capacity: 4.00 MB used: 1.15 MB / 28.67 % prev-live: 1.15 MB / 28.67 % next-live: 1.15 MB / 28.67 % remset: 0.02 MB code-roots: 0.01 MB >> ### PHASE Post-Marking @ 450.310 >> ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 >> ### >> ### type address-range used prev-live next-live gc-eff remset state code-roots >> ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) >> ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UPDAT 6096 >> ### OLD 0x0ffd00000-0x0ffe00000 174456 174456 174456 - 2544 UPDAT 16 >> ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 >> ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 >> ### >> ### SUMMARY capacity: 4.00 MB used: 1.19 MB / 29.66 % prev-live: 1.19 MB / 29.66 % next-live: 1.19 MB / 29.66 % remset: 0.02 MB code-roots: 0.01 MB >> ### PHASE Post-Cleanup @ 450.312 >> ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 >> ### >> ### type address-range used prev-live next-live gc-eff remset state code-roots >> ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) >> ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UNTRA 6096 >> ### OLD 0x0ffd00000-0x0ffe00000 174456 174456 174456 1266519.2 2544 CMPLT 16 >> ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 >> ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 >> ### >> >> **Testing** >> - Manual testing >> - hs-tier1, hs-tier2 > > Joakim Nordstr?m has updated the pull request incrementally with one additional commit since the last revision: > > Using FormatBuffer instead of snprintf. Changed defines to more descriptive names. Looks good. ------------- Marked as reviewed by sjohanss (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2217 From iwalulya at openjdk.java.net Mon Feb 1 09:57:45 2021 From: iwalulya at openjdk.java.net (Ivan Walulya) Date: Mon, 1 Feb 2021 09:57:45 GMT Subject: RFR: 8260044: Parallel GC: Concurrent allocation after heap expansion may cause unnecessary full gc [v2] In-Reply-To: References:

Message-ID: On Sat, 30 Jan 2021 06:09:01 GMT, Kim Barrett wrote: >> Please review this change to ParallelGC to avoid unnecessary full GCs when >> concurrent threads attempt oldgen allocations during evacuation. >> >> When a GC thread fails an oldgen allocation it expands the heap and retries >> the allocation. If the second allocation attempt fails then allocation >> failure is reported to the caller, which can lead to a full GC. But the >> retried allocation could fail because, after expansion, some other thread >> allocated enough of the available space that the retry fails. This can >> happen even though there is plenty of space available, if only that retry >> were to perform another expansion. >> >> Rather than trying to combine the allocation retry with the expansion (it's >> not clear there's a way to do so without breaking invariants), we instead >> simply loop on the allocation attempt + expand, until either the allocation >> succeeds or the expand fails. If some other thread "steals" space from the >> expanding thread and causes its next allocation attempt to fail and do >> another expansion, that's functionally no different from the expanding >> thread succeeding and causing the other thread to fail allocation and do the >> expand instead. >> >> This change includes modifying PSOldGen::expand_to_reserved to return false >> when there is no space available, where it previously returned true. It's >> not clear why it returned true; that seems wrong, but was harmless. But it >> must not do so with the new looping behavior for allocation, else it would >> never terminate. >> >> Testing: >> mach5 tier1-3, tier5 (tier2-3, 5 do a lot of ParallelGC testing) > > Kim Barrett has updated the pull request incrementally with one additional commit since the last revision: > > require non-zero expand size Looks good! ------------- Marked as reviewed by sjohanss (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2309 From iwalulya at openjdk.java.net Mon Feb 1 10:23:49 2021 From: iwalulya at openjdk.java.net (Ivan Walulya) Date: Mon, 1 Feb 2021 10:23:49 GMT Subject: RFR: 8258508: Merge G1RedirtyCardsQueue into qset In-Reply-To: References: Message-ID: <4HRS-gf7zltSRZ6CmxYhrpDOcqPjXVqHXHlbxQUPJ6M=.b1a935fe-c9f5-4fb1-8102-882904205bfb@github.com> On Sat, 30 Jan 2021 10:14:42 GMT, Kim Barrett wrote: > Please review this change to G1RedirtyCardsLocalQueueSet to directly > incorporate the associated queue, simplifying usage. > > Testing: > mach5 tier1 Looks good! ------------- Marked as reviewed by iwalulya (Committer). PR: https://git.openjdk.java.net/jdk/pull/2325 From rkennke at openjdk.java.net Mon Feb 1 11:00:59 2021 From: rkennke at openjdk.java.net (Roman Kennke) Date: Mon, 1 Feb 2021 11:00:59 GMT Subject: RFR: 8260309: Shenandoah: Clean up ShenandoahBarrierSet [v3] In-Reply-To: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> Message-ID: <2u_BCcnM4QkcVVj6MVeFDfDgjB789ouIQuBfqY5p6vo=.2a63e65b-4fcd-4c79-9623-ac203c3ba056@github.com> > We collected some cruft in ShenandoahBarrierSet. Time to clean it up. > > This fixes/removes a number of includes, fixes some comments and it also removes is_a() and is_aligned() which look like leftovers/requirements from earlier incarnations of the superclass BarrierSet. Using the override keyword would be useful for such situations (btw, are we ok to start using override, nullptr, auto etc in Shenandoah, or do we want to keep it C++ for backporting ease?) > > One thing I was not sure about is the ShenandoahHeap* _heap field. Making it const will likely help the compiler avoid repeated access (e.g. in a number of perf-critical paths like the LRB impl). However, maybe we should get rid of the field altogether and make it explicitely using ShenandoahHeap::heap() and avoid repeated access instead of helping the compiler and hoping for the best? > > Testing: > - [x] hotspot_gc_shenandoah release, fastdebug Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: Include accessBarrierSupport.inline.hpp instead of accessBarrierSupport.hpp ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2202/files - new: https://git.openjdk.java.net/jdk/pull/2202/files/bd7da1e2..5f68a73b Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2202&range=02 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2202&range=01-02 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.java.net/jdk/pull/2202.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2202/head:pull/2202 PR: https://git.openjdk.java.net/jdk/pull/2202 From rkennke at openjdk.java.net Mon Feb 1 11:01:02 2021 From: rkennke at openjdk.java.net (Roman Kennke) Date: Mon, 1 Feb 2021 11:01:02 GMT Subject: RFR: 8260309: Shenandoah: Clean up ShenandoahBarrierSet [v2] In-Reply-To: References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com>

Message-ID: On Mon, 1 Feb 2021 09:06:42 GMT, Aleksey Shipilev wrote: >> Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: >> >> - Restore some changes that have been lost during merge >> - Merge branch 'master' into JDK-8260309 >> - 8260309: Shenandoah: Clean up ShenandoahBarrierSet > > src/hotspot/share/gc/shenandoah/shenandoahBarrierSet.inline.hpp line 28: > >> 26: #define SHARE_GC_SHENANDOAH_SHENANDOAHBARRIERSET_INLINE_HPP >> 27: >> 28: #include "gc/shared/accessBarrierSupport.hpp" > > Should it be `accessBarrierSupport.inline.hpp`? Other `*BarrierSet.inline.hpp`-s seem to include that. Right. I changed that. Thanks! ------------- PR: https://git.openjdk.java.net/jdk/pull/2202 From vlivanov at openjdk.java.net Mon Feb 1 11:38:49 2021 From: vlivanov at openjdk.java.net (Vladimir Ivanov) Date: Mon, 1 Feb 2021 11:38:49 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

Message-ID: On Sun, 31 Jan 2021 00:41:11 GMT, Jie Fu wrote: > compileonly and compilercount=1 will let the VM run slow enough to wait for a gc to be finished. That's a strange way to provoke the bug. You could just increase the number of iterations instead. But the right way to fix it is to stress ZGC to continuously run in the background while the test case aggressively unboxes vectors in compiled code. `-Xmx256m` helps with that while `-XX:CICompilerCount=1` is irrelevant. ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From tschatzl at openjdk.java.net Mon Feb 1 11:57:48 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Mon, 1 Feb 2021 11:57:48 GMT Subject: RFR: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() Message-ID: Hi all, can I have reviews for this change that removes parallel handling in `CardTableRS::younger_refs_in_space_iterate` as it is always called with n_threads <= 1, making the parallel code handling there obsolete. A larger cleanup of `CardTableRS` will follow in JDK-8234534. Testing: tier1,2 ------------- Commit messages: - Initial commit Changes: https://git.openjdk.java.net/jdk/pull/2333/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2333&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8260643 Stats: 103 lines in 7 files changed: 3 ins; 72 del; 28 mod Patch: https://git.openjdk.java.net/jdk/pull/2333.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2333/head:pull/2333 PR: https://git.openjdk.java.net/jdk/pull/2333 From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 12:10:45 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 12:10:45 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

Message-ID: <_Wm-fi9j4TZ41F0G_92f7ioKQeDNgZiOEMmLkZ0lvvE=.0a9beba5-5089-4368-b4bc-73faf9d5e858@github.com> On Mon, 1 Feb 2021 11:35:13 GMT, Vladimir Ivanov wrote: > > compileonly and compilercount=1 will let the VM run slow enough to wait for a gc to be finished. > > That's a strange way to provoke the bug. You could just increase the number of iterations instead. > > But the right way to fix it is to stress ZGC to continuously run in the background while the test case aggressively unboxes vectors in compiled code. `-Xmx256m` helps with that while `-XX:CICompilerCount=1` is irrelevant. Yes, it's very weird to provoke the bug like this. If CICompilerCount=1 is removed, the test failed 60% roughly on my machine. And the iteration has already changed from 100 to 1000, the run time of the test is nearly 30s on release version of jvm. If I add the following patch, the test always fails on my machine, diff --git a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java index 1843ec0..959b29a 100644 --- a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java +++ b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java @@ -44,7 +44,7 @@ import jdk.internal.vm.annotation.ForceInline; * @modules jdk.incubator.vector * @modules java.base/jdk.internal.vm.annotation * @run testng/othervm -XX:CompileCommand=compileonly,jdk/incubator/vector/ByteVector.fromByteBuffer - * -XX:-TieredCompilation -XX:CICompilerCount=1 -XX:+UseZGC -Xbatch -Xmx256m VectorRebracket128Test + * -XX:-TieredCompilation -XX:+UseZGC -Xmx256m VectorRebracket128Test */ @Test @@ -125,6 +125,14 @@ public class VectorRebracket128Test { @ForceInline static void testVectorRebracket(VectorSpecies a, VectorSpecies b, byte[] input, byte[] output) { + new Thread(() -> { + while (true) { + try { + System.gc(); + Thread.sleep(100); + } catch (Exception e) {} + } + }).start(); Vector av = a.fromByteArray(input, 0, ByteOrder.nativeOrder()); int block; assert(input.length == output.length); ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From github.com+25214855+casparcwang at openjdk.java.net Mon Feb 1 12:18:44 2021 From: github.com+25214855+casparcwang at openjdk.java.net (=?UTF-8?B?546L6LaF?=) Date: Mon, 1 Feb 2021 12:18:44 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: <_Wm-fi9j4TZ41F0G_92f7ioKQeDNgZiOEMmLkZ0lvvE=.0a9beba5-5089-4368-b4bc-73faf9d5e858@github.com> References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

<_Wm-fi9j4TZ41F0G_92f7ioKQeDNgZiOEMmLkZ0lvvE=.0a9beba5-5089-4368-b4bc-73faf9d5e858@github.com> Message-ID: On Mon, 1 Feb 2021 12:06:26 GMT, ?? wrote: >>> compileonly and compilercount=1 will let the VM run slow enough to wait for a gc to be finished. >> >> That's a strange way to provoke the bug. You could just increase the number of iterations instead. >> >> But the right way to fix it is to stress ZGC to continuously run in the background while the test case aggressively unboxes vectors in compiled code. `-Xmx256m` helps with that while `-XX:CICompilerCount=1` is irrelevant. > >> > compileonly and compilercount=1 will let the VM run slow enough to wait for a gc to be finished. >> >> That's a strange way to provoke the bug. You could just increase the number of iterations instead. >> >> But the right way to fix it is to stress ZGC to continuously run in the background while the test case aggressively unboxes vectors in compiled code. `-Xmx256m` helps with that while `-XX:CICompilerCount=1` is irrelevant. > > Yes, it's very weird to provoke the bug like this. If CICompilerCount=1 is removed, the test failed 60% roughly on my machine. > And the iteration has already changed from 100 to 1000, the run time of the test is nearly 30s on release version of jvm. > > If I add the following patch, the test always fails on my machine, > > diff --git a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > index 1843ec0..959b29a 100644 > --- a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > +++ b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > @@ -44,7 +44,7 @@ import jdk.internal.vm.annotation.ForceInline; > * @modules jdk.incubator.vector > * @modules java.base/jdk.internal.vm.annotation > * @run testng/othervm -XX:CompileCommand=compileonly,jdk/incubator/vector/ByteVector.fromByteBuffer > - * -XX:-TieredCompilation -XX:CICompilerCount=1 -XX:+UseZGC -Xbatch -Xmx256m VectorRebracket128Test > + * -XX:-TieredCompilation -XX:+UseZGC -Xmx256m VectorRebracket128Test > */ > > @Test > @@ -125,6 +125,14 @@ public class VectorRebracket128Test { > @ForceInline > static > void testVectorRebracket(VectorSpecies a, VectorSpecies b, byte[] input, byte[] output) { > + new Thread(() -> { > + while (true) { > + try { > + System.gc(); > + Thread.sleep(100); > + } catch (Exception e) {} > + } > + }).start(); > Vector av = a.fromByteArray(input, 0, ByteOrder.nativeOrder()); > int block; > assert(input.length == output.length); sorry for the wrong patch above, the failed reason of the patch above is due to stack creation failure (create 1000 threads). The following is the right stress gc patch. diff --git a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java index 6b266db..a761ea2 100644 --- a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java +++ b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java @@ -44,7 +44,7 @@ import jdk.internal.vm.annotation.ForceInline; * @modules jdk.incubator.vector * @modules java.base/jdk.internal.vm.annotation * @run testng/othervm -XX:CompileCommand=compileonly,jdk/incubator/vector/ByteVector.fromByteBuffer - * -XX:-TieredCompilation -XX:CICompilerCount=1 -XX:+UseZGC -Xbatch -Xmx256m VectorRebracket128Test + * -XX:-TieredCompilation -XX:+UseZGC -Xmx256m VectorRebracket128Test */ @Test @@ -59,6 +59,19 @@ public class VectorRebracket128Test { static final VectorSpecies bspec128 = ByteVector.SPECIES_128; static final VectorSpecies sspec128 = ShortVector.SPECIES_128; + static { + Thread t = new Thread(() -> { + while (true) { + try { + System.gc(); + Thread.sleep(100); + } catch (Exception e) {} + } + }); + t.setDaemon(true); + t.start(); + } + static IntFunction withToString(String s, IntFunction f) { return new IntFunction() { @Override ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From magnus.ihse.bursie at oracle.com Mon Feb 1 12:29:30 2021 From: magnus.ihse.bursie at oracle.com (Magnus Ihse Bursie) Date: Mon, 1 Feb 2021 13:29:30 +0100 Subject: Build fails when excluding Serial GC In-Reply-To: <2514512e-68d5-868a-5f05-c9d765ae3486@oracle.com> References: <7e2adbed-1b0b-4693-92c0-5c03963b3c55.qingfeng.yy@alibaba-inc.com> <88f8f4b4-941a-5df3-6a89-28741d2f6c7b@oracle.com> <2514512e-68d5-868a-5f05-c9d765ae3486@oracle.com> Message-ID: <3ea7def6-025f-3b63-6598-df001fa8258a@oracle.com> On 2021-01-29 11:19, Stefan Karlsson wrote: > On 2021-01-29 10:49, Magnus Ihse Bursie wrote: >> >> >> On 2021-01-29 09:03, Yang Yi wrote: >>> Hi, >>> >>> It's quite easy to reproduce this problem: >>> ./configure --with-jvm-features=-serialgc ... ; make images >>> >>> I got the following output >>> ``` >>> ... >>> === Output from failing command(s) repeated here === >>> * For target hotspot_variant-server_libjvm_objs_genCollectedHeap.o: >>> /home/qingfeng.yy/openjdk16_so_warning/jdk/src/hotspot/share/gc/shared/genCollectedHeap.cpp: >>> In member function 'virtual void GenCollectedHeap::post_initialize()': >>> /home/qingfeng.yy/openjdk16_so_warning/jdk/src/hotspot/share/gc/shared/genCollectedHeap.cpp:206:3: >>> error: 'MarkSweep' has not been declared >>> ?? 206 |?? MarkSweep::initialize(); >>> ?????? |?? ^~~~~~~~~ >>> * All command lines available in >>> /home/qingfeng.yy/openjdk16_so_warning/jdk/build/linux-x86_64-server-release/make-support/failure-logs. >>> === End of repeated output === >>> ``` >>> I found current JVM features contain the serial gc, but actually I >>> can not >>> build an image that does not contain serial gc. This problem has >>> existed >>> from jdk 11 to jdk head. I am somewhat surprised, so I haven't filed an >>> issue on JBS. Is this really a bug? Or actually we should revise the >>> building >>> document and remove all INCLUDE_SERIALGC macros? >> >> About a year ago I opened >> https://bugs.openjdk.java.net/browse/JDK-8240224, to fix this (and >> other things). This caused quite a heated debate [1], and the result >> was that I closed the bug again. >> >> In summary, my understanding is that hotspot developers view the >> serialgc as essential, and that there exists no reason beyond toy >> applications to remove it from compilation. But furthermore the >> INCLUDE_SERIALGC macros should remain, even though they do not really >> work, since they function as markers of intent for the code. I don't? >> agree 100% with this stance, but it's not my code to complain about. :-) > > I think you got push back on some of the changes. To me and many > others the gcConfig.* changes were really controversial. It doesn't > mean that fixes to clean this up won't be accepted. Fair enough. I think the road to make it possible to exclude serial gc is to first fix the hotspot code for e.g. JDK-8234502, and then we can revisit the build changes needed. But I did get the impression from some developers that this was a futile exercise. /Magnus > In that mail thread, there was a reference to this bug '8234502: Merge > GenCollectedHeap and SerialHeap'. Chipping away at that would be good. > Fixing that would not only make it possible to build without Serial > GC, but also help with the maintainability of our code. > > StefanK > >> >> Possibly, the configure script should be changed so it does not look >> like it's possible to exclude the serialgc... >> >> /Magnus >> >> >> [1] >> https://mail.openjdk.java.net/pipermail/hotspot-gc-dev/2020-March/028779.html >> >>> >>> Cheers,Yang Yi >>> >> > From vlivanov at openjdk.java.net Mon Feb 1 12:47:46 2021 From: vlivanov at openjdk.java.net (Vladimir Ivanov) Date: Mon, 1 Feb 2021 12:47:46 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

<_Wm-fi9j4TZ41F0G_92f7ioKQeDNgZiOEMmLkZ0lvvE=.0a9beba5-5089-4368-b4bc-73faf9d5e858@github.com> Message-ID: <226iFOsl1hXrEoSe9uzgBb1Z75wxQEv5azlJIfzCO4k=.69d5ed3a-7337-472d-b106-1ce2e5d361bf@github.com> On Mon, 1 Feb 2021 12:15:38 GMT, ?? wrote: >>> > compileonly and compilercount=1 will let the VM run slow enough to wait for a gc to be finished. >>> >>> That's a strange way to provoke the bug. You could just increase the number of iterations instead. >>> >>> But the right way to fix it is to stress ZGC to continuously run in the background while the test case aggressively unboxes vectors in compiled code. `-Xmx256m` helps with that while `-XX:CICompilerCount=1` is irrelevant. >> >> Yes, it's very weird to provoke the bug like this. If CICompilerCount=1 is removed, the test failed 60% roughly on my machine. >> And the iteration has already changed from 100 to 1000, the run time of the test is nearly 30s on release version of jvm. >> >> If I add the following patch, the test always fails on my machine, >> >> diff --git a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java >> index 1843ec0..959b29a 100644 >> --- a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java >> +++ b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java >> @@ -44,7 +44,7 @@ import jdk.internal.vm.annotation.ForceInline; >> * @modules jdk.incubator.vector >> * @modules java.base/jdk.internal.vm.annotation >> * @run testng/othervm -XX:CompileCommand=compileonly,jdk/incubator/vector/ByteVector.fromByteBuffer >> - * -XX:-TieredCompilation -XX:CICompilerCount=1 -XX:+UseZGC -Xbatch -Xmx256m VectorRebracket128Test >> + * -XX:-TieredCompilation -XX:+UseZGC -Xmx256m VectorRebracket128Test >> */ >> >> @Test >> @@ -125,6 +125,14 @@ public class VectorRebracket128Test { >> @ForceInline >> static >> void testVectorRebracket(VectorSpecies a, VectorSpecies b, byte[] input, byte[] output) { >> + new Thread(() -> { >> + while (true) { >> + try { >> + System.gc(); >> + Thread.sleep(100); >> + } catch (Exception e) {} >> + } >> + }).start(); >> Vector av = a.fromByteArray(input, 0, ByteOrder.nativeOrder()); >> int block; >> assert(input.length == output.length); > > sorry for the wrong patch above, the failed reason of the patch above is due to stack creation failure (create 1000 threads). The following is the right stress gc patch. > > diff --git a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > index 6b266db..a761ea2 100644 > --- a/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > +++ b/test/hotspot/jtreg/compiler/vectorapi/VectorRebracket128Test.java > @@ -44,7 +44,7 @@ import jdk.internal.vm.annotation.ForceInline; > * @modules jdk.incubator.vector > * @modules java.base/jdk.internal.vm.annotation > * @run testng/othervm -XX:CompileCommand=compileonly,jdk/incubator/vector/ByteVector.fromByteBuffer > - * -XX:-TieredCompilation -XX:CICompilerCount=1 -XX:+UseZGC -Xbatch -Xmx256m VectorRebracket128Test > + * -XX:-TieredCompilation -XX:+UseZGC -Xmx256m VectorRebracket128Test > */ > > @Test > @@ -59,6 +59,19 @@ public class VectorRebracket128Test { > static final VectorSpecies bspec128 = ByteVector.SPECIES_128; > static final VectorSpecies sspec128 = ShortVector.SPECIES_128; > > + static { > + Thread t = new Thread(() -> { > + while (true) { > + try { > + System.gc(); > + Thread.sleep(100); > + } catch (Exception e) {} > + } > + }); > + t.setDaemon(true); > + t.start(); > + } > + > static IntFunction withToString(String s, IntFunction f) { > return new IntFunction() { > @Override Good. Please, file a follow-up RFE to improve the test. ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From shade at openjdk.java.net Mon Feb 1 13:16:42 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Mon, 1 Feb 2021 13:16:42 GMT Subject: RFR: 8260309: Shenandoah: Clean up ShenandoahBarrierSet [v3] In-Reply-To: <2u_BCcnM4QkcVVj6MVeFDfDgjB789ouIQuBfqY5p6vo=.2a63e65b-4fcd-4c79-9623-ac203c3ba056@github.com> References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> <2u_BCcnM4QkcVVj6MVeFDfDgjB789ouIQuBfqY5p6vo=.2a63e65b-4fcd-4c79-9623-ac203c3ba056@github.com> Message-ID: <73Eq3oBXjf_ANTph4ijuK4hfVx-6do89K2w3ChROVlA=.33dfa07d-0a6c-405a-8f65-0671be7cf980@github.com> On Mon, 1 Feb 2021 11:00:59 GMT, Roman Kennke wrote: >> We collected some cruft in ShenandoahBarrierSet. Time to clean it up. >> >> This fixes/removes a number of includes, fixes some comments and it also removes is_a() and is_aligned() which look like leftovers/requirements from earlier incarnations of the superclass BarrierSet. Using the override keyword would be useful for such situations (btw, are we ok to start using override, nullptr, auto etc in Shenandoah, or do we want to keep it C++ for backporting ease?) >> >> One thing I was not sure about is the ShenandoahHeap* _heap field. Making it const will likely help the compiler avoid repeated access (e.g. in a number of perf-critical paths like the LRB impl). However, maybe we should get rid of the field altogether and make it explicitely using ShenandoahHeap::heap() and avoid repeated access instead of helping the compiler and hoping for the best? >> >> Testing: >> - [x] hotspot_gc_shenandoah release, fastdebug > > Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: > > Include accessBarrierSupport.inline.hpp instead of accessBarrierSupport.hpp Marked as reviewed by shade (Reviewer). ------------- PR: https://git.openjdk.java.net/jdk/pull/2202 From ayang at openjdk.java.net Mon Feb 1 13:56:46 2021 From: ayang at openjdk.java.net (Albert Mingkun Yang) Date: Mon, 1 Feb 2021 13:56:46 GMT Subject: RFR: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() In-Reply-To: References: Message-ID: On Mon, 1 Feb 2021 11:53:00 GMT, Thomas Schatzl wrote: > Hi all, > > can I have reviews for this change that removes parallel handling in `CardTableRS::younger_refs_in_space_iterate` as it is always called with n_threads <= 1, making the parallel code handling there obsolete. > > A larger cleanup of `CardTableRS` will follow in JDK-8234534. > > Testing: > tier1,2 Marked as reviewed by ayang (Author). ------------- PR: https://git.openjdk.java.net/jdk/pull/2333 From sjohanss at openjdk.java.net Mon Feb 1 15:35:56 2021 From: sjohanss at openjdk.java.net (Stefan Johansson) Date: Mon, 1 Feb 2021 15:35:56 GMT Subject: RFR: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() In-Reply-To: References: Message-ID: On Mon, 1 Feb 2021 11:53:00 GMT, Thomas Schatzl wrote: > Hi all, > > can I have reviews for this change that removes parallel handling in `CardTableRS::younger_refs_in_space_iterate` as it is always called with n_threads <= 1, making the parallel code handling there obsolete. > > A larger cleanup of `CardTableRS` will follow in JDK-8234534. > > Testing: > tier1,2 Looks good. ------------- Marked as reviewed by sjohanss (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2333 From zgu at openjdk.java.net Mon Feb 1 15:38:09 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Mon, 1 Feb 2021 15:38:09 GMT Subject: RFR: 8260004: Shenandoah: Rename ShenandoahMarkCompact to ShenandoahFullGC [v3] In-Reply-To: References: Message-ID: > Please review this patch that renames ShenandoahMarkCompact to ShenandoahFullGC, to be consistent with other GCs. Zhengyu Gu has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: - Merge branch 'master' into JDK-8260004-rename-fullgc - Merge master - JDK-8260004-rename-fullgc ------------- Changes: https://git.openjdk.java.net/jdk/pull/2266/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2266&range=02 Stats: 38 lines in 9 files changed: 4 ins; 6 del; 28 mod Patch: https://git.openjdk.java.net/jdk/pull/2266.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2266/head:pull/2266 PR: https://git.openjdk.java.net/jdk/pull/2266 From github.com+779991+jaokim at openjdk.java.net Mon Feb 1 15:44:41 2021 From: github.com+779991+jaokim at openjdk.java.net (Joakim =?UTF-8?B?Tm9yZHN0csO2bQ==?=) Date: Mon, 1 Feb 2021 15:44:41 GMT Subject: RFR: 8217327: G1 Post-Cleanup region liveness printing should not print out-of-date efficiency [v4] In-Reply-To: References: <95B6j1ZSceUGfTTDsZfF3a5ZbggYlBiv9WJkHKkzO0w=.edd53e67-02ae-4c8a-ae0f-3a50c7ac0676@github.com> Message-ID: On Mon, 1 Feb 2021 09:52:19 GMT, Stefan Johansson wrote: >> Joakim Nordstr?m has updated the pull request incrementally with one additional commit since the last revision: >> >> Using FormatBuffer instead of snprintf. Changed defines to more descriptive names. > > Looks good. Thanks for review @kstefanj. ------------- PR: https://git.openjdk.java.net/jdk/pull/2217 From zgu at openjdk.java.net Mon Feb 1 16:07:42 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Mon, 1 Feb 2021 16:07:42 GMT Subject: RFR: 8260309: Shenandoah: Clean up ShenandoahBarrierSet [v3] In-Reply-To: <2u_BCcnM4QkcVVj6MVeFDfDgjB789ouIQuBfqY5p6vo=.2a63e65b-4fcd-4c79-9623-ac203c3ba056@github.com> References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> <2u_BCcnM4QkcVVj6MVeFDfDgjB789ouIQuBfqY5p6vo=.2a63e65b-4fcd-4c79-9623-ac203c3ba056@github.com> Message-ID: <79a4QDkcjzuW0lJPfAaJawWwBn4pejdpTzzDaZxaFl0=.72084bab-c67f-4559-9ab2-235a0126da5d@github.com> On Mon, 1 Feb 2021 11:00:59 GMT, Roman Kennke wrote: >> We collected some cruft in ShenandoahBarrierSet. Time to clean it up. >> >> This fixes/removes a number of includes, fixes some comments and it also removes is_a() and is_aligned() which look like leftovers/requirements from earlier incarnations of the superclass BarrierSet. Using the override keyword would be useful for such situations (btw, are we ok to start using override, nullptr, auto etc in Shenandoah, or do we want to keep it C++ for backporting ease?) >> >> One thing I was not sure about is the ShenandoahHeap* _heap field. Making it const will likely help the compiler avoid repeated access (e.g. in a number of perf-critical paths like the LRB impl). However, maybe we should get rid of the field altogether and make it explicitely using ShenandoahHeap::heap() and avoid repeated access instead of helping the compiler and hoping for the best? >> >> Testing: >> - [x] hotspot_gc_shenandoah release, fastdebug > > Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: > > Include accessBarrierSupport.inline.hpp instead of accessBarrierSupport.hpp Looks good. ------------- Marked as reviewed by zgu (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2202 From rkennke at openjdk.java.net Mon Feb 1 17:32:41 2021 From: rkennke at openjdk.java.net (Roman Kennke) Date: Mon, 1 Feb 2021 17:32:41 GMT Subject: Integrated: 8260309: Shenandoah: Clean up ShenandoahBarrierSet In-Reply-To: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> References: <5t_ZDBfj_4BxoJLoWh3R0r6OCh2Q0wc-DNJntvfhW1Q=.925a092e-c1d3-41df-b216-1cbb0b936959@github.com> Message-ID: On Fri, 22 Jan 2021 19:03:14 GMT, Roman Kennke wrote: > We collected some cruft in ShenandoahBarrierSet. Time to clean it up. > > This fixes/removes a number of includes, fixes some comments and it also removes is_a() and is_aligned() which look like leftovers/requirements from earlier incarnations of the superclass BarrierSet. Using the override keyword would be useful for such situations (btw, are we ok to start using override, nullptr, auto etc in Shenandoah, or do we want to keep it C++ for backporting ease?) > > One thing I was not sure about is the ShenandoahHeap* _heap field. Making it const will likely help the compiler avoid repeated access (e.g. in a number of perf-critical paths like the LRB impl). However, maybe we should get rid of the field altogether and make it explicitely using ShenandoahHeap::heap() and avoid repeated access instead of helping the compiler and hoping for the best? > > Testing: > - [x] hotspot_gc_shenandoah release, fastdebug This pull request has now been integrated. Changeset: df33595e Author: Roman Kennke URL: https://git.openjdk.java.net/jdk/commit/df33595e Stats: 31 lines in 6 files changed: 4 ins; 19 del; 8 mod 8260309: Shenandoah: Clean up ShenandoahBarrierSet Reviewed-by: shade, zgu ------------- PR: https://git.openjdk.java.net/jdk/pull/2202 From zgu at openjdk.java.net Mon Feb 1 18:13:46 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Mon, 1 Feb 2021 18:13:46 GMT Subject: Integrated: 8260004: Shenandoah: Rename ShenandoahMarkCompact to ShenandoahFullGC In-Reply-To: References: Message-ID: <9hC5B8QLUCOrNRRz4LN22Zyv_rPDN50nl3rdG_okC6w=.88d570f7-153a-49f8-b41b-ef0678d776d7@github.com> On Wed, 27 Jan 2021 18:16:09 GMT, Zhengyu Gu wrote: > Please review this patch that renames ShenandoahMarkCompact to ShenandoahFullGC, to be consistent with other GCs. This pull request has now been integrated. Changeset: e963ebd7 Author: Zhengyu Gu URL: https://git.openjdk.java.net/jdk/commit/e963ebd7 Stats: 38 lines in 9 files changed: 4 ins; 6 del; 28 mod 8260004: Shenandoah: Rename ShenandoahMarkCompact to ShenandoahFullGC Reviewed-by: shade, rkennke ------------- PR: https://git.openjdk.java.net/jdk/pull/2266 From github.com+779991+jaokim at openjdk.java.net Mon Feb 1 18:22:42 2021 From: github.com+779991+jaokim at openjdk.java.net (Joakim =?UTF-8?B?Tm9yZHN0csO2bQ==?=) Date: Mon, 1 Feb 2021 18:22:42 GMT Subject: Integrated: 8217327: G1 Post-Cleanup region liveness printing should not print out-of-date efficiency In-Reply-To: References: Message-ID: On Mon, 25 Jan 2021 11:52:26 GMT, Joakim Nordstr?m wrote: > **Description** > This fix addresses the issue where gc-efficiency is printed incorrectly when logging post-marking and post-cleanup. The gc-efficiency is calculated in the end of the marking phase, to be logged in the post-cleanup section. It is however not reset, meaning that next phase's post-marking log will show the old value. > > - The gc-efficiency is initialized to -1 when it hasn't been calculated. > - Negative gc-efficiency is displayed as a hyphen "-" in the summary. > - The gc-efficiency is reset to -1 in `HeapRegion::note_start_of_marking()` > > **Note:** there is a sister issue that moves the post-cleanup printing to a later stage. Without this fix, the logging will still be incorrect, so both fixes are needed. See: [JDK-8260042: G1 Post-cleanup liveness printing occurs too early](https://github.com/openjdk/jdk/pull/2168) > > This fix has been tested together with the above mentioned fix. > > **Example** > This is what logging like after fix has been applied. > ### PHASE Post-Marking @ 410.303 > ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 > ### > ### type address-range used prev-live next-live gc-eff remset state code-roots > ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) > ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8464 UPDAT 6096 > ### OLD 0x0ffd00000-0x0ffe00000 132856 132856 132856 - 2544 UPDAT 16 > ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 > ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 > ### > ### SUMMARY capacity: 4.00 MB used: 1.15 MB / 28.67 % prev-live: 1.15 MB / 28.67 % next-live: 1.15 MB / 28.67 % remset: 0.02 MB code-roots: 0.01 MB > ### PHASE Post-Cleanup @ 410.305 > ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 > ### > ### type address-range used prev-live next-live gc-eff remset state code-roots > ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) > ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UNTRA 6096 > ### OLD 0x0ffd00000-0x0ffe00000 132856 132856 132856 1352923.9 2544 CMPLT 16 > ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 > ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 > ### > ### SUMMARY capacity: 4.00 MB used: 1.15 MB / 28.67 % prev-live: 1.15 MB / 28.67 % next-live: 1.15 MB / 28.67 % remset: 0.02 MB code-roots: 0.01 MB > ### PHASE Post-Marking @ 450.310 > ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 > ### > ### type address-range used prev-live next-live gc-eff remset state code-roots > ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) > ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UPDAT 6096 > ### OLD 0x0ffd00000-0x0ffe00000 174456 174456 174456 - 2544 UPDAT 16 > ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 > ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 > ### > ### SUMMARY capacity: 4.00 MB used: 1.19 MB / 29.66 % prev-live: 1.19 MB / 29.66 % next-live: 1.19 MB / 29.66 % remset: 0.02 MB code-roots: 0.01 MB > ### PHASE Post-Cleanup @ 450.312 > ### HEAP reserved: 0x0ffc00000-0x100000000 region-size: 1048576 > ### > ### type address-range used prev-live next-live gc-eff remset state code-roots > ### (bytes) (bytes) (bytes) (bytes/ms) (bytes) (bytes) > ### OLD 0x0ffc00000-0x0ffd00000 1048368 1048368 1048368 - 8624 UNTRA 6096 > ### OLD 0x0ffd00000-0x0ffe00000 174456 174456 174456 1266519.2 2544 CMPLT 16 > ### SURV 0x0ffe00000-0x0fff00000 21368 21368 21368 - 2544 CMPLT 16 > ### FREE 0x0fff00000-0x100000000 0 0 0 - 2384 UNTRA 16 > ### > > **Testing** > - Manual testing > - hs-tier1, hs-tier2 This pull request has now been integrated. Changeset: 50f9a70f Author: JSNORDST Committer: Thomas Schatzl URL: https://git.openjdk.java.net/jdk/commit/50f9a70f Stats: 30 lines in 3 files changed: 10 ins; 0 del; 20 mod 8217327: G1 Post-Cleanup region liveness printing should not print out-of-date efficiency Reviewed-by: tschatzl, sjohanss ------------- PR: https://git.openjdk.java.net/jdk/pull/2217 From zgu at openjdk.java.net Mon Feb 1 20:56:49 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Mon, 1 Feb 2021 20:56:49 GMT Subject: RFR: 8260736: Shenandoah: Cleanup includes in ShenandoahGC and families Message-ID: 8260736: Shenandoah: Cleanup includes in ShenandoahGC and families ------------- Commit messages: - update - Merge branch 'master' into JDK-8260736-cleanup-includes-gc - update - init Changes: https://git.openjdk.java.net/jdk/pull/2339/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2339&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8260736 Stats: 37 lines in 10 files changed: 2 ins; 30 del; 5 mod Patch: https://git.openjdk.java.net/jdk/pull/2339.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2339/head:pull/2339 PR: https://git.openjdk.java.net/jdk/pull/2339 From zgu at openjdk.java.net Mon Feb 1 21:25:54 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Mon, 1 Feb 2021 21:25:54 GMT Subject: RFR: 8260736: Shenandoah: Cleanup includes in ShenandoahGC and families [v2] In-Reply-To: References: Message-ID: > 8260736: Shenandoah: Cleanup includes in ShenandoahGC and families Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: Added back vmThread.hpp ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2339/files - new: https://git.openjdk.java.net/jdk/pull/2339/files/001c3094..87924b4c Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2339&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2339&range=00-01 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.java.net/jdk/pull/2339.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2339/head:pull/2339 PR: https://git.openjdk.java.net/jdk/pull/2339 From kbarrett at openjdk.java.net Mon Feb 1 21:51:54 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Mon, 1 Feb 2021 21:51:54 GMT Subject: [jdk16] RFR: 8260704: ParallelGC: oldgen expansion needs release-store for _end Message-ID: Please review this change that ensures MutableSpace::_end is updated after everything else that is relevant when expanding, by using a release_store to perform the update. With this change the storestore that was added by JDK-8257999 is no longer needed. Testing: mach5 tier1-3, tier5 ------------- Commit messages: - Move JDK-8257999 barrier to correct location Changes: https://git.openjdk.java.net/jdk16/pull/141/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk16&pr=141&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8260704 Stats: 11 lines in 2 files changed: 4 ins; 1 del; 6 mod Patch: https://git.openjdk.java.net/jdk16/pull/141.diff Fetch: git fetch https://git.openjdk.java.net/jdk16 pull/141/head:pull/141 PR: https://git.openjdk.java.net/jdk16/pull/141 From jiefu at openjdk.java.net Tue Feb 2 02:01:48 2021 From: jiefu at openjdk.java.net (Jie Fu) Date: Tue, 2 Feb 2021 02:01:48 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: <226iFOsl1hXrEoSe9uzgBb1Z75wxQEv5azlJIfzCO4k=.69d5ed3a-7337-472d-b106-1ce2e5d361bf@github.com> References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

<_Wm-fi9j4TZ41F0G_92f7ioKQeDNgZiOEMmLkZ0lvvE=.0a9beba5-5089-4368-b4bc-73faf9d5e858@github.com> <226iFOsl1hXrEoSe9uzgBb1Z75wxQEv5azlJIfzCO4k=.69d5ed3a-7337-472d-b106-1ce2e5d361bf@github.com> Message-ID: On Mon, 1 Feb 2021 12:44:59 GMT, Vladimir Ivanov wrote: > Good. Please, file a follow-up RFE to improve the test. OK. I will help to file a JBS bug once the fix has been merged into the jdk mainline. It will be only fixed in the jdk17, right? Thanks. ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From iklam at openjdk.java.net Tue Feb 2 04:34:47 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Tue, 2 Feb 2021 04:34:47 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp Message-ID: collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. In many cases, an object file only directly includes this file via: - memAllocator.hpp (which does not actually use collectedHeap.hpp) - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. Build time of HotSpot is reduced for about 1%. Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. ------------- Commit messages: - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp Changes: https://git.openjdk.java.net/jdk/pull/2347/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8260012 Stats: 110 lines in 60 files changed: 63 ins; 7 del; 40 mod Patch: https://git.openjdk.java.net/jdk/pull/2347.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2347/head:pull/2347 PR: https://git.openjdk.java.net/jdk/pull/2347 From tschatzl at openjdk.java.net Tue Feb 2 07:59:46 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Tue, 2 Feb 2021 07:59:46 GMT Subject: [jdk16] RFR: 8260704: ParallelGC: oldgen expansion needs release-store for _end In-Reply-To: References: Message-ID: <1YY9KmPpKnvdDeecG5Y8Ckb-eCG3vjgnl7O7R1hB1sQ=.435ae6c7-3ed0-4a80-a4ea-22bdefd5811c@github.com> On Mon, 1 Feb 2021 10:10:48 GMT, Kim Barrett wrote: > Please review this change that ensures MutableSpace::_end is updated after > everything else that is relevant when expanding, by using a release_store to > perform the update. With this change the storestore that was added by > JDK-8257999 is no longer needed. > > Testing: > mach5 tier1-3, tier5 Lgtm ------------- Marked as reviewed by tschatzl (Reviewer). PR: https://git.openjdk.java.net/jdk16/pull/141 From sjohanss at openjdk.java.net Tue Feb 2 09:02:47 2021 From: sjohanss at openjdk.java.net (Stefan Johansson) Date: Tue, 2 Feb 2021 09:02:47 GMT Subject: [jdk16] RFR: 8260704: ParallelGC: oldgen expansion needs release-store for _end In-Reply-To: References: Message-ID: On Mon, 1 Feb 2021 10:10:48 GMT, Kim Barrett wrote: > Please review this change that ensures MutableSpace::_end is updated after > everything else that is relevant when expanding, by using a release_store to > perform the update. With this change the storestore that was added by > JDK-8257999 is no longer needed. > > Testing: > mach5 tier1-3, tier5 Looks good. ------------- Marked as reviewed by sjohanss (Reviewer). PR: https://git.openjdk.java.net/jdk16/pull/141 From tschatzl at openjdk.java.net Tue Feb 2 11:44:59 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Tue, 2 Feb 2021 11:44:59 GMT Subject: RFR: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() In-Reply-To: References:

Message-ID: <81e4IJaRYyC_fIcd2uyNXwvRXM5rnNytai82oDNQk4w=.51b2a370-66e4-4a2d-9b15-8736ea7a7a30@github.com> On Mon, 1 Feb 2021 15:32:53 GMT, Stefan Johansson wrote: >> Hi all, >> >> can I have reviews for this change that removes parallel handling in `CardTableRS::younger_refs_in_space_iterate` as it is always called with n_threads <= 1, making the parallel code handling there obsolete. >> >> A larger cleanup of `CardTableRS` will follow in JDK-8234534. >> >> Testing: >> tier1,2 > > Looks good. Thanks @kstefanj @albertnetymk for your reviews. ------------- PR: https://git.openjdk.java.net/jdk/pull/2333 From tschatzl at openjdk.java.net Tue Feb 2 11:45:00 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Tue, 2 Feb 2021 11:45:00 GMT Subject: RFR: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() In-Reply-To: <81e4IJaRYyC_fIcd2uyNXwvRXM5rnNytai82oDNQk4w=.51b2a370-66e4-4a2d-9b15-8736ea7a7a30@github.com> References:

<81e4IJaRYyC_fIcd2uyNXwvRXM5rnNytai82oDNQk4w=.51b2a370-66e4-4a2d-9b15-8736ea7a7a30@github.com> Message-ID: On Tue, 2 Feb 2021 11:01:13 GMT, Thomas Schatzl wrote: >> Looks good. > > Thanks @kstefanj @albertnetymk for your reviews. Fwiw I re-ran tier1+2 with no issues ------------- PR: https://git.openjdk.java.net/jdk/pull/2333 From tschatzl at openjdk.java.net Tue Feb 2 11:45:02 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Tue, 2 Feb 2021 11:45:02 GMT Subject: Integrated: 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() In-Reply-To: References: Message-ID: On Mon, 1 Feb 2021 11:53:00 GMT, Thomas Schatzl wrote: > Hi all, > > can I have reviews for this change that removes parallel handling in `CardTableRS::younger_refs_in_space_iterate` as it is always called with n_threads <= 1, making the parallel code handling there obsolete. > > A larger cleanup of `CardTableRS` will follow in JDK-8234534. > > Testing: > tier1,2 This pull request has now been integrated. Changeset: 288a4fed Author: Thomas Schatzl URL: https://git.openjdk.java.net/jdk/commit/288a4fed Stats: 103 lines in 7 files changed: 3 ins; 72 del; 28 mod 8260643: Remove parallel version handling in CardTableRS::younger_refs_in_space_iterate() Reviewed-by: ayang, sjohanss ------------- PR: https://git.openjdk.java.net/jdk/pull/2333 From vlivanov at openjdk.java.net Tue Feb 2 11:45:43 2021 From: vlivanov at openjdk.java.net (Vladimir Ivanov) Date: Tue, 2 Feb 2021 11:45:43 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com> Message-ID: On Sun, 31 Jan 2021 21:29:40 GMT, Nils Eliasson wrote: >> https://bugs.openjdk.java.net/browse/JDK-8260473 >> >> Function "PhaseVector::expand_vunbox_node" creates a LoadNode, but forgets to make the LoadNode to pass gc barriers. >> >> Testing: all Vector API related tests have passed. >> >> Original pr: https://github.com/openjdk/jdk/pull/2253 > > Approved. > > Now awaiting release team approval. > It will be only fixed in the jdk17, right? Yes, I'm OK with that. ------------- PR: https://git.openjdk.java.net/jdk16/pull/139 From stefank at openjdk.java.net Tue Feb 2 12:14:45 2021 From: stefank at openjdk.java.net (Stefan Karlsson) Date: Tue, 2 Feb 2021 12:14:45 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 04:18:24 GMT, Ioi Lam wrote: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. Looks good. A few things that you might want to consider, but I'm also fine with the patch as it is. src/hotspot/share/gc/shared/memAllocator.hpp line 30: > 28: #include "memory/memRegion.hpp" > 29: #include "oops/oopsHierarchy.hpp" > 30: #include "runtime/thread.hpp" If we want to, this could be changed to a forward declaration if we removed the default value (Thread* thread = Thread::current()) of the constructors. Not needed for this RFE though. src/hotspot/cpu/arm/frame_arm.cpp line 518: > 516: obj = *(oop*)res_addr; > 517: } > 518: assert(obj == NULL || Universe::is_in_heap(obj), "sanity check"); Could have been changed to is_in_heap_or_null. src/hotspot/cpu/ppc/frame_ppc.cpp line 308: > 306: case T_ARRAY: { > 307: oop obj = *(oop*)tos_addr; > 308: assert(obj == NULL || Universe::is_in_heap(obj), "sanity check"); Could have been changed to is_in_heap_or_null. src/hotspot/cpu/s390/frame_s390.cpp line 321: > 319: case T_ARRAY: { > 320: oop obj = *(oop*)tos_addr; > 321: assert(obj == NULL || Universe::is_in_heap(obj), "sanity check"); Could have been changed to is_in_heap_or_null. ------------- Marked as reviewed by stefank (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2347 From tschatzl at openjdk.java.net Tue Feb 2 12:33:47 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Tue, 2 Feb 2021 12:33:47 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 04:18:24 GMT, Ioi Lam wrote: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. Checked a few includes for missing ones; obviously they are included transitively so add as you see fit. src/hotspot/share/gc/shared/memAllocator.hpp line 30: > 28: #include "memory/memRegion.hpp" > 29: #include "oops/oopsHierarchy.hpp" > 30: #include "runtime/thread.hpp" `utilities/globalDefinitions.hpp` for `HeapWord` is missing. src/hotspot/share/oops/compressedOops.inline.hpp line 28: > 26: #define SHARE_OOPS_COMPRESSEDOOPS_INLINE_HPP > 27: > 28: #include "gc/shared/collectedHeap.hpp" `utilities/globalDefinitions.hpp` for `*PTR_FORMAT` and others is missing. src/hotspot/share/oops/oop.inline.hpp line 28: > 26: #define SHARE_OOPS_OOP_INLINE_HPP > 27: > 28: #include "gc/shared/collectedHeap.hpp" `utilities/globalDefinitions.hpp` for `HeapWord` is missing. `globals.hpp` for some globals. `oopsHierarchy.hpp` for `narrowKlass` `utilties/debug.hpp` for `assert` ------------- Marked as reviewed by tschatzl (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2347 From kbarrett at openjdk.java.net Tue Feb 2 19:23:01 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Tue, 2 Feb 2021 19:23:01 GMT Subject: [jdk16] RFR: 8260704: ParallelGC: oldgen expansion needs release-store for _end [v2] In-Reply-To: References: Message-ID: > Please review this change that ensures MutableSpace::_end is updated after > everything else that is relevant when expanding, by using a release_store to > perform the update. With this change the storestore that was added by > JDK-8257999 is no longer needed. > > Testing: > mach5 tier1-3, tier5 Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: - Merge branch 'master' into move_barrier - Move JDK-8257999 barrier to correct location ------------- Changes: - all: https://git.openjdk.java.net/jdk16/pull/141/files - new: https://git.openjdk.java.net/jdk16/pull/141/files/91d8be35..3929bb7f Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk16&pr=141&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk16&pr=141&range=00-01 Stats: 242 lines in 29 files changed: 100 ins; 112 del; 30 mod Patch: https://git.openjdk.java.net/jdk16/pull/141.diff Fetch: git fetch https://git.openjdk.java.net/jdk16 pull/141/head:pull/141 PR: https://git.openjdk.java.net/jdk16/pull/141 From kbarrett at openjdk.java.net Tue Feb 2 19:23:02 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Tue, 2 Feb 2021 19:23:02 GMT Subject: [jdk16] RFR: 8260704: ParallelGC: oldgen expansion needs release-store for _end [v2] In-Reply-To: <1YY9KmPpKnvdDeecG5Y8Ckb-eCG3vjgnl7O7R1hB1sQ=.435ae6c7-3ed0-4a80-a4ea-22bdefd5811c@github.com> References: <1YY9KmPpKnvdDeecG5Y8Ckb-eCG3vjgnl7O7R1hB1sQ=.435ae6c7-3ed0-4a80-a4ea-22bdefd5811c@github.com> Message-ID: On Tue, 2 Feb 2021 07:56:56 GMT, Thomas Schatzl wrote: >> Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: >> >> - Merge branch 'master' into move_barrier >> - Move JDK-8257999 barrier to correct location > > Lgtm Thanks @tschatzl and @kstefanj for reviews. ------------- PR: https://git.openjdk.java.net/jdk16/pull/141 From kbarrett at openjdk.java.net Tue Feb 2 19:23:03 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Tue, 2 Feb 2021 19:23:03 GMT Subject: [jdk16] Integrated: 8260704: ParallelGC: oldgen expansion needs release-store for _end In-Reply-To: References: Message-ID: On Mon, 1 Feb 2021 10:10:48 GMT, Kim Barrett wrote: > Please review this change that ensures MutableSpace::_end is updated after > everything else that is relevant when expanding, by using a release_store to > perform the update. With this change the storestore that was added by > JDK-8257999 is no longer needed. > > Testing: > mach5 tier1-3, tier5 This pull request has now been integrated. Changeset: afd5eefd Author: Kim Barrett URL: https://git.openjdk.java.net/jdk16/commit/afd5eefd Stats: 11 lines in 2 files changed: 4 ins; 1 del; 6 mod 8260704: ParallelGC: oldgen expansion needs release-store for _end Move JDK-8257999 barrier to correct location. Reviewed-by: tschatzl, sjohanss ------------- PR: https://git.openjdk.java.net/jdk16/pull/141 From kbarrett at openjdk.java.net Tue Feb 2 19:25:42 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Tue, 2 Feb 2021 19:25:42 GMT Subject: RFR: 8258508: Merge G1RedirtyCardsQueue into qset In-Reply-To: References:

Message-ID: <5R8m4TmkUnxjS3PFRv9ZoCwmmmQimtukejn019nBUk8=.ee65a633-9e57-4ab9-92c1-a5c352a1e5ef@github.com> On Mon, 1 Feb 2021 09:44:11 GMT, Thomas Schatzl wrote: >> Please review this change to G1RedirtyCardsLocalQueueSet to directly >> incorporate the associated queue, simplifying usage. >> >> Testing: >> mach5 tier1 > > Lgtm. Thanks @tschatzl and @walulyai for reviews. ------------- PR: https://git.openjdk.java.net/jdk/pull/2325 From cjplummer at openjdk.java.net Tue Feb 2 19:51:50 2021 From: cjplummer at openjdk.java.net (Chris Plummer) Date: Tue, 2 Feb 2021 19:51:50 GMT Subject: RFR: 8247514: Improve clhsdb 'findpc' ability to determine what an address points to by improving PointerFinder and PointerLocation classes In-Reply-To: References: <4YKNpyXQ9QGrLhR61tkh71Q3A7VvCj5Ete_4OvzAA-o=.28b7be8c-6f05-42d4-892b-87ebea907b24@github.com> Message-ID: On Mon, 25 Jan 2021 20:00:41 GMT, Chris Plummer wrote: >> See the bug for most details. A few notes here about some implementation details: >> >> In the `PointerLocation` class, I added more consistency w.r.t. whether or not a newline is printed. It used to for some address types, but not others. Now it always does. And if you see a comment something like the following: >> >> ` getTLAB().printOn(tty); // includes "\n" ` >> >> That's just clarifying whether or not the `printOn()` method called will include the newline. Some do and some don't, and knowing what the various `printOn()` methods do makes getting the proper inclusion of the newline easier to understand. >> >> I added `verbose` and `printAddress` boolean arguments to `PointerLocation.printOn()`. Currently they are always `true`. The false arguments will be used when I complete [JDK-8250801](https://bugs.openjdk.java.net/browse/JDK-8250801), which will use `PointerFinder/Location` to show what each register points to. >> >> The CR mentions that the main motivation for this work is for eventual replacement of the old clhsdb `whatis` command, which was implemented in javascript. It used to resolve DSO symbols, whereas `findpc` did not. The `whatis` code did this with the following: >> >> var dso = loadObjectContainingPC(addr); >> if (dso == null) { >> return ptrLoc.toString(); >> } >> var sym = dso.closestSymbolToPC(addr); >> if (sym != null) { >> return sym.name + '+' + sym.offset; >> } >> And now you'll see something similar in the PointerFinder code: >> >> loc.loadObject = cdbg.loadObjectContainingPC(a); >> if (loc.loadObject != null) { >> loc.nativeSymbol = loc.loadObject.closestSymbolToPC(a); >> return loc; >> } >> Note that now that `findpc` does everything that `whatis` used to (and more), we don't really need to add a java version of `whatis`, but I'll probably do so anyway just help out people who are used to using the `whatis` command. That will be done using [JDK-8244670](https://bugs.openjdk.java.net/browse/JDK-8244670) > > Ping! Ping again. ------------- PR: https://git.openjdk.java.net/jdk/pull/2111 From zgu at openjdk.java.net Tue Feb 2 21:35:51 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Tue, 2 Feb 2021 21:35:51 GMT Subject: RFR: 8260998: Shenandoah: Restore reference processing statistics reporting Message-ID: Please review this patch that restores reporting of reference processing statistics after JDK-8254315 ------------- Commit messages: - JDK-8260998-ref-proc-stats Changes: https://git.openjdk.java.net/jdk/pull/2362/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2362&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8260998 Stats: 20 lines in 3 files changed: 18 ins; 1 del; 1 mod Patch: https://git.openjdk.java.net/jdk/pull/2362.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2362/head:pull/2362 PR: https://git.openjdk.java.net/jdk/pull/2362 From ysuenaga at openjdk.java.net Tue Feb 2 23:24:40 2021 From: ysuenaga at openjdk.java.net (Yasumasa Suenaga) Date: Tue, 2 Feb 2021 23:24:40 GMT Subject: RFR: 8247514: Improve clhsdb 'findpc' ability to determine what an address points to by improving PointerFinder and PointerLocation classes In-Reply-To: <4YKNpyXQ9QGrLhR61tkh71Q3A7VvCj5Ete_4OvzAA-o=.28b7be8c-6f05-42d4-892b-87ebea907b24@github.com> References: <4YKNpyXQ9QGrLhR61tkh71Q3A7VvCj5Ete_4OvzAA-o=.28b7be8c-6f05-42d4-892b-87ebea907b24@github.com> Message-ID: <-09XRqbxFbZGkzqDVewiXrJjVNjuLMdZqfxjnxJf3Oc=.2da660b7-a5c1-40e1-81af-8dc814e199ca@github.com> On Sun, 17 Jan 2021 03:57:59 GMT, Chris Plummer wrote: > See the bug for most details. A few notes here about some implementation details: > > In the `PointerLocation` class, I added more consistency w.r.t. whether or not a newline is printed. It used to for some address types, but not others. Now it always does. And if you see a comment something like the following: > > ` getTLAB().printOn(tty); // includes "\n" ` > > That's just clarifying whether or not the `printOn()` method called will include the newline. Some do and some don't, and knowing what the various `printOn()` methods do makes getting the proper inclusion of the newline easier to understand. > > I added `verbose` and `printAddress` boolean arguments to `PointerLocation.printOn()`. Currently they are always `true`. The false arguments will be used when I complete [JDK-8250801](https://bugs.openjdk.java.net/browse/JDK-8250801), which will use `PointerFinder/Location` to show what each register points to. > > The CR mentions that the main motivation for this work is for eventual replacement of the old clhsdb `whatis` command, which was implemented in javascript. It used to resolve DSO symbols, whereas `findpc` did not. The `whatis` code did this with the following: > > var dso = loadObjectContainingPC(addr); > if (dso == null) { > return ptrLoc.toString(); > } > var sym = dso.closestSymbolToPC(addr); > if (sym != null) { > return sym.name + '+' + sym.offset; > } > And now you'll see something similar in the PointerFinder code: > > loc.loadObject = cdbg.loadObjectContainingPC(a); > if (loc.loadObject != null) { > loc.nativeSymbol = loc.loadObject.closestSymbolToPC(a); > return loc; > } > Note that now that `findpc` does everything that `whatis` used to (and more), we don't really need to add a java version of `whatis`, but I'll probably do so anyway just help out people who are used to using the `whatis` command. That will be done using [JDK-8244670](https://bugs.openjdk.java.net/browse/JDK-8244670) LGTM ------------- Marked as reviewed by ysuenaga (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2111 From kbarrett at openjdk.java.net Wed Feb 3 00:57:02 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Wed, 3 Feb 2021 00:57:02 GMT Subject: RFR: 8258508: Merge G1RedirtyCardsQueue into qset [v2] In-Reply-To: References: Message-ID: > Please review this change to G1RedirtyCardsLocalQueueSet to directly > incorporate the associated queue, simplifying usage. > > Testing: > mach5 tier1 Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - Merge branch 'master' into merge_redirty_queue - Merge branch 'master' into merge_redirty_queue - merge redirty cards queue into local qset ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2325/files - new: https://git.openjdk.java.net/jdk/pull/2325/files/06057eb0..fbf891ba Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2325&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2325&range=00-01 Stats: 4520 lines in 346 files changed: 2030 ins; 983 del; 1507 mod Patch: https://git.openjdk.java.net/jdk/pull/2325.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2325/head:pull/2325 PR: https://git.openjdk.java.net/jdk/pull/2325 From kbarrett at openjdk.java.net Wed Feb 3 00:57:03 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Wed, 3 Feb 2021 00:57:03 GMT Subject: Integrated: 8258508: Merge G1RedirtyCardsQueue into qset In-Reply-To: References: Message-ID: On Sat, 30 Jan 2021 10:14:42 GMT, Kim Barrett wrote: > Please review this change to G1RedirtyCardsLocalQueueSet to directly > incorporate the associated queue, simplifying usage. > > Testing: > mach5 tier1 This pull request has now been integrated. Changeset: d423d368 Author: Kim Barrett URL: https://git.openjdk.java.net/jdk/commit/d423d368 Stats: 55 lines in 5 files changed: 12 ins; 26 del; 17 mod 8258508: Merge G1RedirtyCardsQueue into qset Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.java.net/jdk/pull/2325 From iklam at openjdk.java.net Wed Feb 3 06:40:08 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Wed, 3 Feb 2021 06:40:08 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v2] In-Reply-To: References:

Message-ID: On Tue, 2 Feb 2021 12:09:22 GMT, Stefan Karlsson wrote: >> Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: >> >> - @tschatzl and @stefank comments >> - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp >> - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp > > src/hotspot/share/gc/shared/memAllocator.hpp line 30: > >> 28: #include "memory/memRegion.hpp" >> 29: #include "oops/oopsHierarchy.hpp" >> 30: #include "runtime/thread.hpp" > > If we want to, this could be changed to a forward declaration if we removed the default value (Thread* thread = Thread::current()) of the constructors. Not needed for this RFE though. memAllocator.hpp is not included very often (only 65 out of ~1000 .o files), so I decided to leave it as is. > src/hotspot/cpu/arm/frame_arm.cpp line 518: > >> 516: obj = *(oop*)res_addr; >> 517: } >> 518: assert(obj == NULL || Universe::is_in_heap(obj), "sanity check"); > > Could have been changed to is_in_heap_or_null. Fixed > src/hotspot/cpu/ppc/frame_ppc.cpp line 308: > >> 306: case T_ARRAY: { >> 307: oop obj = *(oop*)tos_addr; >> 308: assert(obj == NULL || Universe::is_in_heap(obj), "sanity check"); > > Could have been changed to is_in_heap_or_null. Fixed. I also change other frame_.cpp files to use is_in_heap_or_null. ------------- PR: https://git.openjdk.java.net/jdk/pull/2347 From iklam at openjdk.java.net Wed Feb 3 06:40:04 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Wed, 3 Feb 2021 06:40:04 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v2] In-Reply-To: References: Message-ID: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - @tschatzl and @stefank comments - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2347/files - new: https://git.openjdk.java.net/jdk/pull/2347/files/a1bdc2f7..529e77e4 Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=00-01 Stats: 3635 lines in 268 files changed: 1458 ins; 983 del; 1194 mod Patch: https://git.openjdk.java.net/jdk/pull/2347.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2347/head:pull/2347 PR: https://git.openjdk.java.net/jdk/pull/2347 From iklam at openjdk.java.net Wed Feb 3 06:40:11 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Wed, 3 Feb 2021 06:40:11 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v2] In-Reply-To: References:

Message-ID: On Tue, 2 Feb 2021 12:22:50 GMT, Thomas Schatzl wrote: >> Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: >> >> - @tschatzl and @stefank comments >> - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp >> - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp > > src/hotspot/share/gc/shared/memAllocator.hpp line 30: > >> 28: #include "memory/memRegion.hpp" >> 29: #include "oops/oopsHierarchy.hpp" >> 30: #include "runtime/thread.hpp" > > `utilities/globalDefinitions.hpp` for `HeapWord` is missing. Fixed. > src/hotspot/share/oops/compressedOops.inline.hpp line 28: > >> 26: #define SHARE_OOPS_COMPRESSEDOOPS_INLINE_HPP >> 27: >> 28: #include "gc/shared/collectedHeap.hpp" > > `utilities/globalDefinitions.hpp` for `*PTR_FORMAT` and others is missing. Fixed. > src/hotspot/share/oops/oop.inline.hpp line 28: > >> 26: #define SHARE_OOPS_OOP_INLINE_HPP >> 27: >> 28: #include "gc/shared/collectedHeap.hpp" > > `utilities/globalDefinitions.hpp` for `HeapWord` is missing. > `globals.hpp` for some globals. > `oopsHierarchy.hpp` for `narrowKlass` > `utilties/debug.hpp` for `assert` Fixed. Thanks for the review. ------------- PR: https://git.openjdk.java.net/jdk/pull/2347 From shade at openjdk.java.net Wed Feb 3 08:39:42 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Wed, 3 Feb 2021 08:39:42 GMT Subject: RFR: 8260998: Shenandoah: Restore reference processing statistics reporting In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 20:49:04 GMT, Zhengyu Gu wrote: > Please review this patch that restores reporting of reference processing statistics after JDK-8254315 Looks fine to me. ------------- Marked as reviewed by shade (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2362 From tschatzl at openjdk.java.net Wed Feb 3 09:51:52 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Wed, 3 Feb 2021 09:51:52 GMT Subject: RFR: 8261023: Add comment why memory pretouch must be a store Message-ID: Hi all, may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: - // Note the use of a write here; originally we tried just a read, but - // since the value read was unused, the optimizer removed the read. - // If we ever have a concurrent touchahead thread, we'll want to use - // a read, to avoid the potential of overwriting data (if a mutator - // thread beats the touchahead thread to a page). There are various - // ways of making sure this read is not optimized away: for example, - // generating the code for a read procedure at runtime. It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). Maybe these zero page optimizations came later than that original implementation though. Testing: local compilation - it's adding a comment only, really. Thanks, Thomas ------------- Commit messages: - Initial commit Changes: https://git.openjdk.java.net/jdk/pull/2373/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2373&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8261023 Stats: 4 lines in 1 file changed: 4 ins; 0 del; 0 mod Patch: https://git.openjdk.java.net/jdk/pull/2373.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2373/head:pull/2373 PR: https://git.openjdk.java.net/jdk/pull/2373 From shade at openjdk.java.net Wed Feb 3 10:06:45 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Wed, 3 Feb 2021 10:06:45 GMT Subject: RFR: 8261023: Add comment why memory pretouch must be a store In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 09:47:04 GMT, Thomas Schatzl wrote: > Hi all, > > may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? > > Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. > > A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: > > - // Note the use of a write here; originally we tried just a read, but > - // since the value read was unused, the optimizer removed the read. > - // If we ever have a concurrent touchahead thread, we'll want to use > - // a read, to avoid the potential of overwriting data (if a mutator > - // thread beats the touchahead thread to a page). There are various > - // ways of making sure this read is not optimized away: for example, > - // generating the code for a read procedure at runtime. > > It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). > > Maybe these zero page optimizations came later than that original implementation though. > > Testing: local compilation - it's adding a comment only, really. > > Thanks, > Thomas Looks fine. Bikeshedding suggestions below. src/hotspot/share/runtime/os.cpp line 1819: > 1817: // optimization where only writes trigger actual backing of memory. Reads > 1818: // access a single shared zero page at first and so will not achieve the > 1819: // desired effect. Consider: Note: this must be a store, not a load. On many OSes loads from the fresh memory would be satisfied from a single mapped zero page. We need to store something to each page to get them backed by their own memory, which is what we want as the effect here. ------------- Marked as reviewed by shade (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2373 From iwalulya at openjdk.java.net Wed Feb 3 10:06:45 2021 From: iwalulya at openjdk.java.net (Ivan Walulya) Date: Wed, 3 Feb 2021 10:06:45 GMT Subject: RFR: 8261023: Add comment why memory pretouch must be a store In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 09:47:04 GMT, Thomas Schatzl wrote: > Hi all, > > may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? > > Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. > > A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: > > - // Note the use of a write here; originally we tried just a read, but > - // since the value read was unused, the optimizer removed the read. > - // If we ever have a concurrent touchahead thread, we'll want to use > - // a read, to avoid the potential of overwriting data (if a mutator > - // thread beats the touchahead thread to a page). There are various > - // ways of making sure this read is not optimized away: for example, > - // generating the code for a read procedure at runtime. > > It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). > > Maybe these zero page optimizations came later than that original implementation though. > > Testing: local compilation - it's adding a comment only, really. > > Thanks, > Thomas lgtm! ------------- Marked as reviewed by iwalulya (Committer). PR: https://git.openjdk.java.net/jdk/pull/2373 From tschatzl at openjdk.java.net Wed Feb 3 10:09:52 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Wed, 3 Feb 2021 10:09:52 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal In-Reply-To: References: Message-ID: <47QCVJeDPnsUak4dH0LXGJxDmqutyQeY91MIcPwyi-Q=.19b4ed7a-750a-47a8-aee0-76427c1752cf@github.com> On Tue, 2 Feb 2021 15:13:38 GMT, Thomas Schatzl wrote: > Hi, > > can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? > > Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. > > Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) (latest tier1-4 testing still stuck on linux-aarch64, but everything else passed. I think there is no particular aarch64 specific change in there...) ------------- PR: https://git.openjdk.java.net/jdk/pull/2354 From tschatzl at openjdk.java.net Wed Feb 3 10:09:52 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Wed, 3 Feb 2021 10:09:52 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal Message-ID: Hi, can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) ------------- Commit messages: - Initial commit Changes: https://git.openjdk.java.net/jdk/pull/2354/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2354&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8234534 Stats: 197 lines in 7 files changed: 0 ins; 185 del; 12 mod Patch: https://git.openjdk.java.net/jdk/pull/2354.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2354/head:pull/2354 PR: https://git.openjdk.java.net/jdk/pull/2354 From tschatzl at openjdk.java.net Wed Feb 3 10:31:44 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Wed, 3 Feb 2021 10:31:44 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v2] In-Reply-To: References:

Message-ID: On Tue, 2 Feb 2021 12:30:51 GMT, Thomas Schatzl wrote: >> Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: >> >> - @tschatzl and @stefank comments >> - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp >> - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp > > Checked a few includes for missing ones; obviously they are included transitively so add as you see fit. Still good. ------------- PR: https://git.openjdk.java.net/jdk/pull/2347 From jiefu at openjdk.java.net Wed Feb 3 11:05:56 2021 From: jiefu at openjdk.java.net (Jie Fu) Date: Wed, 3 Feb 2021 11:05:56 GMT Subject: RFR: 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 Message-ID: Hi all, The SIGFPE was caused by this line [1] when MaxVirtMemFraction=0. But according to this comment [2], 0 should not be allowed for MaxVirtMemFraction. Thanks. Best regards, Jie [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/z/zAddressSpaceLimit.cpp#L51 [2] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gc_globals.hpp#L345 ------------- Commit messages: - 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 Changes: https://git.openjdk.java.net/jdk/pull/2374/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2374&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8261028 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.java.net/jdk/pull/2374.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2374/head:pull/2374 PR: https://git.openjdk.java.net/jdk/pull/2374 From tschatzl at openjdk.java.net Wed Feb 3 11:28:57 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Wed, 3 Feb 2021 11:28:57 GMT Subject: RFR: 8261023: Document why memory pretouch must be a store [v2] In-Reply-To: References: Message-ID: > Hi all, > > may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? > > Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. > > A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: > > - // Note the use of a write here; originally we tried just a read, but > - // since the value read was unused, the optimizer removed the read. > - // If we ever have a concurrent touchahead thread, we'll want to use > - // a read, to avoid the potential of overwriting data (if a mutator > - // thread beats the touchahead thread to a page). There are various > - // ways of making sure this read is not optimized away: for example, > - // generating the code for a read procedure at runtime. > > It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). > > Maybe these zero page optimizations came later than that original implementation though. > > Testing: local compilation - it's adding a comment only, really. > > Thanks, > Thomas Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: shade review ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2373/files - new: https://git.openjdk.java.net/jdk/pull/2373/files/d30fff80..2009527e Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2373&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2373&range=00-01 Stats: 4 lines in 1 file changed: 0 ins; 0 del; 4 mod Patch: https://git.openjdk.java.net/jdk/pull/2373.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2373/head:pull/2373 PR: https://git.openjdk.java.net/jdk/pull/2373 From shade at openjdk.java.net Wed Feb 3 11:32:43 2021 From: shade at openjdk.java.net (Aleksey Shipilev) Date: Wed, 3 Feb 2021 11:32:43 GMT Subject: RFR: 8261023: Document why memory pretouch must be a store [v2] In-Reply-To: References:

Message-ID: On Wed, 3 Feb 2021 11:28:57 GMT, Thomas Schatzl wrote: >> Hi all, >> >> may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? >> >> Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. >> >> A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: >> >> - // Note the use of a write here; originally we tried just a read, but >> - // since the value read was unused, the optimizer removed the read. >> - // If we ever have a concurrent touchahead thread, we'll want to use >> - // a read, to avoid the potential of overwriting data (if a mutator >> - // thread beats the touchahead thread to a page). There are various >> - // ways of making sure this read is not optimized away: for example, >> - // generating the code for a read procedure at runtime. >> >> It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). >> >> Maybe these zero page optimizations came later than that original implementation though. >> >> Testing: local compilation - it's adding a comment only, really. >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > shade review Marked as reviewed by shade (Reviewer). ------------- PR: https://git.openjdk.java.net/jdk/pull/2373 From stefank at openjdk.java.net Wed Feb 3 11:59:38 2021 From: stefank at openjdk.java.net (Stefan Karlsson) Date: Wed, 3 Feb 2021 11:59:38 GMT Subject: RFR: 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 11:01:35 GMT, Jie Fu wrote: > Hi all, > > The SIGFPE was caused by this line [1] when MaxVirtMemFraction=0. > But according to this comment [2], 0 should not be allowed for MaxVirtMemFraction. > > Thanks. > Best regards, > Jie > > [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/z/zAddressSpaceLimit.cpp#L51 > [2] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gc_globals.hpp#L345 Looks good. Thanks for fixing! ------------- Marked as reviewed by stefank (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2374 From pliden at openjdk.java.net Wed Feb 3 12:06:40 2021 From: pliden at openjdk.java.net (Per Liden) Date: Wed, 3 Feb 2021 12:06:40 GMT Subject: RFR: 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 11:01:35 GMT, Jie Fu wrote: > Hi all, > > The SIGFPE was caused by this line [1] when MaxVirtMemFraction=0. > But according to this comment [2], 0 should not be allowed for MaxVirtMemFraction. > > Thanks. > Best regards, > Jie > > [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/z/zAddressSpaceLimit.cpp#L51 > [2] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gc_globals.hpp#L345 Looks good! ------------- Marked as reviewed by pliden (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2374 From jiefu at openjdk.java.net Wed Feb 3 12:26:44 2021 From: jiefu at openjdk.java.net (Jie Fu) Date: Wed, 3 Feb 2021 12:26:44 GMT Subject: RFR: 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 In-Reply-To: References:

Message-ID: On Wed, 3 Feb 2021 11:57:16 GMT, Stefan Karlsson wrote: >> Hi all, >> >> The SIGFPE was caused by this line [1] when MaxVirtMemFraction=0. >> But according to this comment [2], 0 should not be allowed for MaxVirtMemFraction. >> >> Thanks. >> Best regards, >> Jie >> >> [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/z/zAddressSpaceLimit.cpp#L51 >> [2] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gc_globals.hpp#L345 > > Looks good. Thanks for fixing! Thanks @stefank and @pliden for your review. Will push it tomorrow. ------------- PR: https://git.openjdk.java.net/jdk/pull/2374 From zgu at openjdk.java.net Wed Feb 3 13:19:53 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Wed, 3 Feb 2021 13:19:53 GMT Subject: Integrated: 8260998: Shenandoah: Restore reference processing statistics reporting In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 20:49:04 GMT, Zhengyu Gu wrote: > Please review this patch that restores reporting of reference processing statistics after JDK-8254315 This pull request has now been integrated. Changeset: 5324b5c5 Author: Zhengyu Gu URL: https://git.openjdk.java.net/jdk/commit/5324b5c5 Stats: 20 lines in 3 files changed: 18 ins; 1 del; 1 mod 8260998: Shenandoah: Restore reference processing statistics reporting Reviewed-by: shade ------------- PR: https://git.openjdk.java.net/jdk/pull/2362 From zgu at openjdk.java.net Wed Feb 3 20:10:53 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Wed, 3 Feb 2021 20:10:53 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah Message-ID: Please review this patch that adds JFR ObjectCountAfterGC event support. AFAICT, the event is off by default. If it is enabled, it distorts Shenandoah pause characteristics, since it performs heap walk during final mark pause. When event is disabled: `[191.033s][info][gc,stats] Pause Init Mark (G) 454 us` `[191.033s][info][gc,stats] Pause Init Mark (N) 13 us` When event is enabled: `[396.631s][info][gc,stats] Pause Final Mark (G) 43199 us` `[396.631s][info][gc,stats] Pause Final Mark (N) 42982 us` Test: - [x] hotspot_gc_shenandoah ------------- Commit messages: - JDK-8259647-object_count_jfr Changes: https://git.openjdk.java.net/jdk/pull/2386/files Webrev: https://webrevs.openjdk.java.net/?repo=jdk&pr=2386&range=00 Issue: https://bugs.openjdk.java.net/browse/JDK-8259647 Stats: 12 lines in 3 files changed: 6 ins; 4 del; 2 mod Patch: https://git.openjdk.java.net/jdk/pull/2386.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2386/head:pull/2386 PR: https://git.openjdk.java.net/jdk/pull/2386 From jiefu at openjdk.java.net Thu Feb 4 00:08:54 2021 From: jiefu at openjdk.java.net (Jie Fu) Date: Thu, 4 Feb 2021 00:08:54 GMT Subject: Integrated: 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 11:01:35 GMT, Jie Fu wrote: > Hi all, > > The SIGFPE was caused by this line [1] when MaxVirtMemFraction=0. > But according to this comment [2], 0 should not be allowed for MaxVirtMemFraction. > > Thanks. > Best regards, > Jie > > [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/z/zAddressSpaceLimit.cpp#L51 > [2] https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gc_globals.hpp#L345 This pull request has now been integrated. Changeset: e2516e41 Author: Jie Fu URL: https://git.openjdk.java.net/jdk/commit/e2516e41 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod 8261028: ZGC: SIGFPE when MaxVirtMemFraction=0 Reviewed-by: stefank, pliden ------------- PR: https://git.openjdk.java.net/jdk/pull/2374 From iklam at openjdk.java.net Thu Feb 4 02:00:07 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Thu, 4 Feb 2021 02:00:07 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v3] In-Reply-To: References: Message-ID: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: - Merge branch 'master' of https://github.com/openjdk/jdk into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - @tschatzl and @stefank comments - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2347/files - new: https://git.openjdk.java.net/jdk/pull/2347/files/529e77e4..7d9015d2 Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=02 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=01-02 Stats: 2516 lines in 114 files changed: 1237 ins; 850 del; 429 mod Patch: https://git.openjdk.java.net/jdk/pull/2347.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2347/head:pull/2347 PR: https://git.openjdk.java.net/jdk/pull/2347 From iklam at openjdk.java.net Thu Feb 4 04:09:06 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Thu, 4 Feb 2021 04:09:06 GMT Subject: RFR: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp [v4] In-Reply-To: References: Message-ID: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - Merge branch 'master' of https://github.com/openjdk/jdk into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - @tschatzl and @stefank comments - Merge branch 'master' into 8260012-reduce-inclue-collectedHeap-heapInspection-hpp - 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2347/files - new: https://git.openjdk.java.net/jdk/pull/2347/files/7d9015d2..cfd70b3c Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=03 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2347&range=02-03 Stats: 2645 lines in 56 files changed: 2497 ins; 69 del; 79 mod Patch: https://git.openjdk.java.net/jdk/pull/2347.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2347/head:pull/2347 PR: https://git.openjdk.java.net/jdk/pull/2347 From iklam at openjdk.java.net Thu Feb 4 04:09:07 2021 From: iklam at openjdk.java.net (Ioi Lam) Date: Thu, 4 Feb 2021 04:09:07 GMT Subject: Integrated: 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 04:18:24 GMT, Ioi Lam wrote: > collectedHeap.hpp is included by 477 out of 1000 .o files in HotSpot. This file in turn includes many other complex header files. > > In many cases, an object file only directly includes this file via: > > - memAllocator.hpp (which does not actually use collectedHeap.hpp) > - oop.inline.hpp and compressedOops.inline.hpp (only use collectedHeap.hpp in asserts via `Universe::heap()->is_in()`). > > By refactoring the above 3 files, we can reduce the .o files that include collectedHeap.hpp to 242. > > This RFE also removes the unnecessary inclusion of heapInspection.hpp from collectedHeap.hpp. > > Build time of HotSpot is reduced for about 1%. > > Tested with mach5: tier1, builds-tier2, builds-tier3, builds-tier4 and builds-tier5. Also locally: aarch64, arm, ppc64, s390, x86, and zero. This pull request has now been integrated. Changeset: 82028e70 Author: Ioi Lam URL: https://git.openjdk.java.net/jdk/commit/82028e70 Stats: 110 lines in 60 files changed: 69 ins; 7 del; 34 mod 8260012: Reduce inclusion of collectedHeap.hpp and heapInspection.hpp Reviewed-by: stefank, tschatzl ------------- PR: https://git.openjdk.java.net/jdk/pull/2347 From ayang at openjdk.java.net Thu Feb 4 10:15:41 2021 From: ayang at openjdk.java.net (Albert Mingkun Yang) Date: Thu, 4 Feb 2021 10:15:41 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 15:13:38 GMT, Thomas Schatzl wrote: > Hi, > > can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? > > Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. > > Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) Marked as reviewed by ayang (Author). src/hotspot/share/gc/shared/cardTableRS.cpp line 442: > 440: CardTable(whole_heap, scanned_concurrently) { } > 441: > 442: CardTableRS::~CardTableRS() { } Now that it's empty, is it possible to remove it completely? src/hotspot/share/gc/shared/cardTableRS.hpp line 55: > 53: virtual void verify_used_region_at_save_marks(Space* sp) const NOT_DEBUG_RETURN; > 54: > 55: void inline_write_ref_field_gc(void* field, oop new_val) { It seems that the arg `new_val` is not used. Maybe remove it or add a comment saying it's an intentional omission. ------------- PR: https://git.openjdk.java.net/jdk/pull/2354 From kbarrett at openjdk.java.net Thu Feb 4 10:31:41 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Thu, 4 Feb 2021 10:31:41 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 15:13:38 GMT, Thomas Schatzl wrote: > Hi, > > can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? > > Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. > > Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) Looks good to me, with the one minor nit I commented on and Albert's suggestions. src/hotspot/share/gc/shared/cardTableRS.cpp line 43: > 41: inline bool ClearNoncleanCardWrapper::clear_card(CardValue* entry) { > 42: CardValue entry_val = *entry; > 43: assert(entry_val == CardTableRS::dirty_card_val(), Consider eliminating `entry_val` - just use `*entry` in the assert. ------------- Marked as reviewed by kbarrett (Reviewer). PR: https://git.openjdk.java.net/jdk/pull/2354 From jiefu at openjdk.java.net Thu Feb 4 11:00:49 2021 From: jiefu at openjdk.java.net (Jie Fu) Date: Thu, 4 Feb 2021 11:00:49 GMT Subject: [jdk16] RFR: 8260473: [vector] ZGC: VectorReshape test produces incorrect results with ZGC enabled In-Reply-To: References: <5OfnHC5N00VVv3pWcU9gsAHa23RbAAX7ReEw9Ct6eug=.4f095083-7050-487d-94e0-3befce6744c5@github.com>

Message-ID: On Wed, 3 Feb 2021 11:29:40 GMT, Aleksey Shipilev wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> shade review > > Marked as reviewed by shade (Reviewer). Thanks @shipilev @walulyai for your reviews. ------------- PR: https://git.openjdk.java.net/jdk/pull/2373 From tschatzl at openjdk.java.net Thu Feb 4 13:50:43 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Thu, 4 Feb 2021 13:50:43 GMT Subject: Integrated: 8261023: Document why memory pretouch must be a store In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 09:47:04 GMT, Thomas Schatzl wrote: > Hi all, > > may I have reviews for this additional comment that explains why `os::pretouch_memory` needs to use a store and must not use a read which would be more convenient? > > Basically on some (all?) OSes memory pages are only actually backed with physical memory on a store to that page. Before that a common "zero page" may be used to satisfy reads. This is not what is intended here. > > A previous comment (that has been removed long ago) seems to have been a bit confused about the actual issue: > > - // Note the use of a write here; originally we tried just a read, but > - // since the value read was unused, the optimizer removed the read. > - // If we ever have a concurrent touchahead thread, we'll want to use > - // a read, to avoid the potential of overwriting data (if a mutator > - // thread beats the touchahead thread to a page). There are various > - // ways of making sure this read is not optimized away: for example, > - // generating the code for a read procedure at runtime. > > It indicates that the reason for using a store has been that the compiler would optimize away the reads (which begs the question why a `volatile` read has not been used). > > Maybe these zero page optimizations came later than that original implementation though. > > Testing: local compilation - it's adding a comment only, really. > > Thanks, > Thomas This pull request has now been integrated. Changeset: be772ffa Author: Thomas Schatzl URL: https://git.openjdk.java.net/jdk/commit/be772ffa Stats: 4 lines in 1 file changed: 4 ins; 0 del; 0 mod 8261023: Document why memory pretouch must be a store Reviewed-by: shade, iwalulya ------------- PR: https://git.openjdk.java.net/jdk/pull/2373 From tschatzl at openjdk.java.net Thu Feb 4 13:56:58 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Thu, 4 Feb 2021 13:56:58 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal [v2] In-Reply-To: References: Message-ID: > Hi, > > can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? > > Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. > > Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: kimbarret, albertnetymk review ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2354/files - new: https://git.openjdk.java.net/jdk/pull/2354/files/5aa23d74..849c79bb Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2354&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2354&range=00-01 Stats: 11 lines in 4 files changed: 0 ins; 6 del; 5 mod Patch: https://git.openjdk.java.net/jdk/pull/2354.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2354/head:pull/2354 PR: https://git.openjdk.java.net/jdk/pull/2354 From tschatzl at openjdk.java.net Thu Feb 4 13:56:59 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Thu, 4 Feb 2021 13:56:59 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal [v2] In-Reply-To: References:

Message-ID: <33XHcZDMFLFqOngnBQUpiuaQ_VlxfZ9HPhinJoDGIYY=.838ade60-1bc8-43c7-98d9-9d8c21ba3d26@github.com> On Thu, 4 Feb 2021 10:29:18 GMT, Kim Barrett wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> kimbarret, albertnetymk review > > Looks good to me, with the one minor nit I commented on and Albert's suggestions. All fixed as suggested. Still compiles. ------------- PR: https://git.openjdk.java.net/jdk/pull/2354 From github.com+71722661+earthling-amzn at openjdk.java.net Thu Feb 4 18:20:40 2021 From: github.com+71722661+earthling-amzn at openjdk.java.net (earthling-amzn) Date: Thu, 4 Feb 2021 18:20:40 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah In-Reply-To: References: Message-ID: On Wed, 3 Feb 2021 20:05:33 GMT, Zhengyu Gu wrote: > Please review this patch that adds JFR ObjectCountAfterGC event support. > > AFAICT, the event is off by default. If it is enabled, it distorts Shenandoah pause characteristics, since it performs heap walk during final mark pause. > > When event is disabled: > `[191.033s][info][gc,stats] Pause Init Mark (G) 454 us` > `[191.033s][info][gc,stats] Pause Init Mark (N) 13 us` > > When event is enabled: > `[396.631s][info][gc,stats] Pause Final Mark (G) 43199 us` > `[396.631s][info][gc,stats] Pause Final Mark (N) 42982 us` > > Test: > - [x] hotspot_gc_shenandoah That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. ------------- PR: https://git.openjdk.java.net/jdk/pull/2386 From rkennke at openjdk.java.net Thu Feb 4 18:46:41 2021 From: rkennke at openjdk.java.net (Roman Kennke) Date: Thu, 4 Feb 2021 18:46:41 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah In-Reply-To: References:

Message-ID: On Thu, 4 Feb 2021 18:17:46 GMT, earthling-amzn wrote: > That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. This could certainly be done, in a similar fashion as liveness counting. However, it would have to be done such that it only actually counts objects when JFR is requesting it, and otherwise stays out of the way, because this costs marking performance. Which means doubling the number of mark-loops, and select the correct loop based on whether or not we need object counts. ------------- PR: https://git.openjdk.java.net/jdk/pull/2386 From zgu at openjdk.java.net Thu Feb 4 19:22:39 2021 From: zgu at openjdk.java.net (Zhengyu Gu) Date: Thu, 4 Feb 2021 19:22:39 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah In-Reply-To: References:

Message-ID: On Thu, 4 Feb 2021 18:44:24 GMT, Roman Kennke wrote: > That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. It dose not just count number of objects, but number of objects by type, much more than liveness counting. Just add a branch in hot marking loop, I can foresee negative impact on performance. ------------- PR: https://git.openjdk.java.net/jdk/pull/2386 From github.com+71722661+earthling-amzn at openjdk.java.net Thu Feb 4 19:45:41 2021 From: github.com+71722661+earthling-amzn at openjdk.java.net (earthling-amzn) Date: Thu, 4 Feb 2021 19:45:41 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah In-Reply-To: References:

Message-ID: On Thu, 4 Feb 2021 19:19:35 GMT, Zhengyu Gu wrote: >>> That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. >> >> This could certainly be done, in a similar fashion as liveness counting. However, it would have to be done such that it only actually counts objects when JFR is requesting it, and otherwise stays out of the way, because this costs marking performance. Which means doubling the number of mark-loops, and select the correct loop based on whether or not we need object counts. > >> That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. > > It dose not just count number of objects, but number of objects by type, much more than liveness counting. Just add a branch in hot marking loop, I can foresee negative impact on performance. Would it be possible to combine the object _counting_ closure and the object _marking_ closure into one aggregate closure and complete both calculations in one pass over the live objects? Of course, only do this when the JFR event is enabled (and even then, perhaps only do it periodically). ------------- PR: https://git.openjdk.java.net/jdk/pull/2386 From rkennke at openjdk.java.net Thu Feb 4 19:45:42 2021 From: rkennke at openjdk.java.net (Roman Kennke) Date: Thu, 4 Feb 2021 19:45:42 GMT Subject: RFR: 8259647: Add support for JFR event ObjectCountAfterGC to Shenandoah In-Reply-To: References:

Message-ID: On Thu, 4 Feb 2021 19:19:35 GMT, Zhengyu Gu wrote: > > That certainly is bad news for pause times. Do you think it'd be feasible to "piggyback" the object count calculation on concurrent marking? Might address https://bugs.openjdk.java.net/browse/JDK-8258431 also. > > It dose not just count number of objects, but number of objects by type, much more than liveness counting. Just add a branch in hot marking loop, I can foresee negative impact on performance. Yes, as I suggested earlier, I'd only turn it on when requested by JFR, and otherwise leave it off. It definitely will impact performance. That means another set of mark loops that we need to generate at compile-time. ------------- PR: https://git.openjdk.java.net/jdk/pull/2386 From kbarrett at openjdk.java.net Fri Feb 5 07:07:43 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Fri, 5 Feb 2021 07:07:43 GMT Subject: RFR: 8259862: MutableSpace's end should be atomic In-Reply-To: References:

Message-ID: On Sat, 30 Jan 2021 12:37:50 GMT, Albert Mingkun Yang wrote: >> Please review this change to MutableSpace, making its _end member volatile >> and using Atomic operations to access the _top and _end members. Some >> unused accessor functions that would otherwise need updating are removed. >> >> Testing: >> mach5 tier1 > > src/hotspot/share/gc/parallel/mutableSpace.hpp line 62: > >> 60: HeapWord* _bottom; >> 61: HeapWord* volatile _top; >> 62: HeapWord* volatile _end; > > Maybe add some comments explaining how `_top` and `_end` are used in the concurrent setting. I've added some comments describing `_bottom`, `_top`, and `_end`. ------------- PR: https://git.openjdk.java.net/jdk/pull/2323 From kbarrett at openjdk.java.net Fri Feb 5 07:27:58 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Fri, 5 Feb 2021 07:27:58 GMT Subject: RFR: 8259862: MutableSpace's end should be atomic [v2] In-Reply-To: References: Message-ID: <8ATJD3ux-1gG56DPWs2XwN_C-aEpNGMxGS1rQvcN9cA=.aa1f3e41-4943-4b06-8838-a96243f56d8c@github.com> > Please review this change to MutableSpace, making its _end member volatile > and using Atomic operations to access the _top and _end members. Some > unused accessor functions that would otherwise need updating are removed. > > Testing: > mach5 tier1 Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: - describe _top and _end - reinstate end_addr() after JDK-8259778 - Merge branch 'master' into atomic_end - make _end volatile and use atomic access ------------- Changes: - all: https://git.openjdk.java.net/jdk/pull/2323/files - new: https://git.openjdk.java.net/jdk/pull/2323/files/a091498c..823879a0 Webrevs: - full: https://webrevs.openjdk.java.net/?repo=jdk&pr=2323&range=01 - incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=2323&range=00-01 Stats: 13119 lines in 648 files changed: 7935 ins; 2932 del; 2252 mod Patch: https://git.openjdk.java.net/jdk/pull/2323.diff Fetch: git fetch https://git.openjdk.java.net/jdk pull/2323/head:pull/2323 PR: https://git.openjdk.java.net/jdk/pull/2323 From kbarrett at openjdk.java.net Fri Feb 5 07:27:59 2021 From: kbarrett at openjdk.java.net (Kim Barrett) Date: Fri, 5 Feb 2021 07:27:59 GMT Subject: Integrated: 8259862: MutableSpace's end should be atomic In-Reply-To: References: Message-ID: On Sat, 30 Jan 2021 05:51:38 GMT, Kim Barrett wrote: > Please review this change to MutableSpace, making its _end member volatile > and using Atomic operations to access the _top and _end members. Some > unused accessor functions that would otherwise need updating are removed. > > Testing: > mach5 tier1 This pull request has now been integrated. Changeset: 1e0a1013 Author: Kim Barrett URL: https://git.openjdk.java.net/jdk/commit/1e0a1013 Stats: 27 lines in 4 files changed: 7 ins; 12 del; 8 mod 8259862: MutableSpace's end should be atomic Make _end volatile and use atomic access Reviewed-by: ayang, tschatzl ------------- PR: https://git.openjdk.java.net/jdk/pull/2323 From tschatzl at openjdk.java.net Fri Feb 5 08:36:40 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Fri, 5 Feb 2021 08:36:40 GMT Subject: RFR: 8234534: Simplify CardTable code after CMS removal [v2] In-Reply-To: References:

Message-ID: On Thu, 4 Feb 2021 10:29:18 GMT, Kim Barrett wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> kimbarret, albertnetymk review > > Looks good to me, with the one minor nit I commented on and Albert's suggestions. Thanks @kimbarrett @albertnetymk for your reviews. ------------- PR: https://git.openjdk.java.net/jdk/pull/2354 From tschatzl at openjdk.java.net Fri Feb 5 08:36:41 2021 From: tschatzl at openjdk.java.net (Thomas Schatzl) Date: Fri, 5 Feb 2021 08:36:41 GMT Subject: Integrated: 8234534: Simplify CardTable code after CMS removal In-Reply-To: References: Message-ID: On Tue, 2 Feb 2021 15:13:38 GMT, Thomas Schatzl wrote: > Hi, > > can I have reviews for this cleanup that removes CMS specific code from `CardTable/CardTableRS`? > > Note that there is still this "conc_scan" parameter passed to the card table that affects barrier code generation, for some reason also G1 barrier code generation although it should not as `G1CardTable::scanned_concurrently()` only used for the "normal" card table. Initial attempts showed that removing this is not straightforward, causing crashes and so I left it out for [JDK-8250941](https://bugs.openjdk.java.net/browse/JDK-8260941) so that this change is solely about removing unused code. > > Testing: tier1-4, some tier1-5 runs earlier (before some removal of hunks for files only containing copyright updates or newline changes) This pull request has now been integrated. Changeset: 78b0d327 Author: Thomas Schatzl URL: https://git.openjdk.java.net/jdk/commit/78b0d327 Stats: 205 lines in 9 files changed: 0 ins; 191 del; 14 mod 8234534: Simplify CardTable code after CMS removal Reviewed-by: ayang, kbarrett ------------- PR: https://git.openjdk.java.net/jdk/pull/2354 From manc at google.com Fri Feb 5 08:47:10 2021 From: manc at google.com (Man Cao) Date: Fri, 5 Feb 2021 00:47:10 -0800 Subject: State of "simplified barriers" for G1 In-Reply-To: References:

Message-ID: Hi All, My apology for postponing this. I've been busy rolling out JDK 11 to all our production servers for the last year. The current state is that the OpenJDK GC team and us have determined to implement https://bugs.openjdk.java.net/browse/JDK-8226731 first, before committing the simplified write barrier. We'd like to get rid of the storeload fence even with Conc Refine enabled. Note that JDK-8230187 contains the most up-to-date description for the proposed simplified writer barrier, JDK-8226197 is a bit outdated. I target to get both JDK-8226731 and JDK-8230187 in JDK 17. I'll send a separate email for JDK-8226731, as there are still some challenges there. Yude, thanks for sharing the ideal and results! I think it is best to open a new RFE for further improvement after JDK-8230187 is implemented. If I understand correctly, the proposed approach avoids dirtying the cards for old-to-old reference stores in young-only phases. That's a nice idea. Are the results comparing the two types of simplified write barriers? Or is for comparing the default barrier with the storeload fence, vs your simplified write barrier that filters untracked regions? -Man On Tue, Dec 22, 2020 at 2:31 AM ??? wrote: > Hi All, > > We are also interested in any follow-ups on this topic. If I recall > correctly, when this was discussed in JDK-8226197, one of the TODOs was > that the storeload fence can be skipped when Conc Refine is turned off. > Regarding this, I'd like to share an idea we have been experimenting in the > last couple of months. We took "skipping the fence" a little further and > tried to improve the throughput with less harm to pause time. > > This is from the observation that many card dirtying operations can go > away without concurrent refine. More specifically, writes that produce a > reference OldObj1.foo->OldObj2 need not dirty the card corresponding to > OldObj1 during young-gc-only phase. Currently, with Conc Refine, this > operation will dirty that card, then the card will be refined (thrown away) > by the refinement thread, because it discovers that the reference points to > an Old region, which is "untracked" during young-gc-only phase. > > The refinement thread does this concurrently so that GC doesn't have to do > it during a pause. But we (~lmao) realized that we can use a flag to > indicate whether a region is tracked, and discard the card dirtying > operation immediately in the barrier (after testing against the flag). We > can do it without any atomics/fences, just ~5 instructions in the barrier. > This way, we get rid of the storeload mem barrier, with Conc Refine turned > off, while still getting the same pause time guarantee in young-gc-only > phase. But as you can see, Mixed GCs still suffer from having no concurrent > refinement. > > We saw improvements on Alibaba JDK11u across the benchmarks we used > (positive number means better): > Dacapo: cases vary from -3.3% to +5.1%, on average +0.3% > specjbb2015 on 96x2.50GHz, 16 GC threads, 24g mem: critical-jOPS +1.9%, > max-jOPS +2.8% > specjbb2015 on 8x2.50GHz, 8 GC threads, 16g mem (observed more Mixed GCs): > critical-jOPS +0.1%, max-jOPS +5.7% > specjvm2008: cases vary from -0.7% to +23.4%, on average +3.1% > Extremem: cases vary from -2.1% to +7.8%, on average +1.0% > I'd love to hear any feedbacks, comments, what problems you can see in > this approach, conceptually or practically, and back to the topic, whether > this idea can be incorporated into your future work/plan of creating a > simplified barrier. > > Yude Lin > > > ------------------------------------------------------------------ > ????Gerhard Hueller > ?????2020?12?21?(???) 03:19 > ????hotspot-gc-dev at openjdk.java.net > ? ??State of "simplified barriers" for G1 > > Hi, > > I remember a slide deck talking about the improvements to G1 since JDK8/9 > and one bullet point on the todo-list was simplified barriers for G1. > > I wonder what happened to this improvement, has it been already > implemented? Is this the non-concurrent refinement option implemented by > google some time ago? > Improvements in this area would be really great, CMS still provides better > throughput for most workloads - with the only real advantage of G1 does > offer are avoiding those degenerated STW full GCs. > > Thanks, Gerhard From yude.lyd at alibaba-inc.com Fri Feb 5 09:39:15 2021 From: yude.lyd at alibaba-inc.com (=?UTF-8?B?5p6X6IKy5b63?=) Date: Fri, 05 Feb 2021 17:39:15 +0800 Subject: =?UTF-8?B?UmU6IFN0YXRlIG9mICJzaW1wbGlmaWVkIGJhcnJpZXJzIiBmb3IgRzE=?= In-Reply-To: References:

, Message-ID: <1005b1ba-e5f9-401c-887c-6f607c9db5f6.yude.lyd@alibaba-inc.com> Thanks Man, I'm glad to hear the updates. I will follow JDK-8230187 closely. I think it is best to open a new RFE for further improvement after JDK-8230187 is implemented. I will take this approach. If I understand correctly, the proposed approach avoids dirtying the cards for old-to-old reference stores in young-only phases. That is correct. Are the results comparing the two types of simplified write barriers? Or is for comparing the default barrier with the storeload fence, vs your simplified write barrier that filters untracked regions? We compared the default barrier (with storeload fence, concurrent refine on) vs untracked region filter (with no storeload fence, concurrent refine off). Yude ------------------------------------------------------------------ From:Man Cao Send Time:2021?2?5?(???) 16:47 To:hotspot-gc-dev at openjdk.java.net Cc:???(??) Subject:Re: State of "simplified barriers" for G1 Hi All, My apology for postponing this. I've been busy rolling out JDK 11 to all our production servers for the last year. The current state is that the OpenJDK GC team and us have determined to implement https://bugs.openjdk.java.net/browse/JDK-8226731 first, before committing the simplified write barrier. We'd like to get rid of the storeload fence even with Conc Refine enabled. Note that JDK-8230187 contains the most up-to-date description for the proposed simplified writer barrier, JDK-8226197 is a bit outdated. I target to get both JDK-8226731 and JDK-8230187 in JDK 17. I'll send a separate email for JDK-8226731, as there are still some challenges there. Yude, thanks for sharing the ideal and results! I think it is best to open a new RFE for further improvement after JDK-8230187 is implemented. If I understand correctly, the proposed approach avoids dirtying the cards for old-to-old reference stores in young-only phases. That's a nice idea. Are the results comparing the two types of simplified write barriers? Or is for comparing the default barrier with the storeload fence, vs your simplified write barrier that filters untracked regions? -Man On Tue, Dec 22, 2020 at 2:31 AM ??? wrote: Hi All, We are also interested in any follow-ups on this topic. If I recall correctly, when this was discussed in JDK-8226197, one of the TODOs was that the storeload fence can be skipped when Conc Refine is turned off. Regarding this, I'd like to share an idea we have been experimenting in the last couple of months. We took "skipping the fence" a little further and tried to improve the throughput with less harm to pause time. This is from the observation that many card dirtying operations can go away without concurrent refine. More specifically, writes that produce a reference OldObj1.foo->OldObj2 need not dirty the card corresponding to OldObj1 during young-gc-only phase. Currently, with Conc Refine, this operation will dirty that card, then the card will be refined (thrown away) by the refinement thread, because it discovers that the reference points to an Old region, which is "untracked" during young-gc-only phase. The refinement thread does this concurrently so that GC doesn't have to do it during a pause. But we (~lmao) realized that we can use a flag to indicate whether a region is tracked, and discard the card dirtying operation immediately in the barrier (after testing against the flag). We can do it without any atomics/fences, just ~5 instructions in the barrier. This way, we get rid of the storeload mem barrier, with Conc Refine turned off, while still getting the same pause time guarantee in young-gc-only phase. But as you can see, Mixed GCs still suffer from having no concurrent refinement. We saw improvements on Alibaba JDK11u across the benchmarks we used (positive number means better): Dacapo: cases vary from -3.3% to +5.1%, on average +0.3% specjbb2015 on 96x2.50GHz, 16 GC threads, 24g mem: critical-jOPS +1.9%, max-jOPS +2.8% specjbb2015 on 8x2.50GHz, 8 GC threads, 16g mem (observed more Mixed GCs): critical-jOPS +0.1%, max-jOPS +5.7% specjvm2008: cases vary from -0.7% to +23.4%, on average +3.1% Extremem: cases vary from -2.1% to +7.8%, on average +1.0% I'd love to hear any feedbacks, comments, what problems you can see in this approach, conceptually or practically, and back to the topic, whether this idea can be incorporated into your future work/plan of creating a simplified barrier. Yude Lin ------------------------------------------------------------------ ????Gerhard Hueller ?????2020?12?21?(???) 03:19 ????hotspot-gc-dev at openjdk.java.net