From ysr at openjdk.org Thu Feb 1 07:21:11 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 1 Feb 2024 07:21:11 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Wed, 31 Jan 2024 16:28:15 GMT, Kelvin Nilsen wrote: >> Several objectives: >> 1. Reduce humongous allocation failures by segregating regular regions from humongous regions >> 2. Do not retire regions just because an allocation failed within the region if the memory remaining within the region is large enough to represent a LAB >> 3. Track range of empty regions in addition to range of available regions in order to expedite humongous allocations >> 4. Treat collector reserves as available for Mutator allocations after evacuation completes >> 5. Improve encapsulation so as to enable an OldCollector reserve for future integration of generational Shenandoah >> >> On internal performance pipelines, this change shows: >> >> 1. some Increase in page faults and rss_max with certain workloads, presumably because of "segregation" of humongous from regular regions. >> 2. An increase in System CPU time on certain benchmarks: sunflow (+165%), scimark.sparse.large (+50%), lusearch (+43%). This system CPU time increase appears to correlate with increased page faults and/or rss. >> 3. An increase in trigger_failure for the hyperalloc_a2048_o4096 experiment (not yet understood) >> 4. 2-30x improvements on multiple metrics of the Extremem phased workload latencies (most likely resulting from fewer degenerated or full GCs) >> >> Shenandoah >> ------------------------------------------------------------------------------------------------------- >> +166.55% scimark.sparse.large/minor_page_fault_count p=0.00000 >> Control: 819938.875 (+/-5724.56 ) 40 >> Test: 2185552.625 (+/-26378.64 ) 20 >> >> +166.16% scimark.sparse.large/rss_max p=0.00000 >> Control: 3285226.375 (+/-22812.93 ) 40 >> Test: 8743881.500 (+/-104906.69 ) 20 >> >> +164.78% sunflow/cpu_system p=0.00000 >> Control: 1.280s (+/- 0.10s ) 40 >> Test: 3.390s (+/- 0.13s ) 20 >> >> +149.29% hyperalloc_a2048_o4096/trigger_failure p=0.00000 >> Control: 3.259 (+/- 1.46 ) 33 >> Test: 8.125 (+/- 2.05 ) 20 >> >> +143.75% pmd/major_page_fault_count p=0.03622 >> Control: 1.000 (+/- 0.00 ) 40 >> Test: 2.438 (+/- 2.59 ) 20 >> >> +80.22% lusearch/minor_page_fault_count p=0.00000 >> Control: 2043930.938 (+/-4777.14 ) 40 >> Test: 3683477.625 (+/-5650.29 ) 20 >> >> +50.50% scimark.sparse.small/minor_page_fault_count p=0.00000 >> Control: 697899.156 (+/-3457.82 ) 40 >> Test: 1050363.812 (+/-175... > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Rename and comments for _capacity_of and _used_by A few more comments. I expect my next round to be the last one. I think we are almost there. Sorry for the delay and for the length of the review comments. src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 1001: > 999: > 1000: if (VerifyAfterGC) { > 1001: Universe::verify(); This line deletion seems to be the only change now in this file. So this file can be removed from the diffs. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 114: > 112: } > 113: > 114: inline void ShenandoahRegionPartition::shrink_interval_if_boundary_modified(ShenandoahFreeSetPartitionId partition, size_t idx) { const both the parameters. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 36: > 34: enum ShenandoahFreeSetPartitionId : uint8_t { > 35: NotFree, // Region has been retired and is not in any free set: there is no available memory. > 36: Mutator, // Region is in the Mutator free set: available memory is available to mutators. Just want to make sure: "available to mutators" -- is this both for object allocation as well as for possible evacuation as part of the mutator LRB? src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 37: > 35: NotFree, // Region has been retired and is not in any free set: there is no available memory. > 36: Mutator, // Region is in the Mutator free set: available memory is available to mutators. > 37: Collector, // Region is in the Collector free set: available memory is reserved for evacuations. When mutators evacuate the target of an LRB, do they use `Mutator` or `Collector`. I assume the former? In that case, I'd say for Collector: `available memory is reserved for collector threads for evacuation`. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 44: > 42: > 43: // This class implements partitioning of regions into distinct sets. Each ShenandoahHeapRegion is either in the Mutator free set, > 44: // the Collector free set, or in neither free set (NotFree). I noticed that you use the term "free partition" quite a lot later, I'd just start using that term early on when talking about these sets. You could, for example, say: // Whenever we say "free partition", we mean any partition other than the "NotFree" partition. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 50: > 48: const size_t _max; // The maximum number of heap regions > 49: const size_t _region_size_bytes; > 50: const ShenandoahFreeSet* _free_set; Interesting: why does the partitioning need a reference to its containing free set? src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 54: > 52: > 53: // For each type, we track an interval outside of which a region affiliated with that partition is guaranteed > 54: // not to be found. This makes searches for free space more efficient. For each partition p, _leftmosts[p] I am being a bit pedantic here. Partition is usually identified with the _set of equivalence classes_. Thus a partition is an equivalence relation, and each equivalence class in the partition has, in this case, a distinct partition id (i.e. each region is either in the Mutator equivalence class aka Mutator free set, the Collector equivalence class aka Collector free set, or the NotFree equivalence class aka NotFree set). In your terminology, each equivalence class is a "free set". src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 60: > 58: size_t _rightmosts[NumPartitions]; > 59: > 60: // Allocation for humongous objects needs to find regions that are entirely empty. For each partion p, _leftmosts[p] `_leftmosts_empty` and, similarly, `_rightmosts_empty`. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 63: > 61: // represents the first region belonging to this partition that is completely empty and _rightmosts[p] represents the > 62: // last region that is completely empty. If there are no completely empty regions in this partition, this is represented > 63: // by canonical [_max, 0]. ... is no completely empty region in this partition id, ... ... the canonical ... src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 68: > 66: > 67: // For each partition p, _capacity[p] represents the total amount of memory within the partition at the time > 68: // of the most recent rebuild, _used[p] represents the total amount of memory that has been consumed within this instead of consumed, can we just say used (or allocated)? src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 74: > 72: // and _used[p], even though the region may have been removed from the free set. > 73: size_t _capacity[NumPartitions]; > 74: size_t _used[NumPartitions]; In light of your earlier documentation of leftmost/righmost/empty/available etc. then, would it be fair to say that the following statement is always true: for p = NotFree: 1. leftmosts[p] = leftmosts_empty[p] = _max 2. rightmosts_empty[p] = rightmosts_empty[p] = 0 3. capacity[p] = used[p] = region_size Are the "NotFree" entries for these arrays ever used? If not, is there any point in keeping them in a product build? Is there any point in keeping them in a non-product build? Does it have some other role that makes it important to keep it, anyway? src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 75: > 73: size_t _capacity[NumPartitions]; > 74: size_t _used[NumPartitions]; > 75: size_t _region_counts[NumPartitions]; If tracked, is this an invariant of these fields? - region_counts[NotFree] == _max - (region_counts[Mutator] + region_counts[Collector]) (This would also make the region_counts[NotFree] unnecessary? See my previous comment.) src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 95: > 93: void make_free(size_t idx, ShenandoahFreeSetPartitionId which_partition, size_t region_capacity); > 94: > 95: // Place region idx into free partition new_partition. Requires that idx is currently not NotFree. Include semantics of region_capacity in comment, e.g.: // Move region idx, with region_capacity bytes of available free space, // from the NotFree partition to the free partition new_partition. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 99: > 97: > 98: // Returns the ShenandoahFreeSetPartitionId affiliation of region idx, NotFree if this region is not currently free. > 99: // This does not enforce that free_set membership implies allocation capacity. I think "NotFree if this region is not currently free" is unnecessary and frankly confusing (why are we mentioning membership in the NotFree partition specially?) I also do not understand (am confused by) the second sentence in the comment. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 103: > 101: > 102: // Returns true iff region idx is in the test_set free_set. Before returning true, asserts that the free > 103: // set is not empty. Requires that test_set != NotFree or NumPartitions. This comment probably needs to be updated. Something simple like: // Is the region, idx, part of which_partition? As it stands, the comment is pretty confusing. In general concise statements of specification are best for documenting APIs. If the presence of APIs needs to be motivated, that should all be done early on in a block comment that motivates the class and why it is what it is. It makes for much terser, clearer, and maintainable documentation. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 112: > 110: // In other words: > 111: // if the requested which_partition is empty: > 112: // leftmost() and leftmost_empty() return _max, rightmost() and rightmost_empty() return 0 There are mutually contradictory statements in the highlighted portion of the documentation above. I suspect the earlier reference to -1 is obsolete and needs to be deleted. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 132: > 130: assert (which_partition > NotFree && which_partition < NumPartitions, "selected free set must be valid"); > 131: return _used[which_partition]; > 132: } The assertions here indicate to me that it is likely my earlier suspicion that many of these fields are not needed for NotFree is true. (See my earlier comment about many of these fields for NotFree.) I feel it may be best to let the enum type system enforce this, rather than use these assertions. NotFree then becomes a sentinel value that is not part of the legal index set that can be passed in here. May be that can be done later, as I realize the type contagion might necessitate more changes (although I think it will conceptually simplify this) by maintaining the disctinction between the tags in the _membership[] array, and the types used for the so-called free partitions. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 142: > 140: assert (which_partition > NotFree && which_partition < NumPartitions, "selected free set must be valid"); > 141: _used[which_partition] = value; > 142: } Same for these assertions. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 171: > 169: }; > 170: > 171: class ShenandoahFreeSet : public CHeapObj { It would be good to have a block comment here motivating this class. It seems (from looking at some of its public APIs) as if it publicly exports only the "mutator view", which I find interesting. The other partitions in `ShenandoahRegionPartition` appears to be for efficiency of the implementation in service of the public APIs for ShenandoahFreeSet. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 174: > 172: private: > 173: ShenandoahHeap* const _heap; > 174: ShenandoahRegionPartition _partitions; I think the use of a plural for the field illustrates the English language interpretation of partition. To be consistent, I'd rename the class name also to the plural `ShenandoahRegionPartitions` as remarked earlier. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 184: > 182: HeapWord* allocate_single(ShenandoahAllocRequest& req, bool& in_new_region); > 183: > 184: // While holding the heap lock, allocate memory for a humongous object which will span multiple contiguous heap `which will` or `which may`? (Is a humongous object allowed to span just a single region as well?) Or are objects humongous only if they won't fit in a region? In which case the "will" is correct. I was confused by tests that use `ShenandoahHumongousThreshold=50` , `=90`, etc. May be in those cases, we go through the `allocate_single()` despite allocating an object (or block) bigger than `ShenandoahHeapRegion::humongous_threshold_words()` ? (That would make the pre-condition of the previous method suspect, though.) src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 187: > 185: // regions. > 186: // > 187: // Precondition: req.size() > ShenandoahHeapRegion::humongous_threshold_words(). `>` or `>=` ? src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 221: > 219: // > 220: // Note that we plan to replenish the Collector reserve at the end of update refs, at which time all > 221: // of the regions recycled from the collection set will be available. I see that you are trying to motivate this API. I feel that these comments belong in the caller. The API should not need to motivate where the caller must call this from. The API came about because there was a need for this in its clients. A good API spec should state its actions. Motivating its uses drags in context that detracts from clarity of the class and method. I realize this is a somewhat subjective stance but from experience it makes for better documentation and more maintainable/readable code. The place for such documentation is usually in a block comment motivating the general design of the class and why it offers the APIs that it does, and who its clients are. src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 231: > 229: inline size_t available() const { > 230: assert(used() <= capacity(), "must use less than capacity"); > 231: return capacity() - used(); So `ShenandoahFreeSet` publicly exports only the mutator view? src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 1069: > 1067: heap->collection_set()->clear(); > 1068: > 1069: // Since Full GC directly manipulates top of certain regions, certain ShenandoahFreeSet abstractions may have been corrupted. Instead of "may have been corrupted", which can be alarming and confusing, I'd state this as: // Full GC doesn't use or maintain the ShenandoahFreeSet abstractions, // so we rebuild the free set from scratch following a Full GC. ------------- PR Review: https://git.openjdk.org/jdk/pull/17561#pullrequestreview-1855229569 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473702305 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473831169 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473809906 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473809223 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473835542 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473810650 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473709592 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473710297 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473712934 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473721984 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473826640 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473828523 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473842682 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473845614 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473848754 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473851167 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473859958 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473860619 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473739467 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473725085 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473726454 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473728080 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473883437 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473738298 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473899033 From ysr at openjdk.org Thu Feb 1 07:21:11 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 1 Feb 2024 07:21:11 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:04:43 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 54: > >> 52: >> 53: // For each type, we track an interval outside of which a region affiliated with that partition is guaranteed >> 54: // not to be found. This makes searches for free space more efficient. For each partition p, _leftmosts[p] > > I am being a bit pedantic here. > Partition is usually identified with the _set of equivalence classes_. Thus a partition is an equivalence relation, and each equivalence class in the partition has, in this case, a distinct partition id (i.e. each region is either in the Mutator equivalence class aka Mutator free set, the Collector equivalence class aka Collector free set, or the NotFree equivalence class aka NotFree set). In your terminology, each equivalence class is a "free set". However, upon reading further, I see that you have used "partition" not in the mathematical sense of an equivalence relation on a set, but in the English language sense as a subset of a set. In that case, you can continue to use the terminology you are using, but I'd change the class `ShenandoahRegionPartition` to the plural `ShenandoahRegionPartitions`, since you think of it as the combination of 3 partitions (in the English language sense): a Mutator partition, a Collector partition, and a NotFree partition. Or you could call it `ShenandoahRegionPartitioning`. Indeed, in your comment above, you say "This class represents a partitioning of ...". > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 184: > >> 182: HeapWord* allocate_single(ShenandoahAllocRequest& req, bool& in_new_region); >> 183: >> 184: // While holding the heap lock, allocate memory for a humongous object which will span multiple contiguous heap > > `which will` or `which may`? (Is a humongous object allowed to span just a single region as well?) > > Or are objects humongous only if they won't fit in a region? In which case the "will" is correct. > > I was confused by tests that use `ShenandoahHumongousThreshold=50` , `=90`, etc. > > May be in those cases, we go through the `allocate_single()` despite allocating an object (or block) bigger than `ShenandoahHeapRegion::humongous_threshold_words()` ? (That would make the pre-condition of the previous method suspect, though.) Same remark applies to the precondition comment below (which is correct, but could be made stronger to say `req.size() > ShenandoahHeapRegion::RegionSizeWords` or such? > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 221: > >> 219: // >> 220: // Note that we plan to replenish the Collector reserve at the end of update refs, at which time all >> 221: // of the regions recycled from the collection set will be available. > > I see that you are trying to motivate this API. I feel that these comments belong in the caller. The API should not need to motivate where the caller must call this from. The API came about because there was a need for this in its clients. A good API spec should state its actions. > > Motivating its uses drags in context that detracts from clarity of the class and method. > > I realize this is a somewhat subjective stance but from experience it makes for better documentation and more maintainable/readable code. > > The place for such documentation is usually in a block comment motivating the general design of the class and why it offers the APIs that it does, and who its clients are. So the documentation here might be: // Move cset_regions number of regions from being available to the collector to // being available to the mutator. // // Typical usage is at the end of evacuation, when the collector no longer needs // the regions that were reserved for evacuation, and these can now be // made available for mutator allocation. BTW, why call the number of regions `cset_regions`? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473717128 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473736515 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473892964 From ysr at openjdk.org Thu Feb 1 07:21:11 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 1 Feb 2024 07:21:11 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 07:10:17 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 221: >> >>> 219: // >>> 220: // Note that we plan to replenish the Collector reserve at the end of update refs, at which time all >>> 221: // of the regions recycled from the collection set will be available. >> >> I see that you are trying to motivate this API. I feel that these comments belong in the caller. The API should not need to motivate where the caller must call this from. The API came about because there was a need for this in its clients. A good API spec should state its actions. >> >> Motivating its uses drags in context that detracts from clarity of the class and method. >> >> I realize this is a somewhat subjective stance but from experience it makes for better documentation and more maintainable/readable code. >> >> The place for such documentation is usually in a block comment motivating the general design of the class and why it offers the APIs that it does, and who its clients are. > > So the documentation here might be: > > // Move cset_regions number of regions from being available to the collector to > // being available to the mutator. > // > // Typical usage is at the end of evacuation, when the collector no longer needs > // the regions that were reserved for evacuation, and these can now be > // made available for mutator allocation. > > BTW, why call the number of regions `cset_regions`? Also, the concept of partition is itself an internal implementation detail that you have carefully encapsulated in this class. There is no point in leaking that out in the naming of the method. The method can just be called `move_regions_from_collector_to_mutator(size_t num)` and be done? "Partition" here adds no value and can be confusing leakage of abstraction. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473896202 From ysr at openjdk.org Thu Feb 1 07:30:02 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 1 Feb 2024 07:30:02 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 06:06:06 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 44: > >> 42: >> 43: // This class implements partitioning of regions into distinct sets. Each ShenandoahHeapRegion is either in the Mutator free set, >> 44: // the Collector free set, or in neither free set (NotFree). > > I noticed that you use the term "free partition" quite a lot later, I'd just start using that term early on when talking about these sets. You could, for example, say: > > // Whenever we say "free partition", we mean any partition other than the "NotFree" partition. Or: // Any partition that is not the "NotFree" partition is a "free partition". ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1473912384 From ysr at openjdk.org Thu Feb 1 08:11:02 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 1 Feb 2024 08:11:02 GMT Subject: RFR: 8324995: Shenandoah: Skip to full gc for humongous allocation failures [v3] In-Reply-To: References:

Message-ID: <4EwZEdlRxSvpaYstvt3imSXBD9lqRDjBxsNw0IhIcVk=.3e826fd9-36fd-4c29-b2a5-663ae40a21c9@github.com> On Wed, 31 Jan 2024 21:50:06 GMT, William Kemper wrote: >> Shenandoah degenerated cycles do not compact regions. When a humongous allocation fails, it is likely due to fragmentation which is better addressed by a full gc. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Fix typo in comment Changes look great; are there any performance numbers to share for the change? ------------- Marked as reviewed by ysr (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/17638#pullrequestreview-1855782924 From shade at openjdk.org Thu Feb 1 11:02:05 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 1 Feb 2024 11:02:05 GMT Subject: RFR: 8324995: Shenandoah: Skip to full gc for humongous allocation failures [v3] In-Reply-To: References:

Message-ID: On Tue, 30 Jan 2024 09:08:01 GMT, Erik ?sterlund wrote: >> ICStubs solve an atomicity problem when setting both the destination and data of an inline cache. Unfortunately, it also leads to occasional safepoint carpets when multiple threads need to ICRefill the stubs at the same time, and spurious GuaranteedSafepointInterval "Cleanup" safepoints every second. This patch changes inline caches to not change the data part at all during the nmethod life cycle, hence removing the need for ICStubs. >> >> The new scheme is less stateful. Instead of adding and removing callsite metadata back and forth when transitioning inline cache states, it installs all state any shape of call will ever need at resolution time in a struct that I call CompiledICData. This reduces inline cache state changes to simply changing the destination of the call, and it doesn't really matter what state transitions to what other state. >> >> With this patch, we get rid of ICStub and ICBuffer classes and the related ICRefill and almost all Cleanup safepoints in practice. It also makes the inline cache code much simpler. >> >> I have tested the changes from tier1-7, and run through full aurora performance tests. > > Erik ?sterlund has updated the pull request incrementally with one additional commit since the last revision: > > ARM32 fixes Thanks for the improvements! Tests are still passing on SAP supported platforms. ------------- PR Comment: https://git.openjdk.org/jdk/pull/17495#issuecomment-1922760746 From wkemper at openjdk.org Fri Feb 2 14:15:42 2024 From: wkemper at openjdk.org (William Kemper) Date: Fri, 2 Feb 2024 14:15:42 GMT Subject: RFR: Merge openjdk/jdk:master Message-ID: Merges tag jdk-23+8 ------------- Commit messages: - 8324174: assert(m->is_entered(current)) failed: invariant - 8325042: remove unused JVMDITools test files - 8323621: JDK build should exclude snippet class in java.lang.foreign - 8324238: [macOS] java/awt/Frame/ShapeNotSetSometimes/ShapeNotSetSometimes.java fails with the shape has not been applied msg - 8320342: Use PassFailJFrame for TruncatedPopupMenuTest.java - 8324981: Shenandoah: Move commit and soft max heap changed methods into heap - 8303374: Implement JEP 455: Primitive Types in Patterns, instanceof, and switch (Preview) - 8320712: Rewrite BadFactoryTest in pure Java - 8324771: Obsolete RAMFraction related flags - 8324970: Serial: Refactor signature of maintain_old_to_young_invariant - ... and 61 more: https://git.openjdk.org/shenandoah/compare/6d36eb78...5b9b176c The webrev contains the conflicts with master: - merge conflicts: https://webrevs.openjdk.org/?repo=shenandoah&pr=389&range=00.conflicts Changes: https://git.openjdk.org/shenandoah/pull/389/files Stats: 18822 lines in 1229 files changed: 7186 ins; 1716 del; 9920 mod Patch: https://git.openjdk.org/shenandoah/pull/389.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/389/head:pull/389 PR: https://git.openjdk.org/shenandoah/pull/389 From eosterlund at openjdk.org Fri Feb 2 15:37:05 2024 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Fri, 2 Feb 2024 15:37:05 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: On Fri, 2 Feb 2024 03:52:04 GMT, Martin Doerr wrote: > Thanks for the improvements! Tests are still passing on SAP supported platforms. Thank you for running through your tests! ------------- PR Comment: https://git.openjdk.org/jdk/pull/17495#issuecomment-1924112606 From kdnilsen at openjdk.org Fri Feb 2 23:43:05 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 2 Feb 2024 23:43:05 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:19:17 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 54: >> >>> 52: >>> 53: // For each type, we track an interval outside of which a region affiliated with that partition is guaranteed >>> 54: // not to be found. This makes searches for free space more efficient. For each partition p, _leftmosts[p] >> >> I am being a bit pedantic here. >> Partition is usually identified with the _set of equivalence classes_. Thus a partition is an equivalence relation, and each equivalence class in the partition has, in this case, a distinct partition id (i.e. each region is either in the Mutator equivalence class aka Mutator free set, the Collector equivalence class aka Collector free set, or the NotFree equivalence class aka NotFree set). In your terminology, each equivalence class is a "free set". > > However, upon reading further, I see that you have used "partition" not in the mathematical sense of an equivalence relation on a set, but in the English language sense as a subset of a set. In that case, you can continue to use the terminology you are using, but I'd change the class `ShenandoahRegionPartition` to the plural `ShenandoahRegionPartitions`, since you think of it as the combination of 3 partitions (in the English language sense): a Mutator partition, a Collector partition, and a NotFree partition. Or you could call it `ShenandoahRegionPartitioning`. Indeed, in your comment above, you say "This class represents a partitioning of ...". I'll go with ShenandoahRegionPartitions. Thanks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476874936 From kdnilsen at openjdk.org Fri Feb 2 23:43:04 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 2 Feb 2024 23:43:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 01:49:38 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 1001: > >> 999: >> 1000: if (VerifyAfterGC) { >> 1001: Universe::verify(); > > This line deletion seems to be the only change now in this file. So this file can be removed from the diffs. Thanks. Adding this line back in. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476874711 From kdnilsen at openjdk.org Sat Feb 3 00:01:04 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 00:01:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:06:12 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 60: > >> 58: size_t _rightmosts[NumPartitions]; >> 59: >> 60: // Allocation for humongous objects needs to find regions that are entirely empty. For each partion p, _leftmosts[p] > > `_leftmosts_empty` and, similarly, `_rightmosts_empty`. Oops. Thanks. > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 63: > >> 61: // represents the first region belonging to this partition that is completely empty and _rightmosts[p] represents the >> 62: // last region that is completely empty. If there are no completely empty regions in this partition, this is represented >> 63: // by canonical [_max, 0]. > > ... is no completely empty region in this partition id, ... > > > > ... the canonical ... Thanks. fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476881647 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476881947 From kdnilsen at openjdk.org Sat Feb 3 00:08:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 00:08:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:28:32 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 68: > >> 66: >> 67: // For each partition p, _capacity[p] represents the total amount of memory within the partition at the time >> 68: // of the most recent rebuild, _used[p] represents the total amount of memory that has been consumed within this > > instead of consumed, can we just say used (or allocated)? Replaced. Thanks. > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 112: > >> 110: // In other words: >> 111: // if the requested which_partition is empty: >> 112: // leftmost() and leftmost_empty() return _max, rightmost() and rightmost_empty() return 0 > > There are mutually contradictory statements in the highlighted portion of the documentation above. I suspect the earlier reference to -1 is obsolete and needs to be deleted. Good catch. Thank you. Fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476883719 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476884451 From kdnilsen at openjdk.org Sat Feb 3 03:02:06 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 03:02:06 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: <8kwTr_bw237-Z58WNoxRWqfVzQlcHqssT_2Lp5Rwi6c=.e8ea536e-3e0a-4e54-a0df-e679d68ae696@github.com> On Thu, 1 Feb 2024 06:37:54 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 132: > >> 130: assert (which_partition > NotFree && which_partition < NumPartitions, "selected free set must be valid"); >> 131: return _used[which_partition]; >> 132: } > > The assertions here indicate to me that it is likely my earlier suspicion that many of these fields are not needed for NotFree is true. (See my earlier comment about many of these fields for NotFree.) I feel it may be best to let the enum type system enforce this, rather than use these assertions. NotFree then becomes a sentinel value that is not part of the legal index set that can be passed in here. > > May be that can be done later, as I realize the type contagion might necessitate more changes (although I think it will conceptually simplify this) by maintaining the disctinction between the tags in the _membership[] array, and the types used for the so-called free partitions. Thanks for sorting through this. You are right. I do not maintain used, capacity for NotFree regions. I've made adjustments to the enum declaration and to the assertions to make this more clear. My efforts to do so may have increased the "dissonance" with mathematical definition of partition. Please let me know if you see a better way to approach this. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476926935 From duke at openjdk.org Sat Feb 3 07:59:12 2024 From: duke at openjdk.org (Lei Zaakjyu) Date: Sat, 3 Feb 2024 07:59:12 GMT Subject: RFR: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' Message-ID: trivial ------------- Commit messages: - move '_soft_ref_policy' to 'CollectedHeap' Changes: https://git.openjdk.org/jdk/pull/17693/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17693&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8325081 Stats: 49 lines in 13 files changed: 3 ins; 44 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/17693.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17693/head:pull/17693 PR: https://git.openjdk.org/jdk/pull/17693 From ysr at openjdk.org Sat Feb 3 08:55:07 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 3 Feb 2024 08:55:07 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v4] In-Reply-To: References:

Message-ID: <95jH1WMe6Vm3bgOl_bPPsOQLdwLml-YV6aT1Z0lktmw=.e52e5398-23a1-40dd-baf2-fa844bfa0244@github.com> On Wed, 31 Jan 2024 00:45:20 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 401: >> >>> 399: >>> 400: HeapWord* ShenandoahFreeSet::allocate_single(ShenandoahAllocRequest& req, bool& in_new_region) { >>> 401: shenandoah_assert_heaplocked(); >> >> In addition, another precondition for this method appears to be that req.size() <= humongous size threshold. Perhaps that check should also be disposed of here. (Based on the documentation at the previous review comment above.) > > Added similar documentation here. Thanks. I meant something like: assert(req.size() <= ShenandoahHeapRegion::humongous_threshold_words(), "Can't exceed humongous size"); Unless the precondition is that it shouldn't exceed a region's worth in size, in which case: `<= ShenandoahHeapRegion::region_size_words().` ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1476987025 From ysr at openjdk.org Sat Feb 3 08:55:06 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 3 Feb 2024 08:55:06 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 07:51:14 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 467: > >> 465: log_debug(gc, free)("Flipped region " SIZE_FORMAT " to gc for request: " PTR_FORMAT, idx, p2i(&req)); >> 466: return result; >> 467: } > > It seems like this can cause potentially many (because of the loop) Mutator regions to flip to Collector (can we call the method `flip_to_collector`?) sometimes even when the request won't be satisfied. Why not flip to Collector only _after_ the allocation is successful? I assume the attempt to allocate would run afoul of assertion checks if it happened before the flip, but I worry about flipping a bunch of stuff unnecessarily and failing to allocate in them after all. Is that futile flipping cause for concern? Can it be avoided (e.g. by repositioning the assertion checks using a proxy variable to signal intent to flip following a successful allocation, then using it to ensure the post-allocation flip, or something similar)? May be such futile flips are uncommon and not a cause for concern? On further thought, when is it that the allocation may fail (lines 463,464)? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477017088 From kdnilsen at openjdk.org Sat Feb 3 14:04:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:04:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v4] In-Reply-To: <7zz-C2deQM2LpSf1Jo5E4C5WYbCyUuNO7CQPMX6Q47s=.0d200023-1b71-462e-bcee-01bcf41b5702@github.com> References:

<7zz-C2deQM2LpSf1Jo5E4C5WYbCyUuNO7CQPMX6Q47s=.0d200023-1b71-462e-bcee-01bcf41b5702@github.com> Message-ID: <8ILT34a4sce3LK0nqtsOPVa80WQ5xXr0Zrat2_TUlVM=.a0a2aa14-bae1-4d40-85bb-a3f18e6d637b@github.com> On Wed, 31 Jan 2024 17:33:17 GMT, Y. Srinivas Ramakrishna wrote: >> The problem happens when leftmost is zero. If we decrement idx beyond zero, we would get MAXINT rather than -1, so the test that idx >= leftmost is always true and loop never terminates... I think I discovered this the hard way... :( > > `ssize_t` is signed, unlike `size_t` which is unsigned. Thanks for clarifying. I did know ssize_t. I'm adjusting the code. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477069379 From kdnilsen at openjdk.org Sat Feb 3 14:07:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:07:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 08:47:48 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 155: > >> 153: // Remove this region from its free partition, but leave its capacity and used as part of the original free partition's totals. >> 154: // When retiring a region, add any remnant of available memory within the region to the used total for the original free partition. >> 155: void ShenandoahRegionPartition::retire_within_partition(size_t idx, size_t used_bytes) { > > Why is the method called `retire_within_partition()` instead of `retire_from_partition()` ? > > (i.e. why _within_ partition, since it's leaving its free partition?) It's a little subtle. Maybe it needs more documentation. We retire the region so it no longer is within the range searched when new allocations are made. However, its totals (capacity and used) are still counted toward the Mutator or Collector partition's total. We've probably created some of our own pain by calling this a partition instead of a "free set". Advice? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477070065 From kdnilsen at openjdk.org Sat Feb 3 14:24:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:24:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 07:12:58 GMT, Y. Srinivas Ramakrishna wrote: >> So the documentation here might be: >> >> // Move cset_regions number of regions from being available to the collector to >> // being available to the mutator. >> // >> // Typical usage is at the end of evacuation, when the collector no longer needs >> // the regions that were reserved for evacuation, and these can now be >> // made available for mutator allocation. >> >> BTW, why call the number of regions `cset_regions`? > > Also, the concept of partition is itself an internal implementation detail that you have carefully encapsulated in this class. There is no point in leaking that out in the naming of the method. > > The method can just be called `move_regions_from_collector_to_mutator(size_t num)` and be done? "Partition" here adds no value and can be confusing leakage of abstraction. Agree. Thanks for these improvements. Done. (also, have enhanced comment to clarify the intent of cset_regions argument, which represents the number of regions in the collection set. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477072830 From kdnilsen at openjdk.org Sat Feb 3 14:29:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:29:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 07:15:51 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 1069: > >> 1067: heap->collection_set()->clear(); >> 1068: >> 1069: // Since Full GC directly manipulates top of certain regions, certain ShenandoahFreeSet abstractions may have been corrupted. > > Instead of "may have been corrupted", which can be alarming and confusing, I'd state this as: > > // Full GC doesn't use or maintain the ShenandoahFreeSet abstractions, > // so we rebuild the free set from scratch following a Full GC. I'm just going to remove that comment. It raises concerns when none is necessary. "Obviously", we need to rebuild the free set following a full gc. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477073747 From kdnilsen at openjdk.org Sat Feb 3 14:33:06 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:33:06 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 14:04:53 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 155: >> >>> 153: // Remove this region from its free partition, but leave its capacity and used as part of the original free partition's totals. >>> 154: // When retiring a region, add any remnant of available memory within the region to the used total for the original free partition. >>> 155: void ShenandoahRegionPartition::retire_within_partition(size_t idx, size_t used_bytes) { >> >> Why is the method called `retire_within_partition()` instead of `retire_from_partition()` ? >> >> (i.e. why _within_ partition, since it's leaving its free partition?) > > It's a little subtle. Maybe it needs more documentation. We retire the region so it no longer is within the range searched when new allocations are made. However, its totals (capacity and used) are still counted toward the Mutator or Collector partition's total. > > We've probably created some of our own pain by calling this a partition instead of a "free set". Advice? I'll change the name, as you suggest. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477074268 From kbarrett at openjdk.org Sat Feb 3 14:35:00 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Sat, 3 Feb 2024 14:35:00 GMT Subject: RFR: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' In-Reply-To: References: Message-ID: On Sat, 3 Feb 2024 07:54:54 GMT, Lei Zaakjyu wrote: > trivial In the constructor for CollectedHeap, I'd like the newly added `_soft_ref_policy` to be explicitly initialized, rather than relying on implicit member initialization. That is, add a mem-initializer to the mem-initializer-list of that constructor. ------------- Changes requested by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/17693#pullrequestreview-1861039681 From kdnilsen at openjdk.org Sat Feb 3 14:41:04 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 14:41:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 01:44:29 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 766: > >> 764: >> 765: // This places regions that have alloc_capacity into the mutator partition. >> 766: find_regions_with_alloc_capacity(cset_regions); > > In conjunction with `clear()` above, it looks like we are doing two walks of the _membership array in the implementation as a result of this. Why not just have a single API from `ShenandoahRegionPartitions` that walks over the regions and sorts them into the NotFree or the Mutator partition in one go, rather than one walk to clear and another to then move some into Mutator? > > Also the method should probably be renamed to `move_alloc_regions_to_mutator()`, which should be moved into `ShenandoahRegionPartitions` class as a public API for this class `ShenandoahFreeSet` to call. We walk twice, first to figure out how much memory is available, and how many regions are completely empty. This information eventually feeds into GenShen's transfer of regions between young-gen and old-gen. There is less motivation for that distinction in single-generation Shenandoah because we do not need to make these informed transfers. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477075161 From kdnilsen at openjdk.org Sat Feb 3 15:49:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 15:49:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 01:36:35 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 761: > >> 759: void ShenandoahFreeSet::prepare_to_rebuild(size_t &cset_regions) { >> 760: shenandoah_assert_heaplocked(); >> 761: // This resets all state information, removing all regions from all partitions. > > I thought it makes them all unavailable, placing them all into the NotFree partition. The wording has been a bit imprecise, possibly made even worse by some global search and replaces on free-set and partition. I've tried to clarify in most recent draft that we consider Collector and Mutator to be partitions, but the NotFree labels means "not in a partition". Maybe you can help me find the right wording here... ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477084264 From kdnilsen at openjdk.org Sat Feb 3 15:49:04 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 15:49:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 14:38:44 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 766: >> >>> 764: >>> 765: // This places regions that have alloc_capacity into the mutator partition. >>> 766: find_regions_with_alloc_capacity(cset_regions); >> >> In conjunction with `clear()` above, it looks like we are doing two walks of the _membership array in the implementation as a result of this. Why not just have a single API from `ShenandoahRegionPartitions` that walks over the regions and sorts them into the NotFree or the Mutator partition in one go, rather than one walk to clear and another to then move some into Mutator? >> >> Also the method should probably be renamed to `move_alloc_regions_to_mutator()`, which should be moved into `ShenandoahRegionPartitions` class as a public API for this class `ShenandoahFreeSet` to call. > > We walk twice, first to figure out how much memory is available, and how many regions are completely empty. This information eventually feeds into GenShen's transfer of regions between young-gen and old-gen. There is less motivation for that distinction in single-generation Shenandoah because we do not need to make these informed transfers. But even before genshen changes, there were two walks through the regions. This is because the rebuild wants to "optimize" the organization of the mutator free set and the collector free set. Certain regions that may been in the mutator set during previous GC will be in the collector set during the next gc, and vice versa. We strive to arrange that each free set is "tightly packed" over a subrange of the regions, with collector free set at the high end of memory and mutator set at the lower end of memory. With GenShen integration, we will place the old collector set above the collector set. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477083826 From kdnilsen at openjdk.org Sat Feb 3 15:53:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 15:53:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 01:42:44 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 683: > >> 681: // move some of the mutator regions into the collector partition with the intent of packing collector memory into the >> 682: // highest (rightmost) addresses of the heap, with mutator memory consuming the lowest addresses of the heap. >> 683: void ShenandoahFreeSet::find_regions_with_alloc_capacity(size_t &cset_regions) { > > This method seems to belong to a public API of `ShenandoahRegionPartitions`. See also comment at the call site of this method. There is a public API for prepare_to_rebuild() followed by finish_rebuild(). This public API is exercised by GenShen, which adjusts the sizes of old-gen and young-gen between the two calls. Single-gen shenandoah does not distinguish between these two steps, because it has no notion of adjusting generation sizes. Single-gen shenandoah invokes the public api rebuild(), which simply delegates to these two functions. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477085137 From kdnilsen at openjdk.org Sat Feb 3 16:10:04 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 16:10:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: <5hm7G2JJ9Kmcc7DHdRPNqeZKJcgQXkf-N1kK1VmDAyI=.7141723b-23bf-4705-9fa9-aeb77e38602f@github.com> On Sat, 3 Feb 2024 15:46:04 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 761: >> >>> 759: void ShenandoahFreeSet::prepare_to_rebuild(size_t &cset_regions) { >>> 760: shenandoah_assert_heaplocked(); >>> 761: // This resets all state information, removing all regions from all partitions. >> >> I thought it makes them all unavailable, placing them all into the NotFree partition. > > The wording has been a bit imprecise, possibly made even worse by some global search and replaces on free-set and partition. I've tried to clarify in most recent draft that we consider Collector and Mutator to be partitions, but the NotFree labels means "not in a partition". Maybe you can help me find the right wording here... I'm adjusting this comment in hopes of making the intent more clear. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477087368 From duke at openjdk.org Sat Feb 3 16:13:24 2024 From: duke at openjdk.org (Lei Zaakjyu) Date: Sat, 3 Feb 2024 16:13:24 GMT Subject: RFR: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' [v2] In-Reply-To: References: Message-ID: > trivial Lei Zaakjyu has updated the pull request incrementally with one additional commit since the last revision: construct '_soft_ref_policy' explicitly ------------- Changes: - all: https://git.openjdk.org/jdk/pull/17693/files - new: https://git.openjdk.org/jdk/pull/17693/files/d1eff994..be6500d0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=17693&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=17693&range=00-01 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/17693.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17693/head:pull/17693 PR: https://git.openjdk.org/jdk/pull/17693 From kdnilsen at openjdk.org Sat Feb 3 16:23:05 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 16:23:05 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: <_PQOYVsBqw0NBV7hjtMa9-iX7-KaM24DRZSQVgizphI=.f721790b-5192-4460-ad33-e065a54c4e35@github.com> On Sat, 3 Feb 2024 01:51:24 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 715: > >> 713: } >> 714: >> 715: // Move no more than max_xfer_regions from the existing Collector free partitions to the Mutator free partition. > > I'd avoid the somewhat redundant "Mutator free partition" or "Collector free partition", but merely say "Mutator partition" and "Collector partition". I'd reserve the term "free partition" for a partition that is not a NotFree partition. This allows terseness and precision at the same time. Agree. Thanks. Fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477088887 From kdnilsen at openjdk.org Sat Feb 3 16:31:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 16:31:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 08:08:34 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 287: > >> 285: _leftmosts_empty[which_partition] = _max; >> 286: _rightmosts_empty[which_partition] = 0; >> 287: return 0; > > To my earlier comment of using `ssize_t`, that would allow us to signal failure here by returning a -1. In the interest of stability, I'm inclined to leave this convention as is. Could be persuaded to make the change, but there are probably more than 5 touchpoints that also need to be changed (all invocations, existing documentation, etc.) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477090051 From kdnilsen at openjdk.org Sat Feb 3 16:42:05 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 16:42:05 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 08:23:06 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 441: > >> 439: // size_t is unsigned, need to dodge underflow when _leftmost = 0 >> 440: // Fast-path: try to allocate in the collector view first >> 441: for (size_t c = _partitions.rightmost(Collector) + 1; c > _partitions.leftmost(Collector); c--) { > > Use `ssize_t` for c. Thanks. Done. > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 457: > >> 455: >> 456: // Try to steal an empty region from the mutator view. >> 457: for (size_t c = _partitions.rightmost_empty(Mutator) + 1; c > _partitions.leftmost_empty(Mutator); c--) { > > `ssize_t` to keep all these loops uniform. Agree. Done. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477094507 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477094575 From kdnilsen at openjdk.org Sat Feb 3 16:52:03 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 16:52:03 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 08:49:52 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 550: > >> 548: // allocate within. This was observed to result in large amounts of available memory being ignored >> 549: // following a failed shared allocation request. TLAB requests will generally downsize to absorb all >> 550: // memory available within the region even if this is less than the desired size. > > I don't understand this comment, since you are it seems to me retiring the region below at line 553. (Also see comment elsewhere on calling the method `retire_within_partition`, instead of the more natural (to me) `retire_from_partition`. I'm fixing this comment to make more clear. In the current implementation, we only retire a region if the remaining capacity is less than PLAB::min_size(). The previous implementation was observed to retire some regions even when there was 50% of the region still available (in the case that a very large shared alloc failed). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1477097271 From kdnilsen at openjdk.org Sat Feb 3 21:06:50 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 3 Feb 2024 21:06:50 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v7] In-Reply-To: References: Message-ID: <2KNDqLswo1RO4cNekrDD_nwWxz9QsLNxdYlNyayuzfI=.67115614-d5d4-483a-9691-16f14f9be51a@github.com> > Several objectives: > 1. Reduce humongous allocation failures by segregating regular regions from humongous regions > 2. Do not retire regions just because an allocation failed within the region if the memory remaining within the region is large enough to represent a LAB > 3. Track range of empty regions in addition to range of available regions in order to expedite humongous allocations > 4. Treat collector reserves as available for Mutator allocations after evacuation completes > 5. Improve encapsulation so as to enable an OldCollector reserve for future integration of generational Shenandoah > > On internal performance pipelines, this change shows: > > 1. some Increase in page faults and rss_max with certain workloads, presumably because of "segregation" of humongous from regular regions. > 2. An increase in System CPU time on certain benchmarks: sunflow (+165%), scimark.sparse.large (+50%), lusearch (+43%). This system CPU time increase appears to correlate with increased page faults and/or rss. > 3. An increase in trigger_failure for the hyperalloc_a2048_o4096 experiment (not yet understood) > 4. 2-30x improvements on multiple metrics of the Extremem phased workload latencies (most likely resulting from fewer degenerated or full GCs) > > Shenandoah > ------------------------------------------------------------------------------------------------------- > +166.55% scimark.sparse.large/minor_page_fault_count p=0.00000 > Control: 819938.875 (+/-5724.56 ) 40 > Test: 2185552.625 (+/-26378.64 ) 20 > > +166.16% scimark.sparse.large/rss_max p=0.00000 > Control: 3285226.375 (+/-22812.93 ) 40 > Test: 8743881.500 (+/-104906.69 ) 20 > > +164.78% sunflow/cpu_system p=0.00000 > Control: 1.280s (+/- 0.10s ) 40 > Test: 3.390s (+/- 0.13s ) 20 > > +149.29% hyperalloc_a2048_o4096/trigger_failure p=0.00000 > Control: 3.259 (+/- 1.46 ) 33 > Test: 8.125 (+/- 2.05 ) 20 > > +143.75% pmd/major_page_fault_count p=0.03622 > Control: 1.000 (+/- 0.00 ) 40 > Test: 2.438 (+/- 2.59 ) 20 > > +80.22% lusearch/minor_page_fault_count p=0.00000 > Control: 2043930.938 (+/-4777.14 ) 40 > Test: 3683477.625 (+/-5650.29 ) 20 > > +50.50% scimark.sparse.small/minor_page_fault_count p=0.00000 > Control: 697899.156 (+/-3457.82 ) 40 > Test: 1050363.812 (+/-175237.63 ) 20 > > +49.97% scimark.sparse.small/rss_max p=0.00000 > Control: 277075... Kelvin Nilsen has updated the pull request incrementally with 15 additional commits since the last revision: - Correct an invalid assertion - Remove extraneous assertion - Fix comment describing retirement of regions following failed allocation - Use ssize_t for iterating over partition regions - Fix description of move_regions_from_collector_to_mutator - Rename retire_within_partition New name is retire_from_partition - Remove unhelpful comment that might cause undue concern to maintainers - Rename move_regions_from_collector_to_mutator_partition New name is move_regions_from_collector_to_mutator. Hide the partition abstraction from public api. - Change loop iterator to ssize_t from int - Adjust enum ShenandoahFreeSetPartitionId to clarify NotFree is not a partition - ... and 5 more: https://git.openjdk.org/jdk/compare/fb1f5bfe...5e27a585 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/17561/files - new: https://git.openjdk.org/jdk/pull/17561/files/fb1f5bfe..5e27a585 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=05-06 Stats: 125 lines in 5 files changed: 12 ins; 13 del; 100 mod Patch: https://git.openjdk.org/jdk/pull/17561.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17561/head:pull/17561 PR: https://git.openjdk.org/jdk/pull/17561 From tschatzl at openjdk.org Mon Feb 5 13:42:03 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 5 Feb 2024 13:42:03 GMT Subject: RFR: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' [v2] In-Reply-To: References:

Message-ID: <0hDALL2-YckvKJySdlXRA9_3WsnKgKfJhrydobHXW-A=.8b3d566f-3e51-4863-9456-6af61aa95ed0@github.com> On Sat, 3 Feb 2024 16:13:24 GMT, Lei Zaakjyu wrote: >> trivial > > Lei Zaakjyu has updated the pull request incrementally with one additional commit since the last revision: > > construct '_soft_ref_policy' explicitly lgtm (but not trivial) ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/17693#pullrequestreview-1862827219 From ysr at openjdk.org Mon Feb 5 15:15:08 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 5 Feb 2024 15:15:08 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 16:28:43 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 287: >> >>> 285: _leftmosts_empty[which_partition] = _max; >>> 286: _rightmosts_empty[which_partition] = 0; >>> 287: return 0; >> >> To my earlier comment of using `ssize_t`, that would allow us to signal failure here by returning a -1. > > In the interest of stability, I'm inclined to leave this convention as is. Could be persuaded to make the change, but there are probably more than 5 touchpoints that also need to be changed (all invocations, existing documentation, etc.) That sounds reasonable; can be addressed later. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1478408600 From ysr at openjdk.org Mon Feb 5 15:24:04 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 5 Feb 2024 15:24:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: <5hm7G2JJ9Kmcc7DHdRPNqeZKJcgQXkf-N1kK1VmDAyI=.7141723b-23bf-4705-9fa9-aeb77e38602f@github.com> References:

<5hm7G2JJ9Kmcc7DHdRPNqeZKJcgQXkf-N1kK1VmDAyI=.7141723b-23bf-4705-9fa9-aeb77e38602f@github.com> Message-ID: On Sat, 3 Feb 2024 16:07:21 GMT, Kelvin Nilsen wrote: >> The wording has been a bit imprecise, possibly made even worse by some global search and replaces on free-set and partition. I've tried to clarify in most recent draft that we consider Collector and Mutator to be partitions, but the NotFree labels means "not in a partition". Maybe you can help me find the right wording here... > > I'm adjusting this comment in hopes of making the intent more clear. ah, ok. I'll reread the new understanding/documentation with this in mind; thanks! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1478426572 From ysr at openjdk.org Mon Feb 5 15:32:04 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 5 Feb 2024 15:32:04 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 15:44:03 GMT, Kelvin Nilsen wrote: >> We walk twice, first to figure out how much memory is available, and how many regions are completely empty. This information eventually feeds into GenShen's transfer of regions between young-gen and old-gen. There is less motivation for that distinction in single-generation Shenandoah because we do not need to make these informed transfers. > > But even before genshen changes, there were two walks through the regions. This is because the rebuild wants to "optimize" the organization of the mutator free set and the collector free set. Certain regions that may have been in the mutator set during previous GC will be in the collector set during the next gc, and vice versa. We strive to arrange that each free set is "tightly packed" over a subrange of the regions, with collector free set at the high end of memory and mutator set at the lower end of memory. With GenShen integration, we will place the old collector set above the collector set. I suppose I'll need to look through this more carefully. In the case of single gen, it still sounded to me like the "clear" really doesn't accomplish anything other than taking stuff out of the free partitions and then the `find_..` sorts them into the new free partitions, and it looked like that could be accomplished by a single walk. If GenShen then wants to break them into two sequences with some other step in between, may be one offers the three API's: one the single-gen optimized one that avoids two walks, and the two APIs `clear` and `find_...` separately for GenShen. But it sounds like you are saying that there is a _need_ for these two walks in the case of single gen as well. Let me discuss this with you offline so I understand this better as I am probably missing something crucial here. Thanks! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1478437768 From aboldtch at openjdk.org Mon Feb 5 15:50:08 2024 From: aboldtch at openjdk.org (Axel Boldt-Christmas) Date: Mon, 5 Feb 2024 15:50:08 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: On Sat, 3 Feb 2024 16:13:24 GMT, Lei Zaakjyu wrote: >> trivial > > Lei Zaakjyu has updated the pull request incrementally with one additional commit since the last revision: > > construct '_soft_ref_policy' explicitly Looks good (and agree with @tschatzl about not trivial, but that doesn't matter now). Also, for future reference, I think the PR description ought to provide more information than was provided here. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/17693#pullrequestreview-1863165729 From duke at openjdk.org Mon Feb 5 23:50:00 2024 From: duke at openjdk.org (Lei Zaakjyu) Date: Mon, 5 Feb 2024 23:50:00 GMT Subject: RFR: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' [v2] In-Reply-To: References:

Message-ID: <2mZDTGNU_wbUwHR9TE2ji1rOwddFz7yXcEFBwzsGSs8=.31aaedbc-7d0d-4d82-ad37-a241e1305e31@github.com> On Sat, 3 Feb 2024 16:13:24 GMT, Lei Zaakjyu wrote: >> trivial > > Lei Zaakjyu has updated the pull request incrementally with one additional commit since the last revision: > > construct '_soft_ref_policy' explicitly thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/17693#issuecomment-1927598071 From eosterlund at openjdk.org Mon Feb 5 23:51:34 2024 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Mon, 5 Feb 2024 23:51:34 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: On Mon, 5 Feb 2024 15:47:29 GMT, Axel Boldt-Christmas wrote: > All my comments have been addressed. > > As previously mentioned as long as the performance is there, then this looks good. Thanks for the review, @xmas92. Any other takers? ------------- PR Comment: https://git.openjdk.org/jdk/pull/17495#issuecomment-1928143097 From wkemper at openjdk.org Tue Feb 6 00:09:36 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 6 Feb 2024 00:09:36 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-23+8 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 72 commits: - Merge tag 'jdk-23+8' into merge-jdk-23+8 Added tag jdk-23+8 for changeset 5b9b176c - 8324174: assert(m->is_entered(current)) failed: invariant Reviewed-by: epeter, dlong, thartmann - 8325042: remove unused JVMDITools test files Reviewed-by: coleenp - 8323621: JDK build should exclude snippet class in java.lang.foreign Reviewed-by: mcimadamore - 8324238: [macOS] java/awt/Frame/ShapeNotSetSometimes/ShapeNotSetSometimes.java fails with the shape has not been applied msg Reviewed-by: azvegint, dnguyen - 8320342: Use PassFailJFrame for TruncatedPopupMenuTest.java Reviewed-by: honkar, aivanov - 8324981: Shenandoah: Move commit and soft max heap changed methods into heap Reviewed-by: shade - 8303374: Implement JEP 455: Primitive Types in Patterns, instanceof, and switch (Preview) Co-authored-by: Jan Lahoda Co-authored-by: Maurizio Cimadamore Co-authored-by: Gavin Bierman Co-authored-by: Brian Goetz Co-authored-by: Raffaello Giulietti Co-authored-by: Aggelos Biboudis Reviewed-by: vromero, jlahoda - 8320712: Rewrite BadFactoryTest in pure Java Reviewed-by: jpai, sundar - 8324771: Obsolete RAMFraction related flags Reviewed-by: dholmes, mbaesken, tschatzl - ... and 62 more: https://git.openjdk.org/shenandoah/compare/1ecdc046...b775a88f ------------- Changes: https://git.openjdk.org/shenandoah/pull/389/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=389&range=01 Stats: 18823 lines in 1229 files changed: 7186 ins; 1717 del; 9920 mod Patch: https://git.openjdk.org/shenandoah/pull/389.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/389/head:pull/389 PR: https://git.openjdk.org/shenandoah/pull/389 From wkemper at openjdk.org Tue Feb 6 00:13:58 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 6 Feb 2024 00:13:58 GMT Subject: RFR: Merge openjdk/jdk21u-dev:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-21.0.3+1 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 200 commits: - Merge remote-tracking branch 'shenandoah-jdk21u/master' into merge-jdk-21.0.3+1 - 8323154: C2: assert(cmp != nullptr && cmp->Opcode() == Op_Cmp(bt)) failed: no exit test Backport-of: 6997bfc68def7f80fbf6a7486a4b9f61225fc471 - 8320943: Files/probeContentType/Basic.java fails on latest Windows 11 - content type mismatch Backport-of: 87516e29dc5015c4cab2c07c5539ad30f2768667 - 8313507: Remove pkcs11/Cipher/TestKATForGCM.java from ProblemList Backport-of: e8471f6bbe692a0d1e293f9e09aaa4f32312eb6a - 8315600: Open source few more headless Swing misc tests Backport-of: b05198a4f354934bc344fe9cbc19d98fd8bc3977 - 8274122: java/io/File/createTempFile/SpecialTempFile.java fails in Windows 11 Backport-of: 4a142c3b0831d60b3d5540f58973e8ad3d1304bf - 8324280: RISC-V: Incorrect implementation in VM_Version::parse_satp_mode Backport-of: e7fdac9d5ce56d2f589df59a7fd2869e35ba2991 - 8324659: GHA: Generic jtreg errors are not reported Backport-of: c313d451a513eb08de0b295c1ce66d0d849d2374 - 8315761: Open source few swing JList and JMenuBar tests Backport-of: bb6b3f2486b07a6ccdeea18519453e6d9c05c2c3 - 8322142: JFR: Periodic tasks aren't orphaned between recordings Backport-of: 1551928502c8ed96350e7b4f1316ea35587407fe - ... and 190 more: https://git.openjdk.org/shenandoah-jdk21u/compare/66665613...4e4e70b0 ------------- Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=01 Stats: 30106 lines in 1346 files changed: 15608 ins; 5805 del; 8693 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/19.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/19/head:pull/19 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/19 From duke at openjdk.org Tue Feb 6 01:08:57 2024 From: duke at openjdk.org (Lei Zaakjyu) Date: Tue, 6 Feb 2024 01:08:57 GMT Subject: Integrated: 8325081: Move '_soft_ref_policy' to 'CollectedHeap' In-Reply-To: References: Message-ID: On Sat, 3 Feb 2024 07:54:54 GMT, Lei Zaakjyu wrote: > trivial This pull request has now been integrated. Changeset: e0fd3f4d Author: Lei Zaakjyu Committer: Kim Barrett URL: https://git.openjdk.org/jdk/commit/e0fd3f4dababad7189b9e02b37a40ea1a3907554 Stats: 50 lines in 14 files changed: 4 ins; 44 del; 2 mod 8325081: Move '_soft_ref_policy' to 'CollectedHeap' Reviewed-by: kbarrett, tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/17693 From kdnilsen at openjdk.org Tue Feb 6 01:37:54 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 6 Feb 2024 01:37:54 GMT Subject: RFR: 8324995: Shenandoah: Skip to full gc for humongous allocation failures [v3] In-Reply-To: References:

Message-ID: On Wed, 31 Jan 2024 21:50:06 GMT, William Kemper wrote: >> Shenandoah degenerated cycles do not compact regions. When a humongous allocation fails, it is likely due to fragmentation which is better addressed by a full gc. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Fix typo in comment Out of cycle alloc failure might as well do full GC. I would keep that behavior in my experiment. ------------- PR Comment: https://git.openjdk.org/jdk/pull/17638#issuecomment-1928644014 From ysr at openjdk.org Tue Feb 6 03:33:58 2024 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Tue, 6 Feb 2024 03:33:58 GMT Subject: RFR: 8323634: Shenandoah: Document behavior of EvacOOM protocol [v5] In-Reply-To: References: <6ciSyKdz9hA6RBOZeDicFetK_G4AUBpx40YX7yT1O1M=.870e1ba1-6f4b-48e9-8360-dab141a3041d@github.com> Message-ID: On Wed, 24 Jan 2024 17:53:39 GMT, Kelvin Nilsen wrote: >> The protocol for handling OOM during evacuation is subtle and critical for correct operation. This PR does NOT change behavior. It provides improved documentation of existing behavior. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Fix spelling error and mismatched parentheses. I like the longer block comment you wrote in the .hpp file describing the protocol because it provides fuller context and defines the intent of the protocol in greater detail. I am not sure which I would go with as both descriptions look good to me in their own way. The existing one has the benefit of being both concrete and concise. With that in mind, I also left a few comments on the original documentation on the left side. I am good with whatever you choose to use. The smaller individual documentation comments for each method look good to me, irrespective. Sorry for the long delay in getting back on this. The protocol is subtle, and I like your ideas about potentially improving it in the future. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.cpp line 46: > 44: void ShenandoahEvacOOMCounter::clear() { > 45: assert(unmasked_count() == 0, "sanity"); > 46: Atomic::release_store_fence(&_bits, (jint)0); Leaving a comment here but it applies to the comment at line 40 above, which reads: // NOTE: It's ok to simply decrement, even with mask set, because unmasked value is positive. May be leave a block comment at the start of the method at line 37 that states: // Decrement the counter atomically, leaving the OOM bit unchanged at its original state. Then, the comment at current line 40, could : // The value is necessarily positive before we decrement, as we assert above, because // this thread incremented it earlier. Since we atomically decrement a positive value, // the state of the OOM bit is left unchanged at its original value. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.cpp line 52: > 50: // associated with this counter. After all _num_counters OOM bits have been set, all threads newly attempting to enter_evacuation > 51: // will be informed that they cannot allocate for evacation. Threads that entered evacuation before the OOM bit was set may > 52: // continue to allocate for evacuation until they exit_evacuation. This can simply state: // Set the OOM bit, and optionally decrement the counter I don't think you need to describe how this fits into the OOM protocol, at least not here. That confuses the documentation and the reader. That can be put in the caller or in a block comment describing the protocol. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.cpp line 62: > 60: jint other = Atomic::cmpxchg(&_bits, threads_in_evac, newval); > 61: if (other == threads_in_evac) { > 62: // Success: return so we can wait for other threads to stop allocating. I would simplify this comment to: // Successfully set the OOM bit (and optionally decremented the counter of threads_in_evac) The context of what happens after we return should be described in the caller, not here. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.cpp line 65: > 63: break; > 64: } else { > 65: // Failure: try again with updated new value. Adding comment here, but applies to `ShenandoahEvacOOMCounter::try_increment()` below, lines 71-89, as a block comment before line 71; one could document it as: // Unless OOM bit is set, increment the counter and return true. // If OOM bit is set, simply return false without incrementing the counter. The context of what the caller does, should be described in the caller, not in this method. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 85: > 83: * OOM-during-evac-handler. The handler allows multiple threads to enter and exit > 84: * evacuation path, but on OOME it requires all threads that experienced OOME to wait > 85: * for current threads to leave, and blocks other threads from entering. The counter state After the period on line 85, I'd add one sentence: // The counter not only tracks the number of threads in the evacuation path, // but also whether any thread has encountered an OOM-during-evac. It thus // captures all of the state needed to track the execution of the protocol. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 87: > 85: * for current threads to leave, and blocks other threads from entering. The counter state > 86: * is striped across multiple cache lines to reduce contention when many threads attempt > 87: * to enter or leave the protocol at the same time. At the end of the period at line 87, I'd add: // As a result, the protocol needs special steps, in the event of an OOM-during-evac, // to ensure that all of the striped counters are zero before the protocol can terminate. // Once the protocol terminates with the OOM bit set, no threads will attempt // further allocations for evacuation, so any unresolved forwarding pointer uniquely // to either its new already-forwarded location or to its original to-space location. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 130: > 128: * safepoint. Marking by Full GC will finish updating references that might > 129: * be inconsistent within the heap, and will then compact all live memory within > 130: * the heap. I like the longer comment you wrote because it provides fuller context and defines the intent of the protocol in greater detail. I am not sure which I would go with as both descriptions look good to me in their own way. With that in mind, I also left a few comments on the original documentation on the left side. I am good with whatever you choose to use. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 136: > 134: * Maintain a count of how many threads are on an evac-path (which is allocating for evacuation) > 135: * > 136: * Upon entry of the evac-path, entering thread will attempt to increase the counter, "atomically increment the counter, if the OOM-bit isn't set." src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 147: > 145: * > 146: * > 147: * Upon exit, exiting thread will decrease the counter using atomic dec. atomically decrement the counter; rather than "decrease the counter using atomic dec." src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 172: > 170: * make the protocol more efficient. > 171: * > 172: * TODO: make refinements to the OOM-during-evac protocol so that it is less disruptive and more efficient. May be all of this and the remainder of this comment in terms of improvements from line 162 above up to line 203 below should instead go in a JBS ticket, include here only a terse TODO with a pointer to the ticket for details: // TODO: JDK-XXXX will investigate potential performance/efficiency improvements to this protocol. src/hotspot/share/gc/shenandoah/shenandoahEvacOOMHandler.hpp line 212: > 210: _oom_not_evacuating > 211: }; > 212: volatile ShenandoahEvacuationState _evacuation_state; Leave a single line of documentation here stating that this is an auxiliary field introduced just for the sake of checking an invariant of the protocol. ------------- Marked as reviewed by ysr (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/17385#pullrequestreview-1842713670 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479084937 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479059992 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479060963 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479063865 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479121544 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479122394 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479174040 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479158182 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479123287 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479102766 PR Review Comment: https://git.openjdk.org/jdk/pull/17385#discussion_r1479066055 From wkemper at openjdk.org Tue Feb 6 17:09:24 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 6 Feb 2024 17:09:24 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: On Fri, 2 Feb 2024 14:09:54 GMT, William Kemper wrote: > Merges tag jdk-23+8 This pull request has now been integrated. Changeset: 55068c2e Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/55068c2ee4a7d3189b930158780774a1bb36636d Stats: 18823 lines in 1229 files changed: 7186 ins; 1717 del; 9920 mod Merge ------------- PR: https://git.openjdk.org/shenandoah/pull/389 From wkemper at openjdk.org Tue Feb 6 22:02:14 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 6 Feb 2024 22:02:14 GMT Subject: RFR: Merge openjdk/jdk21u-dev:master [v3] In-Reply-To: References: Message-ID: > Merges tag jdk-21.0.3+1 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix wrong API usage in wrongly resolved conflict ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files/4e4e70b0..8dfe163f Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=01-02 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/19.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/19/head:pull/19 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/19 From kdnilsen at openjdk.org Wed Feb 7 01:42:10 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 01:42:10 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v8] In-Reply-To: References: Message-ID: > Several objectives: > 1. Reduce humongous allocation failures by segregating regular regions from humongous regions > 2. Do not retire regions just because an allocation failed within the region if the memory remaining within the region is large enough to represent a LAB > 3. Track range of empty regions in addition to range of available regions in order to expedite humongous allocations > 4. Treat collector reserves as available for Mutator allocations after evacuation completes > 5. Improve encapsulation so as to enable an OldCollector reserve for future integration of generational Shenandoah > > On internal performance pipelines, this change shows: > > 1. some Increase in page faults and rss_max with certain workloads, presumably because of "segregation" of humongous from regular regions. > 2. An increase in System CPU time on certain benchmarks: sunflow (+165%), scimark.sparse.large (+50%), lusearch (+43%). This system CPU time increase appears to correlate with increased page faults and/or rss. > 3. An increase in trigger_failure for the hyperalloc_a2048_o4096 experiment (not yet understood) > 4. 2-30x improvements on multiple metrics of the Extremem phased workload latencies (most likely resulting from fewer degenerated or full GCs) > > Shenandoah > ------------------------------------------------------------------------------------------------------- > +166.55% scimark.sparse.large/minor_page_fault_count p=0.00000 > Control: 819938.875 (+/-5724.56 ) 40 > Test: 2185552.625 (+/-26378.64 ) 20 > > +166.16% scimark.sparse.large/rss_max p=0.00000 > Control: 3285226.375 (+/-22812.93 ) 40 > Test: 8743881.500 (+/-104906.69 ) 20 > > +164.78% sunflow/cpu_system p=0.00000 > Control: 1.280s (+/- 0.10s ) 40 > Test: 3.390s (+/- 0.13s ) 20 > > +149.29% hyperalloc_a2048_o4096/trigger_failure p=0.00000 > Control: 3.259 (+/- 1.46 ) 33 > Test: 8.125 (+/- 2.05 ) 20 > > +143.75% pmd/major_page_fault_count p=0.03622 > Control: 1.000 (+/- 0.00 ) 40 > Test: 2.438 (+/- 2.59 ) 20 > > +80.22% lusearch/minor_page_fault_count p=0.00000 > Control: 2043930.938 (+/-4777.14 ) 40 > Test: 3683477.625 (+/-5650.29 ) 20 > > +50.50% scimark.sparse.small/minor_page_fault_count p=0.00000 > Control: 697899.156 (+/-3457.82 ) 40 > Test: 1050363.812 (+/-175237.63 ) 20 > > +49.97% scimark.sparse.small/rss_max p=0.00000 > Control: 277075... Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Combine first two passes over freeset during rebuild ------------- Changes: - all: https://git.openjdk.org/jdk/pull/17561/files - new: https://git.openjdk.org/jdk/pull/17561/files/5e27a585..07fe812a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=06-07 Stats: 87 lines in 3 files changed: 73 ins; 6 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/17561.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17561/head:pull/17561 PR: https://git.openjdk.org/jdk/pull/17561 From dlong at openjdk.org Wed Feb 7 02:59:58 2024 From: dlong at openjdk.org (Dean Long) Date: Wed, 7 Feb 2024 02:59:58 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: On Wed, 7 Feb 2024 02:56:54 GMT, Dean Long wrote: > I saw an earlier version, but I plan to look at your latest, soon. Thank you @dean-long I appreciate it. ------------- PR Comment: https://git.openjdk.org/jdk/pull/17495#issuecomment-1931405479 From kdnilsen at openjdk.org Wed Feb 7 16:26:54 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 16:26:54 GMT Subject: RFR: 8324995: Shenandoah: Skip to full gc for humongous allocation failures [v3] In-Reply-To: References:

Message-ID: On Mon, 5 Feb 2024 15:29:22 GMT, Y. Srinivas Ramakrishna wrote: >> But even before genshen changes, there were two walks through the regions. This is because the rebuild wants to "optimize" the organization of the mutator free set and the collector free set. Certain regions that may have been in the mutator set during previous GC will be in the collector set during the next gc, and vice versa. We strive to arrange that each free set is "tightly packed" over a subrange of the regions, with collector free set at the high end of memory and mutator set at the lower end of memory. With GenShen integration, we will place the old collector set above the collector set. > > I suppose I'll need to look through this more carefully. In the case of single gen, it still sounded to me like the "clear" really doesn't accomplish anything other than taking stuff out of the free partitions and then the `find_..` sorts them into the new free partitions, and it looked like that could be accomplished by a single walk. > > If GenShen then wants to break them into two sequences with some other step in between, may be one offers the three API's: one the single-gen optimized one that avoids two walks, and the two APIs `clear` and `find_...` separately for GenShen. But it sounds like you are saying that there is a _need_ for these two walks in the case of single gen as well. Let me discuss this with you offline so I understand this better as I am probably missing something crucial here. Thanks! In the most recent commit, I have merged the first two passes through the free set. We no longer have a separate clear pass that precedes moving regions with available capacity into the mutator free set. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481913157 From kdnilsen at openjdk.org Wed Feb 7 18:33:11 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:33:11 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 05:20:56 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 36: > >> 34: enum ShenandoahFreeSetPartitionId : uint8_t { >> 35: NotFree, // Region has been retired and is not in any free set: there is no available memory. >> 36: Mutator, // Region is in the Mutator free set: available memory is available to mutators. > > Just want to make sure: "available to mutators" -- is this both for object allocation as well as for possible evacuation as part of the mutator LRB? For personal clarification: when the mutator LRB needs to evacuate an object, it uses the collector set. Each mutator has three TLABS: one for mutator allocations, one for young-gen evacuations, and one for old-gen evacuations. Let me know if you think we need more documentation around this. > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 37: > >> 35: NotFree, // Region has been retired and is not in any free set: there is no available memory. >> 36: Mutator, // Region is in the Mutator free set: available memory is available to mutators. >> 37: Collector, // Region is in the Collector free set: available memory is reserved for evacuations. > > When mutators evacuate the target of an LRB, do they use `Mutator` or `Collector`. I assume the former? In that case, I'd say for Collector: `available memory is reserved for collector threads for evacuation`. actually, the collector reserve is for all evacuation, whether performed by collector worker threads or by mutator threads doing LRB handling. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481915638 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481916799 From kdnilsen at openjdk.org Wed Feb 7 18:39:10 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:39:10 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 05:22:21 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 50: > >> 48: const size_t _max; // The maximum number of heap regions >> 49: const size_t _region_size_bytes; >> 50: const ShenandoahFreeSet* _free_set; > > Interesting: why does the partitioning need a reference to its containing free set? There are a few places where ShenandoahRegionPartitions invokes services provided by ShenandoahFreeSet, such as calling _free_set->alloc_capacity() when enforcing certain assertions and when adjusting interval boundaries. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481922483 From kdnilsen at openjdk.org Wed Feb 7 18:47:59 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:47:59 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 07:27:46 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 44: >> >>> 42: >>> 43: // This class implements partitioning of regions into distinct sets. Each ShenandoahHeapRegion is either in the Mutator free set, >>> 44: // the Collector free set, or in neither free set (NotFree). >> >> I noticed that you use the term "free partition" quite a lot later, I'd just start using that term early on when talking about these sets. You could, for example, say: >> >> // Whenever we say "free partition", we mean any partition other than the "NotFree" partition. > > Or: > > // Any partition that is not the "NotFree" partition is a "free partition". Thanks. I've made this change. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481929525 From kdnilsen at openjdk.org Wed Feb 7 18:47:58 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:47:58 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Wed, 7 Feb 2024 18:29:41 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 36: >> >>> 34: enum ShenandoahFreeSetPartitionId : uint8_t { >>> 35: NotFree, // Region has been retired and is not in any free set: there is no available memory. >>> 36: Mutator, // Region is in the Mutator free set: available memory is available to mutators. >> >> Just want to make sure: "available to mutators" -- is this both for object allocation as well as for possible evacuation as part of the mutator LRB? > > For personal clarification: when the mutator LRB needs to evacuate an object, it uses the collector set. Each mutator has three TLABS: one for mutator allocations, one for young-gen evacuations, and one for old-gen evacuations. Let me know if you think we need more documentation around this. (actually, the old-gen TLAB is not in single-generation Shenandoah, only in GenShen.) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481928053 From kdnilsen at openjdk.org Wed Feb 7 18:48:00 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:48:00 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 05:51:14 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 74: > >> 72: // and _used[p], even though the region may have been removed from the free set. >> 73: size_t _capacity[NumPartitions]; >> 74: size_t _used[NumPartitions]; > > In light of your earlier documentation of leftmost/righmost/empty/available etc. then, would it be fair to say that the following statement is always true: > > for p = NotFree: > 1. leftmosts[p] = leftmosts_empty[p] = _max > 2. rightmosts_empty[p] = rightmosts_empty[p] = 0 > 3. capacity[p] = used[p] = region_size > > Are the "NotFree" entries for these arrays ever used? > > If not, is there any point in keeping them in a product build? Is there any point in keeping them in a non-product build? Does it have some other role that makes it important to keep it, anyway? In most recent change, I shrunk the sizes of the arrays to not include an entry for NotFree. We only maintain entries for Mutator and Collector. > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 75: > >> 73: size_t _capacity[NumPartitions]; >> 74: size_t _used[NumPartitions]; >> 75: size_t _region_counts[NumPartitions]; > > If tracked, is this an invariant of these fields? > > - region_counts[NotFree] == _max - (region_counts[Mutator] + region_counts[Collector]) > > (This would also make the region_counts[NotFree] unnecessary? See my previous comment.) This is not tracked, and no longer relevant because I removed region_counts[NotFree]. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481931675 PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481932447 From kdnilsen at openjdk.org Wed Feb 7 18:56:57 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 18:56:57 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 06:17:25 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 95: > >> 93: void make_free(size_t idx, ShenandoahFreeSetPartitionId which_partition, size_t region_capacity); >> 94: >> 95: // Place region idx into free partition new_partition. Requires that idx is currently not NotFree. > > Include semantics of region_capacity in comment, e.g.: > > > // Move region idx, with region_capacity bytes of available free space, > // from the NotFree partition to the free partition new_partition. Thanks. I've adjusted this comment to make the intent more clear. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1481941805 From wkemper at openjdk.org Wed Feb 7 19:16:59 2024 From: wkemper at openjdk.org (William Kemper) Date: Wed, 7 Feb 2024 19:16:59 GMT Subject: RFR: Merge openjdk/jdk21u-dev:master [v4] In-Reply-To: References: Message-ID: > Merges tag jdk-21.0.3+1 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Finally fix these funny tests ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files/8dfe163f..3b76ed75 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=03 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=02-03 Stats: 3 lines in 2 files changed: 0 ins; 0 del; 3 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/19.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/19/head:pull/19 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/19 From kdnilsen at openjdk.org Wed Feb 7 21:00:57 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 21:00:57 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: <2-le3X33u0wR8EuyrQnG0rn2YtiMUDWydFCq0-R9U4s=.1ec26721-bb1c-4d50-894e-277aacc2170d@github.com> On Thu, 1 Feb 2024 03:01:08 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 171: > >> 169: }; >> 170: >> 171: class ShenandoahFreeSet : public CHeapObj { > > It would be good to have a block comment here motivating this class. > It seems (from looking at some of its public APIs) as if it publicly exports only the "mutator view", which I find interesting. > > The other partitions in `ShenandoahRegionPartition` appears to be for efficiency of the implementation in service of the public APIs for ShenandoahFreeSet. Thanks. I've added a block comment to describe ShenandoahFreeSet and have enhanced the comment that describes ShenandoahRegionPartition. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1482087963 From kdnilsen at openjdk.org Wed Feb 7 21:18:58 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 21:18:58 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:39:33 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 187: > >> 185: // regions. >> 186: // >> 187: // Precondition: req.size() > ShenandoahHeapRegion::humongous_threshold_words(). > > `>` or `>=` ? >. See the only invocation from ShenandoahFreeSet::allocate(). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1482106597 From kdnilsen at openjdk.org Wed Feb 7 21:25:12 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 21:25:12 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v10] In-Reply-To: References: Message-ID: > Several objectives: > 1. Reduce humongous allocation failures by segregating regular regions from humongous regions > 2. Do not retire regions just because an allocation failed within the region if the memory remaining within the region is large enough to represent a LAB > 3. Track range of empty regions in addition to range of available regions in order to expedite humongous allocations > 4. Treat collector reserves as available for Mutator allocations after evacuation completes > 5. Improve encapsulation so as to enable an OldCollector reserve for future integration of generational Shenandoah > > On internal performance pipelines, this change shows: > > 1. some Increase in page faults and rss_max with certain workloads, presumably because of "segregation" of humongous from regular regions. > 2. An increase in System CPU time on certain benchmarks: sunflow (+165%), scimark.sparse.large (+50%), lusearch (+43%). This system CPU time increase appears to correlate with increased page faults and/or rss. > 3. An increase in trigger_failure for the hyperalloc_a2048_o4096 experiment (not yet understood) > 4. 2-30x improvements on multiple metrics of the Extremem phased workload latencies (most likely resulting from fewer degenerated or full GCs) > > Shenandoah > ------------------------------------------------------------------------------------------------------- > +166.55% scimark.sparse.large/minor_page_fault_count p=0.00000 > Control: 819938.875 (+/-5724.56 ) 40 > Test: 2185552.625 (+/-26378.64 ) 20 > > +166.16% scimark.sparse.large/rss_max p=0.00000 > Control: 3285226.375 (+/-22812.93 ) 40 > Test: 8743881.500 (+/-104906.69 ) 20 > > +164.78% sunflow/cpu_system p=0.00000 > Control: 1.280s (+/- 0.10s ) 40 > Test: 3.390s (+/- 0.13s ) 20 > > +149.29% hyperalloc_a2048_o4096/trigger_failure p=0.00000 > Control: 3.259 (+/- 1.46 ) 33 > Test: 8.125 (+/- 2.05 ) 20 > > +143.75% pmd/major_page_fault_count p=0.03622 > Control: 1.000 (+/- 0.00 ) 40 > Test: 2.438 (+/- 2.59 ) 20 > > +80.22% lusearch/minor_page_fault_count p=0.00000 > Control: 2043930.938 (+/-4777.14 ) 40 > Test: 3683477.625 (+/-5650.29 ) 20 > > +50.50% scimark.sparse.small/minor_page_fault_count p=0.00000 > Control: 697899.156 (+/-3457.82 ) 40 > Test: 1050363.812 (+/-175237.63 ) 20 > > +49.97% scimark.sparse.small/rss_max p=0.00000 > Control: 277075... Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Respond to review feedback ------------- Changes: - all: https://git.openjdk.org/jdk/pull/17561/files - new: https://git.openjdk.org/jdk/pull/17561/files/7d5c1fc6..b2ba4cf2 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=09 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=17561&range=08-09 Stats: 43 lines in 2 files changed: 27 ins; 0 del; 16 mod Patch: https://git.openjdk.org/jdk/pull/17561.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17561/head:pull/17561 PR: https://git.openjdk.org/jdk/pull/17561 From kdnilsen at openjdk.org Wed Feb 7 21:25:12 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 21:25:12 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:55:49 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 184: >> >>> 182: HeapWord* allocate_single(ShenandoahAllocRequest& req, bool& in_new_region); >>> 183: >>> 184: // While holding the heap lock, allocate memory for a humongous object which will span multiple contiguous heap >> >> `which will` or `which may`? (Is a humongous object allowed to span just a single region as well?) >> >> Or are objects humongous only if they won't fit in a region? In which case the "will" is correct. >> >> I was confused by tests that use `ShenandoahHumongousThreshold=50` , `=90`, etc. >> >> May be in those cases, we go through the `allocate_single()` despite allocating an object (or block) bigger than `ShenandoahHeapRegion::humongous_threshold_words()` ? (That would make the pre-condition of the previous method suspect, though.) > > Same remark applies to the precondition comment below (which is correct, but could be made stronger to say `req.size() > ShenandoahHeapRegion::RegionSizeWords` or such? Thanks for prodding with these questions. My comment was not accurate. I've endeavored to fix the comment. A humongous object may span 1 or more regions. The extra memory within the region that is not used to represent the humongous object is wasted. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1482111007 From kdnilsen at openjdk.org Wed Feb 7 21:25:12 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 7 Feb 2024 21:25:12 GMT Subject: RFR: 8324649: Shenandoah: refactor implementation of free set [v6] In-Reply-To: References:

Message-ID: On Thu, 1 Feb 2024 02:58:51 GMT, Y. Srinivas Ramakrishna wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Rename and comments for _capacity_of and _used_by > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.hpp line 231: > >> 229: inline size_t available() const { >> 230: assert(used() <= capacity(), "must use less than capacity"); >> 231: return capacity() - used(); > > So `ShenandoahFreeSet` publicly exports only the mutator view? I think of this as "public to the mutator" and "friendly public to the collector". I've tried to clarify with new comments. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17561#discussion_r1482111723 From wkemper at openjdk.org Wed Feb 7 21:48:04 2024 From: wkemper at openjdk.org (William Kemper) Date: Wed, 7 Feb 2024 21:48:04 GMT Subject: RFR: Merge openjdk/jdk21u-dev:master [v5] In-Reply-To: References: Message-ID: > Merges tag jdk-21.0.3+1 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix whitespace ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/19/files/3b76ed75..af5fd615 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=04 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=19&range=03-04 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/19.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/19/head:pull/19 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/19 From dlong at openjdk.org Thu Feb 8 09:21:00 2024 From: dlong at openjdk.org (Dean Long) Date: Thu, 8 Feb 2024 09:21:00 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: On Tue, 30 Jan 2024 09:08:01 GMT, Erik ?sterlund wrote: >> ICStubs solve an atomicity problem when setting both the destination and data of an inline cache. Unfortunately, it also leads to occasional safepoint carpets when multiple threads need to ICRefill the stubs at the same time, and spurious GuaranteedSafepointInterval "Cleanup" safepoints every second. This patch changes inline caches to not change the data part at all during the nmethod life cycle, hence removing the need for ICStubs. >> >> The new scheme is less stateful. Instead of adding and removing callsite metadata back and forth when transitioning inline cache states, it installs all state any shape of call will ever need at resolution time in a struct that I call CompiledICData. This reduces inline cache state changes to simply changing the destination of the call, and it doesn't really matter what state transitions to what other state. >> >> With this patch, we get rid of ICStub and ICBuffer classes and the related ICRefill and almost all Cleanup safepoints in practice. It also makes the inline cache code much simpler. >> >> I have tested the changes from tier1-7, and run through full aurora performance tests. > > Erik ?sterlund has updated the pull request incrementally with one additional commit since the last revision: > > ARM32 fixes src/hotspot/cpu/aarch64/aarch64.ad line 2224: > 2222: // This is the unverified entry point. > 2223: C2_MacroAssembler _masm(&cbuf); > 2224: __ ic_check(CodeEntryAlignment); I'm not sure we want to increase the alignement to CodeEntryAlignment here. I believe C2 already aligns the root block to CodeEntryAlignment. @theRealAph, what do you think? src/hotspot/share/opto/output.cpp line 3416: > 3414: } else { > 3415: if (!target->is_static()) { > 3416: _code_offsets.set_value(CodeOffsets::Entry, _first_block_size - MacroAssembler::ic_check_size()); This looks tricky. I think it means CodeOffsets::Entry starts after the alignment padding NOPs. If that's true then the `ic_check` functions could use a comment explaining that alignment needs to come first, not last. A comment here wouldn't hurt either. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/17495#discussion_r1482646992 PR Review Comment: https://git.openjdk.org/jdk/pull/17495#discussion_r1482643531 From dlong at openjdk.org Thu Feb 8 09:23:57 2024 From: dlong at openjdk.org (Dean Long) Date: Thu, 8 Feb 2024 09:23:57 GMT Subject: RFR: 8322630: Remove ICStubs and related safepoints [v6] In-Reply-To: References:

Message-ID: