JEP 248: Make G1 the Default Garbage Collector
Staffan Friberg
staffan.friberg at oracle.com
Thu Jun 25 17:10:51 UTC 2015
Hi,
As JEP 248 moved to target, we would like to expand on the testing we
have done. We want to ask the community to help us make sure we get the
best data points. These data points will be used to evaluate if there
are critical issues for G1 that can not be addressed in JDK 9 time frame.
Switching to G1 as the default GC in JDK 9 will in the end come down to
a judgment call with the data collected at the decision time.
We would like to get your feedback on all experiences, good, neutral and
bad. Usually bad experiences is what generates blogs and emails, the
other data points are not always easy to get.
*
**Metrics*
The following we see as the key metrics to gather data around to
determine the impact of the switch. Not all metrics will be interesting
for every application, but a default collector needs to behave well in
them as a whole.
* *Startup Time *- VM Initialization
o G1 is a more complex GC than Parallel GC and requires a bit more
infrastructure to be configured to enable concurrent marking and
mixed collections.
o This is most critical for short lived applications (such as
simple command line tools or build tools), where the execution
time is largely impacted by the time it takes to initialize the VM.
o *Goal:* The VM initialization time must not have a major impact
the run time of short running applications
* *Throughput* - Code Performance
o The pure application performance should not be too adversely
affected by which GC is used.
o Due to the heavier write barriers used in G1 there may be a
performance impact. But the impact varies for different
applications.
o *Goal:* The performance is good enough so the general
recommendation does not end up being to set the Parallel GC as
part of application launch scripts.
* *Footprint *- Total Memory Usage
o G1 is a more complex GC than Parallel GC and requires a bit more
infrastructure to be configured to enable concurrent marking and
mixed collections.
o This also ties into the Ergonomics metric below in that the
Ergonomics need to keep the memory usage balanced by not
expanding the heap too aggressively.
o *Goal*: The total memory usage of the Java process must be kept
reasonable.
* *Ergonomics *- GC Behavior and Pauses
o A default GC need to be able to handle a reasonably large set of
different types of applications Out-Of-The-Box without extensive
tuning.
+ A user should be able to just run their Java application,
potentially just setting the heap size and pause time goal,
and have it behave well.
+ The GC needs to be able size the heap and its sub components
using dynamic ergonomics.
o The default GC needs to be able to avoid long costly Full
collections by using Ergonomics to self tune depending on the
workload.
o *Goal*: The GC Ergonomics must be able to handle common
applications without causing performance overhead or Full GCs
*Applications*
We are focusing on general Java applications, and not on applications
that are very dependent on extreme GC tuning. Applications not
considered a target for this switch includes applications requiring low
latency with milliseconds, micro or no-pause requirements, nor very
large heap applications that use multiple of 10s or 100s of GB of heap.
The focus is on applications that are normally not tuned, either because
there has never been a need to do so or they are used by people not
accustomed to GC tuning.
* *Unaffected Applications*
o Client VM Application: The Client VM will continue to use the
Serial Collector, and applications such as Applets and regular
Java applications using the 32bit JRE on Windows with only the
client VM would not be affected.
o "Tuned Applications": Any application that specifically sets
the GC will not be impacted by this change.
* *Affected Applications*
o Command Line Tools
+ Command line tools are often batch type jobs and are often
fairly short running, so the GC pauses are of less concern
as long as the execution is finished as soon as possible.
Example tools are the Java Compiler, Scala Compiler, Maven
and Gradle. The key metrics are Startup Time and Ergonomics
to make sure the tool starts fast and that the GC can
quickly adapt to the behavior of the application. For semi
long running compilations the Throughput will become a
factor as well.
o UI Applications
+ UI applications often don't set a specific GC, they might
tune the heap size depending on the requirements of the
application. End users of UI:s most likely don't know how to
tune the VM, or the fact that the application is written in
Java. As an example NetBeans by default only increases the
minimum heap size as the only GC related tuning and let the
ergonomics handle the rest. As a data point here, Java
Mission Control is configured to run with G1 without issues
so far. For most desktop UI application pure throughput is
less critical. What tends to most noticeable is stalls due
to GC pauses, so Ergonomics is probably the most important
metric, together with Footprint as you do not want you
application to use up all the memory on you desktop/laptop.
o Applications using the Default GC
+ This is the area where we need the most help. We have
internal workloads and benchmarks we can run, but that set
will always be infinitely small compared to what is used
across the Java community. This is also the reason why we
enable this early to expand the amount of testing and
feedback we are able to get. For these applications all the
metrics apply, but their importance will vary from workload
to workload.
We would like to encourage and collaborate on further community
polls and surveys on this matter, to gain more valuable insights on
questions such as:
* Do you specify GC on the command line?
* Are you tuning GC, but do not specify UseParallelGC since it was
the default anyway?
* Do you specify the minimum and/or maximum heap size and nursery size
* Is this a distributed end-user application or a server application?
Regards,
Staffan and Jenny
Java SE Performance
On 06/05/2015 10:11 AM, Ben Evans wrote:
> Hi,
>
> I like the plan, but this statement is really quite bullish,
> especially (IMO) on the available evidence.
>
> G1 was always marketed as the replacement for CMS. In field experience
> so far, that hasn't happened, and I feel that we are glossing over the
> fact that we are now thinking of G1 as a replacement for Parallel.
>
> Certainly none of the testing I've done has been about Parallel -> G1
> - it's all been CMS vs G1. I will be happy to start working with
> clients to try to cover the Parallel vs G1 space, but that hasn't
> happened up until now.
>
> Neither do I think we should be under any illusions that a very large
> number of installs are going to be affected.
>
> It may not be all that representative, but here are the results of a
> quick community straw poll I ran over the last couple of days:
>
> A single question: "Which Garbage Collector Does Your Application Use?
> " yielded these results:
>
> Concurrent Mark & Sweep (CMS)
> 23.94%
> 85
> –
> Garbage First (G1GC)
> 10.70%
> 38
> –
> Parallel (because I explicitly set it)
> 5.07%
> 18
> –
> Default (this actually gives you Parallel)
> 39.15%
> 139
> –
> I Don't Know
> 21.13%
> 75
>
> Total: 355
>
> The clear, stand-out winner is Default, with ~40% of installs using
> this, and therefore being exposed to the proposed change. That makes
> me very nervous.
>
> So, whilst I'm not saying we shouldn't do this, and I know that
> community members, including Kirk, myself & the other jClarity folks
> will help to get some better data, I'd argue that we're still a long
> way from being certain that we're ready for this change.
>
> Thanks,
>
> Ben
>
>
> On Fri, Jun 5, 2015 at 1:12 PM, Stefan Johansson
> <stefan.johansson at oracle.com> wrote:
>> On 2015-06-05 00:08, mark.reinhold at oracle.com wrote:
>>> 2015/6/4 6:44 -0700, charlie.hunt at oracle.com:
>>>> Wanted to come back to this thread, continue the dialog, reiterate the
>>>> objective, (try to) summarize the concerns and put forth a potential
>>>> plan for this JEP going forward.
>>>>
>>>> Intent: Use G1 GC as the default collector chosen by the JVM when no
>>>> GC is explicitly set at the JVM command line.
>>>>
>>>> ...
>>> Charlie -- thanks for the excellent summary of this wide-ranging
>>> discussion!
>>>
>>>> ...
>>>>
>>>> Suggested plan for moving forward:
>>>> - Make G1 the default collector in JDK 9, continue to evaluate G1 and
>>>> enhance G1 in JDK 9
>>>> - Mitigate risk by reverting back to Parallel GC before JDK 9 goes
>>>> “Generally Available” (Sept 22, 2016 [1]) if warranted by continuing
>>>> to monitor observations and experiences with G1 in both JDK 9
>>>> pre-releases and latest JDK 8 update releases
>>>> - Address enhancing ergonomics for selecting a default GC as a
>>>> separate JEP if future observations suggests its needed
>>> I think this is a fine plan.
>>>
>>> Stefan -- To move forward with JEP 248, could you please revise the
>>> second item in the "Risks and Assumptions" section to note that there
>>> is some concern that G1 might not be ready to become the default, that
>>> making it the default now will allow us to get more feedback on it, and
>>> that if it proves to be not ready then we'll revert the default to the
>>> Parallel GC in time for JDK 9 GA?
>> Mark, I've extended second item as follows:
>> - G1 is seen as a robust and well-tested collector. It is not expected
>> to have stability problems, but becoming the default collector will
>> increase its visibility and may reveal previously-unknown issues. If
>> a critical issue is found that can't be addressed in the JDK 9 time
>> frame, we will revert back to use Parallel GC as the default for the
>> JDK 9 GA.
>>
>> Thanks,
>> Stefan
>>
>>
>>> Ben -- Can you live with this plan?
>>>
>>> - Mark
>>
>
>
More information about the hotspot-dev
mailing list