RFR: 8170307: Stack size option -Xss is ignored

Thu Dec 15 10:47:27 UTC 2016

David,

I kind of late at for review but the fix looks good to me.

Nit - no need to cast to intptr_t:

   1087 uintptr_t stack_top;
   . . .
1122 intptr_t(&rlim) < intptr_t(stack_top);

   But you probably added the cast intentionally to underline the values 
are of the same type.

Thanks,
Serguei

On 12/14/16 13:06, David Holmes wrote:
> Can I get a second review please!
>
> Thanks,
> David
>
> On 14/12/2016 3:49 PM, David Holmes wrote:
>> Hi Dan,
>>
>> Thanks for the re-review. I apologize for losing the edits you
>> previously suggested.
>>
>> More inline ...
>>
>> On 14/12/2016 3:12 AM, Daniel D. Daugherty wrote:
>>> On 12/12/16 9:41 PM, David Holmes wrote:
>>>> Okay here's the updated webrev complete with nice logging:
>>>>
>>>> http://cr.openjdk.java.net/~dholmes/8170307/webrev.v2/
>>>
>>> src/os/linux/vm/os_linux.cpp
>>>     L936:   // a user-specified value known to be greater than the
>>> minimum needed.
>>>         Perhaps: ... known to be at least the minimum needed.
>>
>> Changed.
>>
>>>     L932:   // can not do anything to emulate a larger stack than what
>>> has been provided by
>>>         Typo: 'can not' -> 'cannot'
>>
>> Changed.
>>
>>>     L936:   // Mamimum stack size is the easy part, get it from
>>> RLIMIT_STACK
>>>         Typo: 'Mamimum' -> 'Maximum'
>>>         nit - please add a '.' to the end.
>>
>> Fixed.
>>
>>>
>>>     L1125:                          SIZE_FORMAT "K, top=" INTPTR_FORMAT
>>> ", bottom=" INTPTR_FORMAT "\n",
>>>         Does the logging subsystem convert the "\n" into the proper
>>>         output for non-*NIX platforms, e.g., Windows?
>>
>> No idea :) But that was leftover from when this was a ::printf (I wasn't
>> sure logging would work this early in VM init - but it does).
>>
>> Removed.
>>
>>
>>>     L1126: primordial ? "Primordial" : "User",
>>> max_size/K,  _initial_thread_stack_size/K,
>>>         Please add spaces around the div operator.
>>
>> Changed.
>>
>>>         Any particular reason that "Primordial" and "User" start with
>>> upper case?
>>
>> They used to be the first things printed. :) Fixed.
>>
>>> Thumbs up!
>>>
>>> I don't need to see a new webrev if you decide to make the
>>> minor edits above.
>>
>> Updated in place for the second reviewer (whomever they may be).
>>
>> Thanks,
>> David
>> -----
>>
>>> Dan
>>>
>>>
>>>>
>>>> The stack size will be the smaller of the rlimit stack and the
>>>> -Xss/ThreadStackSize value. If the rlimit stack is unlimited and
>>>> ThreadStackSize==0 then we clamp it at 8MB as we do on Solaris. So you
>>>> can now get whatever primordial thread stack size you want by using
>>>> ulimit and -Xss appropriately.
>>>>
>>>> Thanks,
>>>> David
>>>>
>>>> On 3/12/2016 2:11 PM, David Holmes wrote:
>>>>> On 3/12/2016 9:12 AM, Daniel D. Daugherty wrote:
>>>>>> On 12/1/16 10:51 PM, David Holmes wrote:
>>>>>>> Investigating this further the history is quite complex, especially
>>>>>>> when we start looking at other platforms. E.g. see
>>>>>>>
>>>>>>> https://bugs.openjdk.java.net/browse/JDK-6269555
>>>>>>>
>>>>>>> Solaris actually hard-wires an 8MB limit for the primordial thread.
>>>>>>>
>>>>>>> I'm very tempted to do the same on Linux.
>>>>>>
>>>>>> Vote: yes
>>>>>
>>>>> Excellent! Other votes?
>>>>>
>>>>>> This latest problem only comes up with -XX:ThreadStackSize=0 when 
>>>>>> the
>>>>>> stack is unlimited right?
>>>>>
>>>>> Right.
>>>>>
>>>>>> When -XX:ThreadStackSize=0 is specified, is taking the smaller of
>>>>>> 8MB or the ulimit a viable option?
>>>>>
>>>>> I think so.
>>>>>
>>>>>> Also, it looks like Hui had some things to say about not setting the
>>>>>> red/yellow zone pages on the primordial thread when we aren't using
>>>>>> the
>>>>>> 'java' launcher because we don't know the environment of the code 
>>>>>> that
>>>>>> is using the JNI invocation API...
>>>>>
>>>>> Yeah but those comments seem a bit confused to me. They suggest we
>>>>> shouldn't add guard pages but in fact we do add guard pages. And 
>>>>> to me
>>>>> it is no different in the primordial thread than any other natively
>>>>> attached thread ie why should the initially attached thread be 
>>>>> treated
>>>>> differently to any other?** I suspect if I keep researching on this I
>>>>> will find bugs regarding such differences in behaviour (eg the fact
>>>>> that
>>>>> -Xss wasn't working on the main thread).
>>>>>
>>>>> ** There are arguments both ways as to how natively attached threads
>>>>> should behave. The main argument against guard page insertion is
>>>>> that we
>>>>> don't know how far down the existing stack we actually are - we
>>>>> could be
>>>>> past the depth where the guard page would be inserted! The main
>>>>> argument
>>>>> for (which seems to have won the day) is so that we don't get 
>>>>> arbitrary
>>>>> differences in behaviour between threads created and attached by
>>>>> application native code; and threads created direct from application
>>>>> Java code.
>>>>>
>>>>> Anyway, simply upping the 2M limit on Linux to 8M seems a simple
>>>>> solution - assuming it addresses the needs of the folk that ran into
>>>>> this problem.
>>>>>
>>>>> Thanks,
>>>>> David
>>>>>
>>>>>> Dan
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> David
>>>>>>> -----
>>>>>>>
>>>>>>> On 30/11/2016 6:46 PM, David Holmes wrote:
>>>>>>>> On 30/11/2016 6:17 PM, Thomas Stüfe wrote:
>>>>>>>>> On Wed, Nov 30, 2016 at 8:35 AM, David Holmes
>>>>>>>>> <david.holmes at oracle.com
>>>>>>>>> <mailto:david.holmes at oracle.com>> wrote:
>>>>>>>>>
>>>>>>>>>     On 29/11/2016 10:25 PM, David Holmes wrote:
>>>>>>>>>
>>>>>>>>>         I just realized I overlooked the case where
>>>>>>>>> ThreadStackSize=0
>>>>>>>>>         and the
>>>>>>>>>         stack is unlimited. In that case it isn't clear where the
>>>>>>>>> guard
>>>>>>>>>         pages
>>>>>>>>>         will get inserted - I do know that I don't get a
>>>>>>>>> stackoverflow
>>>>>>>>>         error.
>>>>>>>>>
>>>>>>>>>         This needs further investigation.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     So what happens here is that the massive stack-size causes
>>>>>>>>>     stack-bottom to be higher than stack-top! So we will set a
>>>>>>>>>     guard-page goodness knows where, and we can consume the 
>>>>>>>>> current
>>>>>>>>>     stack until such time as we hit an unmapped or protected
>>>>>>>>> region at
>>>>>>>>>     which point we are killed.
>>>>>>>>>
>>>>>>>>>     I'm not sure what to do here. My gut feel is that in such a
>>>>>>>>> case we
>>>>>>>>>     should not attempt to create a guard page in the initial
>>>>>>>>> thread.
>>>>>>>>>     That would require using a sentinel value for the stack-size.
>>>>>>>>> Though
>>>>>>>>>     it also presents a problem for stack-bottom - which is
>>>>>>>>> implicitly
>>>>>>>>>     zero. It may also give false positives in the
>>>>>>>>> is_initial_thread()
>>>>>>>>> check!
>>>>>>>>>
>>>>>>>>>     Thoughts? Suggestions?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Maybe I am overlooking something, but should
>>>>>>>>> os::capture_initial_thread() not call pthread_getattr_np() 
>>>>>>>>> first to
>>>>>>>>> handle the case where the VM was created on a pthread which is
>>>>>>>>> not the
>>>>>>>>> primordial thread and may have a different stack size than what
>>>>>>>>> getrlimit returns? And fall back to getrlimit only if
>>>>>>>>> pthread_getattr_np() fails?
>>>>>>>>
>>>>>>>> My understanding of the problem (which likely no longer exists) is
>>>>>>>> that
>>>>>>>> pthread_getattr_np didn't fail as such but returned bogus values
>>>>>>>> - so
>>>>>>>> the problem was not detectable and so we just had to not use
>>>>>>>> pthread_getattr_np.
>>>>>>>>
>>>>>>>>> And then we also should handle
>>>>>>>>> RLIM_INFINITY. For that case, I also think not setting guard 
>>>>>>>>> pages
>>>>>>>>> would
>>>>>>>>> be safest.
>>>>>>>>>
>>>>>>>>> We also may just refuse to run in that case, because the 
>>>>>>>>> workaround
>>>>>>>>> for
>>>>>>>>> the user is easy - just set the limit before process start. Note
>>>>>>>>> that on
>>>>>>>>> AIX, we currently refuse to run on the primordial thread 
>>>>>>>>> because it
>>>>>>>>> may
>>>>>>>>> have different page sizes than pthreads and it is impossible 
>>>>>>>>> to get
>>>>>>>>> the
>>>>>>>>> exact stack locations.
>>>>>>>>
>>>>>>>> I was wondering why the AIX set up seemed so simple in 
>>>>>>>> comparison :)
>>>>>>>>
>>>>>>>> Thanks,
>>>>>>>> David
>>>>>>>>
>>>>>>>>>
>>>>>>>>> Thomas
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>         David
>>>>>>>>>
>>>>>>>>>         On 29/11/2016 9:59 PM, David Holmes wrote:
>>>>>>>>>
>>>>>>>>>             Hi Thomas,
>>>>>>>>>
>>>>>>>>>             On 29/11/2016 8:39 PM, Thomas Stüfe wrote:
>>>>>>>>>
>>>>>>>>>                 Hi David,
>>>>>>>>>
>>>>>>>>>                 thanks for the good explanation. Change looks
>>>>>>>>> good, I
>>>>>>>>>                 really like the
>>>>>>>>>                 comment in capture_initial_stack().
>>>>>>>>>
>>>>>>>>>                 Question, with -Xss given and being smaller than
>>>>>>>>> current
>>>>>>>>>                 thread stack
>>>>>>>>>                 size, guard pages may appear in the middle of the
>>>>>>>>>                 invoking thread stack?
>>>>>>>>>                 I always thought this is a bit dangerous. If your
>>>>>>>>> model
>>>>>>>>>                 is to have the
>>>>>>>>>                 VM created from the main thread, which then goes
>>>>>>>>> off to
>>>>>>>>>                 do different
>>>>>>>>>                 things, and have other threads then attach and 
>>>>>>>>> run
>>>>>>>>> java
>>>>>>>>>                 code, main
>>>>>>>>>                 thread later may crash in unrelated native code
>>>>>>>>> just
>>>>>>>>>                 because it reached
>>>>>>>>>                 the stack depth of the hava threads? Or am I
>>>>>>>>>                 misunderstanding something?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>             There is no change to the general behaviour other 
>>>>>>>>> than
>>>>>>>>>             allowing a
>>>>>>>>>             primordial process thread that launches the VM, to
>>>>>>>>> now not
>>>>>>>>>             have an
>>>>>>>>>             effective stack limited at 2MB. The current logic 
>>>>>>>>> will
>>>>>>>>>             insert guard
>>>>>>>>>             pages where ever -Xss states (as long as less than 
>>>>>>>>> 2MB
>>>>>>>>> else
>>>>>>>>>             2MB), while
>>>>>>>>>             with the fix the guard pages will be inserted 
>>>>>>>>> above 2MB
>>>>>>>>> - as
>>>>>>>>>             dictated by
>>>>>>>>>             -Xss.
>>>>>>>>>
>>>>>>>>>             David
>>>>>>>>>             -----
>>>>>>>>>
>>>>>>>>>                 Thanks, Thomas
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>                 On Fri, Nov 25, 2016 at 11:38 AM, David Holmes
>>>>>>>>>                 <david.holmes at oracle.com
>>>>>>>>> <mailto:david.holmes at oracle.com>
>>>>>>>>>                 <mailto:david.holmes at oracle.com
>>>>>>>>> <mailto:david.holmes at oracle.com>>> wrote:
>>>>>>>>>
>>>>>>>>>                     Bug:
>>>>>>>>> https://bugs.openjdk.java.net/browse/JDK-8170307
>>>>>>>>> <https://bugs.openjdk.java.net/browse/JDK-8170307>
>>>>>>>>> <https://bugs.openjdk.java.net/browse/JDK-8170307
>>>>>>>>> <https://bugs.openjdk.java.net/browse/JDK-8170307>>
>>>>>>>>>
>>>>>>>>>                     The bug is not public unfortunately for
>>>>>>>>>                 non-technical reasons - but
>>>>>>>>>                     see my eval below.
>>>>>>>>>
>>>>>>>>>                     Background: if you load the JVM from the
>>>>>>>>> primordial
>>>>>>>>>                 thread of a
>>>>>>>>>                     process (not done by the java launcher since
>>>>>>>>> JDK
>>>>>>>>> 6),
>>>>>>>>>                 there is an
>>>>>>>>>                     artificial stack limit imposed on the initial
>>>>>>>>> thread
>>>>>>>>>                 (by sticking
>>>>>>>>>                     the guard page at the limit position of the
>>>>>>>>> actual
>>>>>>>>>                 stack) of the
>>>>>>>>>                     minimum of the -Xss setting and 2M. So if you
>>>>>>>>> set
>>>>>>>>>                 -Xss to > 2M it is
>>>>>>>>>                     ignored for the main thread even if the true
>>>>>>>>> stack
>>>>>>>>>                 is, say, 8M. This
>>>>>>>>>                     limitation dates back 10-15 years and is no
>>>>>>>>> longer
>>>>>>>>>                 relevant today
>>>>>>>>>                     and should be removed (see below). I've also
>>>>>>>>> added
>>>>>>>>>                 additional
>>>>>>>>>                     explanatory notes.
>>>>>>>>>
>>>>>>>>>                     webrev:
>>>>>>>>> http://cr.openjdk.java.net/~dholmes/8170307/webrev/
>>>>>>>>> <http://cr.openjdk.java.net/~dholmes/8170307/webrev/>
>>>>>>>>> <http://cr.openjdk.java.net/~dholmes/8170307/webrev/
>>>>>>>>> <http://cr.openjdk.java.net/~dholmes/8170307/webrev/>>
>>>>>>>>>
>>>>>>>>>                     Testing was manually done by modifying the
>>>>>>>>> launcher
>>>>>>>>>                 to not run the
>>>>>>>>>                     VM in a new thread, and checking the 
>>>>>>>>> resulting
>>>>>>>>> stack
>>>>>>>>>                 size used.
>>>>>>>>>
>>>>>>>>>                     This change will only affect hosted JVMs
>>>>>>>>> launched
>>>>>>>>>                 with a -Xss value
>>>>>>>>>                     > 2M.
>>>>>>>>>
>>>>>>>>>                     Thanks,
>>>>>>>>>                     David
>>>>>>>>>                     -----
>>>>>>>>>
>>>>>>>>>                     Bug eval:
>>>>>>>>>
>>>>>>>>>                     JDK-4441425 limits the stack to 8M as a
>>>>>>>>> safeguard
>>>>>>>>>                 against an
>>>>>>>>>                     unlimited value from getrlimit in 1.3.1, but
>>>>>>>>> further
>>>>>>>>>                 constrained
>>>>>>>>>                     that to 2M in 1.4.0 due to JDK-4466587.
>>>>>>>>>
>>>>>>>>>                     By 1.4.2 we have the basic form of the 
>>>>>>>>> current
>>>>>>>>>                 problematic code:
>>>>>>>>>
>>>>>>>>>                     #ifndef IA64
>>>>>>>>>                       if (rlim.rlim_cur > 2 * K * K)
>>>>>>>>> rlim.rlim_cur =
>>>>>>>>> 2 *
>>>>>>>>>                 K * K;
>>>>>>>>>                     #else
>>>>>>>>>                       // Problem still exists RH7.2 (IA64 anyway)
>>>>>>>>> but
>>>>>>>>>                 2MB is a little
>>>>>>>>>                 small
>>>>>>>>>                       if (rlim.rlim_cur > 4 * K * K)
>>>>>>>>> rlim.rlim_cur =
>>>>>>>>> 4 *
>>>>>>>>>                 K * K;
>>>>>>>>>                     #endif
>>>>>>>>>
>>>>>>>>>                       _initial_thread_stack_size = 
>>>>>>>>> rlim.rlim_cur &
>>>>>>>>>                 ~(page_size() - 1);
>>>>>>>>>
>>>>>>>>>                       if (max_size && 
>>>>>>>>> _initial_thread_stack_size >
>>>>>>>>>                 max_size) {
>>>>>>>>> _initial_thread_stack_size = max_size;
>>>>>>>>>                       }
>>>>>>>>>
>>>>>>>>>                     This was added by JDK-4678676 to allow the
>>>>>>>>> stack of
>>>>>>>>>                 the main thread
>>>>>>>>>                     to be _reduced_ below the default 2M/4M if 
>>>>>>>>> the
>>>>>>>>> -Xss
>>>>>>>>>                 value was
>>>>>>>>>                     smaller than that.** There was no intent to
>>>>>>>>> allow
>>>>>>>>>                 the stack size to
>>>>>>>>>                     follow -Xss arbitrarily due to the 
>>>>>>>>> operational
>>>>>>>>>                 constraints imposed
>>>>>>>>>                     by the OS/glibc at the time when dealing with
>>>>>>>>> the
>>>>>>>>>                 primordial process
>>>>>>>>>                     thread.
>>>>>>>>>
>>>>>>>>>                     ** It could not actually change the actual
>>>>>>>>> stack
>>>>>>>>>                 size of course, but
>>>>>>>>>                     set the guard pages to limit use to the
>>>>>>>>> expected
>>>>>>>>>                 stack size.
>>>>>>>>>
>>>>>>>>>                     In JDK 6, under JDK-6316197, the launcher was
>>>>>>>>>                 changed to create the
>>>>>>>>>                     JVM in a new thread, so that it was not
>>>>>>>>> limited by
>>>>>>>>> the
>>>>>>>>>                     idiosyncracies of the OS or thread library
>>>>>>>>>                 primordial thread
>>>>>>>>>                     handling. However, the stack size limitations
>>>>>>>>>                 remained in place in
>>>>>>>>>                     case the VM was launched from the primordial
>>>>>>>>> thread
>>>>>>>>>                 of a user
>>>>>>>>>                     application via the JNI invocation API.
>>>>>>>>>
>>>>>>>>>                     I believe it should be safe to remove the 2M
>>>>>>>>>                 limitation now.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>
>>>