From david.holmes at oracle.com Mon Jan 2 21:41:20 2012 From: david.holmes at oracle.com (David Holmes) Date: Tue, 03 Jan 2012 15:41:20 +1000 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4EFF3BC6.3060407@oracle.com> References: <4EFF3BC6.3060407@oracle.com> Message-ID: <4F029500.4080407@oracle.com> Hi Jim, I'm getting increasingly concerned about platform specific code being in, or being added to, what is nominally shared code, so I'd like to see if we can reduce some of that: src/os/posix/launcher/java_md.c Instead of the ifdef APPLE can we not factor out a JRE_LIB_PATH (or something like that) that is set to jre/lib or jre/lib/ as appropriate by the platform specific build files? This would also allow the new case /* Is the JRE universal, i.e. no arch dir? */ to be handled by the existing code. --- src/os/bsd/vm/os_bsd.cpp Similar comment - use JRE_LIB_PATH instead of jre/lib/%s etc --- David ----- On 1/01/2012 2:43 AM, James Melvin wrote: > Hi, > > This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. > There were 3 primary changes required to re-enable gamma... > > 1) Statically link with CoreFoundation framework to resolve symbols > > The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. > Because Mac OS X files are case-insensitive by default, we collide on > $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This > resulted in unresolved symbols in the Mac OS X framework libraries. The > solution for gamma was to statically link with CoreFoundation framework > to properly resolve framework symbols and allow gamma to successfully > dlopen() libjava.dylib. > > 2) Adjust various paths to reflect no arch subdirs > > On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. > Instead, one can use universal binaries to package multiple > architectures in a single binary. At the moment though, we are only > building 64-bit non-universal binaries. Note, the test_gamma script > assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. > Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script > gracefully, as libjava.dylib is in a different, unexpected place. > > 3) Modify test_gamma script to set library path only for gamma launch > > Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). > Instead, set this later in the script only for the gamma launcher test > run. While in there, I took the liberty of decrypting the script to make > it more maintainable and more easily merged whenever we reconcile the > unix ports into a single codebase. There is no change to the script output. > > Feedback welcome... > > WEBREV: > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 > > TESTS RUN: > JPRT 2011-12-31-061123.jmelvin.7125793 > local Mac OS X builds/tests > > > Thanks and Happy New Year! > > Jim From daniel.daugherty at oracle.com Tue Jan 3 10:39:13 2012 From: daniel.daugherty at oracle.com (Daniel D. Daugherty) Date: Tue, 03 Jan 2012 11:39:13 -0700 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4EFECA5D.6010905@oracle.com> References: <4EFECA5D.6010905@oracle.com> Message-ID: <4F034B51.3070609@oracle.com> > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 Jim, Thanks for diving in and improving the MacOS X port! Comments below. Dan make/bsd/makefiles/buildtree.make line 422: The new 'java -fullversion' invocation does not include the $(JAVA_FLAG) option like the old code did. Any particular reason for the change? Looks like that means the '-d32' or '-d64' options won't be specified as they were before. line 447: Why not just echo FULL_VERSION? Why pipe to awk? line 465: The 'jre/lib/libjava.dylib' part of the new check is MacOS X specific. Other BSDs don't necessarily use the '.dylib' extension (instead of .so) and I don't think that other BSDs have dropped the "arch" subdir. line 484: The DYLD_LIBRARY_PATH part is MacOS X specific. Will still need to set LD_LIBRARY_PATH for other BSDs. line 492: You switched from $(TESTFLAGS) to literal flag values, but you left the TESTFLAGS variable around. Any reason for the switch? make/bsd/makefiles/launcher.make Please add a comment explaining why '-framework CoreFoundation' is needed. Your explanatory block below is a really good start. make/bsd/makefiles/vm.make No comments. src/os/bsd/vm/os_bsd.cpp line 2585: Uses a suffix of ".so". That shouldn't work on MacOS X since MacOS X uses '.dylib'. That's OK for other BSDs, but not MacOS X. Also the comments that mention '.so' should be updated to include '.dylib' (not caused by your changes). To David H. - Yes, this change added another '#fdef __APPLE__'. It is not the first and it likely won't be the last since we're not done yet with the MacOS X port. There are a number of things that need to be cleaned up and we're tracking them. However, as you know, we don't have enough folks to handle all of the work so we'll just have to live with the warts for now. src/os/posix/launcher/java_md.c No comments. On 12/31/11 1:39 AM, James Melvin wrote: > Hi, > > This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. > There were 3 primary changes required to re-enable gamma... > > 1) Statically link with CoreFoundation framework to resolve symbols > > The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. > Because Mac OS X files are case-insensitive by default, we collide on > $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This > resulted in unresolved symbols in the Mac OS X framework libraries. The > solution for gamma was to statically link with CoreFoundation framework > to properly resolve framework symbols and allow gamma to successfully > dlopen() libjava.dylib. > > 2) Adjust various paths to reflect no arch subdirs > > On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. > Instead, one can use universal binaries to package multiple > architectures in a single binary. At the moment though, we are only > building 64-bit non-universal binaries. Note, the test_gamma script > assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. > Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script > gracefully, as libjava.dylib is in a different, unexpected place. > > 3) Modify test_gamma script to set library path only for gamma launch > > Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). > Instead, set this later in the script only for the gamma launcher test > run. While in there, I took the liberty of decrypting the script to make > it more maintainable and more easily merged whenever we reconcile the > unix ports into a single codebase. There is no change to the script > output. > > Feedback welcome... > > WEBREV: > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 > > TESTS RUN: > JPRT 2011-12-31-061123.jmelvin.7125793 > local Mac OS X builds/tests > > > Thanks and Happy New Year! > > Jim From kelly.ohair at oracle.com Tue Jan 3 11:23:26 2012 From: kelly.ohair at oracle.com (Kelly O'Hair) Date: Tue, 3 Jan 2012 11:23:26 -0800 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F029500.4080407@oracle.com> References: <4EFF3BC6.3060407@oracle.com> <4F029500.4080407@oracle.com> Message-ID: <462344E2-8F4D-46B2-8BF2-473DBCBC2C1A@oracle.com> I was having similar concerns, but wasn't able to come up with a clean alternative at the time. Having a ARCH_LIB_PATH_APPEND and a ARCH_BIN_PATH_APPEND might be better, (I think bin and lib might use different names sometimes? Or they did once upon a time? bin/v9 lib/sparcv9?) Anyway, I'd like to see the proprietary names left out. Or minimized. Or isolated to the Makefiles? -kto On Jan 2, 2012, at 9:41 PM, David Holmes wrote: > Hi Jim, > > I'm getting increasingly concerned about platform specific code being in, or being added to, what is nominally shared code, so I'd like to see if we can reduce some of that: > > src/os/posix/launcher/java_md.c > > Instead of the ifdef APPLE can we not factor out a JRE_LIB_PATH (or something like that) that is set to jre/lib or jre/lib/ as appropriate by the platform specific build files? This would also allow the new case > > /* Is the JRE universal, i.e. no arch dir? */ > > to be handled by the existing code. > > --- > > src/os/bsd/vm/os_bsd.cpp > > Similar comment - use JRE_LIB_PATH instead of jre/lib/%s etc > > --- > > David > ----- > > > > On 1/01/2012 2:43 AM, James Melvin wrote: >> Hi, >> >> This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. >> There were 3 primary changes required to re-enable gamma... >> >> 1) Statically link with CoreFoundation framework to resolve symbols >> >> The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. >> Because Mac OS X files are case-insensitive by default, we collide on >> $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This >> resulted in unresolved symbols in the Mac OS X framework libraries. The >> solution for gamma was to statically link with CoreFoundation framework >> to properly resolve framework symbols and allow gamma to successfully >> dlopen() libjava.dylib. >> >> 2) Adjust various paths to reflect no arch subdirs >> >> On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. >> Instead, one can use universal binaries to package multiple >> architectures in a single binary. At the moment though, we are only >> building 64-bit non-universal binaries. Note, the test_gamma script >> assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. >> Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script >> gracefully, as libjava.dylib is in a different, unexpected place. >> >> 3) Modify test_gamma script to set library path only for gamma launch >> >> Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). >> Instead, set this later in the script only for the gamma launcher test >> run. While in there, I took the liberty of decrypting the script to make >> it more maintainable and more easily merged whenever we reconcile the >> unix ports into a single codebase. There is no change to the script output. >> >> Feedback welcome... >> >> WEBREV: >> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 >> >> TESTS RUN: >> JPRT 2011-12-31-061123.jmelvin.7125793 >> local Mac OS X builds/tests >> >> >> Thanks and Happy New Year! >> >> Jim From john.coomes at oracle.com Tue Jan 3 12:46:09 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Tue, 03 Jan 2012 20:46:09 +0000 Subject: hg: hsx/hotspot-main/langtools: 12 new changesets Message-ID: <20120103204637.CC12547861@hg.openjdk.java.net> Changeset: 4822dfe0922b Author: ohair Date: 2011-12-12 08:15 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/4822dfe0922b 7119829: Adjust default jprt testing configuration Reviewed-by: alanb ! make/jprt.properties Changeset: 3809292620c9 Author: jjg Date: 2011-12-13 11:21 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/3809292620c9 7120736: refactor javac option handling Reviewed-by: mcimadamore ! src/share/classes/com/sun/tools/javac/api/JavacTool.java ! src/share/classes/com/sun/tools/javac/code/Source.java ! src/share/classes/com/sun/tools/javac/comp/Check.java ! src/share/classes/com/sun/tools/javac/comp/Enter.java ! src/share/classes/com/sun/tools/javac/comp/Lower.java ! src/share/classes/com/sun/tools/javac/file/Locations.java ! src/share/classes/com/sun/tools/javac/jvm/ClassReader.java ! src/share/classes/com/sun/tools/javac/jvm/ClassWriter.java ! src/share/classes/com/sun/tools/javac/jvm/Gen.java ! src/share/classes/com/sun/tools/javac/jvm/Target.java ! src/share/classes/com/sun/tools/javac/main/JavaCompiler.java ! src/share/classes/com/sun/tools/javac/main/Main.java ! src/share/classes/com/sun/tools/javac/nio/JavacPathFileManager.java ! src/share/classes/com/sun/tools/javac/processing/JavacProcessingEnvironment.java ! src/share/classes/com/sun/tools/javac/util/BaseFileManager.java ! src/share/classes/com/sun/tools/javac/util/Log.java ! src/share/classes/com/sun/tools/javac/util/Options.java ! test/tools/javac/diags/examples/UnsupportedEncoding.java Changeset: 4e4fed1d02f9 Author: jjg Date: 2011-12-13 14:33 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/4e4fed1d02f9 7121164: renamed files not committed Reviewed-by: ksrini - src/share/classes/com/sun/tools/javac/main/JavacOption.java + src/share/classes/com/sun/tools/javac/main/Option.java + src/share/classes/com/sun/tools/javac/main/OptionHelper.java - src/share/classes/com/sun/tools/javac/main/OptionName.java - src/share/classes/com/sun/tools/javac/main/RecognizedOptions.java Changeset: 4261dc8af622 Author: jjg Date: 2011-12-14 16:16 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/4261dc8af622 7111022: javac no long prints last round of processing 7121323: Sqe tests using -Xstdout option fail with an invalid flag error message Reviewed-by: darcy ! src/share/classes/com/sun/tools/javac/main/Option.java ! src/share/classes/com/sun/tools/javac/processing/JavacProcessingEnvironment.java ! src/share/classes/com/sun/tools/javac/util/Log.java ! test/tools/javac/4846262/Test.sh + test/tools/javac/processing/options/testPrintProcessorInfo/TestWithXstdout.java ! test/tools/javac/util/T6597678.java Changeset: 281eeedf9755 Author: jjg Date: 2011-12-14 17:52 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/281eeedf9755 7121681: compiler message file broken for javac -fullversion Reviewed-by: jjh ! src/share/classes/com/sun/tools/javac/main/Option.java Changeset: 42ffceeceeca Author: jjg Date: 2011-12-14 21:52 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/42ffceeceeca 7121682: remove obsolete import Reviewed-by: jjh ! test/tools/javac/api/T6838467.java Changeset: ab2a880cc23b Author: lana Date: 2011-12-15 19:53 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/ab2a880cc23b Merge Changeset: 6b773fdeb633 Author: jjg Date: 2011-12-16 13:49 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/6b773fdeb633 7121961: javadoc is missing a resource property Reviewed-by: bpatel ! src/share/classes/com/sun/tools/doclets/formats/html/resources/standard.properties Changeset: a7a2720c7897 Author: jjh Date: 2011-12-16 16:41 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/a7a2720c7897 7122342: testPrintProcessorInfo/TestWithXstdout.java failed for JDK8 nightly build at 12/16/2011 Summary: Do not pass empty args to javac Reviewed-by: jjg ! test/tools/javac/processing/options/testPrintProcessorInfo/TestWithXstdout.java Changeset: 1ae5988e201b Author: mcimadamore Date: 2011-12-19 12:07 +0000 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/1ae5988e201b 7120463: Fix method reference parser support in order to avoid ambiguities Summary: Add lookahead routine to disambiguate between method reference in method context and binary expression Reviewed-by: jjg, dlsmith ! src/share/classes/com/sun/tools/javac/parser/JavacParser.java ! test/tools/javac/lambda/MethodReferenceParserTest.java Changeset: 77b2c066084c Author: lana Date: 2011-12-23 16:39 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/77b2c066084c Merge - src/share/classes/com/sun/tools/javac/main/JavacOption.java - src/share/classes/com/sun/tools/javac/main/OptionName.java - src/share/classes/com/sun/tools/javac/main/RecognizedOptions.java Changeset: ffd294128a48 Author: katleman Date: 2011-12-29 15:14 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/ffd294128a48 Added tag jdk8-b19 for changeset 77b2c066084c ! .hgtags From jon.masamitsu at oracle.com Tue Jan 3 12:47:30 2012 From: jon.masamitsu at oracle.com (jon.masamitsu at oracle.com) Date: Tue, 03 Jan 2012 20:47:30 +0000 Subject: hg: hsx/hotspot-main/hotspot: 3 new changesets Message-ID: <20120103204741.A0F8E47862@hg.openjdk.java.net> Changeset: 20bfb6d15a94 Author: iveresov Date: 2011-12-27 16:43 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/20bfb6d15a94 7124829: NUMA: memory leak on Linux with large pages Summary: In os::free_memory() use mmap with the same attributes as for the heap space Reviewed-by: kvn Contributed-by: Aleksey Ignatenko ! src/os/bsd/vm/os_bsd.cpp ! src/os/linux/vm/os_linux.cpp ! src/os/solaris/vm/os_solaris.cpp ! src/os/windows/vm/os_windows.cpp ! src/share/vm/gc_implementation/shared/mutableNUMASpace.cpp ! src/share/vm/gc_implementation/shared/mutableSpace.cpp ! src/share/vm/runtime/os.hpp Changeset: 776173fc2df9 Author: stefank Date: 2011-12-29 07:37 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/776173fc2df9 7125516: G1: ~ConcurrentMark() frees incorrectly Summary: Replaced the code with a ShouldNotReachHere Reviewed-by: tonyp, jmasa ! src/share/vm/gc_implementation/g1/concurrentMark.cpp Changeset: 5ee33ff9b1c4 Author: jmasa Date: 2012-01-03 10:22 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/5ee33ff9b1c4 Merge From John.Coomes at oracle.com Tue Jan 3 15:17:58 2012 From: John.Coomes at oracle.com (John Coomes) Date: Tue, 3 Jan 2012 15:17:58 -0800 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4EFF3BC6.3060407@oracle.com> References: <4EFF3BC6.3060407@oracle.com> Message-ID: <20227.36006.993739.512423@oracle.com> James Melvin (james.melvin at oracle.com) wrote: > Hi, > > This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. > There were 3 primary changes required to re-enable gamma... > > 1) Statically link with CoreFoundation framework to resolve symbols > > The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. > Because Mac OS X files are case-insensitive by default, we collide on > $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This > resulted in unresolved symbols in the Mac OS X framework libraries. The > solution for gamma was to statically link with CoreFoundation framework > to properly resolve framework symbols and allow gamma to successfully > dlopen() libjava.dylib. > > 2) Adjust various paths to reflect no arch subdirs > > On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. > Instead, one can use universal binaries to package multiple > architectures in a single binary. At the moment though, we are only > building 64-bit non-universal binaries. Note, the test_gamma script > assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. > Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script > gracefully, as libjava.dylib is in a different, unexpected place. Hi Jim, As others have said, avoid #ifdef __APPLE__ in the shared sources if at all possible. > 3) Modify test_gamma script to set library path only for gamma launch > > Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). > Instead, set this later in the script only for the gamma launcher test > run. While in there, I took the liberty of decrypting the script to make > it more maintainable and more easily merged whenever we reconcile the > unix ports into a single codebase. There is no change to the script output. Better not to take that liberty, as it will make reconciling things more difficult. With your change, the osx version can't be easily compared with the other ports; any semantic differences are lost in the noise. Also, brevity is a virtue--what once fit on a few lines has bloated to multiple screenfulls. -John From rednaxelafx at gmail.com Wed Jan 4 03:10:53 2012 From: rednaxelafx at gmail.com (Krystal Mok) Date: Wed, 4 Jan 2012 19:10:53 +0800 Subject: Fwd: Request for review (XS): SA should cope with partially loaded ConstantPool In-Reply-To: References: Message-ID: cc'ing hotspot-dev ---------- Forwarded message ---------- From: Krystal Mok Date: Fri, Dec 30, 2011 at 8:36 PM Subject: Request for review (XS): SA should cope with partially loaded ConstantPool To: serviceability-dev at openjdk.java.net Hi all, I was using CLHSDB to dump the contents of PermGen the other day, and ran into a ClassCastException, as shown in [1]. It turns out that there was a partially loaded constantPoolOopDesc instance in the PermGen, which is actually dead already, but not collected yet (because no GC has happened yet). The way it's marked to be "partially loaded" is setting a pointer to this constantPoolOopDesc object itself to its _pool_holder field, which caused the exception in the Serviceability Agent. There's no problem with VM, but I think SA should cope with this behavior. So here's a patch to fix SA: diff -r fe2c87649981 agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java --- a/agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java Thu Dec 29 15:14:33 2011 -0800 +++ b/agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java Fri Dec 30 20:15:10 2011 +0800 @@ -648,7 +648,12 @@ } public void printValueOn(PrintStream tty) { - tty.print("ConstantPool for " + getPoolHolder().getName().asString()); + Oop holder = poolHolder.getValue(this); + if (holder instanceof Klass) { + tty.print("ConstantPool for " + ((Klass) holder).getName().asString()); + } else { + tty.print("ConstantPool for partially loaded class"); + } } public long getObjectSize() { By the way, there's another bug in current tip version of SA. In 6990754 [2], Symbols were moved into native memory, and SA was updated accordingly. But it missed a case in ConstantPool.iterateFields(OopVisitor visitor, boolean doVMFields). A quick-n-dirty fix would be: diff -r fe2c87649981 agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java --- a/agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java Thu Dec 29 15:14:33 2011 -0800 +++ b/agent/src/share/classes/sun/jvm/hotspot/oops/ConstantPool.java Fri Dec 30 20:15:10 2011 +0800 @@ -454,7 +454,7 @@ case JVM_CONSTANT_Class: case JVM_CONSTANT_UnresolvedString: case JVM_CONSTANT_Utf8: - visitor.doOop(new OopField(new NamedFieldIdentifier(nameForTag(ctag)), indexOffset(index), true), true); + visitor.doInt(new IntField(new NamedFieldIdentifier(nameForTag(ctag)), indexOffset(index), true), true); break; case JVM_CONSTANT_Fieldref: But fixing it like this would make it hard to see the connection between a ConstantPool and the Symbols it's referencing. I'm not so sure about what the best fix would look like. Tried adding a "SymbolField" type, but it felt too heavy. Any suggestions? Regards, Kris Mok [1]: https://gist.github.com/1526668#file_clhsdb_session [2]: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6990754 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120104/ca0fadc2/attachment.html From bob.vandette at oracle.com Wed Jan 4 16:56:22 2012 From: bob.vandette at oracle.com (bob.vandette at oracle.com) Date: Thu, 05 Jan 2012 00:56:22 +0000 Subject: hg: hsx/hotspot-main/hotspot: 9 new changesets Message-ID: <20120105005643.961564788C@hg.openjdk.java.net> Changeset: 75c0a73eee98 Author: coleenp Date: 2011-11-17 12:53 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/75c0a73eee98 7102776: Pack instanceKlass boolean fields into single u1 field Summary: Reduce class runtime memory usage by packing 4 instanceKlass boolean fields into single u1 field. Save 4-byte for each loaded class. Reviewed-by: dholmes, bobv, phh, twisti, never, coleenp Contributed-by: Jiangli Zhou ! agent/src/share/classes/sun/jvm/hotspot/oops/InstanceKlass.java ! src/share/vm/code/dependencies.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/instanceKlassKlass.cpp ! src/share/vm/runtime/vmStructs.cpp Changeset: da4dd142ea01 Author: bobv Date: 2011-11-29 14:44 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/da4dd142ea01 Merge ! src/share/vm/code/dependencies.cpp Changeset: 52b5d32fbfaf Author: coleenp Date: 2011-12-06 18:28 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/52b5d32fbfaf 7117052: instanceKlass::_init_state can be u1 type Summary: Change instanceKlass::_init_state field to u1 type. Reviewed-by: bdelsart, coleenp, dholmes, phh, never Contributed-by: Jiangli Zhou ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/share/vm/ci/ciInstanceKlass.cpp ! src/share/vm/memory/dump.cpp ! src/share/vm/oops/instanceKlass.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/parseHelper.cpp ! src/share/vm/runtime/vmStructs.cpp Changeset: eccc4b1f8945 Author: vladidan Date: 2011-12-07 16:47 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/eccc4b1f8945 7050298: ARM: SIGSEGV in JNIHandleBlock::allocate_handle Summary: missing release barrier in Monitor::IUnlock Reviewed-by: dholmes, dice ! src/share/vm/runtime/mutex.cpp Changeset: 2685ea97b89f Author: jiangli Date: 2011-12-09 11:29 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/2685ea97b89f Merge ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp Changeset: 8fdf463085e1 Author: jiangli Date: 2011-12-16 17:33 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/8fdf463085e1 Merge Changeset: dca455dea3a7 Author: bdelsart Date: 2011-12-20 12:33 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/dca455dea3a7 7116216: StackOverflow GC crash Summary: GC crash for explicit stack overflow checks after a C2I transition. Reviewed-by: coleenp, never Contributed-by: yang02.wang at sap.com, bertrand.delsart at oracle.com ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp + test/compiler/7116216/LargeFrame.java + test/compiler/7116216/StackOverflow.java Changeset: cd5d8cafcc84 Author: jiangli Date: 2011-12-28 12:15 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/cd5d8cafcc84 7123315: instanceKlass::_static_oop_field_count and instanceKlass::_java_fields_count should be u2 type. Summary: Change instanceKlass::_static_oop_field_count and instanceKlass::_java_fields_count to u2 type. Reviewed-by: never, bdelsart, dholmes Contributed-by: Jiangli Zhou ! src/share/vm/classfile/classFileParser.cpp ! src/share/vm/classfile/classFileParser.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/runtime/vmStructs.cpp Changeset: 05de27e852c4 Author: jiangli Date: 2012-01-04 12:36 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/05de27e852c4 Merge ! src/share/vm/classfile/classFileParser.cpp From tony.printezis at oracle.com Thu Jan 5 08:12:02 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Thu, 05 Jan 2012 11:12:02 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4899B594-66EA-444B-9224-9EA660B4E346@kodewerk.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <4E9BFF43-DAC8-480A-AF46-3AADD3FE2ED7@kodewerk.com> <4EFB37E0.1090509@oracle.com> <4899B594-66EA-444B-9224-9EA660B4E346@kodewerk.com> Message-ID: <4F05CBD2.6080406@oracle.com> Kirk, Trying to clean my inbox post-holidays.... I hope your temperature is now back no normal. :-) On 12/28/2011 11:21 AM, Kirk Pepperdine wrote: > Ok, I'm reading this with a 38 degree temp so maybe that's why I'm not > getting it, my brain is slow?. I've looked at the link Jon provided.. > very nice but still leaves me puzzled. Wouldn't simply implementing > Unreferenced be enough to trigger the clean up? I would imagine a > broken pipe or some other fault should cause the distributed objects > to be dereferenced (i.e. become collectable). At the end of the day, > this seems like calling System.gc() in Servlet.destroy(). I'm also not sure I understand your point above about simply implementing Unreferenced. Consider the following: Host 1: has object A Host 2: has object B that has a remote reference to A Host 1 does not know anything about what's happening in Host 2. The only thing it has been told is that there's a remote reference to object A. When Host 2 discovers that B is dead it has to somehow tell Host 1 that the remote reference to A does not exist any more. This will allow Host 1 to collect A as long as it's not otherwise unreachable. If Host 2 crashes, that message will never be sent. Not sure what happens in that case, I assume hosts have to frequently refresh the remote references so the ref to A will not be refreshed and eventually be considered dead? Tony -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120105/dddac8a2/attachment.html From kirk at kodewerk.com Thu Jan 5 08:29:52 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Thu, 5 Jan 2012 17:29:52 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F05CBD2.6080406@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <4E9BFF43-DAC8-480A-AF46-3AADD3FE2ED7@kodewerk.com> <4EFB37E0.1090509@oracle.com> <4899B594-66EA-444B-9224-9EA660B4E346@kodewerk.com> <4F05CBD2.6080406@oracle.com> Message-ID: <4DD6120C-4B9B-49F4-A793-80628081F79E@kodewerk.com> Hi Tony, Temp is back to normal and I'm much better. On 2012-01-05, at 5:12 PM, Tony Printezis wrote: > Kirk, > > Trying to clean my inbox post-holidays.... I hope your temperature is now back no normal. :-) > > On 12/28/2011 11:21 AM, Kirk Pepperdine wrote: >> >> Ok, I'm reading this with a 38 degree temp so maybe that's why I'm not getting it, my brain is slow?. I've looked at the link Jon provided.. very nice but still leaves me puzzled. Wouldn't simply implementing Unreferenced be enough to trigger the clean up? I would imagine a broken pipe or some other fault should cause the distributed objects to be dereferenced (i.e. become collectable). At the end of the day, this seems like calling System.gc() in Servlet.destroy(). > > I'm also not sure I understand your point above about simply implementing Unreferenced. Consider the following: > > Host 1: has object A > Host 2: has object B that has a remote reference to A > > Host 1 does not know anything about what's happening in Host 2. The only thing it has been told is that there's a remote reference to object A. When Host 2 discovers that B is dead it has to somehow tell Host 1 that the remote reference to A does not exist any more. This will allow Host 1 to collect A as long as it's not otherwise unreachable. If Host 2 crashes, that message will never be sent. Not sure what happens in that case, I assume hosts have to frequently refresh the remote references so the ref to A will not be refreshed and eventually be considered dead? If host2 dies, I would assume that the socket connection it had opened with host1 would break. But that is an exceptional case. In the functional case, B will dereference A which should pass a single along to the B proxy running in host 1 that A should be dereferenced. If B is collected in Host2, B proxy should be released and collected in Host1 via normal dereferencing. I don't see a need to call System.gc(). Kirk -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120105/44f31bf2/attachment.html From tony.printezis at oracle.com Thu Jan 5 08:45:20 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Thu, 05 Jan 2012 11:45:20 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4EFB5583.5000005@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> Message-ID: <4F05D3A0.1050503@oracle.com> Dmitry, On 12/28/2011 12:44 PM, Dmitry Samersoff wrote: >>> Each of them has it's own workaround (e.g. connection pool manager with refcounting or separate checker thread) >> I'm not sure that I'd call these work-arounds as they all serve a multitude of purposes.. but, beyond the scope > Nowdays we have plenty of memory so we can delay socket (an other > resources) reclamation but save some CPU power. Well, having lots of memory can allow us to have lots of "room" in the heap to postpone GC. However, there are native resources that are reclaimed by finalization that are scarce (typically there's a fixed number of them, or limited amount of memory we can dedicate to them, etc.) so extra memory is just not going to help: there are likely to run out before the heap is full enough to cause a GC. Increasing their max number is a short-term fix and will only postpone the inevitable. > It's especially valuable > if an application have clear visible pick and spare hours. > I agree with you - there is no reason to have an API to trigger GC or > finalization explicitly. I totally agree with this ....but also see below. > I dream about a time when JVM would be able to > detect low load time and start GC/finalization automatically. I can't see how this is going to help: - If you detect that the machine load is low it doesn't also mean that there are garbage objects in the heap that need to be reclaimed or finalized. So, triggering GC "opportunistically" will be, I'd guess, unproductive most of the time. - In fact, if the machine load is low it means that the application is not doing much, therefore maybe there are not many objects to be reclaimed / finalized. Which means that this is probably the worst time to trigger GC. - It's not always desirable to do work that might be unproductive when the machine load is low. Consider battery powered mobile devices: doing potentially unproductive work could drain battery unnecessarily. > But today there are a cu's cases that can't be solved without such API. I agree with you that giving users an API to trigger GCs / finalization is not optimal given that they will most likely mis-use it (and they do). However, giving an API to library writers to inform the GC how much of a native resource is currently being consumed (say: 43 files are open out of a max of 100) and let the GC decide what to do is probably a better approach (IMHO). Tony From tony.printezis at oracle.com Thu Jan 5 08:52:24 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Thu, 05 Jan 2012 11:52:24 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4DD6120C-4B9B-49F4-A793-80628081F79E@kodewerk.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <4E9BFF43-DAC8-480A-AF46-3AADD3FE2ED7@kodewerk.com> <4EFB37E0.1090509@oracle.com> <4899B594-66EA-444B-9224-9EA660B4E346@kodewerk.com> <4F05CBD2.6080406@oracle.com> <4DD6120C-4B9B-49F4-A793-80628081F79E@kodewerk.com> Message-ID: <4F05D548.6060606@oracle.com> Kirk, Consider the case where B becomes garbage in Host 2 but Host 2 is not doing much so the next Full GC in Host 2 happens 2 days later. During that time A cannot be collected as Host 1 still thinks there's a remote reference to A and Host 2 knows that it holds a remote reference from B to A but does not know that B is dead. Tony On 01/05/2012 11:29 AM, Kirk Pepperdine wrote: > >> Host 1: has object A >> Host 2: has object B that has a remote reference to A >> >> Host 1 does not know anything about what's happening in Host 2. The >> only thing it has been told is that there's a remote reference to >> object A. When Host 2 discovers that B is dead it has to somehow tell >> Host 1 that the remote reference to A does not exist any more. This >> will allow Host 1 to collect A as long as it's not otherwise >> unreachable. If Host 2 crashes, that message will never be sent. Not >> sure what happens in that case, I assume hosts have to frequently >> refresh the remote references so the ref to A will not be refreshed >> and eventually be considered dead? > > If host2 dies, I would assume that the socket connection it had opened > with host1 would break. But that is an exceptional case. In the > functional case, B will dereference A which should pass a single along > to the B proxy running in host 1 that A should be dereferenced. If B > is collected in Host2, B proxy should be released and collected in > Host1 via normal dereferencing. I don't see a need to call System.gc(). > > Kirk > > From kirk at kodewerk.com Thu Jan 5 09:37:41 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Thu, 5 Jan 2012 18:37:41 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F05D3A0.1050503@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> Message-ID: <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> On 2012-01-05, at 5:45 PM, Tony Printezis wrote: > Dmitry, > > On 12/28/2011 12:44 PM, Dmitry Samersoff wrote: >>>> Each of them has it's own workaround (e.g. connection pool manager with refcounting or separate checker thread) >>> I'm not sure that I'd call these work-arounds as they all serve a multitude of purposes.. but, beyond the scope >> Nowdays we have plenty of memory so we can delay socket (an other >> resources) reclamation but save some CPU power. > > Well, having lots of memory can allow us to have lots of "room" in the heap to postpone GC. However, there are native resources that are reclaimed by finalization that are scarce (typically there's a fixed number of them, or limited amount of memory we can dedicate to them, etc.) so extra memory is just not going to help: there are likely to run out before the heap is full enough to cause a GC. Increasing their max number is a short-term fix and will only postpone the inevitable. Right, so you're relying on finalization when you should be using a normal close mechanism. > >> It's especially valuable >> if an application have clear visible pick and spare hours. > >> I agree with you - there is no reason to have an API to trigger GC or >> finalization explicitly. > > I totally agree with this ....but also see below. > >> I dream about a time when JVM would be able to >> detect low load time and start GC/finalization automatically. > > I can't see how this is going to help: > > - If you detect that the machine load is low it doesn't also mean that there are garbage objects in the heap that need to be reclaimed or finalized. So, triggering GC "opportunistically" will be, I'd guess, unproductive most of the time. > > - In fact, if the machine load is low it means that the application is not doing much, therefore maybe there are not many objects to be reclaimed / finalized. Which means that this is probably the worst time to trigger GC. > > - It's not always desirable to do work that might be unproductive when the machine load is low. Consider battery powered mobile devices: doing potentially unproductive work could drain battery unnecessarily. Right, this assumes that the app is creating objects when it's busy. We both know that this isn't necessarily true in the low-latency case. That said, triggering a GC when the machine appears to be idle in a low latency application is unlikely to do much more than burn CPU cycles (and battery). > >> But today there are a cu's cases that can't be solved without such API. > > I agree with you that giving users an API to trigger GCs / finalization is not optimal given that they will most likely mis-use it (and they do). However, giving an API to library writers to inform the GC how much of a native resource is currently being consumed (say: 43 files are open out of a max of 100) and let the GC decide what to do is probably a better approach (IMHO). I get the feeling that what people are looking for is a destructor.. and in Java the destructor is close(). One has to consider finalization of any resource to be the mechanism of last resort or when all else fails, finalization will catch it (assuming it has a chance to run). Eg, it's the application's responsibility to call close(). If you know enough to call System.gc() (or any other API), you should know enough to call close. Regards, Kirk From vitalyd at gmail.com Thu Jan 5 09:48:10 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Thu, 5 Jan 2012 12:48:10 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F05D3A0.1050503@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> Message-ID: Hi Tony, One case I can see where doing a GC during low activity would help is so that when activity resumes the GC spaces are as clean (and compacted, if necessary) as can be and thus throughput and latency can be reduced when activity resumes. Presumably you'd run just one such GC per low activity period. On Jan 5, 2012 11:46 AM, "Tony Printezis" wrote: > Dmitry, > > On 12/28/2011 12:44 PM, Dmitry Samersoff wrote: > >> Each of them has it's own workaround (e.g. connection pool manager with >>>> refcounting or separate checker thread) >>>> >>> I'm not sure that I'd call these work-arounds as they all serve a >>> multitude of purposes.. but, beyond the scope >>> >> Nowdays we have plenty of memory so we can delay socket (an other >> resources) reclamation but save some CPU power. >> > > Well, having lots of memory can allow us to have lots of "room" in the > heap to postpone GC. However, there are native resources that are reclaimed > by finalization that are scarce (typically there's a fixed number of them, > or limited amount of memory we can dedicate to them, etc.) so extra memory > is just not going to help: there are likely to run out before the heap is > full enough to cause a GC. Increasing their max number is a short-term fix > and will only postpone the inevitable. > > It's especially valuable >> if an application have clear visible pick and spare hours. >> > > I agree with you - there is no reason to have an API to trigger GC or >> finalization explicitly. >> > > I totally agree with this ....but also see below. > > I dream about a time when JVM would be able to >> detect low load time and start GC/finalization automatically. >> > > I can't see how this is going to help: > > - If you detect that the machine load is low it doesn't also mean that > there are garbage objects in the heap that need to be reclaimed or > finalized. So, triggering GC "opportunistically" will be, I'd guess, > unproductive most of the time. > > - In fact, if the machine load is low it means that the application is not > doing much, therefore maybe there are not many objects to be reclaimed / > finalized. Which means that this is probably the worst time to trigger GC. > > - It's not always desirable to do work that might be unproductive when the > machine load is low. Consider battery powered mobile devices: doing > potentially unproductive work could drain battery unnecessarily. > > But today there are a cu's cases that can't be solved without such API. >> > > I agree with you that giving users an API to trigger GCs / finalization is > not optimal given that they will most likely mis-use it (and they do). > However, giving an API to library writers to inform the GC how much of a > native resource is currently being consumed (say: 43 files are open out of > a max of 100) and let the GC decide what to do is probably a better > approach (IMHO). > > Tony > > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120105/1334e3df/attachment.html From david.holmes at oracle.com Thu Jan 5 18:41:33 2012 From: david.holmes at oracle.com (David Holmes) Date: Fri, 06 Jan 2012 12:41:33 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> Message-ID: <4F065F5D.6040605@oracle.com> On 6/01/2012 3:37 AM, Kirk Pepperdine wrote: > I get the feeling that what people are looking for is a destructor.. > and in Java the destructor is close(). One has to consider finalization > of any resource to be the mechanism of last resort or when all else > fails, finalization will catch it (assuming it has a chance to run). Eg, > it's the application's responsibility to call close(). If you know > enough to call System.gc() (or any other API), you should know enough to > call close. Not to defend finalization in any way but the key difference is that any part of the code can call System.gc() or System.runFinalization() without needing to know what exactly needs to be finalized. Afterall the key thing about GC is it relieves the programmer from having to manage object lifetimes, so if you don't know when the object is no longer used you don't know when to call close. David From john.coomes at oracle.com Thu Jan 5 20:32:10 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:32:10 +0000 Subject: hg: hsx/hotspot-main: Added tag jdk8-b20 for changeset 5a5eaf6374bc Message-ID: <20120106043210.38CD0478AD@hg.openjdk.java.net> Changeset: cc771d92284f Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/rev/cc771d92284f Added tag jdk8-b20 for changeset 5a5eaf6374bc ! .hgtags From john.coomes at oracle.com Thu Jan 5 20:32:16 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:32:16 +0000 Subject: hg: hsx/hotspot-main/corba: Added tag jdk8-b20 for changeset 51d8b6cb18c0 Message-ID: <20120106043220.3E9DF478AE@hg.openjdk.java.net> Changeset: f157fc2a71a3 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/corba/rev/f157fc2a71a3 Added tag jdk8-b20 for changeset 51d8b6cb18c0 ! .hgtags From john.coomes at oracle.com Thu Jan 5 20:32:27 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:32:27 +0000 Subject: hg: hsx/hotspot-main/jaxp: Added tag jdk8-b20 for changeset f052abb8f374 Message-ID: <20120106043227.31275478AF@hg.openjdk.java.net> Changeset: d41eeadf5c13 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jaxp/rev/d41eeadf5c13 Added tag jdk8-b20 for changeset f052abb8f374 ! .hgtags From john.coomes at oracle.com Thu Jan 5 20:32:34 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:32:34 +0000 Subject: hg: hsx/hotspot-main/jaxws: Added tag jdk8-b20 for changeset 2b2818e3386f Message-ID: <20120106043234.31631478B0@hg.openjdk.java.net> Changeset: dc2ee8b87884 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jaxws/rev/dc2ee8b87884 Added tag jdk8-b20 for changeset 2b2818e3386f ! .hgtags From john.coomes at oracle.com Thu Jan 5 20:33:27 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:33:27 +0000 Subject: hg: hsx/hotspot-main/jdk: 9 new changesets Message-ID: <20120106043526.1C178478B1@hg.openjdk.java.net> Changeset: 172d70c50c65 Author: cgruszka Date: 2011-09-15 13:59 -0400 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/172d70c50c65 7066713: Separate demos from the bundles on Solaris and Linux Summary: add new license files to demos and samples, new directory for bundling Reviewed-by: katleman, ohair, billyh ! make/common/Release.gmk ! make/common/shared/Defs-control.gmk Changeset: eaf967fd25c5 Author: cgruszka Date: 2011-10-18 14:21 -0400 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/eaf967fd25c5 7099017: jdk7u2-dev does not build Summary: changes to skip demo/DEMOS_LICENSE and sample/SAMPLES_LICENSE when building OPENJDK Reviewed-by: ohair, billyh ! make/common/Release.gmk Changeset: 39b7f01203c9 Author: cgruszka Date: 2011-11-17 16:57 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/39b7f01203c9 Merge Changeset: b64e7263b4fd Author: cgruszka Date: 2011-11-18 01:03 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/b64e7263b4fd Merge Changeset: e98869ff9f1e Author: cgruszka Date: 2011-12-05 12:41 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/e98869ff9f1e Merge - test/java/io/FileDescriptor/FileChannelFDTest.java - test/java/io/etc/FileDescriptorSharing.java Changeset: ffa36a6a46f5 Author: cgruszka Date: 2011-12-16 15:01 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/ffa36a6a46f5 Merge - make/sun/motif12/reorder-i586 - make/sun/motif12/reorder-sparc - make/sun/motif12/reorder-sparcv9 - src/share/native/java/util/zip/zlib-1.2.3/ChangeLog - src/share/native/java/util/zip/zlib-1.2.3/README - src/share/native/java/util/zip/zlib-1.2.3/compress.c - src/share/native/java/util/zip/zlib-1.2.3/crc32.h - src/share/native/java/util/zip/zlib-1.2.3/deflate.c - src/share/native/java/util/zip/zlib-1.2.3/deflate.h - src/share/native/java/util/zip/zlib-1.2.3/gzio.c - src/share/native/java/util/zip/zlib-1.2.3/infback.c - src/share/native/java/util/zip/zlib-1.2.3/inffast.c - src/share/native/java/util/zip/zlib-1.2.3/inffast.h - src/share/native/java/util/zip/zlib-1.2.3/inffixed.h - src/share/native/java/util/zip/zlib-1.2.3/inflate.c - src/share/native/java/util/zip/zlib-1.2.3/inflate.h - src/share/native/java/util/zip/zlib-1.2.3/inftrees.c - src/share/native/java/util/zip/zlib-1.2.3/inftrees.h - src/share/native/java/util/zip/zlib-1.2.3/patches/ChangeLog_java - src/share/native/java/util/zip/zlib-1.2.3/patches/crc32.c.diff - src/share/native/java/util/zip/zlib-1.2.3/patches/inflate.c.diff - src/share/native/java/util/zip/zlib-1.2.3/patches/zconf.h.diff - src/share/native/java/util/zip/zlib-1.2.3/patches/zlib.h.diff - src/share/native/java/util/zip/zlib-1.2.3/trees.c - src/share/native/java/util/zip/zlib-1.2.3/trees.h - src/share/native/java/util/zip/zlib-1.2.3/uncompr.c - src/share/native/java/util/zip/zlib-1.2.3/zadler32.c - src/share/native/java/util/zip/zlib-1.2.3/zconf.h - src/share/native/java/util/zip/zlib-1.2.3/zcrc32.c - src/share/native/java/util/zip/zlib-1.2.3/zlib.h - src/share/native/java/util/zip/zlib-1.2.3/zutil.c - src/share/native/java/util/zip/zlib-1.2.3/zutil.h - src/solaris/classes/sun/awt/motif/AWTLockAccess.java - src/solaris/classes/sun/awt/motif/MFontPeer.java - src/solaris/classes/sun/awt/motif/MToolkit.java - src/solaris/classes/sun/awt/motif/MToolkitThreadBlockedHandler.java - src/solaris/classes/sun/awt/motif/MWindowAttributes.java - src/solaris/classes/sun/awt/motif/X11FontMetrics.java - src/solaris/native/sun/awt/MouseInfo.c - src/solaris/native/sun/awt/XDrawingArea.c - src/solaris/native/sun/awt/XDrawingArea.h - src/solaris/native/sun/awt/XDrawingAreaP.h - src/solaris/native/sun/awt/awt_Cursor.h - src/solaris/native/sun/awt/awt_KeyboardFocusManager.h - src/solaris/native/sun/awt/awt_MToolkit.c - src/solaris/native/sun/awt/awt_MToolkit.h - src/solaris/native/sun/awt/awt_MenuItem.h - src/solaris/native/sun/awt/awt_PopupMenu.h - src/solaris/native/sun/awt/awt_TopLevel.h - src/solaris/native/sun/awt/awt_Window.h - src/solaris/native/sun/awt/awt_mgrsel.c - src/solaris/native/sun/awt/awt_mgrsel.h - src/solaris/native/sun/awt/awt_motif.h - src/solaris/native/sun/awt/awt_wm.c - src/solaris/native/sun/awt/awt_wm.h - src/solaris/native/sun/awt/awt_xembed.h - src/solaris/native/sun/awt/awt_xembed_server.c - src/solaris/native/sun/awt/awt_xembed_server.h - test/java/util/ResourceBundle/Control/ExpirationTest.java - test/java/util/ResourceBundle/Control/ExpirationTest.sh Changeset: 5fe1525e6e2c Author: cgruszka Date: 2011-12-23 10:43 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/5fe1525e6e2c Merge Changeset: 39e938cd1b82 Author: cgruszka Date: 2012-01-03 14:34 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/39e938cd1b82 Merge Changeset: fc050750f8a0 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/jdk/rev/fc050750f8a0 Added tag jdk8-b20 for changeset 39e938cd1b82 ! .hgtags From john.coomes at oracle.com Thu Jan 5 20:36:49 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Fri, 06 Jan 2012 04:36:49 +0000 Subject: hg: hsx/hotspot-main/langtools: Added tag jdk8-b20 for changeset ffd294128a48 Message-ID: <20120106043656.C11D7478B2@hg.openjdk.java.net> Changeset: 020819eb56d2 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/langtools/rev/020819eb56d2 Added tag jdk8-b20 for changeset ffd294128a48 ! .hgtags From kirk at kodewerk.com Thu Jan 5 22:24:18 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Fri, 6 Jan 2012 07:24:18 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F065F5D.6040605@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> Message-ID: On 2012-01-06, at 3:41 AM, David Holmes wrote: > On 6/01/2012 3:37 AM, Kirk Pepperdine wrote: >> I get the feeling that what people are looking for is a destructor.. >> and in Java the destructor is close(). One has to consider finalization >> of any resource to be the mechanism of last resort or when all else >> fails, finalization will catch it (assuming it has a chance to run). Eg, >> it's the application's responsibility to call close(). If you know >> enough to call System.gc() (or any other API), you should know enough to >> call close. > > Not to defend finalization in any way but the key difference is that any part of the code can call System.gc() or System.runFinalization() without needing to know what exactly needs to be finalized. Afterall the key thing about GC is it relieves the programmer from having to manage object lifetimes, so if you don't know when the object is no longer used you don't know when to call close. I think this is my point, band aiding over not knowing when to call close with a call to the system saying do something expensive that will most likely have little value isn't a call that I'd like to see. More over, if you dig deeper into these types of problems it seems like there are safer, more viable solutions. Regards, Kirk From tony.printezis at oracle.com Fri Jan 6 02:26:22 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 05:26:22 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> Message-ID: <4F06CC4E.7020302@oracle.com> Kirk, On 01/05/2012 12:37 PM, Kirk Pepperdine wrote: >> Well, having lots of memory can allow us to have lots of "room" in the heap to postpone GC. However, there are native resources that are reclaimed by finalization that are scarce (typically there's a fixed number of them, or limited amount of memory we can dedicate to them, etc.) so extra memory is just not going to help: there are likely to run out before the heap is full enough to cause a GC. Increasing their max number is a short-term fix and will only postpone the inevitable. > Right, so you're relying on finalization when you should be using a normal close mechanism. See below. > >>> I dream about a time when JVM would be able to >>> detect low load time and start GC/finalization automatically. >> I can't see how this is going to help: >> >> - If you detect that the machine load is low it doesn't also mean that there are garbage objects in the heap that need to be reclaimed or finalized. So, triggering GC "opportunistically" will be, I'd guess, unproductive most of the time. >> >> - In fact, if the machine load is low it means that the application is not doing much, therefore maybe there are not many objects to be reclaimed / finalized. Which means that this is probably the worst time to trigger GC. >> >> - It's not always desirable to do work that might be unproductive when the machine load is low. Consider battery powered mobile devices: doing potentially unproductive work could drain battery unnecessarily. > Right, this assumes that the app is creating objects when it's busy. We both know that this isn't necessarily true in the low-latency case. Sure, but that's the exception. I'd bet that the above is true 95+% of the time. > That said, triggering a GC when the machine appears to be idle in a low latency application is unlikely to do much more than burn CPU cycles (and battery). Bingo. >>> But today there are a cu's cases that can't be solved without such API. >> I agree with you that giving users an API to trigger GCs / finalization is not optimal given that they will most likely mis-use it (and they do). However, giving an API to library writers to inform the GC how much of a native resource is currently being consumed (say: 43 files are open out of a max of 100) and let the GC decide what to do is probably a better approach (IMHO). > I get the feeling that what people are looking for is a destructor.. and in Java the destructor is close(). ....inside a finally { } clause. :-) > One has to consider finalization of any resource to be the mechanism of last resort or when all else fails, finalization will catch it (assuming it has a chance to run). Eg, it's the application's responsibility to call close(). I agree. And I'm sure I can dig up some of my slides from past talks that make this exact point: if you know you're done with an object, please call close() on it. However, as it's already been pointed out in an earlier reply to this thread, it's not always possible to rely on close(): reclaiming DirectByteBuffers that are not guaranteed to be unreachable could be a security issue. Tony > If you know enough to call System.gc() (or any other API), you should know enough to call close. > > Regards, > Kirk > From tony.printezis at oracle.com Fri Jan 6 02:34:08 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 05:34:08 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F065F5D.6040605@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> Message-ID: <4F06CE20.10809@oracle.com> On 01/05/2012 09:41 PM, David Holmes wrote: > On 6/01/2012 3:37 AM, Kirk Pepperdine wrote: >> I get the feeling that what people are looking for is a destructor.. >> and in Java the destructor is close(). One has to consider finalization >> of any resource to be the mechanism of last resort or when all else >> fails, finalization will catch it (assuming it has a chance to run). Eg, >> it's the application's responsibility to call close(). If you know >> enough to call System.gc() (or any other API), you should know enough to >> call close. > > Not to defend finalization Good. :-) > in any way but the key difference is that any part of the code can > call System.gc() or System.runFinalization() without needing to know > what exactly needs to be finalized. If each library could indicate to the GC whether the resource it manages is running low or not, using the API I mentioned, the GC could do the above automatically, behind the scenes, without the user having to do anything else. Tony > Afterall the key thing about GC is it relieves the programmer from > having to manage object lifetimes, so if you don't know when the > object is no longer used you don't know when to call close. > > David From tony.printezis at oracle.com Fri Jan 6 02:34:34 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 05:34:34 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> Message-ID: <4F06CE3A.2040101@oracle.com> Vitaly, Sure, but if the GC detects that the load is low it doesn't know whether the load will remain low for 5 ms or 5 hours (and it's impossible to know, maybe not even the application knows). I can already imagine the bug reports: a spike suddenly happened in the market and the JVM was "locked up" for several seconds!!! Tony On 01/05/2012 12:48 PM, Vitaly Davidovich wrote: > > Hi Tony, > > One case I can see where doing a GC during low activity would help is > so that when activity resumes the GC spaces are as clean (and > compacted, if necessary) as can be and thus throughput and latency can > be reduced when activity resumes. Presumably you'd run just one such > GC per low activity period. > > On Jan 5, 2012 11:46 AM, "Tony Printezis" > wrote: > > Dmitry, > > On 12/28/2011 12:44 PM, Dmitry Samersoff wrote: > > Each of them has it's own workaround (e.g. connection > pool manager with refcounting or separate checker thread) > > I'm not sure that I'd call these work-arounds as they all > serve a multitude of purposes.. but, beyond the scope > > Nowdays we have plenty of memory so we can delay socket (an other > resources) reclamation but save some CPU power. > > > Well, having lots of memory can allow us to have lots of "room" in > the heap to postpone GC. However, there are native resources that > are reclaimed by finalization that are scarce (typically there's a > fixed number of them, or limited amount of memory we can dedicate > to them, etc.) so extra memory is just not going to help: there > are likely to run out before the heap is full enough to cause a > GC. Increasing their max number is a short-term fix and will only > postpone the inevitable. > > It's especially valuable > if an application have clear visible pick and spare hours. > > > I agree with you - there is no reason to have an API to > trigger GC or > finalization explicitly. > > > I totally agree with this ....but also see below. > > I dream about a time when JVM would be able to > detect low load time and start GC/finalization automatically. > > > I can't see how this is going to help: > > - If you detect that the machine load is low it doesn't also mean > that there are garbage objects in the heap that need to be > reclaimed or finalized. So, triggering GC "opportunistically" will > be, I'd guess, unproductive most of the time. > > - In fact, if the machine load is low it means that the > application is not doing much, therefore maybe there are not many > objects to be reclaimed / finalized. Which means that this is > probably the worst time to trigger GC. > > - It's not always desirable to do work that might be unproductive > when the machine load is low. Consider battery powered mobile > devices: doing potentially unproductive work could drain battery > unnecessarily. > > But today there are a cu's cases that can't be solved > without such API. > > > I agree with you that giving users an API to trigger GCs / > finalization is not optimal given that they will most likely > mis-use it (and they do). However, giving an API to library > writers to inform the GC how much of a native resource is > currently being consumed (say: 43 files are open out of a max of > 100) and let the GC decide what to do is probably a better > approach (IMHO). > > Tony > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/a43c670e/attachment.html From david.holmes at oracle.com Fri Jan 6 03:14:47 2012 From: david.holmes at oracle.com (David Holmes) Date: Fri, 06 Jan 2012 21:14:47 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F06CE20.10809@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> Message-ID: <4F06D7A7.6010603@oracle.com> Hi Tony, On 6/01/2012 8:34 PM, Tony Printezis wrote: > If each library could indicate to the GC whether the resource it manages > is running low or not, using the API I mentioned, the GC could do the > above automatically, behind the scenes, without the user having to do > anything else. I'm not sure a library writer has the necessary information to do this. Seems to me that how an application uses a particular type determines the scarcity of the resource. I can imagine something like: void setFinalizationLimit(Class cls, int limit) so that GC runs finalization once a "reference count" reaches "limit". That limits the frequency of finalization, but the actual finalization cost may still be unacceptably high. Cheers, David > Tony > >> Afterall the key thing about GC is it relieves the programmer from >> having to manage object lifetimes, so if you don't know when the >> object is no longer used you don't know when to call close. >> >> David From kirk at kodewerk.com Fri Jan 6 05:57:51 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Fri, 6 Jan 2012 14:57:51 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F06D7A7.6010603@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> Message-ID: <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> Tony, David, I really feel that this problem is being solved at the wrong level. The JVM lacks application semantics to know when to do the "right" thing. So, IMHO, this should be managed by the application. For the same reason, there are other things that the application shouldn't touch like telling the JVM it's time to run a collection. That said, one thing the JVM might know about is how many file descriptors it's allowed and it could trigger an attempt to recover them (i.e. run finalization) once they start running low.. just as the JVM manages memory by recovering it when it runs low. That said, I did run a bench where we opened 1,000,000,000 sockets on a single VM. That is just about as many sockets as can be opened on a machine without recompiling the kernel to configure a bigger file descriptor bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to open all these sockets. While this bench was a bit extreme, it's clear that WebSocket gateways *will* stress file descriptor. Not sure about direct buffers though I do feel they should implement finalization as a fallback position. Cheers, Kirk On 2012-01-06, at 12:14 PM, David Holmes wrote: > Hi Tony, > > On 6/01/2012 8:34 PM, Tony Printezis wrote: >> If each library could indicate to the GC whether the resource it manages >> is running low or not, using the API I mentioned, the GC could do the >> above automatically, behind the scenes, without the user having to do >> anything else. > > I'm not sure a library writer has the necessary information to do this. Seems to me that how an application uses a particular type determines the scarcity of the resource. I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" reaches "limit". That limits the frequency of finalization, but the actual finalization cost may still be unacceptably high. > > Cheers, > David > >> Tony >> >>> Afterall the key thing about GC is it relieves the programmer from >>> having to manage object lifetimes, so if you don't know when the >>> object is no longer used you don't know when to call close. >>> >>> David From Alan.Bateman at oracle.com Fri Jan 6 06:58:27 2012 From: Alan.Bateman at oracle.com (Alan Bateman) Date: Fri, 06 Jan 2012 14:58:27 +0000 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F06CC4E.7020302@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F06CC4E.7020302@oracle.com> Message-ID: <4F070C13.3090407@oracle.com> On 06/01/2012 10:26, Tony Printezis wrote: > > I agree. And I'm sure I can dig up some of my slides from past talks > that make this exact point: if you know you're done with an object, > please call close() on it. However, as it's already been pointed out > in an earlier reply to this thread, it's not always possible to rely > on close(): reclaiming DirectByteBuffers that are not guaranteed to be > unreachable could be a security issue. Just to mention that in Java SE then all APIs for sockets and files implement Closeable so they have a close method, can be used with the try-with-resources construct where appropriate. While some of the older APIs (FileInputStream, FileOutputStream, the default SocketImpls) have finalizers, the newer APIs do not and so the resources must be explicitly closed. As you mention, direct and mapped buffers can't be explicitly reclaimed. We don't have a good solution to that problem which is why System.gc is invoked when limits are reached. It would be nice to re-visit this some day. Off-hand I think this is the only other place in the JDK, aside from RMI DGC, where System.gc is invoked explicitly. -Alan. From tony.printezis at oracle.com Fri Jan 6 08:07:39 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 11:07:39 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F06D7A7.6010603@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> Message-ID: <4F071C4B.8010002@oracle.com> David, On 01/06/2012 06:14 AM, David Holmes wrote: > Hi Tony, > > On 6/01/2012 8:34 PM, Tony Printezis wrote: >> If each library could indicate to the GC whether the resource it manages >> is running low or not, using the API I mentioned, the GC could do the >> above automatically, behind the scenes, without the user having to do >> anything else. > > I'm not sure a library writer has the necessary information to do > this. Seems to me that how an application uses a particular type > determines the scarcity of the resource. I will be very surprised if the majority of application developers will be willing to measure at what rate the application consumes certain resources and update said measure when the application changes, load increases, OS changes, hardware changes, etc. And, sure, I think there will be many cases where the library writer has a good idea / can find out the max number of instances of a particular resource (file descriptors, sockets, etc.). Tony > I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" reaches "limit". > That limits the frequency of finalization, but the actual finalization > cost may still be unacceptably high. > > Cheers, > David > >> Tony >> >>> Afterall the key thing about GC is it relieves the programmer from >>> having to manage object lifetimes, so if you don't know when the >>> object is no longer used you don't know when to call close. >>> >>> David From tony.printezis at oracle.com Fri Jan 6 08:16:21 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 11:16:21 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> Message-ID: <4F071E55.9050203@oracle.com> Kirk, Inline. On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: > Tony, David, > > I really feel that this problem is being solved at the wrong level. The JVM lacks application semantics to know when to do the "right" thing. I absolutely agree. > So, IMHO, this should be managed by the application. I absolutely disagree. :-) First, let's define some terminology to make sure we're all on the same page: JVM : HotSpot library : the Java code that manages certain native resource (e.g., classes in the java.io package) - I do not consider this part of the JVM application : what the user writes which runs on top of HotSpot and uses java.io.File's. > For the same reason, there are other things that the application shouldn't touch like telling the JVM it's time to run a collection. Amen to that. :-) > That said, one thing the JVM might know about is how many file descriptors it's allowed and it could trigger an attempt to recover them (i.e. run finalization) once they start running low.. Indeed. But the JVM (as defined above) does not know anything about file descriptors. It's the library, in this case classes in java.io, that does. However, I don't think said library should also be calling System.gc() either. It should be providing information to the GC, via the API I have been suggesting, on how much of a certain resource we have and the GC should be making informed decisions on whether it when it should trigger a cycle. Tony > just as the JVM manages memory by recovering it when it runs low. That said, I did run a bench where we opened 1,000,000,000 sockets on a single VM. That is just about as many sockets as can be opened on a machine without recompiling the kernel to configure a bigger file descriptor bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to open all these sockets. While this bench was a bit extreme, it's clear that WebSocket gateways *will* stress file descriptor. > > Not sure about direct buffers though I do feel they should implement finalization as a fallback position. > > Cheers, > Kirk > > On 2012-01-06, at 12:14 PM, David Holmes wrote: > >> Hi Tony, >> >> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>> If each library could indicate to the GC whether the resource it manages >>> is running low or not, using the API I mentioned, the GC could do the >>> above automatically, behind the scenes, without the user having to do >>> anything else. >> I'm not sure a library writer has the necessary information to do this. Seems to me that how an application uses a particular type determines the scarcity of the resource. I can imagine something like: >> >> void setFinalizationLimit(Class cls, int limit) >> >> so that GC runs finalization once a "reference count" reaches "limit". That limits the frequency of finalization, but the actual finalization cost may still be unacceptably high. >> >> Cheers, >> David >> >>> Tony >>> >>>> Afterall the key thing about GC is it relieves the programmer from >>>> having to manage object lifetimes, so if you don't know when the >>>> object is no longer used you don't know when to call close. >>>> >>>> David From vitalyd at gmail.com Fri Jan 6 08:32:07 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Fri, 6 Jan 2012 11:32:07 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F071E55.9050203@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com> Message-ID: Tony, Interestingly the .NET CLR has an API to allow apps to tell the runtime(gc specifically) to be more aggressive - System.GC.AddMemoryPressure/RemoveMemoryPressure. The idea being that you'd call this when a managed object allocates native memory and thus adds overall pressure. This hint is then used by the gc. What are your thoughts on that? Granted this is more amenable for mem management and not other scarce resources. On Jan 6, 2012 11:17 AM, "Tony Printezis" wrote: > Kirk, > > Inline. > > On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: > >> Tony, David, >> >> I really feel that this problem is being solved at the wrong level. The >> JVM lacks application semantics to know when to do the "right" thing. >> > > I absolutely agree. > > So, IMHO, this should be managed by the application. >> > > I absolutely disagree. :-) > > First, let's define some terminology to make sure we're all on the same > page: > > JVM : HotSpot > library : the Java code that manages certain native resource (e.g., > classes in the java.io package) - I do not consider this part of the JVM > application : what the user writes which runs on top of HotSpot and uses > java.io.File's. > > For the same reason, there are other things that the application >> shouldn't touch like telling the JVM it's time to run a collection. >> > > Amen to that. :-) > > That said, one thing the JVM might know about is how many file >> descriptors it's allowed and it could trigger an attempt to recover them >> (i.e. run finalization) once they start running low.. >> > > Indeed. But the JVM (as defined above) does not know anything about file > descriptors. It's the library, in this case classes in java.io, that > does. However, I don't think said library should also be calling > System.gc() either. It should be providing information to the GC, via the > API I have been suggesting, on how much of a certain resource we have and > the GC should be making informed decisions on whether it when it should > trigger a cycle. > > Tony > > just as the JVM manages memory by recovering it when it runs low. That >> said, I did run a bench where we opened 1,000,000,000 sockets on a single >> VM. That is just about as many sockets as can be opened on a machine >> without recompiling the kernel to configure a bigger file descriptor >> bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to >> open all these sockets. While this bench was a bit extreme, it's clear that >> WebSocket gateways *will* stress file descriptor. >> >> Not sure about direct buffers though I do feel they should implement >> finalization as a fallback position. >> >> Cheers, >> Kirk >> >> On 2012-01-06, at 12:14 PM, David Holmes wrote: >> >> Hi Tony, >>> >>> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>> >>>> If each library could indicate to the GC whether the resource it manages >>>> is running low or not, using the API I mentioned, the GC could do the >>>> above automatically, behind the scenes, without the user having to do >>>> anything else. >>>> >>> I'm not sure a library writer has the necessary information to do this. >>> Seems to me that how an application uses a particular type determines the >>> scarcity of the resource. I can imagine something like: >>> >>> void setFinalizationLimit(Class cls, int limit) >>> >>> so that GC runs finalization once a "reference count" reaches "limit". >>> That limits the frequency of finalization, but the actual finalization cost >>> may still be unacceptably high. >>> >>> Cheers, >>> David >>> >>> Tony >>>> >>>> Afterall the key thing about GC is it relieves the programmer from >>>>> having to manage object lifetimes, so if you don't know when the >>>>> object is no longer used you don't know when to call close. >>>>> >>>>> David >>>>> >>>> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/57cc47ed/attachment.html From tony.printezis at oracle.com Fri Jan 6 08:37:14 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 06 Jan 2012 11:37:14 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com> Message-ID: <4F07233A.5010001@oracle.com> Does AddMemoryPressure() take any parameters? Creating a new socket (out of a max of say 1,000,000) should have different weight to, say, creating a new file (out of a of say 50). Tony On 01/06/2012 11:32 AM, Vitaly Davidovich wrote: > > Tony, > > Interestingly the .NET CLR has an API to allow apps to tell the > runtime(gc specifically) to be more aggressive - > System.GC.AddMemoryPressure/RemoveMemoryPressure. The idea being that > you'd call this when a managed object allocates native memory and thus > adds overall pressure. This hint is then used by the gc. What are > your thoughts on that? Granted this is more amenable for mem > management and not other scarce resources. > > On Jan 6, 2012 11:17 AM, "Tony Printezis" > wrote: > > Kirk, > > Inline. > > On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: > > Tony, David, > > I really feel that this problem is being solved at the wrong > level. The JVM lacks application semantics to know when to do > the "right" thing. > > > I absolutely agree. > > So, IMHO, this should be managed by the application. > > > I absolutely disagree. :-) > > First, let's define some terminology to make sure we're all on the > same page: > > JVM : HotSpot > library : the Java code that manages certain native resource > (e.g., classes in the java.io package) - I do not > consider this part of the JVM > application : what the user writes which runs on top of HotSpot > and uses java.io.File's. > > For the same reason, there are other things that the > application shouldn't touch like telling the JVM it's time to > run a collection. > > > Amen to that. :-) > > That said, one thing the JVM might know about is how many file > descriptors it's allowed and it could trigger an attempt to > recover them (i.e. run finalization) once they start running low.. > > > Indeed. But the JVM (as defined above) does not know anything > about file descriptors. It's the library, in this case classes in > java.io , that does. However, I don't think said > library should also be calling System.gc() either. It should be > providing information to the GC, via the API I have been > suggesting, on how much of a certain resource we have and the GC > should be making informed decisions on whether it when it should > trigger a cycle. > > Tony > > just as the JVM manages memory by recovering it when it runs > low. That said, I did run a bench where we opened > 1,000,000,000 sockets on a single VM. That is just about as > many sockets as can be opened on a machine without recompiling > the kernel to configure a bigger file descriptor bitmap. I'm > quite happy that the JVM wasn't fighting me as I was trying to > open all these sockets. While this bench was a bit extreme, > it's clear that WebSocket gateways *will* stress file descriptor. > > Not sure about direct buffers though I do feel they should > implement finalization as a fallback position. > > Cheers, > Kirk > > On 2012-01-06, at 12:14 PM, David Holmes wrote: > > Hi Tony, > > On 6/01/2012 8:34 PM, Tony Printezis wrote: > > If each library could indicate to the GC whether the > resource it manages > is running low or not, using the API I mentioned, the > GC could do the > above automatically, behind the scenes, without the > user having to do > anything else. > > I'm not sure a library writer has the necessary > information to do this. Seems to me that how an > application uses a particular type determines the scarcity > of the resource. I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" > reaches "limit". That limits the frequency of > finalization, but the actual finalization cost may still > be unacceptably high. > > Cheers, > David > > Tony > > Afterall the key thing about GC is it relieves the > programmer from > having to manage object lifetimes, so if you don't > know when the > object is no longer used you don't know when to > call close. > > David > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/5111f34c/attachment-0001.html From kirk at kodewerk.com Fri Jan 6 08:54:37 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Fri, 6 Jan 2012 17:54:37 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com> Message-ID: when it comes to performance, I'm not sure that the CLR is a good role model ;-) Regards, Kirk On 2012-01-06, at 5:32 PM, Vitaly Davidovich wrote: > Tony, > > Interestingly the .NET CLR has an API to allow apps to tell the runtime(gc specifically) to be more aggressive - System.GC.AddMemoryPressure/RemoveMemoryPressure. The idea being that you'd call this when a managed object allocates native memory and thus adds overall pressure. This hint is then used by the gc. What are your thoughts on that? Granted this is more amenable for mem management and not other scarce resources. > > On Jan 6, 2012 11:17 AM, "Tony Printezis" wrote: > Kirk, > > Inline. > > On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: > Tony, David, > > I really feel that this problem is being solved at the wrong level. The JVM lacks application semantics to know when to do the "right" thing. > > I absolutely agree. > > So, IMHO, this should be managed by the application. > > I absolutely disagree. :-) > > First, let's define some terminology to make sure we're all on the same page: > > JVM : HotSpot > library : the Java code that manages certain native resource (e.g., classes in the java.io package) - I do not consider this part of the JVM > application : what the user writes which runs on top of HotSpot and uses java.io.File's. > > For the same reason, there are other things that the application shouldn't touch like telling the JVM it's time to run a collection. > > Amen to that. :-) > > That said, one thing the JVM might know about is how many file descriptors it's allowed and it could trigger an attempt to recover them (i.e. run finalization) once they start running low.. > > Indeed. But the JVM (as defined above) does not know anything about file descriptors. It's the library, in this case classes in java.io, that does. However, I don't think said library should also be calling System.gc() either. It should be providing information to the GC, via the API I have been suggesting, on how much of a certain resource we have and the GC should be making informed decisions on whether it when it should trigger a cycle. > > Tony > > just as the JVM manages memory by recovering it when it runs low. That said, I did run a bench where we opened 1,000,000,000 sockets on a single VM. That is just about as many sockets as can be opened on a machine without recompiling the kernel to configure a bigger file descriptor bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to open all these sockets. While this bench was a bit extreme, it's clear that WebSocket gateways *will* stress file descriptor. > > Not sure about direct buffers though I do feel they should implement finalization as a fallback position. > > Cheers, > Kirk > > On 2012-01-06, at 12:14 PM, David Holmes wrote: > > Hi Tony, > > On 6/01/2012 8:34 PM, Tony Printezis wrote: > If each library could indicate to the GC whether the resource it manages > is running low or not, using the API I mentioned, the GC could do the > above automatically, behind the scenes, without the user having to do > anything else. > I'm not sure a library writer has the necessary information to do this. Seems to me that how an application uses a particular type determines the scarcity of the resource. I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" reaches "limit". That limits the frequency of finalization, but the actual finalization cost may still be unacceptably high. > > Cheers, > David > > Tony > > Afterall the key thing about GC is it relieves the programmer from > having to manage object lifetimes, so if you don't know when the > object is no longer used you don't know when to call close. > > David -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/b26e0c83/attachment.html From vitalyd at gmail.com Fri Jan 6 08:57:05 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Fri, 6 Jan 2012 11:57:05 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F07233A.5010001@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com> <4F07233A.5010001@oracle.com> Message-ID: Yes it takes an int64. On Jan 6, 2012 11:37 AM, "Tony Printezis" wrote: > Does AddMemoryPressure() take any parameters? Creating a new socket (out > of a max of say 1,000,000) should have different weight to, say, creating a > new file (out of a of say 50). > > Tony > > On 01/06/2012 11:32 AM, Vitaly Davidovich wrote: > > Tony, > > Interestingly the .NET CLR has an API to allow apps to tell the > runtime(gc specifically) to be more aggressive - > System.GC.AddMemoryPressure/RemoveMemoryPressure. The idea being that > you'd call this when a managed object allocates native memory and thus adds > overall pressure. This hint is then used by the gc. What are your > thoughts on that? Granted this is more amenable for mem management and not > other scarce resources. > On Jan 6, 2012 11:17 AM, "Tony Printezis" > wrote: > >> Kirk, >> >> Inline. >> >> On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: >> >>> Tony, David, >>> >>> I really feel that this problem is being solved at the wrong level. The >>> JVM lacks application semantics to know when to do the "right" thing. >>> >> >> I absolutely agree. >> >> So, IMHO, this should be managed by the application. >>> >> >> I absolutely disagree. :-) >> >> First, let's define some terminology to make sure we're all on the same >> page: >> >> JVM : HotSpot >> library : the Java code that manages certain native resource (e.g., >> classes in the java.io package) - I do not consider this part of the JVM >> application : what the user writes which runs on top of HotSpot and uses >> java.io.File's. >> >> For the same reason, there are other things that the application >>> shouldn't touch like telling the JVM it's time to run a collection. >>> >> >> Amen to that. :-) >> >> That said, one thing the JVM might know about is how many file >>> descriptors it's allowed and it could trigger an attempt to recover them >>> (i.e. run finalization) once they start running low.. >>> >> >> Indeed. But the JVM (as defined above) does not know anything about file >> descriptors. It's the library, in this case classes in java.io, that >> does. However, I don't think said library should also be calling >> System.gc() either. It should be providing information to the GC, via the >> API I have been suggesting, on how much of a certain resource we have and >> the GC should be making informed decisions on whether it when it should >> trigger a cycle. >> >> Tony >> >> just as the JVM manages memory by recovering it when it runs low. That >>> said, I did run a bench where we opened 1,000,000,000 sockets on a single >>> VM. That is just about as many sockets as can be opened on a machine >>> without recompiling the kernel to configure a bigger file descriptor >>> bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to >>> open all these sockets. While this bench was a bit extreme, it's clear that >>> WebSocket gateways *will* stress file descriptor. >>> >>> Not sure about direct buffers though I do feel they should implement >>> finalization as a fallback position. >>> >>> Cheers, >>> Kirk >>> >>> On 2012-01-06, at 12:14 PM, David Holmes wrote: >>> >>> Hi Tony, >>>> >>>> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>>> >>>>> If each library could indicate to the GC whether the resource it >>>>> manages >>>>> is running low or not, using the API I mentioned, the GC could do the >>>>> above automatically, behind the scenes, without the user having to do >>>>> anything else. >>>>> >>>> I'm not sure a library writer has the necessary information to do this. >>>> Seems to me that how an application uses a particular type determines the >>>> scarcity of the resource. I can imagine something like: >>>> >>>> void setFinalizationLimit(Class cls, int limit) >>>> >>>> so that GC runs finalization once a "reference count" reaches "limit". >>>> That limits the frequency of finalization, but the actual finalization cost >>>> may still be unacceptably high. >>>> >>>> Cheers, >>>> David >>>> >>>> Tony >>>>> >>>>> Afterall the key thing about GC is it relieves the programmer from >>>>>> having to manage object lifetimes, so if you don't know when the >>>>>> object is no longer used you don't know when to call close. >>>>>> >>>>>> David >>>>>> >>>>> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/f4cbd233/attachment.html From vitalyd at gmail.com Fri Jan 6 08:59:04 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Fri, 6 Jan 2012 11:59:04 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com>

Message-ID: Curious - why do you say this? I don't want to diverge this thread but that's a bold statement :). On Jan 6, 2012 11:54 AM, "Kirk Pepperdine" wrote: > when it comes to performance, I'm not sure that the CLR is a good role > model ;-) > > Regards, > Kirk > > On 2012-01-06, at 5:32 PM, Vitaly Davidovich wrote: > > Tony, > > Interestingly the .NET CLR has an API to allow apps to tell the > runtime(gc specifically) to be more aggressive - > System.GC.AddMemoryPressure/RemoveMemoryPressure. The idea being that > you'd call this when a managed object allocates native memory and thus adds > overall pressure. This hint is then used by the gc. What are your > thoughts on that? Granted this is more amenable for mem management and not > other scarce resources. > On Jan 6, 2012 11:17 AM, "Tony Printezis" > wrote: > >> Kirk, >> >> Inline. >> >> On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: >> >>> Tony, David, >>> >>> I really feel that this problem is being solved at the wrong level. The >>> JVM lacks application semantics to know when to do the "right" thing. >>> >> >> I absolutely agree. >> >> So, IMHO, this should be managed by the application. >>> >> >> I absolutely disagree. :-) >> >> First, let's define some terminology to make sure we're all on the same >> page: >> >> JVM : HotSpot >> library : the Java code that manages certain native resource (e.g., >> classes in the java.io package) - I do not consider this part of the JVM >> application : what the user writes which runs on top of HotSpot and uses >> java.io.File's. >> >> For the same reason, there are other things that the application >>> shouldn't touch like telling the JVM it's time to run a collection. >>> >> >> Amen to that. :-) >> >> That said, one thing the JVM might know about is how many file >>> descriptors it's allowed and it could trigger an attempt to recover them >>> (i.e. run finalization) once they start running low.. >>> >> >> Indeed. But the JVM (as defined above) does not know anything about file >> descriptors. It's the library, in this case classes in java.io, that >> does. However, I don't think said library should also be calling >> System.gc() either. It should be providing information to the GC, via the >> API I have been suggesting, on how much of a certain resource we have and >> the GC should be making informed decisions on whether it when it should >> trigger a cycle. >> >> Tony >> >> just as the JVM manages memory by recovering it when it runs low. That >>> said, I did run a bench where we opened 1,000,000,000 sockets on a single >>> VM. That is just about as many sockets as can be opened on a machine >>> without recompiling the kernel to configure a bigger file descriptor >>> bitmap. I'm quite happy that the JVM wasn't fighting me as I was trying to >>> open all these sockets. While this bench was a bit extreme, it's clear that >>> WebSocket gateways *will* stress file descriptor. >>> >>> Not sure about direct buffers though I do feel they should implement >>> finalization as a fallback position. >>> >>> Cheers, >>> Kirk >>> >>> On 2012-01-06, at 12:14 PM, David Holmes wrote: >>> >>> Hi Tony, >>>> >>>> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>>> >>>>> If each library could indicate to the GC whether the resource it >>>>> manages >>>>> is running low or not, using the API I mentioned, the GC could do the >>>>> above automatically, behind the scenes, without the user having to do >>>>> anything else. >>>>> >>>> I'm not sure a library writer has the necessary information to do this. >>>> Seems to me that how an application uses a particular type determines the >>>> scarcity of the resource. I can imagine something like: >>>> >>>> void setFinalizationLimit(Class cls, int limit) >>>> >>>> so that GC runs finalization once a "reference count" reaches "limit". >>>> That limits the frequency of finalization, but the actual finalization cost >>>> may still be unacceptably high. >>>> >>>> Cheers, >>>> David >>>> >>>> Tony >>>>> >>>>> Afterall the key thing about GC is it relieves the programmer from >>>>>> having to manage object lifetimes, so if you don't know when the >>>>>> object is no longer used you don't know when to call close. >>>>>> >>>>>> David >>>>>> >>>>> > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120106/7b55777e/attachment-0001.html From kirk at kodewerk.com Fri Jan 6 09:02:14 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Fri, 6 Jan 2012 18:02:14 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F071E55.9050203@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <3EF8333A-5812-41DD-8180-4AE70F581161@kodewerk.com> <4F071E55.9050203@oracle.com> Message-ID: On 2012-01-06, at 5:16 PM, Tony Printezis wrote: > Kirk, > > Inline. > > On 01/06/2012 08:57 AM, Kirk Pepperdine wrote: >> Tony, David, >> >> I really feel that this problem is being solved at the wrong level. The JVM lacks application semantics to know when to do the "right" thing. > > I absolutely agree. > >> So, IMHO, this should be managed by the application. > > I absolutely disagree. :-) > > First, let's define some terminology to make sure we're all on the same page: > > JVM : HotSpot > library : the Java code that manages certain native resource (e.g., classes in the java.io package) - I do not consider this part of the JVM > application : what the user writes which runs on top of HotSpot and uses java.io.File's. Ok, so with these definitions I'd say that anything but the JVM should manage those things to which it doesn't have semantics knowledge. That would be the application or library (I bundled these together when I shouldn't have). > >> For the same reason, there are other things that the application shouldn't touch like telling the JVM it's time to run a collection. > > Amen to that. :-) > >> That said, one thing the JVM might know about is how many file descriptors it's allowed and it could trigger an attempt to recover them (i.e. run finalization) once they start running low.. > > Indeed. But the JVM (as defined above) does not know anything about file descriptors. It's the library, in this case classes in java.io, that does. However, I don't think said library should also be calling System.gc() either. It should be providing information to the GC, via the API I have been suggesting, on how much of a certain resource we have and the GC should be making informed decisions on whether it when it should trigger a cycle. Ok, again, my confused terminology? basically I don't see how the collector could make an informed decision on anything other than file descriptors. Would it have to track rates of consumption and desposal to build an understanding of when it should take action? Regards, Kirk From jon.masamitsu at oracle.com Fri Jan 6 10:16:37 2012 From: jon.masamitsu at oracle.com (jon.masamitsu at oracle.com) Date: Fri, 06 Jan 2012 18:16:37 +0000 Subject: hg: hsx/hotspot-main/hotspot: 3 new changesets Message-ID: <20120106181649.EF4B1478D1@hg.openjdk.java.net> Changeset: b6a04c79ccbc Author: stefank Date: 2012-01-02 10:01 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/b6a04c79ccbc 7125503: Compiling collectedHeap.cpp fails with -Werror=int-to-pointer-cast with g++ 4.6.1 Summary: Used uintptr_t and void* for all the casts and checks in test_is_in. Reviewed-by: tonyp, jmasa ! src/share/vm/gc_interface/collectedHeap.cpp Changeset: 4753e3dda3c8 Author: jmasa Date: 2012-01-04 07:56 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/4753e3dda3c8 Merge Changeset: 2ee4167627a3 Author: jmasa Date: 2012-01-05 21:02 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/2ee4167627a3 Merge From daniel.daugherty at oracle.com Fri Jan 6 19:47:45 2012 From: daniel.daugherty at oracle.com (daniel.daugherty at oracle.com) Date: Sat, 07 Jan 2012 03:47:45 +0000 Subject: hg: hsx/hotspot-main/hotspot: 7 new changesets Message-ID: <20120107034759.8E1ED478DD@hg.openjdk.java.net> Changeset: 7ab5f6318694 Author: phh Date: 2012-01-01 11:17 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/7ab5f6318694 7125934: Add a fast unordered timestamp capability to Hotspot on x86/x64 Summary: Add rdtsc detection and inline generation. Reviewed-by: kamg, dholmes Contributed-by: karen.kinnear at oracle.com ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/os_cpu/bsd_x86/vm/os_bsd_x86.hpp + src/os_cpu/bsd_x86/vm/os_bsd_x86.inline.hpp ! src/os_cpu/linux_x86/vm/os_linux_x86.hpp + src/os_cpu/linux_x86/vm/os_linux_x86.inline.hpp ! src/os_cpu/solaris_x86/vm/os_solaris_x86.hpp + src/os_cpu/solaris_x86/vm/os_solaris_x86.inline.hpp ! src/os_cpu/solaris_x86/vm/solaris_x86_32.il ! src/os_cpu/solaris_x86/vm/solaris_x86_64.il ! src/os_cpu/windows_x86/vm/os_windows_x86.hpp + src/os_cpu/windows_x86/vm/os_windows_x86.inline.hpp ! src/share/vm/runtime/init.cpp ! src/share/vm/runtime/os.cpp ! src/share/vm/runtime/os.hpp + src/share/vm/runtime/os_ext.hpp Changeset: b16494a69d3d Author: phh Date: 2012-01-03 15:11 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/b16494a69d3d 7126185: Clean up lasterror handling, add os::get_last_error() Summary: Add os::get_last_error(), replace getLastErrorString() by os::lasterror() in os_windows.cpp. Reviewed-by: kamg, dholmes Contributed-by: erik.gahlin at oracle.com ! src/os/posix/vm/os_posix.cpp ! src/os/windows/vm/os_windows.cpp ! src/share/vm/runtime/os.hpp Changeset: 5b58979183f9 Author: dcubed Date: 2012-01-05 06:24 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/5b58979183f9 7127032: fix for 7122253 adds a JvmtiThreadState earlier than necessary Summary: Use JavaThread::jvmti_thread_state() instead of JvmtiThreadState::state_for(). Reviewed-by: coleenp, poonam, acorn ! src/share/vm/classfile/classFileParser.cpp Changeset: 8a63c6323842 Author: fparain Date: 2012-01-05 07:26 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/8a63c6323842 7125594: C-heap growth issue in ThreadService::find_deadlocks_at_safepoint Reviewed-by: sspitsyn, dcubed, mchung, dholmes ! src/share/vm/services/threadService.cpp Changeset: 2e0ef19fc891 Author: phh Date: 2012-01-05 17:14 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/2e0ef19fc891 7126480: Make JVM start time in milliseconds since the Java epoch available Summary: Expose existing Management::_begin_vm_creation_time via new accessor Management::begin_vm_creation_time(). Reviewed-by: acorn, dcubed ! src/share/vm/services/management.hpp Changeset: 66259eca2bf7 Author: phh Date: 2012-01-05 17:16 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/66259eca2bf7 Merge Changeset: 2b3acb34791f Author: dcubed Date: 2012-01-06 16:18 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/2b3acb34791f Merge ! src/os/windows/vm/os_windows.cpp ! src/share/vm/classfile/classFileParser.cpp ! src/share/vm/runtime/os.hpp From vladimir.kozlov at oracle.com Fri Jan 6 22:35:41 2012 From: vladimir.kozlov at oracle.com (vladimir.kozlov at oracle.com) Date: Sat, 07 Jan 2012 06:35:41 +0000 Subject: hg: hsx/hotspot-main/hotspot: 16 new changesets Message-ID: <20120107063612.AD879478DE@hg.openjdk.java.net> Changeset: abcceac2f7cd Author: iveresov Date: 2011-12-12 12:44 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/abcceac2f7cd 7119730: Tiered: SIGSEGV in AdvancedThresholdPolicy::is_method_profiled(methodOop) Summary: Added handles for references to methods in select_task() Reviewed-by: twisti, kvn ! src/share/vm/runtime/advancedThresholdPolicy.cpp Changeset: 7bca37d28f32 Author: roland Date: 2011-12-13 10:54 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/7bca37d28f32 7114106: C1: assert(goto_state->is_same(sux_state)) failed: states must match now Summary: fix C1's CEE to take inlining into account when the stacks in states are compared. Reviewed-by: iveresov, never ! src/share/vm/c1/c1_Optimizer.cpp Changeset: d725f0affb1a Author: iveresov Date: 2011-12-13 17:10 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/d725f0affb1a 7121111: -server -Xcomp -XX:+TieredCompilation does not invoke C2 compiler Summary: Exercise C2 more in tiered mode with Xcomp Reviewed-by: kvn, never ! src/share/vm/runtime/arguments.cpp Changeset: 127b3692c168 Author: kvn Date: 2011-12-14 14:54 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/127b3692c168 7116452: Add support for AVX instructions Summary: Added support for AVX extension to the x86 instruction set. Reviewed-by: never ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/assembler_x86.hpp ! src/cpu/x86/vm/assembler_x86.inline.hpp ! src/cpu/x86/vm/nativeInst_x86.cpp ! src/cpu/x86/vm/nativeInst_x86.hpp ! src/cpu/x86/vm/register_definitions_x86.cpp ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/runtime/globals.hpp Changeset: 669f6a7d5b70 Author: never Date: 2011-12-19 14:16 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/669f6a7d5b70 7121073: secondary_super_cache memory slice has incorrect bounds in flatten_alias_type Reviewed-by: kvn ! src/share/vm/opto/compile.cpp Changeset: 65149e74c706 Author: kvn Date: 2011-12-20 00:55 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/65149e74c706 7121648: Use 3-operands SIMD instructions on x86 with AVX Summary: Use 3-operands SIMD instructions in C2 generated code for machines with AVX. Reviewed-by: never ! make/bsd/makefiles/adlc.make ! make/linux/makefiles/adlc.make ! make/solaris/makefiles/adlc.make ! make/windows/makefiles/adlc.make ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/assembler_x86.hpp + src/cpu/x86/vm/x86.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/opto/matcher.cpp Changeset: 069ab3f976d3 Author: stefank Date: 2011-12-07 11:35 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/069ab3f976d3 7118863: Move sizeof(klassOopDesc) into the *Klass::*_offset_in_bytes() functions Summary: Moved sizeof(klassOopDesc), changed the return type to ByteSize and removed the _in_bytes suffix. Reviewed-by: never, bdelsart, coleenp, jrose ! src/cpu/sparc/vm/assembler_sparc.cpp ! src/cpu/sparc/vm/c1_CodeStubs_sparc.cpp ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_MacroAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/cppInterpreter_sparc.cpp ! src/cpu/sparc/vm/methodHandles_sparc.cpp ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/c1_CodeStubs_x86.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_MacroAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/cppInterpreter_x86.cpp ! src/cpu/x86/vm/methodHandles_x86.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/c1/c1_LIRGenerator.cpp ! src/share/vm/oops/arrayKlass.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/klass.cpp ! src/share/vm/oops/klass.hpp ! src/share/vm/oops/klassOop.hpp ! src/share/vm/oops/objArrayKlass.hpp ! src/share/vm/opto/compile.cpp ! src/share/vm/opto/graphKit.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/parse1.cpp ! src/share/vm/opto/parseHelper.cpp ! src/share/vm/shark/sharkIntrinsics.cpp ! src/share/vm/shark/sharkTopLevelBlock.cpp Changeset: 1dc233a8c7fe Author: roland Date: 2011-12-20 16:56 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/1dc233a8c7fe 7121140: Allocation paths require explicit memory synchronization operations for RMO systems Summary: adds store store barrier after initialization of header and body of objects. Reviewed-by: never, kvn ! src/cpu/sparc/vm/sparc.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/adlc/formssel.cpp ! src/share/vm/opto/callnode.hpp ! src/share/vm/opto/classes.hpp ! src/share/vm/opto/escape.cpp ! src/share/vm/opto/graphKit.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/memnode.hpp ! src/share/vm/opto/node.hpp Changeset: e5ac210043cd Author: roland Date: 2011-12-22 10:55 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/e5ac210043cd 7123108: C1: assert(if_state != NULL) failed: states do not match up Summary: In CEE, ensure if and common successor state are at the same inline level Reviewed-by: never ! src/share/vm/c1/c1_Optimizer.cpp + test/compiler/7123108/Test7123108.java Changeset: b642b49f9738 Author: roland Date: 2011-12-23 09:36 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/b642b49f9738 7123253: C1: in store check code, usage of registers may be incorrect Summary: fix usage of input register in assembly code for store check. Reviewed-by: never ! src/share/vm/c1/c1_LIR.cpp Changeset: 40c2484c09e1 Author: kvn Date: 2011-12-23 15:24 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/40c2484c09e1 7110832: ctw/.../org_apache_avalon_composition_util_StringHelper crashes the VM Summary: Distance is too large for one short branch in string_indexofC8(). Reviewed-by: iveresov ! src/cpu/x86/vm/assembler_x86.cpp ! src/share/vm/asm/assembler.cpp ! src/share/vm/asm/assembler.hpp Changeset: d12a66fa3820 Author: kvn Date: 2011-12-27 15:08 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/d12a66fa3820 7123954: Some CTW test crash with SIGSEGV Summary: Correct Allocate expansion code to preserve i_o when only slow call is generated. Reviewed-by: iveresov ! src/share/vm/opto/compile.cpp ! src/share/vm/opto/macro.cpp Changeset: 8940fd98d540 Author: kvn Date: 2011-12-29 11:37 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/8940fd98d540 Merge ! src/cpu/x86/vm/assembler_x86.cpp ! src/share/vm/runtime/globals.hpp Changeset: 9c87bcb3b4dd Author: kvn Date: 2011-12-30 11:43 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/9c87bcb3b4dd 7125879: assert(proj != NULL) failed: must be found Summary: Leave i_o attached to slow allocation call when there are no i_o users after the call. Reviewed-by: iveresov, twisti ! src/share/vm/opto/macro.cpp + test/compiler/7125879/Test7125879.java Changeset: 1cb50d7a9d95 Author: iveresov Date: 2012-01-05 17:25 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/1cb50d7a9d95 7119294: Two command line options cause JVM to crash Summary: Setup thread register in MacroAssembler::incr_allocated_bytes() on x64 Reviewed-by: kvn ! src/cpu/x86/vm/assembler_x86.cpp Changeset: 22cee0ee8927 Author: kvn Date: 2012-01-06 20:09 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/22cee0ee8927 Merge ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/parseHelper.cpp From james.melvin at oracle.com Sat Jan 7 08:38:46 2012 From: james.melvin at oracle.com (James Melvin) Date: Sat, 07 Jan 2012 11:38:46 -0500 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F034B51.3070609@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> Message-ID: <4F087516.40505@oracle.com> Hi Dan, Finally getting back on the trail to fix the gamma launcher. Sorry for the delayed response. Thanks for the review, Dan and David. Replies inline... On 1/3/12 1:39 PM, Daniel D. Daugherty wrote: >> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 > > Jim, > > Thanks for diving in and improving the MacOS X port! > Comments below. > > Dan > > > make/bsd/makefiles/buildtree.make > > line 422: The new 'java -fullversion' invocation does not include > the $(JAVA_FLAG) option like the old code did. Any particular > reason for the change? > > Looks like that means the '-d32' or '-d64' options won't be > specified as they were before. Originally, this no longer made sense as both -d32 and -d64 were mapped to 64-bit. After further review, I'm going to readd this option in case we ever change our minds and decide to support both 32 and 64-bit JVMs on Mac OS X. > > line 447: Why not just echo FULL_VERSION? Why pipe to awk? To preserve the original script output, I needed to trim the extra newline... 1 from $FULLVERSION and 1 from echo. > > line 465: The 'jre/lib/libjava.dylib' part of the new check is > MacOS X specific. Other BSDs don't necessarily use the > '.dylib' extension (instead of .so) and I don't think that > other BSDs have dropped the "arch" subdir. To be more friendly to other BSDs, I've added a $(LIBARCH) subdir check and $(LIBRARY_SUFFIX) instead of hardcoded .dylib. However, I really don't have a way of testing this for those other BSDs. > > line 484: The DYLD_LIBRARY_PATH part is MacOS X specific. Will > still need to set LD_LIBRARY_PATH for other BSDs. Also, a good point. I've re-added LD_LIBRARY_PATH with it's original setting. > > line 492: You switched from $(TESTFLAGS) to literal flag values, > but you left the TESTFLAGS variable around. Any reason for > the switch? Nice find. Cut-paste overwrite. Fixed by restoring $(TESTFLAGS). > > > make/bsd/makefiles/launcher.make > Please add a comment explaining why '-framework CoreFoundation' > is needed. Your explanatory block below is a really good start. Done. > > > make/bsd/makefiles/vm.make > No comments. > > > src/os/bsd/vm/os_bsd.cpp > line 2585: Uses a suffix of ".so". That shouldn't work on MacOS X > since MacOS X uses '.dylib'. That's OK for other BSDs, but not > MacOS X. Also the comments that mention '.so' should be updated > to include '.dylib' (not caused by your changes). I've replaced .so with $JNI_LIB_SUFFIX defined earlier in the source. In the area comments, I've just dropped .so extension altogether to cheaply ambiguate. > > To David H. - Yes, this change added another '#fdef __APPLE__'. It > is not the first and it likely won't be the last since we're > not done yet with the MacOS X port. There are a number of > things that need to be cleaned up and we're tracking them. > However, as you know, we don't have enough folks to handle all > of the work so we'll just have to live with the warts for now. For this particular change to fix gamma, I've managed to resolve David's concerns by adding support for no-arch paths in the code rather than using #ifdefs. However, ifdefs are sprinkled everywhere and this will need to be resolved whenever we reconcile the unix platforms into a more common codebase. > > src/os/posix/launcher/java_md.c > No comments. > Thanks for the review comments. I've also added a 1 line change in make/bsd/makefiles/defs.make to fix a build warning around duplicate targets for Xusage.txt due to a variable expansion. This has already been resolved for other platforms. Changes included in new webrev. More feedback welcome. WEBREV: http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 TESTS RUN: JPRT 2012-01-07-064433.jmelvin.7125793 local Mac OS X builds/tests - Jim > > On 12/31/11 1:39 AM, James Melvin wrote: >> Hi, >> >> This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. >> There were 3 primary changes required to re-enable gamma... >> >> 1) Statically link with CoreFoundation framework to resolve symbols >> >> The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. >> Because Mac OS X files are case-insensitive by default, we collide on >> $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This >> resulted in unresolved symbols in the Mac OS X framework libraries. The >> solution for gamma was to statically link with CoreFoundation framework >> to properly resolve framework symbols and allow gamma to successfully >> dlopen() libjava.dylib. >> >> 2) Adjust various paths to reflect no arch subdirs >> >> On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. >> Instead, one can use universal binaries to package multiple >> architectures in a single binary. At the moment though, we are only >> building 64-bit non-universal binaries. Note, the test_gamma script >> assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. >> Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script >> gracefully, as libjava.dylib is in a different, unexpected place. >> >> 3) Modify test_gamma script to set library path only for gamma launch >> >> Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). >> Instead, set this later in the script only for the gamma launcher test >> run. While in there, I took the liberty of decrypting the script to make >> it more maintainable and more easily merged whenever we reconcile the >> unix ports into a single codebase. There is no change to the script >> output. >> >> Feedback welcome... >> >> WEBREV: >> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 >> >> TESTS RUN: >> JPRT 2011-12-31-061123.jmelvin.7125793 >> local Mac OS X builds/tests >> >> >> Thanks and Happy New Year! >> >> Jim From david.holmes at oracle.com Sun Jan 8 15:23:38 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 09 Jan 2012 09:23:38 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F071C4B.8010002@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> Message-ID: <4F0A257A.50104@oracle.com> On 7/01/2012 2:07 AM, Tony Printezis wrote: > David, > > On 01/06/2012 06:14 AM, David Holmes wrote: >> Hi Tony, >> >> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>> If each library could indicate to the GC whether the resource it manages >>> is running low or not, using the API I mentioned, the GC could do the >>> above automatically, behind the scenes, without the user having to do >>> anything else. >> >> I'm not sure a library writer has the necessary information to do >> this. Seems to me that how an application uses a particular type >> determines the scarcity of the resource. > > I will be very surprised if the majority of application developers will > be willing to measure at what rate the application consumes certain > resources and update said measure when the application changes, load > increases, OS changes, hardware changes, etc. And, sure, I think there > will be many cases where the library writer has a good idea / can find > out the max number of instances of a particular resource (file > descriptors, sockets, etc.). Perhaps, but the library writer may only have partial knowledge. The library can use reference counts to track how much of a resource it has handed out, and what has been handed back. It might know what the absolute limit for a resource is (via getrlimit etc). But can it know the absolute usage rate of a given resource? Can you query how many available file descriptors a process has left? Some resources will be used by native code outside the libs (including the VM). So at best this is a heuristic, so the library tracks the resource and at some threshold it invoke System.runFinalization. The GC itself doesn't make an informed decision because, as you said your self, the VM (and hence GC) knows nothing about the resource being tracked. But would we want to burden all users of these classes with the overhead of resource tracking? Cheers, David > Tony > >> I can imagine something like: >> >> void setFinalizationLimit(Class cls, int limit) >> >> so that GC runs finalization once a "reference count" reaches "limit". >> That limits the frequency of finalization, but the actual finalization >> cost may still be unacceptably high. >> >> Cheers, >> David >> >>> Tony >>> >>>> Afterall the key thing about GC is it relieves the programmer from >>>> having to manage object lifetimes, so if you don't know when the >>>> object is no longer used you don't know when to call close. >>>> >>>> David From david.holmes at oracle.com Sun Jan 8 15:51:19 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 09 Jan 2012 09:51:19 +1000 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F087516.40505@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> <4F087516.40505@oracle.com> Message-ID: <4F0A2BF7.8060903@oracle.com> Thanks Jim, I'm much happier now. David ----- On 8/01/2012 2:38 AM, James Melvin wrote: > Hi Dan, > > Finally getting back on the trail to fix the gamma launcher. Sorry for > the delayed response. Thanks for the review, Dan and David. Replies > inline... > > > On 1/3/12 1:39 PM, Daniel D. Daugherty wrote: >>> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 >> >> Jim, >> >> Thanks for diving in and improving the MacOS X port! >> Comments below. >> >> Dan >> >> >> make/bsd/makefiles/buildtree.make >> >> line 422: The new 'java -fullversion' invocation does not include >> the $(JAVA_FLAG) option like the old code did. Any particular >> reason for the change? >> >> Looks like that means the '-d32' or '-d64' options won't be >> specified as they were before. > > Originally, this no longer made sense as both -d32 and -d64 were mapped > to 64-bit. After further review, I'm going to readd this option in case > we ever change our minds and decide to support both 32 and 64-bit JVMs > on Mac OS X. > > >> >> line 447: Why not just echo FULL_VERSION? Why pipe to awk? > > To preserve the original script output, I needed to trim the extra > newline... 1 from $FULLVERSION and 1 from echo. > >> >> line 465: The 'jre/lib/libjava.dylib' part of the new check is >> MacOS X specific. Other BSDs don't necessarily use the >> '.dylib' extension (instead of .so) and I don't think that >> other BSDs have dropped the "arch" subdir. > > To be more friendly to other BSDs, I've added a $(LIBARCH) subdir check > and $(LIBRARY_SUFFIX) instead of hardcoded .dylib. However, I really > don't have a way of testing this for those other BSDs. > >> >> line 484: The DYLD_LIBRARY_PATH part is MacOS X specific. Will >> still need to set LD_LIBRARY_PATH for other BSDs. > > Also, a good point. I've re-added LD_LIBRARY_PATH with it's original > setting. > >> >> line 492: You switched from $(TESTFLAGS) to literal flag values, >> but you left the TESTFLAGS variable around. Any reason for >> the switch? > > Nice find. Cut-paste overwrite. Fixed by restoring $(TESTFLAGS). > >> >> >> make/bsd/makefiles/launcher.make >> Please add a comment explaining why '-framework CoreFoundation' >> is needed. Your explanatory block below is a really good start. > > Done. > >> >> >> make/bsd/makefiles/vm.make >> No comments. >> >> >> src/os/bsd/vm/os_bsd.cpp >> line 2585: Uses a suffix of ".so". That shouldn't work on MacOS X >> since MacOS X uses '.dylib'. That's OK for other BSDs, but not >> MacOS X. Also the comments that mention '.so' should be updated >> to include '.dylib' (not caused by your changes). > > I've replaced .so with $JNI_LIB_SUFFIX defined earlier in the source. > In the area comments, I've just dropped .so extension altogether to > cheaply ambiguate. > >> >> To David H. - Yes, this change added another '#fdef __APPLE__'. It >> is not the first and it likely won't be the last since we're >> not done yet with the MacOS X port. There are a number of >> things that need to be cleaned up and we're tracking them. >> However, as you know, we don't have enough folks to handle all >> of the work so we'll just have to live with the warts for now. > > For this particular change to fix gamma, I've managed to resolve David's > concerns by adding support for no-arch paths in the code rather than > using #ifdefs. However, ifdefs are sprinkled everywhere and this will > need to be resolved whenever we reconcile the unix platforms into a more > common codebase. > > >> >> src/os/posix/launcher/java_md.c >> No comments. >> > > Thanks for the review comments. I've also added a 1 line change in > make/bsd/makefiles/defs.make to fix a build warning around duplicate > targets for Xusage.txt due to a variable expansion. This has already > been resolved for other platforms. > > > Changes included in new webrev. More feedback welcome. > > WEBREV: > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 > > TESTS RUN: > JPRT 2012-01-07-064433.jmelvin.7125793 > local Mac OS X builds/tests > > > - Jim > > >> >> On 12/31/11 1:39 AM, James Melvin wrote: >>> Hi, >>> >>> This change fixes the 'gamma' simple launcher for HotSpot on Mac OS X. >>> There were 3 primary changes required to re-enable gamma... >>> >>> 1) Statically link with CoreFoundation framework to resolve symbols >>> >>> The gamma launcher dlopen()s libjava.dylib from $JAVA_HOME/jre/lib. >>> Because Mac OS X files are case-insensitive by default, we collide on >>> $FRAMEWORK/libJPEG.dylib and ${JAVA_HOME}/jre/lib/libjpeg.dylib. This >>> resulted in unresolved symbols in the Mac OS X framework libraries. The >>> solution for gamma was to statically link with CoreFoundation framework >>> to properly resolve framework symbols and allow gamma to successfully >>> dlopen() libjava.dylib. >>> >>> 2) Adjust various paths to reflect no arch subdirs >>> >>> On Mac OS X, there are no arch subdirs, e.g jre/lib vs jre/lib/. >>> Instead, one can use universal binaries to package multiple >>> architectures in a single binary. At the moment though, we are only >>> building 64-bit non-universal binaries. Note, the test_gamma script >>> assumes an Oracle JDK layout for JAVA_HOME, derived from ALT_BOOTDIR. >>> Using an Apple JDK for ALT_BOOTDIR will fail the test_gamma script >>> gracefully, as libjava.dylib is in a different, unexpected place. >>> >>> 3) Modify test_gamma script to set library path only for gamma launch >>> >>> Setting DYLD_LIBRARY_PATH adversely affects the real java launcher(s). >>> Instead, set this later in the script only for the gamma launcher test >>> run. While in there, I took the liberty of decrypting the script to make >>> it more maintainable and more easily merged whenever we reconcile the >>> unix ports into a single codebase. There is no change to the script >>> output. >>> >>> Feedback welcome... >>> >>> WEBREV: >>> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.00 >>> >>> TESTS RUN: >>> JPRT 2011-12-31-061123.jmelvin.7125793 >>> local Mac OS X builds/tests >>> >>> >>> Thanks and Happy New Year! >>> >>> Jim From ysr1729 at gmail.com Sun Jan 8 16:08:14 2012 From: ysr1729 at gmail.com (Srinivas Ramakrishna) Date: Sun, 8 Jan 2012 16:08:14 -0800 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F0A257A.50104@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> Message-ID: Hi David -- On Sun, Jan 8, 2012 at 3:23 PM, David Holmes wrote: > On 7/01/2012 2:07 AM, Tony Printezis wrote: > >> David, >> >> On 01/06/2012 06:14 AM, David Holmes wrote: >> >>> Hi Tony, >>> >>> On 6/01/2012 8:34 PM, Tony Printezis wrote: >>> >>>> If each library could indicate to the GC whether the resource it manages >>>> is running low or not, using the API I mentioned, the GC could do the >>>> above automatically, behind the scenes, without the user having to do >>>> anything else. >>>> >>> >>> I'm not sure a library writer has the necessary information to do >>> this. Seems to me that how an application uses a particular type >>> determines the scarcity of the resource. >>> >> >> I will be very surprised if the majority of application developers will >> be willing to measure at what rate the application consumes certain >> resources and update said measure when the application changes, load >> increases, OS changes, hardware changes, etc. And, sure, I think there >> will be many cases where the library writer has a good idea / can find >> out the max number of instances of a particular resource (file >> descriptors, sockets, etc.). >> > > Perhaps, but the library writer may only have partial knowledge. The > library can use reference counts to track how much of a resource it has > handed out, and what has been handed back. It might know what the absolute > limit for a resource is (via getrlimit etc). But can it know the absolute > usage rate of a given resource? Can you query how many available file > descriptors a process has left? Some resources will be used by native code > outside the libs (including the VM). > The VM can probably keep track of its own descriptors, but you are right that pure native code may make the final calculation noisy unless the underlying platform provides suitable query API's. Rates are easy (modulo the noise in numbers) given that the VM/libs can keep track of the numbers it understands. Don't think of it as (this is what the libraries cab do abd this is what the VM can do, but rather as, this is what the VM+libs can provide to the (pure) Java application. > > So at best this is a heuristic, so the library tracks the resource and at > some threshold it invoke System.runFinalization. The GC itself doesn't make > an informed decision because, as you said your self, the VM (and hence GC) > knows nothing about the resource being tracked. > Yes, it would be a heuristic-driven policy (or set of policy choices), but it's better than each application rolling its own policy and infrastructure from scratch. > > But would we want to burden all users of these classes with the overhead > of resource tracking? > The objective is to do the resource tracking from within the libraries (+JVM), so the user wouldn't have to bother. For example, by providing suitable high level API's at the library level which would be wrappers around specific native resources that would be subject to such tracking. -- ramki > > Cheers, > David > > > Tony >> >> I can imagine something like: >>> >>> void setFinalizationLimit(Class cls, int limit) >>> >>> so that GC runs finalization once a "reference count" reaches "limit". >>> That limits the frequency of finalization, but the actual finalization >>> cost may still be unacceptably high. >>> >>> Cheers, >>> David >>> >>> Tony >>>> >>>> Afterall the key thing about GC is it relieves the programmer from >>>>> having to manage object lifetimes, so if you don't know when the >>>>> object is no longer used you don't know when to call close. >>>>> >>>>> David >>>>> >>>> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120108/cbbc302d/attachment.html From david.holmes at oracle.com Sun Jan 8 16:14:51 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 09 Jan 2012 10:14:51 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> Message-ID: <4F0A317B.6000702@oracle.com> Hi Ramki, On 9/01/2012 10:08 AM, Srinivas Ramakrishna wrote: > So at best this is a heuristic, so the library tracks the resource > and at some threshold it invoke System.runFinalization. The GC > itself doesn't make an informed decision because, as you said your > self, the VM (and hence GC) knows nothing about the resource being > tracked. > > > Yes, it would be a heuristic-driven policy (or set of policy choices), > but it's better than each application rolling its own policy and infrastructure from scratch. If it can be provided as an additional feature that applications only pay for if they need it. > But would we want to burden all users of these classes with the > overhead of resource tracking? > > > The objective is to do the resource tracking from within the libraries > (+JVM), so the user wouldn't have to bother. > For example, by providing suitable high level API's at the library level > which would be wrappers > around specific native resources that would be subject to such tracking. That's exactly the overhead I'm referring to. If these resources are always managed then all applications pay the price for the few that need it. David ----- > -- ramki > > > Cheers, > David > > > Tony > > I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" > reaches "limit". > That limits the frequency of finalization, but the actual > finalization > cost may still be unacceptably high. > > Cheers, > David > > Tony > > Afterall the key thing about GC is it relieves the > programmer from > having to manage object lifetimes, so if you don't > know when the > object is no longer used you don't know when to call > close. > > David > > From kirk at kodewerk.com Sun Jan 8 22:31:11 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Mon, 9 Jan 2012 07:31:11 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F0A257A.50104@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> Message-ID: >> > > Perhaps, but the library writer may only have partial knowledge. The library can use reference counts to track how much of a resource it has handed out, and what has been handed back. It might know what the absolute limit for a resource is (via getrlimit etc). But can it know the absolute usage rate of a given resource? Can you query how many available file descriptors a process has left? Some resources will be used by native code outside the libs (including the VM). > > So at best this is a heuristic, so the library tracks the resource and at some threshold it invoke System.runFinalization. The GC itself doesn't make an informed decision because, as you said your self, the VM (and hence GC) knows nothing about the resource being tracked. > > But would we want to burden all users of these classes with the overhead of resource tracking? Isn't this what a pool is for? I don't know but IMHO, this as going off very quickly. My feeling we're trying to hack in a work-around for an API to do have GC do something that it was never intended for it to do. Further more, if reference objects need to be treated specially then maybe the should be tracked specially so that a different mechanism can be triggered, one that doesn't interfere with GC. Crazy idea, what if reference objects were loosely connected to a special GC root so one could trace them on their own and if you could quickly determine that they weren't connected to anything else,.... Regards, Kirk From david.holmes at oracle.com Sun Jan 8 22:56:56 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 09 Jan 2012 16:56:56 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> Message-ID: <4F0A8FB8.7080600@oracle.com> On 9/01/2012 4:31 PM, Kirk Pepperdine wrote: >>> >> >> Perhaps, but the library writer may only have partial knowledge. The library can use reference counts to track how much of a resource it has handed out, and what has been handed back. It might know what the absolute limit for a resource is (via getrlimit etc). But can it know the absolute usage rate of a given resource? Can you query how many available file descriptors a process has left? Some resources will be used by native code outside the libs (including the VM). >> >> So at best this is a heuristic, so the library tracks the resource and at some threshold it invoke System.runFinalization. The GC itself doesn't make an informed decision because, as you said your self, the VM (and hence GC) knows nothing about the resource being tracked. >> >> But would we want to burden all users of these classes with the overhead of resource tracking? > > Isn't this what a pool is for? I don't know but IMHO, this as going off very quickly. My feeling we're trying to hack in a work-around for an API to do have GC do something that it was never intended for it to do. Further more, if reference objects need to be treated specially then maybe the should be tracked specially so that a different mechanism can be triggered, one that doesn't interfere with GC. Crazy idea, what if reference objects were loosely connected to a special GC root so one could trace them on their own and if you could quickly determine that they weren't connected to anything else,.... I'm not quite sure what you are arguing for or against :) but isn't what you describe called "reference counting"? You can imagine a class that uses a "finalizable" resource tracking construction and close() itself and also initiating some kind of cleanup mechanism. The cleanup has to involve the GC though because unless you have reference-counted classes (ala C++ auto_ptr - which Java can not support) there is no way to know when a resource is reclaimable. I also think the there will be an overhead associated with such resource tracking, and I don't think everyone should have to pay the price for that if they don't need it. Anyway I'm just repeating myself so I'll stop :) Cheers, David > Regards, > Kirk > From kirk at kodewerk.com Sun Jan 8 23:17:00 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Mon, 9 Jan 2012 08:17:00 +0100 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F0A8FB8.7080600@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> <4F0A8FB8.7080600@oracle.com> Message-ID: <8EC11CA6-B90E-466D-86F7-E37070A1EF88@kodewerk.com> On 2012-01-09, at 7:56 AM, David Holmes wrote: > On 9/01/2012 4:31 PM, Kirk Pepperdine wrote: >>>> >>> >>> Perhaps, but the library writer may only have partial knowledge. The library can use reference counts to track how much of a resource it has handed out, and what has been handed back. It might know what the absolute limit for a resource is (via getrlimit etc). But can it know the absolute usage rate of a given resource? Can you query how many available file descriptors a process has left? Some resources will be used by native code outside the libs (including the VM). >>> >>> So at best this is a heuristic, so the library tracks the resource and at some threshold it invoke System.runFinalization. The GC itself doesn't make an informed decision because, as you said your self, the VM (and hence GC) knows nothing about the resource being tracked. >>> >>> But would we want to burden all users of these classes with the overhead of resource tracking? >> >> Isn't this what a pool is for? I don't know but IMHO, this as going off very quickly. My feeling we're trying to hack in a work-around for an API to do have GC do something that it was never intended for it to do. Further more, if reference objects need to be treated specially then maybe the should be tracked specially so that a different mechanism can be triggered, one that doesn't interfere with GC. Crazy idea, what if reference objects were loosely connected to a special GC root so one could trace them on their own and if you could quickly determine that they weren't connected to anything else,.... > > I'm not quite sure what you are arguing for or against :) but isn't what you describe called "reference counting"? no no no, definitely not reference counting, I was thinking partial mark sweep focused on the finalizable objects with no guarantees that it would find objects ready to be finalized. One that would find the reference objects that were very obviously collectable. > > I also think the there will be an overhead associated with such resource tracking, and I don't think everyone should have to pay the price for that if they don't need it. Well, I think the whole idea is insane which is the *only* reason I've engaged in this conversation. I much prefer just being an observer and cherry picking the diagnostic/troubleshooting postings. Regards, Kirk From ysr1729 at gmail.com Sun Jan 8 23:27:54 2012 From: ysr1729 at gmail.com (Srinivas Ramakrishna) Date: Sun, 8 Jan 2012 23:27:54 -0800 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F0A317B.6000702@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> <4F0A317B.6000702@oracle.com> Message-ID: Ah, I see what you meant. Yes, there'd probably be some (theoretical) overhead from such tracking, if there are multiple applications within a single JVM some of which wanted to use these APIs and some did not, then the ones that do not care for such tracking or policies would pay the price (and besides one would have to resolve composition issues in such cases). But if one confined oneself for the moment to the simpler case of a single application per JVM, then if that application chose not to use such tracking, then there would be little to no overhead for it. Even in the case of multiple applications, my guess is that the overhead for other applications could be managed. I suspect that, compared to the JNI transition overheads, resource tracking overheads per se (and not the implementation of specific finalization triggering policies thereof) could, i think, be engineered to be very small. If, on the other hand, you are referring to the overhead of the work done for prompt finalization related to specific resources that one of a set of applications running on a JVM cares about and that others do not, that is a legitimate concern, but seems to me to be no different from these disparate applications sharing the same JVM heap which is subject to a global collection which stops the threads of each application (or of concurrent collection which imposes an overhead on all the applications sharing that JVM, or indeed the hardware), if you know what i am getting at... -- ramki On Sun, Jan 8, 2012 at 4:14 PM, David Holmes wrote: > Hi Ramki, > > On 9/01/2012 10:08 AM, Srinivas Ramakrishna wrote: > > > > So at best this is a heuristic, so the library tracks the resource >> and at some threshold it invoke System.runFinalization. The GC >> itself doesn't make an informed decision because, as you said your >> self, the VM (and hence GC) knows nothing about the resource being >> tracked. >> >> >> Yes, it would be a heuristic-driven policy (or set of policy choices), >> but it's better than each application rolling its own policy and >> infrastructure from scratch. >> > > If it can be provided as an additional feature that applications only pay > for if they need it. > > > But would we want to burden all users of these classes with the >> overhead of resource tracking? >> >> >> The objective is to do the resource tracking from within the libraries >> (+JVM), so the user wouldn't have to bother. >> For example, by providing suitable high level API's at the library level >> which would be wrappers >> around specific native resources that would be subject to such tracking. >> > > That's exactly the overhead I'm referring to. If these resources are > always managed then all applications pay the price for the few that need it. > > David > ----- > > > -- ramki >> >> >> Cheers, >> David >> >> >> Tony >> >> I can imagine something like: >> >> void setFinalizationLimit(Class cls, int limit) >> >> so that GC runs finalization once a "reference count" >> reaches "limit". >> That limits the frequency of finalization, but the actual >> finalization >> cost may still be unacceptably high. >> >> Cheers, >> David >> >> Tony >> >> Afterall the key thing about GC is it relieves the >> programmer from >> having to manage object lifetimes, so if you don't >> know when the >> object is no longer used you don't know when to call >> close. >> >> David >> >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120108/eb15576d/attachment.html From david.holmes at oracle.com Sun Jan 8 23:44:35 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 09 Jan 2012 17:44:35 +1000 Subject: JEP 132: More-prompt finalization In-Reply-To: References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> <4F0A317B.6000702@oracle.com> Message-ID: <4F0A9AE3.7000602@oracle.com> On 9/01/2012 5:27 PM, Srinivas Ramakrishna wrote: > Ah, I see what you meant. Yes, there'd probably be some (theoretical) > overhead from such tracking, I'm not considering multiple apps per VM I'm just thinking of a programmer using java.io.* or java.net.* classes which have been "enhanced" to support resource tracking. If the proposal involves new wrapper classes that an application has to opt-in to use then that is fine. But I think this has gone off-track somewhat - finding alternatives to finalization. The JEP is more about speeding up finalization for those apps that (for better or worse) rely on it. I think there are two main aspects of this: - adding more threads to process finalizers and/or reference objects - adding an API to request and run finalization etc (more efficiently that doing: System.gc(); System.gc(); System.runFinalization() ) Although we "all" know "finalizers are evil", they exist and are (mis-)used by some apps and cause some issues. Cheers, David > if there are multiple applications within a single JVM some of which > wanted to use these APIs > and some did not, then the ones that do not care for such tracking or > policies would pay > the price (and besides one would have to resolve composition issues in > such cases). But > if one confined oneself for the moment to the simpler case of a single > application per JVM, > then if that application chose not to use such tracking, then there > would be little to no overhead > for it. > > Even in the case of multiple applications, my guess is that the overhead > for other applications > could be managed. I suspect that, compared to the JNI transition > overheads, resource tracking > overheads per se (and not the implementation of specific finalization > triggering policies thereof) > could, i think, be engineered to be very small. > > If, on the other hand, you are referring to the overhead of the work > done for prompt finalization > related to specific resources that one of a set of applications running > on a JVM cares about > and that others do not, that is a legitimate concern, but seems to me to > be no different from these disparate > applications sharing the same JVM heap which is subject to a global > collection which stops > the threads of each application (or of concurrent collection which > imposes an overhead on > all the applications sharing that JVM, or indeed the hardware), if you > know what i am getting at... > > -- ramki > > On Sun, Jan 8, 2012 at 4:14 PM, David Holmes > wrote: > > Hi Ramki, > > On 9/01/2012 10:08 AM, Srinivas Ramakrishna wrote: > > > > So at best this is a heuristic, so the library tracks the > resource > and at some threshold it invoke System.runFinalization. The GC > itself doesn't make an informed decision because, as you > said your > self, the VM (and hence GC) knows nothing about the resource > being > tracked. > > > Yes, it would be a heuristic-driven policy (or set of policy > choices), > but it's better than each application rolling its own policy and > infrastructure from scratch. > > > If it can be provided as an additional feature that applications > only pay for if they need it. > > > But would we want to burden all users of these classes with the > overhead of resource tracking? > > > The objective is to do the resource tracking from within the > libraries > (+JVM), so the user wouldn't have to bother. > For example, by providing suitable high level API's at the > library level > which would be wrappers > around specific native resources that would be subject to such > tracking. > > > That's exactly the overhead I'm referring to. If these resources are > always managed then all applications pay the price for the few that > need it. > > David > ----- > > > -- ramki > > > Cheers, > David > > > Tony > > I can imagine something like: > > void setFinalizationLimit(Class cls, int limit) > > so that GC runs finalization once a "reference count" > reaches "limit". > That limits the frequency of finalization, but the > actual > finalization > cost may still be unacceptably high. > > Cheers, > David > > Tony > > Afterall the key thing about GC is it > relieves the > programmer from > having to manage object lifetimes, so if you > don't > know when the > object is no longer used you don't know when > to call > close. > > David > > > From kirk at kodewerk.com Mon Jan 9 04:21:49 2012 From: kirk at kodewerk.com (Kirk Pepperdine) Date: Mon, 9 Jan 2012 13:21:49 +0100 Subject: Very long young gc pause (ParNew with CMS) In-Reply-To: <4F0AD41E.6000109@java4.info> References: <4F0ACAAC.8020103@java4.info> <997C208B-7920-446E-8A90-A0D6B752996F@kodewerk.com> <4F0AD41E.6000109@java4.info> Message-ID: <5C4AFCB8-764B-4BA1-B1BA-075416FE1A58@kodewerk.com> There is so much stuff in this gc log it's hard to see the real problem... As as much data as there is in the flags, we're missing the trigger for the CMS cycle That said, these settings will cause premature promotion in most applications. This will put more stress on CMS >>> -XX:SurvivorRatio=8 \ >>> -XX:TargetSurvivorRatio=90 \ The initiating occupancy fraction is very high IMHO and that will cause more difficulties. In most cases, it's better to have a lower threshold as this will give more head room to sort out fragmentation. Also, new size pinned to 256m is also causing a lot of premature promotion. And -Xmx==-Xms.... you can read one of my blog postings on that subject. I do find it interesting that you have a 28G heap with new size set to 256m. IMHO, too much work has been done setting flags. I'd be included to turn most of them back to default settings. I'd turn off most of the data being collected in the GC log as I think it's mostly for debugging GC and it's not very helpful for sizing analysis. Last point, the benefit of CMS is shorter pause times and if you're not seeing that, you might be better off with the parallel collector. Note for the guys I have having the RMI/full gc discussion with... >>> -Dsun.rmi.dgc.server.gcInterval=9223372036854775807 \ >>> -Dsun.rmi.dgc.client.gcInterval=9223372036854775807 \ You've gotta love these settings... ;And I'll have to say they are quite typical. 8^) Regards, Kirk On 2012-01-09, at 12:48 PM, Florian Binder wrote: > Hi Kirk, > > I have attached the log since 8:45 am today to this mail. > Every full hour the application is creating a lot of objects which are kept in memory for a long time. This process may take a few minutes (should be done in less than 10 min) and may explain the premature promotion. But not all young gcs, which are taking so long, are during this process. > > Regards, > Flo > > > Am 09.01.2012 12:23, schrieb Kirk Pepperdine: >> Can you post the complete log? I see a premature promotion event but you'd need the entire log to see if this is pathological case or just an anomaly. >> >> Regards, >> Kirk >> >> On 2012-01-09, at 12:08 PM, Florian Binder wrote: >> >>> Hi everybody, >>> >>> I am using CMS (with ParNew) gc and have very long (> 6 seconds) young >>> gc pauses. >>> As you can see in the log below the old-gen-heap consists of one large >>> block, the new Size has 256m, it uses 13 worker threads and it has to >>> copy 27505761 words (~210mb) directly from eden to old gen. >>> I have seen that this problem occurs only after about one week of >>> uptime. Even thought we make a full (compacting) gc every night. >>> Since real-time> user-time I assume it might be a synchronization >>> problem. Can this be true? >>> >>> Do you have any Ideas how I can speed up this gcs? >>> >>> Please let me know, if you need more informations. >>> >>> Thank you, >>> Flo >>> >>> >>> ##### java -version ##### >>> java version "1.6.0_29" >>> Java(TM) SE Runtime Environment (build 1.6.0_29-b11) >>> Java HotSpot(TM) 64-Bit Server VM (build 20.4-b02, mixed mode) >>> >>> ##### The startup parameters: ##### >>> -Xms28G -Xmx28G >>> -XX:+UseConcMarkSweepGC \ >>> -XX:CMSMaxAbortablePrecleanTime=10000 \ >>> -XX:SurvivorRatio=8 \ >>> -XX:TargetSurvivorRatio=90 \ >>> -XX:MaxTenuringThreshold=31 \ >>> -XX:CMSInitiatingOccupancyFraction=80 \ >>> -XX:NewSize=256M \ >>> >>> -verbose:gc \ >>> -XX:+PrintFlagsFinal \ >>> -XX:PrintFLSStatistics=1 \ >>> -XX:+PrintGCDetails \ >>> -XX:+PrintGCDateStamps \ >>> -XX:-TraceClassUnloading \ >>> -XX:+PrintGCApplicationConcurrentTime \ >>> -XX:+PrintGCApplicationStoppedTime \ >>> -XX:+PrintTenuringDistribution \ >>> -XX:+CMSClassUnloadingEnabled \ >>> -Dsun.rmi.dgc.server.gcInterval=9223372036854775807 \ >>> -Dsun.rmi.dgc.client.gcInterval=9223372036854775807 \ >>> >>> -Djava.awt.headless=true >>> >>> ##### From the out-file (as of +PrintFlagsFinal): ##### >>> ParallelGCThreads = 13 >>> >>> ##### The gc.log-excerpt: ##### >>> Application time: 20,0617700 seconds >>> 2011-12-22T12:02:03.289+0100: [GC Before GC: >>> Statistics for BinaryTreeDictionary: >>> ------------------------------------ >>> Total Free Space: 1183290963 >>> Max Chunk Size: 1183290963 >>> Number of Blocks: 1 >>> Av. Block Size: 1183290963 >>> Tree Height: 1 >>> Before GC: >>> Statistics for BinaryTreeDictionary: >>> ------------------------------------ >>> Total Free Space: 0 >>> Max Chunk Size: 0 >>> Number of Blocks: 0 >>> Tree Height: 0 >>> [ParNew >>> Desired survivor size 25480392 bytes, new threshold 1 (max 31) >>> - age 1: 28260160 bytes, 28260160 total >>> : 249216K->27648K(249216K), 6,1808130 secs] >>> 20061765K->20056210K(29332480K)After GC: >>> Statistics for BinaryTreeDictionary: >>> ------------------------------------ >>> Total Free Space: 1155785202 >>> Max Chunk Size: 1155785202 >>> Number of Blocks: 1 >>> Av. Block Size: 1155785202 >>> Tree Height: 1 >>> After GC: >>> Statistics for BinaryTreeDictionary: >>> ------------------------------------ >>> Total Free Space: 0 >>> Max Chunk Size: 0 >>> Number of Blocks: 0 >>> Tree Height: 0 >>> , 6,1809440 secs] [Times: user=3,08 sys=0,51, real=6,18 secs] >>> Total time for which application threads were stopped: 6,1818730 seconds >>> _______________________________________________ >>> hotspot-gc-use mailing list >>> hotspot-gc-use at openjdk.java.net >>> http://mail.openjdk.java.net/mailman/listinfo/hotspot-gc-use > > From daniel.daugherty at oracle.com Mon Jan 9 15:37:24 2012 From: daniel.daugherty at oracle.com (Daniel D. Daugherty) Date: Mon, 09 Jan 2012 16:37:24 -0700 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F087516.40505@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> <4F087516.40505@oracle.com> Message-ID: <4F0B7A34.5070406@oracle.com> On 1/7/12 9:38 AM, James Melvin wrote: > WEBREV: > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 make/bsd/Makefile No comments. make/bsd/makefiles/buildtree.make No comments. make/bsd/makefiles/defs.make Thanks for fixing this one for BSD platforms. make/bsd/makefiles/launcher.make line 60: typo: 'inadvertenly' -> 'inadvertently' Sorry I missed this in my first review, but the addition of '-framework CoreFoundation' to LFLAGS_LAUNCHER is probably MacOS X specific. I think: ifeq ($(OS_VENDOR), Darwin) else endif will work in launcher.make also. make/bsd/makefiles/vm.make No comments. src/os/bsd/vm/os_bsd.cpp line 2544: typo: 'overriden' -> 'overridden' line 2588: typo: 'overriden' -> 'overridden' Looks like old code line 2576 depended on the 'hotspot' symlink to refer to either 'client' or 'server' or whatever JVM you wanted to run. I'm fairly certain that the 'hotspot' symlink was retired; I'm just not sure when. src/os/posix/launcher/java_md.c No comments. Dan From mikael.gerdin at oracle.com Tue Jan 10 11:27:06 2012 From: mikael.gerdin at oracle.com (Mikael Gerdin) Date: Tue, 10 Jan 2012 20:27:06 +0100 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4ED50287.3070102@oracle.com> References: <4ED50287.3070102@oracle.com> Message-ID: <4F0C910A.5070205@oracle.com> Hi all Back from vacations now with an updated version of the webrev based on the feedback received in this thread. Changes include: * removed install target from makefiles * renamed flag form EnableWhiteBoxAPI to remove redundant Enable * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to the boot class path from inside the VM. http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ Thanks Mikael Gerdin On 2011-11-29 17:04, Mikael Gerdin wrote: > Hi > > I've been working on a white box testing API for HotSpot in order to > allow for improved precision in vm testing. > > The basic idea is to open up the possibility for tests written in Java > to call native methods which query or poke the vm in some way. > > The API is accessible by using the class sun/hotspot/WhiteBox which is > not intended to be available in public builds. > In order to allow the WhiteBox class access to the VM the > registerNatives function is linked to JVM_RegisterWhiteBoxMethods. That > function then links all the implementation functions using normal JNI > RegisterNatives. > > The API is not meant to be used by end users for any intent or purpose > and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions > -XX:+EnableWhiteboxAPI" and the fact that the class files will not be > present in an end user build of a JDK. > If the VM crashes after this API has been accessed a note will be > written in the hs_err file to signal that the API has been used. > > Webrev: > http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ > (thanks to stefank for hosting my webrev :) > > CR: > I'll file a CR tomorrow. > > Change comments: > > make/jprt.properties > > Add a test target to make sure that the API is available on all > supported platforms > > make/** > > Makefile changes to build the class sun/hotspot/WhiteBox, put it in a > JAR file and copy it to the jre/lib/endorsed directory in the export > targets. > The BSD makefile changes are not tested since I don't have access to any > BSD/OSX machine to test them on. > > src/share/vm/prims/nativeLookup.cpp > > Special-case the method sun/hotspot/WhiteBox/registerNatives and link it > to JVM_RegisterWhiteBoxMethods > > src/share/vm/prims/whitebox.* > > The implementation of the white box API. The actual API functions are > only examples of what we want to be able to do using the API. > > src/share/vm/runtime/globals.hpp > > Add the command line flag > > src/share/vm/utilities/vmError.cpp > > Print a message in hs_err files when white box API has been used. > > test/Makefile > > Add a makefile test target for the white box API test > > test/sanity/wbapi.java > > JTreg test to ensure that the API works. > > > Thanks > /Mikael Gerdin From james.melvin at oracle.com Tue Jan 10 13:05:13 2012 From: james.melvin at oracle.com (James Melvin) Date: Tue, 10 Jan 2012 16:05:13 -0500 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F0B7A34.5070406@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> <4F087516.40505@oracle.com> <4F0B7A34.5070406@oracle.com> Message-ID: <4F0CA809.7020001@oracle.com> Hi Dan, Final webrev to reflect your comments... http://cr.openjdk.java.net/~jmelvin/7125793/webrev.02 Minor changes this round: make/bsd/makefiles/buildtree.make # Fail gracefully on Apple BOOTDIR make/bsd/makefiles/launcher.make # Link with framework only on Mac src/os/bsd/vm/os_bsd.cpp # Just spelling fix Lastly, I wanted to reply to John Coomes comments earlier about the test_gamma script simplification. Although I also value economy of expression, in this case I think the use of more advanced shell constructs increases the time for fresh eyes to decipher. Given performance and such is not an issue, I'd prefer to keep the simpler version I'm proposing with this change on Mac OS X, to make it easier on future maintenance. This should be a model for the other platforms when we reconcile. I've attached the before and after copies should there be further comments on the simplified short script. Thanks, Jim On 1/9/12 6:37 PM, Daniel D. Daugherty wrote: > On 1/7/12 9:38 AM, James Melvin wrote: >> WEBREV: >> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 > > make/bsd/Makefile > No comments. > > make/bsd/makefiles/buildtree.make > No comments. > > make/bsd/makefiles/defs.make > Thanks for fixing this one for BSD platforms. > > make/bsd/makefiles/launcher.make > line 60: typo: 'inadvertenly' -> 'inadvertently' > > Sorry I missed this in my first review, but the addition > of '-framework CoreFoundation' to LFLAGS_LAUNCHER is > probably MacOS X specific. I think: > > ifeq ($(OS_VENDOR), Darwin) > else > endif > > will work in launcher.make also. > > make/bsd/makefiles/vm.make > No comments. > > src/os/bsd/vm/os_bsd.cpp > line 2544: typo: 'overriden' -> 'overridden' > line 2588: typo: 'overriden' -> 'overridden' > > Looks like old code line 2576 depended on the 'hotspot' > symlink to refer to either 'client' or 'server' or whatever > JVM you wanted to run. I'm fairly certain that the 'hotspot' > symlink was retired; I'm just not sure when. > > src/os/posix/launcher/java_md.c > No comments. > > Dan > > -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: test_gamma.before Url: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120110/6a912938/test_gamma.before -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: test_gamma.after Url: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120110/6a912938/test_gamma.after From daniel.daugherty at oracle.com Tue Jan 10 13:27:17 2012 From: daniel.daugherty at oracle.com (Daniel D. Daugherty) Date: Tue, 10 Jan 2012 14:27:17 -0700 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F0CA809.7020001@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> <4F087516.40505@oracle.com> <4F0B7A34.5070406@oracle.com> <4F0CA809.7020001@oracle.com> Message-ID: <4F0CAD35.40809@oracle.com> On 1/10/12 2:05 PM, James Melvin wrote: > Hi Dan, > > Final webrev to reflect your comments... > > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.02 > > Minor changes this round: > > make/bsd/makefiles/buildtree.make # Fail gracefully on Apple BOOTDIR > make/bsd/makefiles/launcher.make # Link with framework only on Mac > src/os/bsd/vm/os_bsd.cpp # Just spelling fix Thumbs up on the current version. To close the loop on one of my earlier comments: $ ls -l binaries/solsparc/jre/lib/sparc/hotspot lrwxrwxrwx 1 nobody nobody 6 Apr 1 2009 binaries/solsparc/jre/lib/sparc/hotspot -> client This symlink exists in JDK1.3.1, but I didn't find it in JDK1.4.0. > Lastly, I wanted to reply to John Coomes comments earlier about the > test_gamma script simplification. Although I also value economy of > expression, in this case I think the use of more advanced shell > constructs increases the time for fresh eyes to decipher. Given > performance and such is not an issue, I'd prefer to keep the simpler > version I'm proposing with this change on Mac OS X, to make it easier on > future maintenance. This should be a model for the other platforms when > we reconcile. I've attached the before and after copies should there be > further comments on the simplified short script. The attachments didn't come through because your e-mail went through the OpenJDK list servers. Just to be clear: I vote for the newer version. It is more straight forward and has comments. Dan > > Thanks, > > Jim > > > On 1/9/12 6:37 PM, Daniel D. Daugherty wrote: >> On 1/7/12 9:38 AM, James Melvin wrote: >>> WEBREV: >>> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 >> >> make/bsd/Makefile >> No comments. >> >> make/bsd/makefiles/buildtree.make >> No comments. >> >> make/bsd/makefiles/defs.make >> Thanks for fixing this one for BSD platforms. >> >> make/bsd/makefiles/launcher.make >> line 60: typo: 'inadvertenly' -> 'inadvertently' >> >> Sorry I missed this in my first review, but the addition >> of '-framework CoreFoundation' to LFLAGS_LAUNCHER is >> probably MacOS X specific. I think: >> >> ifeq ($(OS_VENDOR), Darwin) >> else >> endif >> >> will work in launcher.make also. >> >> make/bsd/makefiles/vm.make >> No comments. >> >> src/os/bsd/vm/os_bsd.cpp >> line 2544: typo: 'overriden' -> 'overridden' >> line 2588: typo: 'overriden' -> 'overridden' >> >> Looks like old code line 2576 depended on the 'hotspot' >> symlink to refer to either 'client' or 'server' or whatever >> JVM you wanted to run. I'm fairly certain that the 'hotspot' >> symlink was retired; I'm just not sure when. >> >> src/os/posix/launcher/java_md.c >> No comments. >> >> Dan >> >> From John.Coomes at oracle.com Wed Jan 11 12:01:04 2012 From: John.Coomes at oracle.com (John Coomes) Date: Wed, 11 Jan 2012 12:01:04 -0800 Subject: RFR (S): 7125793: MAC: test_gamma should always work In-Reply-To: <4F0CA809.7020001@oracle.com> References: <4EFECA5D.6010905@oracle.com> <4F034B51.3070609@oracle.com> <4F087516.40505@oracle.com> <4F0B7A34.5070406@oracle.com> <4F0CA809.7020001@oracle.com> Message-ID: <20237.60032.473316.761490@oracle.com> James Melvin (james.melvin at oracle.com) wrote: > Hi Dan, > > Final webrev to reflect your comments... > > http://cr.openjdk.java.net/~jmelvin/7125793/webrev.02 > > Minor changes this round: > > make/bsd/makefiles/buildtree.make # Fail gracefully on Apple BOOTDIR > make/bsd/makefiles/launcher.make # Link with framework only on Mac > src/os/bsd/vm/os_bsd.cpp # Just spelling fix > > Lastly, I wanted to reply to John Coomes comments earlier about the > test_gamma script simplification. Although I also value economy of > expression, in this case I think the use of more advanced shell > constructs increases the time for fresh eyes to decipher. Given > performance and such is not an issue, I'd prefer to keep the simpler > version I'm proposing with this change on Mac OS X, to make it easier on > future maintenance. This should be a model for the other platforms when > we reconcile. I've attached the before and after copies should there be > further comments on the simplified short script. As mentioned before, I have problems with you're proposed change beyond just brevity. First, it makes reconciling changes with the other platforms more difficult, so do all (if you get agreement) or do none. Second, any semantic changes are lost in the reformatting noise. Do the reformatting separately from the semantic change. And the expanded version is waaaay to bloated. I'm open to some code expansion, but the whitespace decorators around comment blocks for each trivial statement is too much. It reminds me of beginning cobol. -John > On 1/9/12 6:37 PM, Daniel D. Daugherty wrote: > > On 1/7/12 9:38 AM, James Melvin wrote: > >> WEBREV: > >> http://cr.openjdk.java.net/~jmelvin/7125793/webrev.01 > > > > make/bsd/Makefile > > No comments. > > > > make/bsd/makefiles/buildtree.make > > No comments. > > > > make/bsd/makefiles/defs.make > > Thanks for fixing this one for BSD platforms. > > > > make/bsd/makefiles/launcher.make > > line 60: typo: 'inadvertenly' -> 'inadvertently' > > > > Sorry I missed this in my first review, but the addition > > of '-framework CoreFoundation' to LFLAGS_LAUNCHER is > > probably MacOS X specific. I think: > > > > ifeq ($(OS_VENDOR), Darwin) > > else > > endif > > > > will work in launcher.make also. > > > > make/bsd/makefiles/vm.make > > No comments. > > > > src/os/bsd/vm/os_bsd.cpp > > line 2544: typo: 'overriden' -> 'overridden' > > line 2588: typo: 'overriden' -> 'overridden' > > > > Looks like old code line 2576 depended on the 'hotspot' > > symlink to refer to either 'client' or 'server' or whatever > > JVM you wanted to run. I'm fairly certain that the 'hotspot' > > symlink was retired; I'm just not sure when. > > > > src/os/posix/launcher/java_md.c > > No comments. > > > > Dan > > > > > > ---------------------------------------------------------------------- > #!/bin/sh > # Generated by /Users/jmelvin/dev/testing/make/bsd/makefiles/buildtree.make > . ./env.sh > if [ "" != "" ]; then { echo Cross compiling for ARCH , skipping gamma run.; exit 0; }; fi > if [ -z $JAVA_HOME ]; then { echo JAVA_HOME must be set to run this test.; exit 0; }; fi > if ! ${JAVA_HOME}/bin/java -d32 -fullversion 2>&1 > /dev/null > then > echo JAVA_HOME must point to 32bit JDK.; exit 0; > fi > rm -f Queens.class > ${JAVA_HOME}/bin/javac -d . /Users/jmelvin/dev/testing/make/test/Queens.java > [ -f gamma_g ] && { gamma=gamma_g; } > ./${gamma:-gamma} -Xbatch -showversion Queens < /dev/null > exit 0 > > ---------------------------------------------------------------------- > #!/bin/sh > > # Generated by /Users/jmelvin/dev/7125793/make/bsd/makefiles/buildtree.make > > # > # Include environment settings for gamma run > # > > . ./env.sh > > > # > # Do not run gamma test for cross compiles > # > > if [ -n "" ]; then > echo Cross compiling for ARCH , skipping gamma run. > exit 0 > fi > > > # > # Make sure JAVA_HOME is set as it is required for gamma > # > > if [ -z "${JAVA_HOME}" ]; then > echo JAVA_HOME must be set to run this test. > exit 0 > fi > > > # > # Report JAVA_HOME version to be used for the test > # > > FULL_VERSION="`${JAVA_HOME}/bin/java -d64 -fullversion`" > if [ $? -ne 0 ]; then > echo JAVA_HOME must point to a 64-bit OpenJDK. > exit 0 > fi > echo "${FULL_VERSION}" | awk '{printf("%s",$0);}' > > > # > # Use gamma_g if it exists > # > > GAMMA_PROG=gamma > if [ -f gamma_g ]; then > GAMMA_PROG=gamma_g > fi > > > # > # Ensure architecture for gamma and JAVA_HOME is the same. > # NOTE: gamma assumes the OpenJDK directory layout. > # > > GAMMA_ARCH="`file ${GAMMA_PROG} | awk '{print $NF}'`" > JVM_LIB="${JAVA_HOME}/jre/lib/libjava.dylib" > if [ ! -f ${JVM_LIB} ]; then > JVM_LIB="${JAVA_HOME}/jre/lib/amd64/libjava.dylib" > fi > if [ ! -f ${JVM_LIB} ] || [ -z "`file ${JVM_LIB} | grep ${GAMMA_ARCH}`" ]; then > echo JAVA_HOME must point to a 64-bit OpenJDK. > exit 0 > fi > > > # > # Compile Queens program for test > # > > rm -f Queens.class > ${JAVA_HOME}/bin/javac -d . /Users/jmelvin/dev/7125793/make/test/Queens.java > > > # > # Set library path solely for gamma launcher test run > # > > LD_LIBRARY_PATH=.:${JAVA_HOME}/jre/lib/amd64/native_threads:${JAVA_HOME}/jre/lib/amd64: > DYLD_LIBRARY_PATH=.:${JAVA_HOME}/jre/lib/native_threads:${JAVA_HOME}/jre/lib:${JAVA_HOME}/jre/lib/amd64/native_threads:${JAVA_HOME}/jre/lib/amd64: > export DYLD_LIBRARY_PATH > > > # > # Use the gamma launcher and JAVA_HOME to run the test > # > > ./${GAMMA_PROG} -Xbatch -showversion Queens < /dev/null From david.holmes at oracle.com Wed Jan 11 23:26:08 2012 From: david.holmes at oracle.com (David Holmes) Date: Thu, 12 Jan 2012 17:26:08 +1000 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4F0C910A.5070205@oracle.com> References: <4ED50287.3070102@oracle.com> <4F0C910A.5070205@oracle.com> Message-ID: <4F0E8B10.3030600@oracle.com> Hi Mikael, This seems to address my concerns with the previous implementation of this. Some further minor comments: In whitebox.cpp: 181 if (result != 0) { 182 WhiteBox::set_used(); shouldn't the above be testing for == 0 ? --- wbapi.java: normal Java naming style is to use camel-case for class names. Though as WB is itself an acronym I'd be okay with WBApi. In fact I'd be happy with anything other than initial lower-case :) --- test/Makefile: does wbapitest need to be added to the phoney list? --- Cheers, David ----- On 11/01/2012 5:27 AM, Mikael Gerdin wrote: > Hi all > > Back from vacations now with an updated version of the webrev based on > the feedback received in this thread. > Changes include: > * removed install target from makefiles > * renamed flag form EnableWhiteBoxAPI to remove redundant Enable > * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to > the boot class path from inside the VM. > > http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ > > Thanks > Mikael Gerdin > > On 2011-11-29 17:04, Mikael Gerdin wrote: >> Hi >> >> I've been working on a white box testing API for HotSpot in order to >> allow for improved precision in vm testing. >> >> The basic idea is to open up the possibility for tests written in Java >> to call native methods which query or poke the vm in some way. >> >> The API is accessible by using the class sun/hotspot/WhiteBox which is >> not intended to be available in public builds. >> In order to allow the WhiteBox class access to the VM the >> registerNatives function is linked to JVM_RegisterWhiteBoxMethods. That >> function then links all the implementation functions using normal JNI >> RegisterNatives. >> >> The API is not meant to be used by end users for any intent or purpose >> and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions >> -XX:+EnableWhiteboxAPI" and the fact that the class files will not be >> present in an end user build of a JDK. >> If the VM crashes after this API has been accessed a note will be >> written in the hs_err file to signal that the API has been used. >> >> Webrev: >> http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ >> (thanks to stefank for hosting my webrev :) >> >> CR: >> I'll file a CR tomorrow. >> >> Change comments: >> >> make/jprt.properties >> >> Add a test target to make sure that the API is available on all >> supported platforms >> >> make/** >> >> Makefile changes to build the class sun/hotspot/WhiteBox, put it in a >> JAR file and copy it to the jre/lib/endorsed directory in the export >> targets. >> The BSD makefile changes are not tested since I don't have access to any >> BSD/OSX machine to test them on. >> >> src/share/vm/prims/nativeLookup.cpp >> >> Special-case the method sun/hotspot/WhiteBox/registerNatives and link it >> to JVM_RegisterWhiteBoxMethods >> >> src/share/vm/prims/whitebox.* >> >> The implementation of the white box API. The actual API functions are >> only examples of what we want to be able to do using the API. >> >> src/share/vm/runtime/globals.hpp >> >> Add the command line flag >> >> src/share/vm/utilities/vmError.cpp >> >> Print a message in hs_err files when white box API has been used. >> >> test/Makefile >> >> Add a makefile test target for the white box API test >> >> test/sanity/wbapi.java >> >> JTreg test to ensure that the API works. >> >> >> Thanks >> /Mikael Gerdin > From aph at redhat.com Thu Jan 12 03:53:39 2012 From: aph at redhat.com (Andrew Haley) Date: Thu, 12 Jan 2012 11:53:39 +0000 Subject: Question re safepoints and monitors Message-ID: <4F0EC9C3.7000909@redhat.com> This is re HS20 on Zero, but it might apply to any HotSpot port AFAIK. We invoke Thread.stop on a thread. To do this, we need to wait for the thread to reach a safepoint. So, thread->set_pending_exception() is called, and SafepointSynchronize::_state is set to SafepointSynchronize::_synchronizing. The thread needs to acquire a lock, so it enters InterpreterRuntime::monitorenter(). This does the safepoint check. monitorenter() is marked IRT_ENTRY_NO_ASYNC, so it does not check for pending async exceptions. Control returns from monitorenter and the thread continues to execute Java. The async exception is not processed. As far as I can see, there is still a pending exception for this thread but it won't be processed until something else causes the thread to move to a safepoint. Is that right? Thanks, Andrew. From david.holmes at oracle.com Thu Jan 12 04:39:35 2012 From: david.holmes at oracle.com (David Holmes) Date: Thu, 12 Jan 2012 22:39:35 +1000 Subject: Question re safepoints and monitors In-Reply-To: <4F0EC9C3.7000909@redhat.com> References: <4F0EC9C3.7000909@redhat.com> Message-ID: <4F0ED487.8060504@oracle.com> Andrew, On 12/01/2012 9:53 PM, Andrew Haley wrote: > This is re HS20 on Zero, but it might apply to any HotSpot port AFAIK. > > We invoke Thread.stop on a thread. To do this, we need to wait for > the thread to reach a safepoint. So, thread->set_pending_exception() > is called, and SafepointSynchronize::_state is set to > SafepointSynchronize::_synchronizing. > > The thread needs to acquire a lock, so it enters > InterpreterRuntime::monitorenter(). This does the safepoint check. > monitorenter() is marked IRT_ENTRY_NO_ASYNC, so it does not check for > pending async exceptions. Control returns from monitorenter and the > thread continues to execute Java. The async exception is not > processed. Yes it is - the interpreter executes the monitorenter() call inside a CALL_VM macro which checks for the pending exception. David ----- > As far as I can see, there is still a pending exception for this > thread but it won't be processed until something else causes the > thread to move to a safepoint. Is that right? > > Thanks, > Andrew. From aph at redhat.com Thu Jan 12 05:01:00 2012 From: aph at redhat.com (Andrew Haley) Date: Thu, 12 Jan 2012 13:01:00 +0000 Subject: Question re safepoints and monitors In-Reply-To: <4F0ED487.8060504@oracle.com> References: <4F0EC9C3.7000909@redhat.com> <4F0ED487.8060504@oracle.com> Message-ID: <4F0ED98C.4060007@redhat.com> On 01/12/2012 12:39 PM, David Holmes wrote: > Yes it is - the interpreter executes the monitorenter() call inside a > CALL_VM macro which checks for the pending exception. Thanks, but I still don't quite get it. The interrupting thread calls send_thread_stop(), which sets thread._pending_async_exception: void set_pending_async_exception(oop e) { _pending_async_exception = e; _special_runtime_exit_condition = _async_exception; set_has_async_exception(); } As far as I can see the code that actually reads thread._pending_async_exception and sets thread._pending_exception is JavaThread::check_and_handle_async_exceptions(): // Check for pending async. exception if (_pending_async_exception != NULL) { // Only overwrite an already pending exception, if it is not a threadDeath. if (!has_pending_exception() || !pending_exception()->is_a(SystemDictionary::ThreadDeath_klass())) { // We cannot call Exceptions::_throw(...) here because we cannot block set_pending_exception(_pending_async_exception, __FILE__, __LINE__); but check_and_handle_async_exceptions() is not called in the case of a monitor operation. We need something to copy _pending_async_exception to _pending_exception, but I can't see when that would ever be called. Andrew. From daniel.daugherty at oracle.com Thu Jan 12 05:17:46 2012 From: daniel.daugherty at oracle.com (daniel.daugherty at oracle.com) Date: Thu, 12 Jan 2012 13:17:46 +0000 Subject: hg: hsx/hotspot-main/hotspot: 7129240: backout fix for 7102776 until 7128770 is resolved Message-ID: <20120112131754.38E694793A@hg.openjdk.java.net> Changeset: 8f8b94305aff Author: dcubed Date: 2012-01-11 19:54 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/8f8b94305aff 7129240: backout fix for 7102776 until 7128770 is resolved Reviewed-by: phh, bobv, coleenp, dcubed Contributed-by: Jiangli Zhou ! agent/src/share/classes/sun/jvm/hotspot/oops/InstanceKlass.java ! src/share/vm/code/dependencies.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/instanceKlassKlass.cpp ! src/share/vm/runtime/vmStructs.cpp From david.holmes at oracle.com Thu Jan 12 05:21:27 2012 From: david.holmes at oracle.com (David Holmes) Date: Thu, 12 Jan 2012 23:21:27 +1000 Subject: Question re safepoints and monitors In-Reply-To: <4F0ED98C.4060007@redhat.com> References: <4F0EC9C3.7000909@redhat.com> <4F0ED487.8060504@oracle.com> <4F0ED98C.4060007@redhat.com> Message-ID: <4F0EDE57.1010307@oracle.com> On 12/01/2012 11:01 PM, Andrew Haley wrote: > On 01/12/2012 12:39 PM, David Holmes wrote: >> Yes it is - the interpreter executes the monitorenter() call inside a >> CALL_VM macro which checks for the pending exception. > > Thanks, but I still don't quite get it. > > The interrupting thread calls send_thread_stop(), which sets > thread._pending_async_exception: > > void set_pending_async_exception(oop e) { > _pending_async_exception = e; > _special_runtime_exit_condition = _async_exception; > set_has_async_exception(); > } > > As far as I can see the code that actually reads > thread._pending_async_exception and sets thread._pending_exception is > JavaThread::check_and_handle_async_exceptions(): > > // Check for pending async. exception > if (_pending_async_exception != NULL) { > // Only overwrite an already pending exception, if it is not a threadDeath. > if (!has_pending_exception() || !pending_exception()->is_a(SystemDictionary::ThreadDeath_klass())) { > > // We cannot call Exceptions::_throw(...) here because we cannot block > set_pending_exception(_pending_async_exception, __FILE__, __LINE__); > > but check_and_handle_async_exceptions() is not called in the case of a > monitor operation. We need something to copy _pending_async_exception > to _pending_exception, but I can't see when that would ever be called. Yes I think you are correct. The exception will remain as a pending async exception until the next thread state transition that causes handle_special_runtime_exit_condition to be called (or there may be some other path that forces an async check - like the next safepoint). There is a lot of history to the way Thread.stop operates ... David ----- > Andrew. From aph at redhat.com Thu Jan 12 07:26:34 2012 From: aph at redhat.com (Andrew Haley) Date: Thu, 12 Jan 2012 15:26:34 +0000 Subject: Question re safepoints and monitors In-Reply-To: <4F0EDE57.1010307@oracle.com> References: <4F0EC9C3.7000909@redhat.com> <4F0ED487.8060504@oracle.com> <4F0ED98C.4060007@redhat.com> <4F0EDE57.1010307@oracle.com> Message-ID: <4F0EFBAA.2000304@redhat.com> On 01/12/2012 01:21 PM, David Holmes wrote: > Yes I think you are correct. The exception will remain as a pending > async exception until the next thread state transition that causes > handle_special_runtime_exit_condition to be called (or there may be some > other path that forces an async check - like the next safepoint). > > There is a lot of history to the way Thread.stop operates ... I can well imagine! Thanks very much, Andrew. From mikael.gerdin at oracle.com Thu Jan 12 08:20:32 2012 From: mikael.gerdin at oracle.com (Mikael Gerdin) Date: Thu, 12 Jan 2012 17:20:32 +0100 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4F0E8B10.3030600@oracle.com> References: <4ED50287.3070102@oracle.com> <4F0C910A.5070205@oracle.com> <4F0E8B10.3030600@oracle.com> Message-ID: <4F0F0850.8050402@oracle.com> Hi David, On 2012-01-12 08:26, David Holmes wrote: > Hi Mikael, > > This seems to address my concerns with the previous implementation of > this. Some further minor comments: > > In whitebox.cpp: > > 181 if (result != 0) { > 182 WhiteBox::set_used(); > > shouldn't the above be testing for == 0 ? Yes it should, fixed. > > --- > > wbapi.java: normal Java naming style is to use camel-case for class > names. Though as WB is itself an acronym I'd be okay with WBApi. In fact > I'd be happy with anything other than initial lower-case :) Many of our existing tests have lower-case names so I guess I thought that was some sort of convention, it does not really matter to me. WBApi it is then. > > --- > > test/Makefile: does wbapitest need to be added to the phoney list? Yes, fixed. New webrev at: http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.3/ Incremental at: http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2-to-3/webrev/ /Mikael > > --- > > Cheers, > David > ----- > > > On 11/01/2012 5:27 AM, Mikael Gerdin wrote: >> Hi all >> >> Back from vacations now with an updated version of the webrev based on >> the feedback received in this thread. >> Changes include: >> * removed install target from makefiles >> * renamed flag form EnableWhiteBoxAPI to remove redundant Enable >> * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to >> the boot class path from inside the VM. >> >> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ >> >> Thanks >> Mikael Gerdin >> >> On 2011-11-29 17:04, Mikael Gerdin wrote: >>> Hi >>> >>> I've been working on a white box testing API for HotSpot in order to >>> allow for improved precision in vm testing. >>> >>> The basic idea is to open up the possibility for tests written in Java >>> to call native methods which query or poke the vm in some way. >>> >>> The API is accessible by using the class sun/hotspot/WhiteBox which is >>> not intended to be available in public builds. >>> In order to allow the WhiteBox class access to the VM the >>> registerNatives function is linked to JVM_RegisterWhiteBoxMethods. That >>> function then links all the implementation functions using normal JNI >>> RegisterNatives. >>> >>> The API is not meant to be used by end users for any intent or purpose >>> and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions >>> -XX:+EnableWhiteboxAPI" and the fact that the class files will not be >>> present in an end user build of a JDK. >>> If the VM crashes after this API has been accessed a note will be >>> written in the hs_err file to signal that the API has been used. >>> >>> Webrev: >>> http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ >>> (thanks to stefank for hosting my webrev :) >>> >>> CR: >>> I'll file a CR tomorrow. >>> >>> Change comments: >>> >>> make/jprt.properties >>> >>> Add a test target to make sure that the API is available on all >>> supported platforms >>> >>> make/** >>> >>> Makefile changes to build the class sun/hotspot/WhiteBox, put it in a >>> JAR file and copy it to the jre/lib/endorsed directory in the export >>> targets. >>> The BSD makefile changes are not tested since I don't have access to any >>> BSD/OSX machine to test them on. >>> >>> src/share/vm/prims/nativeLookup.cpp >>> >>> Special-case the method sun/hotspot/WhiteBox/registerNatives and link it >>> to JVM_RegisterWhiteBoxMethods >>> >>> src/share/vm/prims/whitebox.* >>> >>> The implementation of the white box API. The actual API functions are >>> only examples of what we want to be able to do using the API. >>> >>> src/share/vm/runtime/globals.hpp >>> >>> Add the command line flag >>> >>> src/share/vm/utilities/vmError.cpp >>> >>> Print a message in hs_err files when white box API has been used. >>> >>> test/Makefile >>> >>> Add a makefile test target for the white box API test >>> >>> test/sanity/wbapi.java >>> >>> JTreg test to ensure that the API works. >>> >>> >>> Thanks >>> /Mikael Gerdin >> From david.holmes at oracle.com Thu Jan 12 15:00:21 2012 From: david.holmes at oracle.com (David Holmes) Date: Fri, 13 Jan 2012 09:00:21 +1000 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4F0F0850.8050402@oracle.com> References: <4ED50287.3070102@oracle.com> <4F0C910A.5070205@oracle.com> <4F0E8B10.3030600@oracle.com> <4F0F0850.8050402@oracle.com> Message-ID: <4F0F6605.5080900@oracle.com> Hi Mikael, On 13/01/2012 2:20 AM, Mikael Gerdin wrote: >> wbapi.java: normal Java naming style is to use camel-case for class >> names. Though as WB is itself an acronym I'd be okay with WBApi. In fact >> I'd be happy with anything other than initial lower-case :) > > Many of our existing tests have lower-case names so I guess I thought > that was some sort of convention, it does not really matter to me. I think those tests must have been written by C programers ;-) > WBApi it is then. Thanks.There is a slight typo in that the file is WBapi.java not WBApi.java David ----- > >> >> --- >> >> test/Makefile: does wbapitest need to be added to the phoney list? > > Yes, fixed. > > New webrev at: > http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.3/ > Incremental at: > http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2-to-3/webrev/ > > /Mikael > >> >> --- >> >> Cheers, >> David >> ----- >> >> >> On 11/01/2012 5:27 AM, Mikael Gerdin wrote: >>> Hi all >>> >>> Back from vacations now with an updated version of the webrev based on >>> the feedback received in this thread. >>> Changes include: >>> * removed install target from makefiles >>> * renamed flag form EnableWhiteBoxAPI to remove redundant Enable >>> * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to >>> the boot class path from inside the VM. >>> >>> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ >>> >>> Thanks >>> Mikael Gerdin >>> >>> On 2011-11-29 17:04, Mikael Gerdin wrote: >>>> Hi >>>> >>>> I've been working on a white box testing API for HotSpot in order to >>>> allow for improved precision in vm testing. >>>> >>>> The basic idea is to open up the possibility for tests written in Java >>>> to call native methods which query or poke the vm in some way. >>>> >>>> The API is accessible by using the class sun/hotspot/WhiteBox which is >>>> not intended to be available in public builds. >>>> In order to allow the WhiteBox class access to the VM the >>>> registerNatives function is linked to JVM_RegisterWhiteBoxMethods. That >>>> function then links all the implementation functions using normal JNI >>>> RegisterNatives. >>>> >>>> The API is not meant to be used by end users for any intent or purpose >>>> and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions >>>> -XX:+EnableWhiteboxAPI" and the fact that the class files will not be >>>> present in an end user build of a JDK. >>>> If the VM crashes after this API has been accessed a note will be >>>> written in the hs_err file to signal that the API has been used. >>>> >>>> Webrev: >>>> http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ >>>> (thanks to stefank for hosting my webrev :) >>>> >>>> CR: >>>> I'll file a CR tomorrow. >>>> >>>> Change comments: >>>> >>>> make/jprt.properties >>>> >>>> Add a test target to make sure that the API is available on all >>>> supported platforms >>>> >>>> make/** >>>> >>>> Makefile changes to build the class sun/hotspot/WhiteBox, put it in a >>>> JAR file and copy it to the jre/lib/endorsed directory in the export >>>> targets. >>>> The BSD makefile changes are not tested since I don't have access to >>>> any >>>> BSD/OSX machine to test them on. >>>> >>>> src/share/vm/prims/nativeLookup.cpp >>>> >>>> Special-case the method sun/hotspot/WhiteBox/registerNatives and >>>> link it >>>> to JVM_RegisterWhiteBoxMethods >>>> >>>> src/share/vm/prims/whitebox.* >>>> >>>> The implementation of the white box API. The actual API functions are >>>> only examples of what we want to be able to do using the API. >>>> >>>> src/share/vm/runtime/globals.hpp >>>> >>>> Add the command line flag >>>> >>>> src/share/vm/utilities/vmError.cpp >>>> >>>> Print a message in hs_err files when white box API has been used. >>>> >>>> test/Makefile >>>> >>>> Add a makefile test target for the white box API test >>>> >>>> test/sanity/wbapi.java >>>> >>>> JTreg test to ensure that the API works. >>>> >>>> >>>> Thanks >>>> /Mikael Gerdin >>> From keith.mcguigan at oracle.com Thu Jan 12 18:38:57 2012 From: keith.mcguigan at oracle.com (keith.mcguigan at oracle.com) Date: Fri, 13 Jan 2012 02:38:57 +0000 Subject: hg: hsx/hotspot-main/hotspot: 3 new changesets Message-ID: <20120113023907.D45364794E@hg.openjdk.java.net> Changeset: 4f25538b54c9 Author: fparain Date: 2012-01-09 10:27 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/4f25538b54c9 7120511: Add diagnostic commands Reviewed-by: acorn, phh, dcubed, sspitsyn ! src/share/vm/classfile/vmSymbols.hpp ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/globals.cpp ! src/share/vm/runtime/globals.hpp ! src/share/vm/runtime/init.cpp ! src/share/vm/services/attachListener.cpp ! src/share/vm/services/diagnosticCommand.cpp ! src/share/vm/services/diagnosticCommand.hpp ! src/share/vm/services/diagnosticFramework.cpp ! src/share/vm/services/diagnosticFramework.hpp ! src/share/vm/services/management.cpp Changeset: 865e0817f32b Author: kamg Date: 2012-01-10 15:47 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/865e0817f32b Merge ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/globals.hpp Changeset: efdf6985a3a2 Author: kamg Date: 2012-01-12 09:59 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/efdf6985a3a2 Merge From john.r.rose at oracle.com Fri Jan 13 03:30:19 2012 From: john.r.rose at oracle.com (john.r.rose at oracle.com) Date: Fri, 13 Jan 2012 11:30:19 +0000 Subject: hg: hsx/hotspot-main/hotspot: 5 new changesets Message-ID: <20120113113032.725D34795C@hg.openjdk.java.net> Changeset: 5da7201222d5 Author: kvn Date: 2012-01-07 10:39 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/5da7201222d5 7110824: ctw/jarfiles/GUI3rdParty_jar/ob_mask_DateField crashes VM Summary: Change yank_if_dead() to recursive method to remove all dead inputs. Reviewed-by: never ! src/cpu/sparc/vm/sparc.ad ! src/share/vm/opto/chaitin.hpp ! src/share/vm/opto/postaloc.cpp Changeset: e9a5e0a812c8 Author: kvn Date: 2012-01-07 13:26 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/e9a5e0a812c8 7125896: Eliminate nested locks Summary: Nested locks elimination done before lock nodes expansion by looking for outer locks of the same object. Reviewed-by: never, twisti ! src/cpu/sparc/vm/sparc.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/ci/ciTypeFlow.cpp ! src/share/vm/ci/ciTypeFlow.hpp ! src/share/vm/opto/c2_globals.hpp ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/callnode.hpp ! src/share/vm/opto/escape.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/locknode.hpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/macro.hpp ! src/share/vm/opto/output.cpp ! src/share/vm/opto/parse1.cpp ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/deoptimization.cpp Changeset: 35acf8f0a2e4 Author: kvn Date: 2012-01-10 18:05 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/35acf8f0a2e4 7128352: assert(obj_node == obj) failed Summary: Compare uncasted object nodes. Reviewed-by: never ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/cfgnode.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/node.cpp ! src/share/vm/opto/node.hpp ! src/share/vm/opto/phaseX.hpp ! src/share/vm/opto/subnode.cpp ! test/compiler/7116216/StackOverflow.java Changeset: c8d8e124380c Author: kvn Date: 2012-01-12 12:28 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/c8d8e124380c 7064302: JDK7 build 147 crashed after testing my java 6-compiled web app Summary: Don't split CMove node if it's control edge is different from split region. Reviewed-by: never ! src/share/vm/opto/loopnode.cpp ! src/share/vm/opto/loopnode.hpp ! src/share/vm/opto/loopopts.cpp Changeset: 31a5b9aad4bc Author: jrose Date: 2012-01-13 00:27 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/31a5b9aad4bc Merge ! src/share/vm/runtime/arguments.cpp From Dmitry.Samersoff at oracle.com Fri Jan 13 06:34:57 2012 From: Dmitry.Samersoff at oracle.com (Dmitry Samersoff) Date: Fri, 13 Jan 2012 18:34:57 +0400 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F06CE3A.2040101@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <4F06CE3A.2040101@oracle.com> Message-ID: <4F104111.5090809@oracle.com> Tony, On 2012-01-06 14:34, Tony Printezis wrote: > Vitaly, > > Sure, but if the GC detects that the load is low it doesn't know whether > the load will remain low for 5 ms or 5 hours (and it's impossible to > know, maybe not even the application knows). I can already imagine the > bug reports: a spike suddenly happened in the market and the JVM was > "locked up" for several seconds!!! To be more practical, lets consider a vital case - a shop selling something to particular region. (e.g city's best pizza) It has clear visible pick hours (evening) and clear visible spare time (early morning) So cu would like to have as much free resources as possible at a pick hours. Solution widely used today - create a cluster, then restart nodes one by one at a spare time. What we(java) can do for such case: 1 (simple, doable today). Give a way to setup ergonomic profile e.g. something like -XX:PreferredGCTime. During this time VM sets GC parameters to collect as much as possible at the cost of higher load by GC thread. 2. (advanced) Collect heap usage statistics and calculate a probability of low load time. -Dmitry -- Dmitry Samersoff Java Hotspot development team, SPB04 * There will come soft rains ... From tony.printezis at oracle.com Fri Jan 13 08:47:27 2012 From: tony.printezis at oracle.com (Tony Printezis) Date: Fri, 13 Jan 2012 11:47:27 -0500 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F104111.5090809@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <4F06CE3A.2040101@oracle.com> <4F104111.5090809@oracle.com> Message-ID: <4F10601F.9010208@oracle.com> Dmitry, I totally appreciate that a lot of apps have non-trivial periods of idle time. However, you assume that there will be enough resources available to handle the peak time load so that we can wait to reclaim them when the load drops. I just don't think this is always a reasonable assumption. Depending what it's doing, an app can consume a lot of resources during peak time, If peak time lasts for a few hours (as it's probably the case in your pizza delivery example below). FWIW, Tony On 01/13/2012 09:34 AM, Dmitry Samersoff wrote: > Tony, > > > On 2012-01-06 14:34, Tony Printezis wrote: >> Vitaly, >> >> Sure, but if the GC detects that the load is low it doesn't know whether >> the load will remain low for 5 ms or 5 hours (and it's impossible to >> know, maybe not even the application knows). I can already imagine the >> bug reports: a spike suddenly happened in the market and the JVM was >> "locked up" for several seconds!!! > To be more practical, lets consider a vital case - a shop selling > something to particular region. (e.g city's best pizza) > > It has clear visible pick hours (evening) and clear visible spare time > (early morning) > > So cu would like to have as much free resources as possible at a pick > hours. Solution widely used today - create a cluster, then restart > nodes one by one at a spare time. > > What we(java) can do for such case: > > 1 (simple, doable today). Give a way to setup ergonomic profile e.g. > something like -XX:PreferredGCTime. During this time VM sets GC > parameters to collect as much as possible at the cost of higher > load by GC thread. > > 2. (advanced) Collect heap usage statistics and calculate a > probability of low load time. > > -Dmitry > From bengt.rutisson at oracle.com Fri Jan 13 09:02:16 2012 From: bengt.rutisson at oracle.com (bengt.rutisson at oracle.com) Date: Fri, 13 Jan 2012 17:02:16 +0000 Subject: hg: hsx/hotspot-main/hotspot: 9 new changesets Message-ID: <20120113170236.16C4847962@hg.openjdk.java.net> Changeset: bacb651cf5bf Author: tonyp Date: 2012-01-05 05:54 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/bacb651cf5bf 7113006: G1: excessive ergo output when an evac failure happens Summary: Introduce a flag that is set when a heap expansion attempt during a GC fails so that we do not consantly attempt to expand the heap when it's going to fail anyway. This not only prevents the excessive ergo output (which is generated when a region allocation fails) but also avoids excessive and ultimately unsuccessful expansion attempts. Reviewed-by: jmasa, johnc ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp Changeset: 5fd354a959c5 Author: jmasa Date: 2012-01-05 21:21 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/5fd354a959c5 Merge Changeset: 023652e49ac0 Author: johnc Date: 2011-12-23 11:14 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/023652e49ac0 7121496: G1: do the per-region evacuation failure handling work in parallel Summary: Parallelize the removal of self forwarding pointers etc. by wrapping in a HeapRegion closure, which is then wrapped inside an AbstractGangTask. Reviewed-by: tonyp, iveresov ! src/share/vm/gc_implementation/g1/concurrentMark.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp + src/share/vm/gc_implementation/g1/g1EvacFailure.hpp ! src/share/vm/gc_implementation/g1/heapRegion.hpp Changeset: 02838862dec8 Author: tonyp Date: 2012-01-07 00:43 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/02838862dec8 7121623: G1: always be able to reliably calculate the length of a forwarded chunked array Summary: Store the "next chunk start index" in the length field of the to-space object, instead of the from-space object, so that we can always reliably read the size of all from-space objects. Reviewed-by: johnc, ysr, jmasa ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp Changeset: 97c00e21fecb Author: tonyp Date: 2012-01-09 23:50 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/97c00e21fecb 7125281: G1: heap expansion code is replicated Reviewed-by: brutisso, johnc ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp Changeset: 1d6185f732aa Author: brutisso Date: 2012-01-10 20:02 +0100 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/1d6185f732aa 7128532: G1: Change default value of G1DefaultMaxNewGenPercent to 80 Reviewed-by: tonyp, jmasa ! src/share/vm/gc_implementation/g1/g1_globals.hpp Changeset: 2ace1c4ee8da Author: tonyp Date: 2012-01-10 18:58 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/2ace1c4ee8da 6888336: G1: avoid explicitly marking and pushing objects in survivor spaces Summary: This change simplifies the interaction between GC and concurrent marking. By disabling survivor spaces during the initial-mark pause we don't need to propagate marks of objects we copy during each GC (since we never need to copy an explicitly marked object). Reviewed-by: johnc, brutisso ! src/share/vm/gc_implementation/g1/concurrentMark.cpp ! src/share/vm/gc_implementation/g1/concurrentMark.hpp ! src/share/vm/gc_implementation/g1/concurrentMark.inline.hpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.cpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.hpp ! src/share/vm/gc_implementation/g1/g1EvacFailure.hpp ! src/share/vm/gc_implementation/g1/g1OopClosures.hpp ! src/share/vm/gc_implementation/g1/heapRegion.cpp ! src/share/vm/gc_implementation/g1/heapRegion.hpp ! src/share/vm/gc_implementation/g1/heapRegion.inline.hpp ! src/share/vm/gc_implementation/g1/ptrQueue.hpp ! src/share/vm/gc_implementation/g1/satbQueue.cpp ! src/share/vm/gc_implementation/g1/satbQueue.hpp Changeset: 9d4f4a1825e4 Author: brutisso Date: 2012-01-13 01:55 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/9d4f4a1825e4 Merge Changeset: 5acd82522540 Author: brutisso Date: 2012-01-13 06:18 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/5acd82522540 Merge From ysr1729 at gmail.com Fri Jan 13 09:15:44 2012 From: ysr1729 at gmail.com (Srinivas Ramakrishna) Date: Fri, 13 Jan 2012 09:15:44 -0800 Subject: JEP 132: More-prompt finalization In-Reply-To: <4F0A9AE3.7000602@oracle.com> References: <20111222230542.48A451084@eggemoggin.niobe.net> <4EF3E44F.1060409@oracle.com> <4EF4A8A4.1060502@oracle.com> <4EF5C695.4060206@oracle.com> <4EF9F709.4030101@oracle.com> <4EFA2869.6070105@oracle.com> <712FF280-2C59-47A0-99DE-ECEAD5FCFB15@kodewerk.com> <4EFAC2C7.7060508@oracle.com> <32CBF5EE-6AB8-4888-895E-37E495ACCD74@kodewerk.com> <4EFB5583.5000005@oracle.com> <4F05D3A0.1050503@oracle.com> <08196B9F-49AC-4DBF-B0C7-76903539C217@kodewerk.com> <4F065F5D.6040605@oracle.com> <4F06CE20.10809@oracle.com> <4F06D7A7.6010603@oracle.com> <4F071C4B.8010002@oracle.com> <4F0A257A.50104@oracle.com> <4F0A317B.6000702@oracle.com> <4F0A9AE3.7000602@oracle.com> Message-ID: On Sun, Jan 8, 2012 at 11:44 PM, David Holmes wrote: > On 9/01/2012 5:27 PM, Srinivas Ramakrishna wrote: > >> Ah, I see what you meant. Yes, there'd probably be some (theoretical) >> overhead from such tracking, >> > > I'm not considering multiple apps per VM I'm just thinking of a programmer > using java.io.* or java.net.* classes which have been "enhanced" to support > resource tracking. > I don't think the overhead would be high. The tracking we are considering is of a very simple nature, with the libraries doing some (atomic) incrementing and decrementing of a counter per allocation or finalization respectively (we could avoid contention by deferring updates to a global counter except at specific points although that may make rate calculations noisy), and GC polling using these counters to decide if a GC cycle should be run because the resource may be close to exhaustion in a little while. (One could alternatively have the allocation itself detecting that fact and nudging the GC subsystem to check; one can engineer this in a manner such that the cost itself can be made fairly small.) I suspect that the inline cost of these actions on native resource allocation will likely be quite small, even if not negligible, compared with the total cost of such a native resource allocation, i am guessing. > If the proposal involves new wrapper classes that an application has to > opt-in to use then that is fine. > Certainly, any such API should allow for opt-in (perhaps on a per-native-resource basis), albeit likely on a JVM-wide basis (per-resource) to make the resource-tracking effective. > > But I think this has gone off-track somewhat - finding alternatives to > finalization. The JEP is more about speeding up finalization for those apps > that (for better or worse) rely on it. I think there are two main aspects > of this: > - adding more threads to process finalizers and/or reference objects > - adding an API to request and run finalization etc (more efficiently that > doing: System.gc(); System.gc(); System.runFinalization() ) > Certainly that is the first step and who knows, it may allow us to mostly kick the can further down the road for a bit longer. > Although we "all" know "finalizers are evil", they exist and are > (mis-)used by some apps and cause some issues. > Fully agree. -- ramki > > Cheers, > David > > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120113/3054bcbf/attachment-0001.html From vladimir.kozlov at oracle.com Fri Jan 13 19:02:50 2012 From: vladimir.kozlov at oracle.com (vladimir.kozlov at oracle.com) Date: Sat, 14 Jan 2012 03:02:50 +0000 Subject: hg: hsx/hotspot-main/hotspot: 4 new changesets Message-ID: <20120114030303.0F4D347973@hg.openjdk.java.net> Changeset: b0ff910edfc9 Author: kvn Date: 2012-01-12 14:45 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/b0ff910edfc9 7128355: assert(!nocreate) failed: Cannot build a phi for a block already parsed Summary: Do not common BoxLock nodes and avoid creating phis of boxes. Reviewed-by: never ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/locknode.hpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/parse1.cpp Changeset: f4d8930a45b9 Author: jrose Date: 2012-01-13 00:51 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/f4d8930a45b9 Merge Changeset: 89d0a5d40008 Author: kvn Date: 2012-01-13 12:58 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/89d0a5d40008 7129618: assert(obj_node->eqv_uncast(obj),""); Summary: Relax verification and locks elimination checks for new implementation (EliminateNestedLocks). Reviewed-by: iveresov ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/macro.cpp Changeset: e504fd26c073 Author: kvn Date: 2012-01-13 14:21 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/e504fd26c073 Merge From john.coomes at oracle.com Sat Jan 14 03:42:36 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Sat, 14 Jan 2012 11:42:36 +0000 Subject: hg: hsx/hsx23/hotspot: 75 new changesets Message-ID: <20120114114512.BED8E47978@hg.openjdk.java.net> Changeset: fe2c87649981 Author: katleman Date: 2011-12-29 15:14 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/fe2c87649981 Added tag jdk8-b19 for changeset 9232e0ecbc2c ! .hgtags Changeset: 9952d1c439d6 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/9952d1c439d6 Added tag jdk8-b20 for changeset fe2c87649981 ! .hgtags Changeset: ed621d125d02 Author: katleman Date: 2012-01-13 10:05 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/ed621d125d02 Added tag jdk8-b21 for changeset 9952d1c439d6 ! .hgtags Changeset: 0841c0ec2ed6 Author: amurillo Date: 2011-12-23 15:29 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/0841c0ec2ed6 7123810: new hotspot build - hs23-b10 Reviewed-by: jcoomes ! make/hotspot_version Changeset: 3b2b58fb1425 Author: tonyp Date: 2011-12-20 12:59 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/3b2b58fb1425 7123165: G1: output during parallel verification can get messed up Summary: Serialize the worker threads that are generating output during parallel heap verification to make sure the output is consistent. Reviewed-by: brutisso, johnc, jmasa ! src/share/vm/gc_implementation/g1/heapRegion.cpp Changeset: d15b458c4225 Author: jmasa Date: 2011-12-20 20:29 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/d15b458c4225 Merge Changeset: 67fdcb391461 Author: tonyp Date: 2011-12-21 07:53 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/67fdcb391461 7119027: G1: use atomics to update RS length / predict time of inc CSet Summary: Make sure that the updates to the RS length and inc CSet predicted time are updated in an MT-safe way. Reviewed-by: brutisso, iveresov ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.cpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.hpp Changeset: 441e946dc1af Author: jmasa Date: 2011-12-14 13:34 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/441e946dc1af 7121618: Change type of number of GC workers to unsigned int. Summary: Change variables representing the number of GC workers to uint from int and size_t. Change the parameter in work(int i) to work(uint worker_id). Reviewed-by: brutisso, tonyp ! src/share/vm/gc_implementation/concurrentMarkSweep/compactibleFreeListSpace.cpp ! src/share/vm/gc_implementation/concurrentMarkSweep/compactibleFreeListSpace.hpp ! src/share/vm/gc_implementation/concurrentMarkSweep/concurrentMarkSweepGeneration.cpp ! src/share/vm/gc_implementation/g1/collectionSetChooser.cpp ! src/share/vm/gc_implementation/g1/concurrentMark.cpp ! src/share/vm/gc_implementation/g1/concurrentMark.hpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.cpp ! src/share/vm/gc_implementation/g1/g1RemSet.cpp ! src/share/vm/gc_implementation/g1/g1RemSet.hpp ! src/share/vm/gc_implementation/g1/g1RemSet.inline.hpp ! src/share/vm/gc_implementation/parNew/parCardTableModRefBS.cpp ! src/share/vm/gc_implementation/parNew/parNewGeneration.cpp ! src/share/vm/gc_implementation/parNew/parNewGeneration.hpp ! src/share/vm/gc_interface/collectedHeap.hpp ! src/share/vm/memory/genCollectedHeap.cpp ! src/share/vm/memory/genCollectedHeap.hpp ! src/share/vm/memory/referenceProcessor.cpp ! src/share/vm/memory/referenceProcessor.hpp ! src/share/vm/memory/sharedHeap.cpp ! src/share/vm/memory/sharedHeap.hpp ! src/share/vm/runtime/globals.hpp ! src/share/vm/utilities/workgroup.cpp ! src/share/vm/utilities/workgroup.hpp ! src/share/vm/utilities/yieldingWorkgroup.cpp ! src/share/vm/utilities/yieldingWorkgroup.hpp Changeset: 1cbe7978b021 Author: brutisso Date: 2011-12-21 22:13 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/1cbe7978b021 7113021: G1: automatically enable young gen size auto-tuning when -Xms==-Xmx Summary: Use a percentage of -Xms as min and another percentage of -Xmx as max for the young gen size Reviewed-by: tonyp, johnc ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.cpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.hpp ! src/share/vm/gc_implementation/g1/g1_globals.hpp Changeset: 7faca6dfa2ed Author: jmasa Date: 2011-12-27 12:38 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/7faca6dfa2ed Merge ! src/share/vm/runtime/globals.hpp Changeset: 4ceaf61479fc Author: dcubed Date: 2011-12-22 12:50 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/4ceaf61479fc 7122253: Instrumentation.retransformClasses() leaks class bytes Summary: Change ClassFileParser::parseClassFile() to use the instanceKlass:_cached_class_file_bytes field to avoid leaking the cache. Reviewed-by: coleenp, acorn, poonam ! src/share/vm/classfile/classFileParser.cpp ! src/share/vm/prims/jvmtiEnv.cpp ! src/share/vm/prims/jvmtiExport.cpp ! src/share/vm/prims/jvmtiRedefineClasses.cpp Changeset: 4ec93d767458 Author: vladidan Date: 2011-12-26 20:36 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/4ec93d767458 Merge Changeset: 3db6ea5ce021 Author: vladidan Date: 2011-12-29 20:09 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/3db6ea5ce021 Merge Changeset: 20bfb6d15a94 Author: iveresov Date: 2011-12-27 16:43 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/20bfb6d15a94 7124829: NUMA: memory leak on Linux with large pages Summary: In os::free_memory() use mmap with the same attributes as for the heap space Reviewed-by: kvn Contributed-by: Aleksey Ignatenko ! src/os/bsd/vm/os_bsd.cpp ! src/os/linux/vm/os_linux.cpp ! src/os/solaris/vm/os_solaris.cpp ! src/os/windows/vm/os_windows.cpp ! src/share/vm/gc_implementation/shared/mutableNUMASpace.cpp ! src/share/vm/gc_implementation/shared/mutableSpace.cpp ! src/share/vm/runtime/os.hpp Changeset: 776173fc2df9 Author: stefank Date: 2011-12-29 07:37 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/776173fc2df9 7125516: G1: ~ConcurrentMark() frees incorrectly Summary: Replaced the code with a ShouldNotReachHere Reviewed-by: tonyp, jmasa ! src/share/vm/gc_implementation/g1/concurrentMark.cpp Changeset: 5ee33ff9b1c4 Author: jmasa Date: 2012-01-03 10:22 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/5ee33ff9b1c4 Merge Changeset: 75c0a73eee98 Author: coleenp Date: 2011-11-17 12:53 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/75c0a73eee98 7102776: Pack instanceKlass boolean fields into single u1 field Summary: Reduce class runtime memory usage by packing 4 instanceKlass boolean fields into single u1 field. Save 4-byte for each loaded class. Reviewed-by: dholmes, bobv, phh, twisti, never, coleenp Contributed-by: Jiangli Zhou ! agent/src/share/classes/sun/jvm/hotspot/oops/InstanceKlass.java ! src/share/vm/code/dependencies.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/instanceKlassKlass.cpp ! src/share/vm/runtime/vmStructs.cpp Changeset: da4dd142ea01 Author: bobv Date: 2011-11-29 14:44 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/da4dd142ea01 Merge ! src/share/vm/code/dependencies.cpp Changeset: 52b5d32fbfaf Author: coleenp Date: 2011-12-06 18:28 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/52b5d32fbfaf 7117052: instanceKlass::_init_state can be u1 type Summary: Change instanceKlass::_init_state field to u1 type. Reviewed-by: bdelsart, coleenp, dholmes, phh, never Contributed-by: Jiangli Zhou ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/share/vm/ci/ciInstanceKlass.cpp ! src/share/vm/memory/dump.cpp ! src/share/vm/oops/instanceKlass.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/parseHelper.cpp ! src/share/vm/runtime/vmStructs.cpp Changeset: eccc4b1f8945 Author: vladidan Date: 2011-12-07 16:47 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/eccc4b1f8945 7050298: ARM: SIGSEGV in JNIHandleBlock::allocate_handle Summary: missing release barrier in Monitor::IUnlock Reviewed-by: dholmes, dice ! src/share/vm/runtime/mutex.cpp Changeset: 2685ea97b89f Author: jiangli Date: 2011-12-09 11:29 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/2685ea97b89f Merge ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp Changeset: 8fdf463085e1 Author: jiangli Date: 2011-12-16 17:33 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/8fdf463085e1 Merge Changeset: dca455dea3a7 Author: bdelsart Date: 2011-12-20 12:33 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/dca455dea3a7 7116216: StackOverflow GC crash Summary: GC crash for explicit stack overflow checks after a C2I transition. Reviewed-by: coleenp, never Contributed-by: yang02.wang at sap.com, bertrand.delsart at oracle.com ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp + test/compiler/7116216/LargeFrame.java + test/compiler/7116216/StackOverflow.java Changeset: cd5d8cafcc84 Author: jiangli Date: 2011-12-28 12:15 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/cd5d8cafcc84 7123315: instanceKlass::_static_oop_field_count and instanceKlass::_java_fields_count should be u2 type. Summary: Change instanceKlass::_static_oop_field_count and instanceKlass::_java_fields_count to u2 type. Reviewed-by: never, bdelsart, dholmes Contributed-by: Jiangli Zhou ! src/share/vm/classfile/classFileParser.cpp ! src/share/vm/classfile/classFileParser.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/runtime/vmStructs.cpp Changeset: 05de27e852c4 Author: jiangli Date: 2012-01-04 12:36 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/05de27e852c4 Merge ! src/share/vm/classfile/classFileParser.cpp Changeset: b6a04c79ccbc Author: stefank Date: 2012-01-02 10:01 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/b6a04c79ccbc 7125503: Compiling collectedHeap.cpp fails with -Werror=int-to-pointer-cast with g++ 4.6.1 Summary: Used uintptr_t and void* for all the casts and checks in test_is_in. Reviewed-by: tonyp, jmasa ! src/share/vm/gc_interface/collectedHeap.cpp Changeset: 4753e3dda3c8 Author: jmasa Date: 2012-01-04 07:56 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/4753e3dda3c8 Merge Changeset: 2ee4167627a3 Author: jmasa Date: 2012-01-05 21:02 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/2ee4167627a3 Merge Changeset: 7ab5f6318694 Author: phh Date: 2012-01-01 11:17 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/7ab5f6318694 7125934: Add a fast unordered timestamp capability to Hotspot on x86/x64 Summary: Add rdtsc detection and inline generation. Reviewed-by: kamg, dholmes Contributed-by: karen.kinnear at oracle.com ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/os_cpu/bsd_x86/vm/os_bsd_x86.hpp + src/os_cpu/bsd_x86/vm/os_bsd_x86.inline.hpp ! src/os_cpu/linux_x86/vm/os_linux_x86.hpp + src/os_cpu/linux_x86/vm/os_linux_x86.inline.hpp ! src/os_cpu/solaris_x86/vm/os_solaris_x86.hpp + src/os_cpu/solaris_x86/vm/os_solaris_x86.inline.hpp ! src/os_cpu/solaris_x86/vm/solaris_x86_32.il ! src/os_cpu/solaris_x86/vm/solaris_x86_64.il ! src/os_cpu/windows_x86/vm/os_windows_x86.hpp + src/os_cpu/windows_x86/vm/os_windows_x86.inline.hpp ! src/share/vm/runtime/init.cpp ! src/share/vm/runtime/os.cpp ! src/share/vm/runtime/os.hpp + src/share/vm/runtime/os_ext.hpp Changeset: b16494a69d3d Author: phh Date: 2012-01-03 15:11 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/b16494a69d3d 7126185: Clean up lasterror handling, add os::get_last_error() Summary: Add os::get_last_error(), replace getLastErrorString() by os::lasterror() in os_windows.cpp. Reviewed-by: kamg, dholmes Contributed-by: erik.gahlin at oracle.com ! src/os/posix/vm/os_posix.cpp ! src/os/windows/vm/os_windows.cpp ! src/share/vm/runtime/os.hpp Changeset: 5b58979183f9 Author: dcubed Date: 2012-01-05 06:24 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/5b58979183f9 7127032: fix for 7122253 adds a JvmtiThreadState earlier than necessary Summary: Use JavaThread::jvmti_thread_state() instead of JvmtiThreadState::state_for(). Reviewed-by: coleenp, poonam, acorn ! src/share/vm/classfile/classFileParser.cpp Changeset: 8a63c6323842 Author: fparain Date: 2012-01-05 07:26 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/8a63c6323842 7125594: C-heap growth issue in ThreadService::find_deadlocks_at_safepoint Reviewed-by: sspitsyn, dcubed, mchung, dholmes ! src/share/vm/services/threadService.cpp Changeset: 2e0ef19fc891 Author: phh Date: 2012-01-05 17:14 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/2e0ef19fc891 7126480: Make JVM start time in milliseconds since the Java epoch available Summary: Expose existing Management::_begin_vm_creation_time via new accessor Management::begin_vm_creation_time(). Reviewed-by: acorn, dcubed ! src/share/vm/services/management.hpp Changeset: 66259eca2bf7 Author: phh Date: 2012-01-05 17:16 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/66259eca2bf7 Merge Changeset: 2b3acb34791f Author: dcubed Date: 2012-01-06 16:18 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/2b3acb34791f Merge ! src/os/windows/vm/os_windows.cpp ! src/share/vm/classfile/classFileParser.cpp ! src/share/vm/runtime/os.hpp Changeset: abcceac2f7cd Author: iveresov Date: 2011-12-12 12:44 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/abcceac2f7cd 7119730: Tiered: SIGSEGV in AdvancedThresholdPolicy::is_method_profiled(methodOop) Summary: Added handles for references to methods in select_task() Reviewed-by: twisti, kvn ! src/share/vm/runtime/advancedThresholdPolicy.cpp Changeset: 7bca37d28f32 Author: roland Date: 2011-12-13 10:54 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/7bca37d28f32 7114106: C1: assert(goto_state->is_same(sux_state)) failed: states must match now Summary: fix C1's CEE to take inlining into account when the stacks in states are compared. Reviewed-by: iveresov, never ! src/share/vm/c1/c1_Optimizer.cpp Changeset: d725f0affb1a Author: iveresov Date: 2011-12-13 17:10 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/d725f0affb1a 7121111: -server -Xcomp -XX:+TieredCompilation does not invoke C2 compiler Summary: Exercise C2 more in tiered mode with Xcomp Reviewed-by: kvn, never ! src/share/vm/runtime/arguments.cpp Changeset: 127b3692c168 Author: kvn Date: 2011-12-14 14:54 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/127b3692c168 7116452: Add support for AVX instructions Summary: Added support for AVX extension to the x86 instruction set. Reviewed-by: never ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/assembler_x86.hpp ! src/cpu/x86/vm/assembler_x86.inline.hpp ! src/cpu/x86/vm/nativeInst_x86.cpp ! src/cpu/x86/vm/nativeInst_x86.hpp ! src/cpu/x86/vm/register_definitions_x86.cpp ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/runtime/globals.hpp Changeset: 669f6a7d5b70 Author: never Date: 2011-12-19 14:16 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/669f6a7d5b70 7121073: secondary_super_cache memory slice has incorrect bounds in flatten_alias_type Reviewed-by: kvn ! src/share/vm/opto/compile.cpp Changeset: 65149e74c706 Author: kvn Date: 2011-12-20 00:55 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/65149e74c706 7121648: Use 3-operands SIMD instructions on x86 with AVX Summary: Use 3-operands SIMD instructions in C2 generated code for machines with AVX. Reviewed-by: never ! make/bsd/makefiles/adlc.make ! make/linux/makefiles/adlc.make ! make/solaris/makefiles/adlc.make ! make/windows/makefiles/adlc.make ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/assembler_x86.hpp + src/cpu/x86/vm/x86.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/opto/matcher.cpp Changeset: 069ab3f976d3 Author: stefank Date: 2011-12-07 11:35 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/069ab3f976d3 7118863: Move sizeof(klassOopDesc) into the *Klass::*_offset_in_bytes() functions Summary: Moved sizeof(klassOopDesc), changed the return type to ByteSize and removed the _in_bytes suffix. Reviewed-by: never, bdelsart, coleenp, jrose ! src/cpu/sparc/vm/assembler_sparc.cpp ! src/cpu/sparc/vm/c1_CodeStubs_sparc.cpp ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_MacroAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/cppInterpreter_sparc.cpp ! src/cpu/sparc/vm/methodHandles_sparc.cpp ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/assembler_x86.cpp ! src/cpu/x86/vm/c1_CodeStubs_x86.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_MacroAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/cppInterpreter_x86.cpp ! src/cpu/x86/vm/methodHandles_x86.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/c1/c1_LIRGenerator.cpp ! src/share/vm/oops/arrayKlass.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/klass.cpp ! src/share/vm/oops/klass.hpp ! src/share/vm/oops/klassOop.hpp ! src/share/vm/oops/objArrayKlass.hpp ! src/share/vm/opto/compile.cpp ! src/share/vm/opto/graphKit.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/parse1.cpp ! src/share/vm/opto/parseHelper.cpp ! src/share/vm/shark/sharkIntrinsics.cpp ! src/share/vm/shark/sharkTopLevelBlock.cpp Changeset: 1dc233a8c7fe Author: roland Date: 2011-12-20 16:56 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/1dc233a8c7fe 7121140: Allocation paths require explicit memory synchronization operations for RMO systems Summary: adds store store barrier after initialization of header and body of objects. Reviewed-by: never, kvn ! src/cpu/sparc/vm/sparc.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/adlc/formssel.cpp ! src/share/vm/opto/callnode.hpp ! src/share/vm/opto/classes.hpp ! src/share/vm/opto/escape.cpp ! src/share/vm/opto/graphKit.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/memnode.hpp ! src/share/vm/opto/node.hpp Changeset: e5ac210043cd Author: roland Date: 2011-12-22 10:55 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/e5ac210043cd 7123108: C1: assert(if_state != NULL) failed: states do not match up Summary: In CEE, ensure if and common successor state are at the same inline level Reviewed-by: never ! src/share/vm/c1/c1_Optimizer.cpp + test/compiler/7123108/Test7123108.java Changeset: b642b49f9738 Author: roland Date: 2011-12-23 09:36 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/b642b49f9738 7123253: C1: in store check code, usage of registers may be incorrect Summary: fix usage of input register in assembly code for store check. Reviewed-by: never ! src/share/vm/c1/c1_LIR.cpp Changeset: 40c2484c09e1 Author: kvn Date: 2011-12-23 15:24 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/40c2484c09e1 7110832: ctw/.../org_apache_avalon_composition_util_StringHelper crashes the VM Summary: Distance is too large for one short branch in string_indexofC8(). Reviewed-by: iveresov ! src/cpu/x86/vm/assembler_x86.cpp ! src/share/vm/asm/assembler.cpp ! src/share/vm/asm/assembler.hpp Changeset: d12a66fa3820 Author: kvn Date: 2011-12-27 15:08 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/d12a66fa3820 7123954: Some CTW test crash with SIGSEGV Summary: Correct Allocate expansion code to preserve i_o when only slow call is generated. Reviewed-by: iveresov ! src/share/vm/opto/compile.cpp ! src/share/vm/opto/macro.cpp Changeset: 8940fd98d540 Author: kvn Date: 2011-12-29 11:37 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/8940fd98d540 Merge ! src/cpu/x86/vm/assembler_x86.cpp ! src/share/vm/runtime/globals.hpp Changeset: 9c87bcb3b4dd Author: kvn Date: 2011-12-30 11:43 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/9c87bcb3b4dd 7125879: assert(proj != NULL) failed: must be found Summary: Leave i_o attached to slow allocation call when there are no i_o users after the call. Reviewed-by: iveresov, twisti ! src/share/vm/opto/macro.cpp + test/compiler/7125879/Test7125879.java Changeset: 1cb50d7a9d95 Author: iveresov Date: 2012-01-05 17:25 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/1cb50d7a9d95 7119294: Two command line options cause JVM to crash Summary: Setup thread register in MacroAssembler::incr_allocated_bytes() on x64 Reviewed-by: kvn ! src/cpu/x86/vm/assembler_x86.cpp Changeset: 22cee0ee8927 Author: kvn Date: 2012-01-06 20:09 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/22cee0ee8927 Merge ! src/cpu/sparc/vm/c1_LIRAssembler_sparc.cpp ! src/cpu/sparc/vm/c1_Runtime1_sparc.cpp ! src/cpu/sparc/vm/stubGenerator_sparc.cpp ! src/cpu/sparc/vm/templateInterpreter_sparc.cpp ! src/cpu/sparc/vm/templateTable_sparc.cpp ! src/cpu/x86/vm/c1_LIRAssembler_x86.cpp ! src/cpu/x86/vm/c1_Runtime1_x86.cpp ! src/cpu/x86/vm/stubGenerator_x86_32.cpp ! src/cpu/x86/vm/stubGenerator_x86_64.cpp ! src/cpu/x86/vm/templateInterpreter_x86_32.cpp ! src/cpu/x86/vm/templateInterpreter_x86_64.cpp ! src/cpu/x86/vm/templateTable_x86_32.cpp ! src/cpu/x86/vm/templateTable_x86_64.cpp ! src/cpu/x86/vm/vm_version_x86.cpp ! src/cpu/x86/vm/vm_version_x86.hpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/parseHelper.cpp Changeset: 8f8b94305aff Author: dcubed Date: 2012-01-11 19:54 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/8f8b94305aff 7129240: backout fix for 7102776 until 7128770 is resolved Reviewed-by: phh, bobv, coleenp, dcubed Contributed-by: Jiangli Zhou ! agent/src/share/classes/sun/jvm/hotspot/oops/InstanceKlass.java ! src/share/vm/code/dependencies.cpp ! src/share/vm/oops/instanceKlass.hpp ! src/share/vm/oops/instanceKlassKlass.cpp ! src/share/vm/runtime/vmStructs.cpp Changeset: 4f25538b54c9 Author: fparain Date: 2012-01-09 10:27 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/4f25538b54c9 7120511: Add diagnostic commands Reviewed-by: acorn, phh, dcubed, sspitsyn ! src/share/vm/classfile/vmSymbols.hpp ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/globals.cpp ! src/share/vm/runtime/globals.hpp ! src/share/vm/runtime/init.cpp ! src/share/vm/services/attachListener.cpp ! src/share/vm/services/diagnosticCommand.cpp ! src/share/vm/services/diagnosticCommand.hpp ! src/share/vm/services/diagnosticFramework.cpp ! src/share/vm/services/diagnosticFramework.hpp ! src/share/vm/services/management.cpp Changeset: 865e0817f32b Author: kamg Date: 2012-01-10 15:47 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/865e0817f32b Merge ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/globals.hpp Changeset: efdf6985a3a2 Author: kamg Date: 2012-01-12 09:59 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/efdf6985a3a2 Merge Changeset: 5da7201222d5 Author: kvn Date: 2012-01-07 10:39 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/5da7201222d5 7110824: ctw/jarfiles/GUI3rdParty_jar/ob_mask_DateField crashes VM Summary: Change yank_if_dead() to recursive method to remove all dead inputs. Reviewed-by: never ! src/cpu/sparc/vm/sparc.ad ! src/share/vm/opto/chaitin.hpp ! src/share/vm/opto/postaloc.cpp Changeset: e9a5e0a812c8 Author: kvn Date: 2012-01-07 13:26 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/e9a5e0a812c8 7125896: Eliminate nested locks Summary: Nested locks elimination done before lock nodes expansion by looking for outer locks of the same object. Reviewed-by: never, twisti ! src/cpu/sparc/vm/sparc.ad ! src/cpu/x86/vm/x86_32.ad ! src/cpu/x86/vm/x86_64.ad ! src/share/vm/ci/ciTypeFlow.cpp ! src/share/vm/ci/ciTypeFlow.hpp ! src/share/vm/opto/c2_globals.hpp ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/callnode.hpp ! src/share/vm/opto/escape.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/locknode.hpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/macro.hpp ! src/share/vm/opto/output.cpp ! src/share/vm/opto/parse1.cpp ! src/share/vm/runtime/arguments.cpp ! src/share/vm/runtime/deoptimization.cpp Changeset: 35acf8f0a2e4 Author: kvn Date: 2012-01-10 18:05 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/35acf8f0a2e4 7128352: assert(obj_node == obj) failed Summary: Compare uncasted object nodes. Reviewed-by: never ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/cfgnode.cpp ! src/share/vm/opto/library_call.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/memnode.cpp ! src/share/vm/opto/node.cpp ! src/share/vm/opto/node.hpp ! src/share/vm/opto/phaseX.hpp ! src/share/vm/opto/subnode.cpp ! test/compiler/7116216/StackOverflow.java Changeset: c8d8e124380c Author: kvn Date: 2012-01-12 12:28 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/c8d8e124380c 7064302: JDK7 build 147 crashed after testing my java 6-compiled web app Summary: Don't split CMove node if it's control edge is different from split region. Reviewed-by: never ! src/share/vm/opto/loopnode.cpp ! src/share/vm/opto/loopnode.hpp ! src/share/vm/opto/loopopts.cpp Changeset: 31a5b9aad4bc Author: jrose Date: 2012-01-13 00:27 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/31a5b9aad4bc Merge ! src/share/vm/runtime/arguments.cpp Changeset: bacb651cf5bf Author: tonyp Date: 2012-01-05 05:54 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/bacb651cf5bf 7113006: G1: excessive ergo output when an evac failure happens Summary: Introduce a flag that is set when a heap expansion attempt during a GC fails so that we do not consantly attempt to expand the heap when it's going to fail anyway. This not only prevents the excessive ergo output (which is generated when a region allocation fails) but also avoids excessive and ultimately unsuccessful expansion attempts. Reviewed-by: jmasa, johnc ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp Changeset: 5fd354a959c5 Author: jmasa Date: 2012-01-05 21:21 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/5fd354a959c5 Merge Changeset: 023652e49ac0 Author: johnc Date: 2011-12-23 11:14 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/023652e49ac0 7121496: G1: do the per-region evacuation failure handling work in parallel Summary: Parallelize the removal of self forwarding pointers etc. by wrapping in a HeapRegion closure, which is then wrapped inside an AbstractGangTask. Reviewed-by: tonyp, iveresov ! src/share/vm/gc_implementation/g1/concurrentMark.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp + src/share/vm/gc_implementation/g1/g1EvacFailure.hpp ! src/share/vm/gc_implementation/g1/heapRegion.hpp Changeset: 02838862dec8 Author: tonyp Date: 2012-01-07 00:43 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/02838862dec8 7121623: G1: always be able to reliably calculate the length of a forwarded chunked array Summary: Store the "next chunk start index" in the length field of the to-space object, instead of the from-space object, so that we can always reliably read the size of all from-space objects. Reviewed-by: johnc, ysr, jmasa ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp Changeset: 97c00e21fecb Author: tonyp Date: 2012-01-09 23:50 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/97c00e21fecb 7125281: G1: heap expansion code is replicated Reviewed-by: brutisso, johnc ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp Changeset: 1d6185f732aa Author: brutisso Date: 2012-01-10 20:02 +0100 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/1d6185f732aa 7128532: G1: Change default value of G1DefaultMaxNewGenPercent to 80 Reviewed-by: tonyp, jmasa ! src/share/vm/gc_implementation/g1/g1_globals.hpp Changeset: 2ace1c4ee8da Author: tonyp Date: 2012-01-10 18:58 -0500 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/2ace1c4ee8da 6888336: G1: avoid explicitly marking and pushing objects in survivor spaces Summary: This change simplifies the interaction between GC and concurrent marking. By disabling survivor spaces during the initial-mark pause we don't need to propagate marks of objects we copy during each GC (since we never need to copy an explicitly marked object). Reviewed-by: johnc, brutisso ! src/share/vm/gc_implementation/g1/concurrentMark.cpp ! src/share/vm/gc_implementation/g1/concurrentMark.hpp ! src/share/vm/gc_implementation/g1/concurrentMark.inline.hpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.cpp ! src/share/vm/gc_implementation/g1/g1CollectedHeap.hpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.cpp ! src/share/vm/gc_implementation/g1/g1CollectorPolicy.hpp ! src/share/vm/gc_implementation/g1/g1EvacFailure.hpp ! src/share/vm/gc_implementation/g1/g1OopClosures.hpp ! src/share/vm/gc_implementation/g1/heapRegion.cpp ! src/share/vm/gc_implementation/g1/heapRegion.hpp ! src/share/vm/gc_implementation/g1/heapRegion.inline.hpp ! src/share/vm/gc_implementation/g1/ptrQueue.hpp ! src/share/vm/gc_implementation/g1/satbQueue.cpp ! src/share/vm/gc_implementation/g1/satbQueue.hpp Changeset: 9d4f4a1825e4 Author: brutisso Date: 2012-01-13 01:55 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/9d4f4a1825e4 Merge Changeset: 5acd82522540 Author: brutisso Date: 2012-01-13 06:18 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/5acd82522540 Merge Changeset: b0ff910edfc9 Author: kvn Date: 2012-01-12 14:45 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/b0ff910edfc9 7128355: assert(!nocreate) failed: Cannot build a phi for a block already parsed Summary: Do not common BoxLock nodes and avoid creating phis of boxes. Reviewed-by: never ! src/share/vm/opto/callnode.cpp ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/locknode.hpp ! src/share/vm/opto/macro.cpp ! src/share/vm/opto/parse1.cpp Changeset: f4d8930a45b9 Author: jrose Date: 2012-01-13 00:51 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/f4d8930a45b9 Merge Changeset: 89d0a5d40008 Author: kvn Date: 2012-01-13 12:58 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/89d0a5d40008 7129618: assert(obj_node->eqv_uncast(obj),""); Summary: Relax verification and locks elimination checks for new implementation (EliminateNestedLocks). Reviewed-by: iveresov ! src/share/vm/opto/locknode.cpp ! src/share/vm/opto/macro.cpp Changeset: e504fd26c073 Author: kvn Date: 2012-01-13 14:21 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/e504fd26c073 Merge Changeset: 513351373923 Author: amurillo Date: 2012-01-14 00:47 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/513351373923 Merge Changeset: 24727fb37561 Author: amurillo Date: 2012-01-14 00:47 -0800 URL: http://hg.openjdk.java.net/hsx/hsx23/hotspot/rev/24727fb37561 Added tag hs23-b10 for changeset 513351373923 ! .hgtags From james.melvin at oracle.com Sat Jan 14 09:02:47 2012 From: james.melvin at oracle.com (James Melvin) Date: Sat, 14 Jan 2012 12:02:47 -0500 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot Message-ID: <4F11B537.2000504@oracle.com> Greetings, We're ready to require HotSpot builds on Mac OS X for JPRT integrate jobs. There are 3 mac-minis in each queue. Build/Test times are short relative to other platforms. Uses the stable Linux testlist for now. http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 Tested with *several* JPRT submissions for other bugfixes. I'd like to integrate this change right after the current snapshot window. Feedback welcome. Thanks, Jim From john.coomes at oracle.com Sat Jan 14 09:15:38 2012 From: john.coomes at oracle.com (john.coomes at oracle.com) Date: Sat, 14 Jan 2012 17:15:38 +0000 Subject: hg: hsx/hotspot-main/hotspot: 6 new changesets Message-ID: <20120114171557.8674747979@hg.openjdk.java.net> Changeset: fe2c87649981 Author: katleman Date: 2011-12-29 15:14 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/fe2c87649981 Added tag jdk8-b19 for changeset 9232e0ecbc2c ! .hgtags Changeset: 9952d1c439d6 Author: katleman Date: 2012-01-05 08:42 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/9952d1c439d6 Added tag jdk8-b20 for changeset fe2c87649981 ! .hgtags Changeset: ed621d125d02 Author: katleman Date: 2012-01-13 10:05 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/ed621d125d02 Added tag jdk8-b21 for changeset 9952d1c439d6 ! .hgtags Changeset: 513351373923 Author: amurillo Date: 2012-01-14 00:47 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/513351373923 Merge Changeset: 24727fb37561 Author: amurillo Date: 2012-01-14 00:47 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/24727fb37561 Added tag hs23-b10 for changeset 513351373923 ! .hgtags Changeset: 4e80db53c323 Author: amurillo Date: 2012-01-14 00:52 -0800 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/4e80db53c323 7129512: new hotspot build - hs23-b11 Reviewed-by: jcoomes ! make/hotspot_version From manojo10386 at gmail.com Sun Jan 15 09:59:26 2012 From: manojo10386 at gmail.com (Manohar Jonnalagedda) Date: Sun, 15 Jan 2012 18:59:26 +0100 Subject: Detecting range check elimination with PrintAssembly Message-ID: Hello, following this reference on Range Check Elimination done by the Hotspot compiler [1], I was keen in knowing how I can detect whether range checks are taking place in loops by inspecting output using the PrintAssembly flag; with the old PrintOptoAssembly flag, I have seen output such as the following, which I assume to be range checks : B11: # B73 B12 <- B10 Freq: 1.21365 139 movq RAX, [rsp + #24] # spill 13e movl RSI, [RAX + #12 (8-bit)] # range 141 NullCheck RAX What is the equivalent with the new PrintAssembly flag (using hsdis)? Moreover, as stated on the wiki page [1], loops are optimized if the stride is a compile-time constant. I performed a few tests on a kmeans program, with 3 nested loops, having the following (high-level) structure: === void method1(){ //loop 1 for(int i = 0; i< rows1; i++){ //... for(int j = 0; j< rows2; j++){ //... for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} } } } void method2(){ //loop 2 for(int i =0; i < rows1; i++){ for(int j=0 ; i< rows2; j++){ for(int k=0 ; k< cols; k++){ array[i*cols+k] = //... } } } } void main(){ do{ method1(); method2(); }while(!converged) } ==== In the first test, cols is an int whose value is determined at runtime (by reading a file), in the second test, it is given as a compile-time constant(3). In the second test, there is a **significant** speed-up (around 40%). However, when studying the diff of the output of PrintOptoAssembly for both method1 and method2, there is no difference (apart from slight value changes in frequency). Would you have any hints as to where I could look for differences? Thanks a lot, Manohar [1] https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120115/819ae1fc/attachment.html From rednaxelafx at gmail.com Sun Jan 15 12:22:59 2012 From: rednaxelafx at gmail.com (Krystal Mok) Date: Mon, 16 Jan 2012 04:22:59 +0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References: Message-ID: Hi, In your PrintOptoAssembly output snippet, the instruction at 0x13e is a LoadRange, which loads the range from the header of an array: (from x86_64.ad) // Load Range instruct loadRange(rRegI dst, memory mem) %{ match(Set dst (LoadRange mem)); ins_cost(125); // XXX format %{ "movl $dst, $mem\t# range" %} opcode(0x8B); ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); ins_pipe(ialu_reg_mem); %} That's not a range check just yet; the real check, if any, should come after the null check, in the form of comparing something else with RSI. But you didn't show what's after the null check, how RSI is used, so it's hard to say what you're seeing in your example. As for the two test examples, could you paste the entire source code, with the PrintOptoAssembly output of method1() and method2() ? The first example looks weird, maybe it's a typo but you're using "j < cols" as the loop condition for the inner loop. I'd guess it's the difference in locality that made the difference in performance in your two tests. - Kris On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda wrote: > Hello, > > following this reference on Range Check Elimination done by the Hotspot > compiler [1], I was keen in knowing how I can detect whether range checks > are taking place in loops by inspecting output using the PrintAssembly > flag; with the old PrintOptoAssembly flag, I have seen output such as the > following, which I assume to be range checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page [1], loops are optimized if the > stride is a compile-time constant. I performed a few tests on a kmeans > program, with 3 nested loops, having the following (high-level) structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int whose value is determined at runtime > (by reading a file), in the second test, it is given as a compile-time > constant(3). In the second test, there is a **significant** speed-up > (around 40%). However, when studying the diff of the output of > PrintOptoAssembly for both method1 and method2, there is no difference > (apart from slight value changes in frequency). Would you have any hints as > to where I could look for differences? > > Thanks a lot, > Manohar > > [1] > https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120116/691adbe4/attachment.html From david.holmes at oracle.com Sun Jan 15 13:47:09 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 16 Jan 2012 07:47:09 +1000 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F11B537.2000504@oracle.com> References: <4F11B537.2000504@oracle.com> Message-ID: <4F13495D.9040302@oracle.com> Hi Jim, On 15/01/2012 3:02 AM, James Melvin wrote: > Greetings, > > We're ready to require HotSpot builds on Mac OS X for JPRT integrate > jobs. There are 3 mac-minis in each queue. Build/Test times are short > relative to other platforms. Uses the stable Linux testlist for now. Maybe I'm missing something but isn't the OSX code currently primarily in the jdk7u-macosx repo? Or is all the hotspot code already in mainline? David ----- > http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 > > Tested with *several* JPRT submissions for other bugfixes. I'd like to > integrate this change right after the current snapshot window. > > Feedback welcome. > > Thanks, > > Jim From vladimir.kozlov at oracle.com Sun Jan 15 14:30:12 2012 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Sun, 15 Jan 2012 14:30:12 -0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

Message-ID: <4F135374.8020004@oracle.com> Manohar, As Kris pointed you need to fix your example: void method1(){ ... for(int k = 0; j < cols; k++){ ^ 'k' ? void method2(){ ... for(int j=0 ; i< rows2; j++){ ^ 'j' ? Second, your two test methods are different so you can't directly compare them. method1() iterates over rows using middle loop index 'j' and method2() uses external loop index 'i'. Unless they are typos again. Third, PrintOptoAssembly flag is not 'old'. It can be only used in debug VM to print 'pseudo' assembler by JIT compiler itself and not by hsdis disassembler. PrintAssembly flag can be used with product VM but it needs hsdis library. If you are using jdk7 there are few flags you can use to print loop optimizations information. They need debug version of VM but it is not problem for you, I think, since you can use debug PrintOptoAssembly flag. -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree after each round of loop opts, -XX:+TraceLoopPredicate prints RC information when it is moved from a loop, -XX:+TraceRangeLimitCheck prints additional information for RC elimination optimization. Fourth, range check expression in your example is not what you think. RC expression should be next: (i*stride+offset) where 'i' is loop variable, 'stride' is constant and 'offset' is loop invariant. In your example 'offset' is (j * cols) since it is loop invariant, 'k' is loop variable and stride is '1' (one). In both your methods RC will be moved out of inner loop so the code for it will be the same. The only difference in these methods will be where and how (j * cols) and (i * cols) expressions are calculated. Regards, Vladimir On 1/15/12 12:22 PM, Krystal Mok wrote: > Hi, > > In your PrintOptoAssembly output snippet, the instruction at 0x13e is a LoadRange, which loads the range from the header > of an array: > > (from x86_64.ad ) > // Load Range > instruct loadRange(rRegI dst, memory mem) > %{ > match(Set dst (LoadRange mem)); > > ins_cost(125); // XXX > format %{ "movl $dst, $mem\t# range" %} > opcode(0x8B); > ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); > ins_pipe(ialu_reg_mem); > %} > > That's not a range check just yet; the real check, if any, should come after the null check, in the form of comparing > something else with RSI. But you didn't show what's after the null check, how RSI is used, so it's hard to say what > you're seeing in your example. > > As for the two test examples, could you paste the entire source code, with the PrintOptoAssembly output of method1() and > method2() ? The first example looks weird, maybe it's a typo but you're using "j < cols" as the loop condition for the > inner loop. > > I'd guess it's the difference in locality that made the difference in performance in your two tests. > > - Kris > > On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda > wrote: > > Hello, > > following this reference on Range Check Elimination done by the Hotspot compiler [1], I was keen in knowing how I > can detect whether range checks are taking place in loops by inspecting output using the PrintAssembly flag; with > the old PrintOptoAssembly flag, I have seen output such as the following, which I assume to be range checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page [1], loops are optimized if the stride is a compile-time constant. I performed > a few tests on a kmeans program, with 3 nested loops, having the following (high-level) structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int whose value is determined at runtime (by reading a file), in the second test, it > is given as a compile-time constant(3). In the second test, there is a */significant*/ speed-up (around 40%). > However, when studying the diff of the output of PrintOptoAssembly for both method1 and method2, there is no > difference (apart from slight value changes in frequency). Would you have any hints as to where I could look for > differences? > > Thanks a lot, > Manohar > > [1] https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination > > From daniel.daugherty at oracle.com Sun Jan 15 15:35:31 2012 From: daniel.daugherty at oracle.com (Daniel D. Daugherty) Date: Sun, 15 Jan 2012 16:35:31 -0700 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F13495D.9040302@oracle.com> References: <4F11B537.2000504@oracle.com> <4F13495D.9040302@oracle.com> Message-ID: <4F1362C3.4060600@oracle.com> The hotspot part of the MacOS X port was pushed to Main_Baseline on 2011.10.21. I've attached the changeset notification. The rest of the MacOS X port is primarily in jdk7u-osx forest... Item of interest: the second reviewer that got back to me on those hotspot bits was this 'dholmes' guy... :-) Dan On 1/15/12 2:47 PM, David Holmes wrote: > Hi Jim, > > On 15/01/2012 3:02 AM, James Melvin wrote: >> Greetings, >> >> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >> jobs. There are 3 mac-minis in each queue. Build/Test times are short >> relative to other platforms. Uses the stable Linux testlist for now. > > Maybe I'm missing something but isn't the OSX code currently primarily > in the jdk7u-macosx repo? Or is all the hotspot code already in mainline? > > David > ----- > >> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >> >> Tested with *several* JPRT submissions for other bugfixes. I'd like to >> integrate this change right after the current snapshot window. >> >> Feedback welcome. >> >> Thanks, >> >> Jim -------------- next part -------------- An embedded message was scrubbed... From: daniel.daugherty at oracle.com Subject: hg: hsx/hotspot-main/hotspot: 3 new changesets Date: Fri, 21 Oct 2011 07:29:37 +0000 Size: 6613 Url: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120115/97113485/AttachedMessage-0001.nws From james.melvin at oracle.com Sun Jan 15 18:25:30 2012 From: james.melvin at oracle.com (James Melvin) Date: Sun, 15 Jan 2012 21:25:30 -0500 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F1362C3.4060600@oracle.com> References: <4F11B537.2000504@oracle.com> <4F13495D.9040302@oracle.com> <4F1362C3.4060600@oracle.com> Message-ID: <4F138A9A.8030102@oracle.com> > The hotspot part of the MacOS X port was pushed to Main_Baseline on > 2011.10.21. I've attached the changeset notification. Yes, the current HotSpot source includes the port to Mac OS X. The change I am proposing will simply add Mac OS X to the default set of platforms in JPRT which must pass for integrate jobs. If the job does not pass, the changeset will not be integrated. This required smoke test ensures HotSpot is beta quality or better at any point in time. The only risk is if future changesets break the Mac OS X build or test. Such breakage would need to be resolved before the change can be integrated. - Jim On 1/15/12 6:35 PM, Daniel D. Daugherty wrote: > The hotspot part of the MacOS X port was pushed to Main_Baseline on > 2011.10.21. I've attached the changeset notification. > > The rest of the MacOS X port is primarily in jdk7u-osx forest... > > Item of interest: the second reviewer that got back to me on those > hotspot bits was this 'dholmes' guy... :-) > > Dan > > > On 1/15/12 2:47 PM, David Holmes wrote: >> Hi Jim, >> >> On 15/01/2012 3:02 AM, James Melvin wrote: >>> Greetings, >>> >>> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >>> jobs. There are 3 mac-minis in each queue. Build/Test times are short >>> relative to other platforms. Uses the stable Linux testlist for now. >> >> Maybe I'm missing something but isn't the OSX code currently primarily >> in the jdk7u-macosx repo? Or is all the hotspot code already in mainline? >> >> David >> ----- >> >>> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >>> >>> Tested with *several* JPRT submissions for other bugfixes. I'd like to >>> integrate this change right after the current snapshot window. >>> >>> Feedback welcome. >>> >>> Thanks, >>> >>> Jim From david.holmes at oracle.com Sun Jan 15 18:42:49 2012 From: david.holmes at oracle.com (David Holmes) Date: Mon, 16 Jan 2012 12:42:49 +1000 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F1362C3.4060600@oracle.com> References: <4F11B537.2000504@oracle.com> <4F13495D.9040302@oracle.com> <4F1362C3.4060600@oracle.com> Message-ID: <4F138EA9.2050203@oracle.com> On 16/01/2012 9:35 AM, Daniel D. Daugherty wrote: > The hotspot part of the MacOS X port was pushed to Main_Baseline on > 2011.10.21. I've attached the changeset notification. > > The rest of the MacOS X port is primarily in jdk7u-osx forest... > > Item of interest: the second reviewer that got back to me on those > hotspot bits was this 'dholmes' guy... :-) Yes but since then I've been watching the 7u-osx updates and seeing bunches of bugs filed, and so was left wondering exactly what state the OSX "port" is in on the hotspot side. Or for that matter on the JDK side as the hotspot build has to be placed into an OSX JDK. As long as the code builds and passes the tests for 7 and 8 then that should be fine - and Jim indicates that is the case. Cheers, David > Dan > > > On 1/15/12 2:47 PM, David Holmes wrote: >> Hi Jim, >> >> On 15/01/2012 3:02 AM, James Melvin wrote: >>> Greetings, >>> >>> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >>> jobs. There are 3 mac-minis in each queue. Build/Test times are short >>> relative to other platforms. Uses the stable Linux testlist for now. >> >> Maybe I'm missing something but isn't the OSX code currently primarily >> in the jdk7u-macosx repo? Or is all the hotspot code already in mainline? >> >> David >> ----- >> >>> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >>> >>> Tested with *several* JPRT submissions for other bugfixes. I'd like to >>> integrate this change right after the current snapshot window. >>> >>> Feedback welcome. >>> >>> Thanks, >>> >>> Jim From manojo10386 at gmail.com Mon Jan 16 01:39:49 2012 From: manojo10386 at gmail.com (Manohar Jonnalagedda) Date: Mon, 16 Jan 2012 10:39:49 +0100 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F135374.8020004@oracle.com> References:

<4F135374.8020004@oracle.com> Message-ID: Hi Kris, Vladimir, thanks for both your responses. Second, your two test methods are different so you can't directly compare > them. method1() iterates over rows using middle loop index 'j' and > method2() uses external loop index 'i'. Unless they are typos again. > You are right, these are indeed typos. As Kris suggested, I have the code printed here: http://pastebin.com/xRFD1Nt1. The methods corresponding to method1, and method2 are constructNearestClusterVector and computeNewCentroids. Their PrintOptoAssembly outputs are respectively at http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 Also, it seems I have not explained myself correctly. I am not trying to compare the performance of method1 with respect to that of method2: method1 and method2 both run in the same program. What I am trying to compare is their performance in two cases: - when cols is a compile-time constant (much faster) - when cols is a value determined at run-time > If you are using jdk7 there are few flags you can use to print loop > optimizations information. They need debug version of VM but it is not > problem for you, I think, since you can use debug PrintOptoAssembly flag. > > -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree > after each round of loop opts, > -XX:+TraceLoopPredicate prints RC information when it is moved from a loop, > -XX:+TraceRangeLimitCheck prints additional information for RC elimination > optimization. > Thanks for these, I will have a look at what they output. Fourth, range check expression in your example is not what you think. RC > expression should be next: > (i*stride+offset) where 'i' is loop variable, 'stride' is constant and > 'offset' is loop invariant. > > In your example 'offset' is (j * cols) since it is loop invariant, 'k' is > loop variable and stride is '1' (one). > In both your methods RC will be moved out of inner loop so the code for it > will be the same. The only difference in these methods will be where and > how (j * cols) and (i * cols) expressions are calculated. > I'd guess it's the difference in locality that made the difference in >> performance in your two tests. >> > Thanks for the explanation. I understand from the above that the assembly output in both cases mentioned above may not be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time constant) will be visible elsewhere. Is that right? If so, where would I be able to detect this? Cheers, Manohar > In your PrintOptoAssembly output snippet, the instruction at 0x13e is a >> LoadRange, which loads the range from the header >> of an array: >> >> (from x86_64.ad ) >> >> // Load Range >> instruct loadRange(rRegI dst, memory mem) >> %{ >> match(Set dst (LoadRange mem)); >> >> ins_cost(125); // XXX >> format %{ "movl $dst, $mem\t# range" %} >> opcode(0x8B); >> ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); >> ins_pipe(ialu_reg_mem); >> %} >> >> That's not a range check just yet; the real check, if any, should come >> after the null check, in the form of comparing >> something else with RSI. But you didn't show what's after the null check, >> how RSI is used, so it's hard to say what >> you're seeing in your example. >> >> As for the two test examples, could you paste the entire source code, >> with the PrintOptoAssembly output of method1() and >> method2() ? The first example looks weird, maybe it's a typo but you're >> using "j < cols" as the loop condition for the >> inner loop. >> >> > >> - Kris >> >> On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda < >> manojo10386 at gmail.com > wrote: >> >> Hello, >> >> following this reference on Range Check Elimination done by the >> Hotspot compiler [1], I was keen in knowing how I >> can detect whether range checks are taking place in loops by >> inspecting output using the PrintAssembly flag; with >> the old PrintOptoAssembly flag, I have seen output such as the >> following, which I assume to be range checks : >> >> B11: # B73 B12 <- B10 Freq: 1.21365 >> 139 movq RAX, [rsp + #24] # spill >> 13e movl RSI, [RAX + #12 (8-bit)] # range >> 141 NullCheck RAX >> >> What is the equivalent with the new PrintAssembly flag (using hsdis)? >> >> Moreover, as stated on the wiki page [1], loops are optimized if the >> stride is a compile-time constant. I performed >> a few tests on a kmeans program, with 3 nested loops, having the >> following (high-level) structure: >> >> === >> void method1(){ >> //loop 1 >> for(int i = 0; i< rows1; i++){ >> //... >> for(int j = 0; j< rows2; j++){ >> //... >> for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} >> } >> } >> } >> >> void method2(){ >> //loop 2 >> for(int i =0; i < rows1; i++){ >> for(int j=0 ; i< rows2; j++){ >> for(int k=0 ; k< cols; k++){ >> array[i*cols+k] = //... >> } >> } >> } >> } >> >> void main(){ >> >> do{ >> method1(); method2(); >> }while(!converged) >> >> } >> ==== >> >> In the first test, cols is an int whose value is determined at runtime >> (by reading a file), in the second test, it >> is given as a compile-time constant(3). In the second test, there is a >> */significant*/ speed-up (around 40%). >> >> However, when studying the diff of the output of PrintOptoAssembly for >> both method1 and method2, there is no >> difference (apart from slight value changes in frequency). Would you >> have any hints as to where I could look for >> differences? >> >> Thanks a lot, >> Manohar >> >> [1] >> https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination >> >> >> > As Kris pointed you need to fix your example: -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120116/65204a22/attachment.html From mikael.gerdin at oracle.com Mon Jan 16 01:44:46 2012 From: mikael.gerdin at oracle.com (Mikael Gerdin) Date: Mon, 16 Jan 2012 10:44:46 +0100 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F11B537.2000504@oracle.com> References: <4F11B537.2000504@oracle.com> Message-ID: <4F13F18E.1080300@oracle.com> Hi Jim on line 575, is there any particular reason that you don't add macosx to jprt.make.rule.test.targets.standard.internalvmtests? /Mikael On 2012-01-14 18:02, James Melvin wrote: > Greetings, > > We're ready to require HotSpot builds on Mac OS X for JPRT integrate > jobs. There are 3 mac-minis in each queue. Build/Test times are short > relative to other platforms. Uses the stable Linux testlist for now. > > http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 > > Tested with *several* JPRT submissions for other bugfixes. I'd like to > integrate this change right after the current snapshot window. > > Feedback welcome. > > Thanks, > > Jim From vitalyd at gmail.com Mon Jan 16 10:55:11 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Mon, 16 Jan 2012 13:55:11 -0500 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

<4F135374.8020004@oracle.com> Message-ID: Hi Manohar, Are you repeatedly seeing ~40% speedup with a compile-time constant? In the two assembly dumps you posted, the computeNewCentroids seems to have some loop unrolling + non-temporal prefetch instructions that I don't see (at a cursory glance, albeit) in the 2nd method. How big is the input array for these functions? If it's larger than your highest level cache, can you try running the same tests (constant vs non-constant) with a size that would fit into L2/L3? What cpu are you running these tests on? Vitaly On Mon, Jan 16, 2012 at 4:39 AM, Manohar Jonnalagedda wrote: > Hi Kris, Vladimir, > > thanks for both your responses. > > Second, your two test methods are different so you can't directly compare >> them. method1() iterates over rows using middle loop index 'j' and >> method2() uses external loop index 'i'. Unless they are typos again. >> > > You are right, these are indeed typos. As Kris suggested, I have the code > printed here: http://pastebin.com/xRFD1Nt1. The methods corresponding to > method1, and method2 are constructNearestClusterVector and > computeNewCentroids. Their PrintOptoAssembly outputs are respectively at > http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 > > Also, it seems I have not explained myself correctly. I am not trying to > compare the performance of method1 with respect to that of method2: method1 > and method2 both run in the same program. What I am trying to compare is > their performance in two cases: > - when cols is a compile-time constant (much faster) > - when cols is a value determined at run-time > > > >> If you are using jdk7 there are few flags you can use to print loop >> optimizations information. They need debug version of VM but it is not >> problem for you, I think, since you can use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree >> after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information when it is moved from a >> loop, >> -XX:+TraceRangeLimitCheck prints additional information for RC >> elimination optimization. >> > > Thanks for these, I will have a look at what they output. > > Fourth, range check expression in your example is not what you think. RC >> expression should be next: >> (i*stride+offset) where 'i' is loop variable, 'stride' is constant and >> 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since it is loop invariant, 'k' is >> loop variable and stride is '1' (one). >> In both your methods RC will be moved out of inner loop so the code for >> it will be the same. The only difference in these methods will be where and >> how (j * cols) and (i * cols) expressions are calculated. >> > > I'd guess it's the difference in locality that made the difference in >>> performance in your two tests. >>> >> > Thanks for the explanation. I understand from the above that the assembly > output in both cases mentioned above may not be different, because the > expressions are similar. The difference in runtime (due to cols being a > compile-time constant) will be visible elsewhere. Is that right? If so, > where would I be able to detect this? > > Cheers, > Manohar > > >> In your PrintOptoAssembly output snippet, the instruction at 0x13e is a >>> LoadRange, which loads the range from the header >>> of an array: >>> >>> (from x86_64.ad ) >>> >>> // Load Range >>> instruct loadRange(rRegI dst, memory mem) >>> %{ >>> match(Set dst (LoadRange mem)); >>> >>> ins_cost(125); // XXX >>> format %{ "movl $dst, $mem\t# range" %} >>> opcode(0x8B); >>> ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); >>> ins_pipe(ialu_reg_mem); >>> %} >>> >>> That's not a range check just yet; the real check, if any, should come >>> after the null check, in the form of comparing >>> something else with RSI. But you didn't show what's after the null >>> check, how RSI is used, so it's hard to say what >>> you're seeing in your example. >>> >>> As for the two test examples, could you paste the entire source code, >>> with the PrintOptoAssembly output of method1() and >>> method2() ? The first example looks weird, maybe it's a typo but you're >>> using "j < cols" as the loop condition for the >>> inner loop. >>> >>> >> >>> - Kris >>> >>> On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda < >>> manojo10386 at gmail.com > wrote: >>> >>> Hello, >>> >>> following this reference on Range Check Elimination done by the >>> Hotspot compiler [1], I was keen in knowing how I >>> can detect whether range checks are taking place in loops by >>> inspecting output using the PrintAssembly flag; with >>> the old PrintOptoAssembly flag, I have seen output such as the >>> following, which I assume to be range checks : >>> >>> B11: # B73 B12 <- B10 Freq: 1.21365 >>> 139 movq RAX, [rsp + #24] # spill >>> 13e movl RSI, [RAX + #12 (8-bit)] # range >>> 141 NullCheck RAX >>> >>> What is the equivalent with the new PrintAssembly flag (using hsdis)? >>> >>> Moreover, as stated on the wiki page [1], loops are optimized if the >>> stride is a compile-time constant. I performed >>> a few tests on a kmeans program, with 3 nested loops, having the >>> following (high-level) structure: >>> >>> === >>> void method1(){ >>> //loop 1 >>> for(int i = 0; i< rows1; i++){ >>> //... >>> for(int j = 0; j< rows2; j++){ >>> //... >>> for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} >>> } >>> } >>> } >>> >>> void method2(){ >>> //loop 2 >>> for(int i =0; i < rows1; i++){ >>> for(int j=0 ; i< rows2; j++){ >>> for(int k=0 ; k< cols; k++){ >>> array[i*cols+k] = //... >>> } >>> } >>> } >>> } >>> >>> void main(){ >>> >>> do{ >>> method1(); method2(); >>> }while(!converged) >>> >>> } >>> ==== >>> >>> In the first test, cols is an int whose value is determined at >>> runtime (by reading a file), in the second test, it >>> is given as a compile-time constant(3). In the second test, there is >>> a */significant*/ speed-up (around 40%). >>> >>> However, when studying the diff of the output of PrintOptoAssembly >>> for both method1 and method2, there is no >>> difference (apart from slight value changes in frequency). Would you >>> have any hints as to where I could look for >>> differences? >>> >>> Thanks a lot, >>> Manohar >>> >>> [1] >>> https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination >>> >>> >>> >> As Kris pointed you need to fix your example: > > -- Vitaly 617-548-7007 (mobile) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120116/59687f71/attachment-0001.html From james.melvin at oracle.com Mon Jan 16 13:46:34 2012 From: james.melvin at oracle.com (James Melvin) Date: Mon, 16 Jan 2012 16:46:34 -0500 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F13F18E.1080300@oracle.com> References: <4F11B537.2000504@oracle.com> <4F13F18E.1080300@oracle.com> Message-ID: <4F149ABA.3000708@oracle.com> Hi Mikael, Nice find! I've added the missing line to add internalvmtests to each JPRT run on Mac OS X. New webrev posted and JPRT test job complete. WEBREV: http://cr.openjdk.java.net/~jmelvin/7126732/webrev.01 JPRT: http://prt-web.us.oracle.com//archive/2012/01/2012-01-16-172504.jmelvin.7126732 Thanks! Jim On 1/16/12 4:44 AM, Mikael Gerdin wrote: > Hi Jim > > on line 575, is there any particular reason that you don't add macosx to > jprt.make.rule.test.targets.standard.internalvmtests? > > /Mikael > > On 2012-01-14 18:02, James Melvin wrote: >> Greetings, >> >> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >> jobs. There are 3 mac-minis in each queue. Build/Test times are short >> relative to other platforms. Uses the stable Linux testlist for now. >> >> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >> >> Tested with *several* JPRT submissions for other bugfixes. I'd like to >> integrate this change right after the current snapshot window. >> >> Feedback welcome. >> >> Thanks, >> >> Jim From vladimir.kozlov at oracle.com Mon Jan 16 13:58:32 2012 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Mon, 16 Jan 2012 13:58:32 -0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

<4F135374.8020004@oracle.com> Message-ID: <4F149D88.2060305@oracle.com> > be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time > constant) will be visible elsewhere. Is that right? If so, where would I be able to detect this? In such situations we usually use some visual tools to see difference between log outputs. At least you can use 'diff'. You may need to replace instructions addresses in outputs (number at the beginning of lines) with the same value to match. There are few tricks you may use to get similar PrintOptoAssembly output. Use next flags to avoid mixing output from program output and from 2 compiler threads (flags stop program until a method is compiled and run only one compiler thread): -Xbatch -XX:CICompilerCount=1 Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method is compiled and inlined. Note that you may see similar output for individual methods but could be big difference in compiled caller (computeAll()) method where 2 loop methods could be inlined. So you need to compare all compiled methods. In general, to have constant as loop limit is always win because some checks in generated code could be avoided and more optimizations could be done for such loops. Use -XX:+TraceLoopOpts to see what loop optimizations are done in both cases. For example, in your code you set 'x_col = 3', as result the next loop in constructNearestClusterVector() will be fully unrolled when this method is inlined into computeAll() and x_col is replaced with '3': for(k = 0; k < x_col; k++) { double tmp = x[i*x_col + k] - mu[j* mu_col + k]; dist += tmp * tmp; } Vladimir On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > Hi Kris, Vladimir, > > thanks for both your responses. > > Second, your two test methods are different so you can't directly compare them. method1() iterates over rows using > middle loop index 'j' and method2() uses external loop index 'i'. Unless they are typos again. > > > You are right, these are indeed typos. As Kris suggested, I have the code printed here: http://pastebin.com/xRFD1Nt1. > The methods corresponding to method1, and method2 are constructNearestClusterVector and computeNewCentroids. Their > PrintOptoAssembly outputs are respectively at http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 > > Also, it seems I have not explained myself correctly. I am not trying to compare the performance of method1 with respect > to that of method2: method1 and method2 both run in the same program. What I am trying to compare is their performance > in two cases: > - when cols is a compile-time constant (much faster) > - when cols is a value determined at run-time > > If you are using jdk7 there are few flags you can use to print loop optimizations information. They need debug > version of VM but it is not problem for you, I think, since you can use debug PrintOptoAssembly flag. > > -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree after each round of loop opts, > -XX:+TraceLoopPredicate prints RC information when it is moved from a loop, > -XX:+TraceRangeLimitCheck prints additional information for RC elimination optimization. > > > Thanks for these, I will have a look at what they output. > > Fourth, range check expression in your example is not what you think. RC expression should be next: > (i*stride+offset) where 'i' is loop variable, 'stride' is constant and 'offset' is loop invariant. > > In your example 'offset' is (j * cols) since it is loop invariant, 'k' is loop variable and stride is '1' (one). > In both your methods RC will be moved out of inner loop so the code for it will be the same. The only difference in > these methods will be where and how (j * cols) and (i * cols) expressions are calculated. > > > I'd guess it's the difference in locality that made the difference in performance in your two tests. > > Thanks for the explanation. I understand from the above that the assembly output in both cases mentioned above may not > be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time constant) > will be visible elsewhere. Is that right? If so, where would I be able to detect this? > > Cheers, > Manohar > > In your PrintOptoAssembly output snippet, the instruction at 0x13e is a LoadRange, which loads the range from > the header > of an array: > > (from x86_64.ad ) > > // Load Range > instruct loadRange(rRegI dst, memory mem) > %{ > match(Set dst (LoadRange mem)); > > ins_cost(125); // XXX > format %{ "movl $dst, $mem\t# range" %} > opcode(0x8B); > ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); > ins_pipe(ialu_reg_mem); > %} > > That's not a range check just yet; the real check, if any, should come after the null check, in the form of > comparing > something else with RSI. But you didn't show what's after the null check, how RSI is used, so it's hard to say what > you're seeing in your example. > > As for the two test examples, could you paste the entire source code, with the PrintOptoAssembly output of > method1() and > method2() ? The first example looks weird, maybe it's a typo but you're using "j < cols" as the loop condition > for the > inner loop. > > > - Kris > > On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda > >> wrote: > > Hello, > > following this reference on Range Check Elimination done by the Hotspot compiler [1], I was keen in knowing > how I > can detect whether range checks are taking place in loops by inspecting output using the PrintAssembly flag; > with > the old PrintOptoAssembly flag, I have seen output such as the following, which I assume to be range checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page [1], loops are optimized if the stride is a compile-time constant. I > performed > a few tests on a kmeans program, with 3 nested loops, having the following (high-level) structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int whose value is determined at runtime (by reading a file), in the second > test, it > is given as a compile-time constant(3). In the second test, there is a */significant*/ speed-up (around 40%). > > However, when studying the diff of the output of PrintOptoAssembly for both method1 and method2, there is no > difference (apart from slight value changes in frequency). Would you have any hints as to where I could look for > differences? > > Thanks a lot, > Manohar > > [1] https://wikis.oracle.com/display/HotSpotInternals/RangeCheckElimination > > > > As Kris pointed you need to fix your example: > From vitalyd at gmail.com Mon Jan 16 14:37:42 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Mon, 16 Jan 2012 17:37:42 -0500 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F149D88.2060305@oracle.com> References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> Message-ID: Hi Vladimir, If x_col is always seen to be same value in the profile shouldn't the loop be unrolled as well with some deopt guard? Or does this not participate in profiling? Thanks On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" wrote: > > be different, because the expressions are similar. The difference in > runtime (due to cols being a compile-time > > constant) will be visible elsewhere. Is that right? If so, where would I > be able to detect this? > > In such situations we usually use some visual tools to see difference > between log outputs. At least you can use 'diff'. You may need to replace > instructions addresses in outputs (number at the beginning of lines) with > the same value to match. There are few tricks you may use to get similar > PrintOptoAssembly output. Use next flags to avoid mixing output from > program output and from 2 compiler threads (flags stop program until a > method is compiled and run only one compiler thread): > > -Xbatch -XX:CICompilerCount=1 > > Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method is > compiled and inlined. Note that you may see similar output for individual > methods but could be big difference in compiled caller (computeAll()) > method where 2 loop methods could be inlined. So you need to compare all > compiled methods. > > In general, to have constant as loop limit is always win because some > checks in generated code could be avoided and more optimizations could be > done for such loops. Use -XX:+TraceLoopOpts to see what loop optimizations > are done in both cases. > > For example, in your code you set 'x_col = 3', as result the next loop in > constructNearestClusterVector(**) will be fully unrolled when this method > is inlined into computeAll() and x_col is replaced with '3': > > for(k = 0; k < x_col; k++) { > double tmp = x[i*x_col + k] - mu[j* mu_col + k]; > dist += tmp * tmp; > } > > Vladimir > > On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > >> Hi Kris, Vladimir, >> >> thanks for both your responses. >> >> Second, your two test methods are different so you can't directly >> compare them. method1() iterates over rows using >> middle loop index 'j' and method2() uses external loop index 'i'. >> Unless they are typos again. >> >> >> You are right, these are indeed typos. As Kris suggested, I have the code >> printed here: http://pastebin.com/xRFD1Nt1. >> The methods corresponding to method1, and method2 are >> constructNearestClusterVector and computeNewCentroids. Their >> PrintOptoAssembly outputs are respectively at >> http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 >> >> Also, it seems I have not explained myself correctly. I am not trying to >> compare the performance of method1 with respect >> to that of method2: method1 and method2 both run in the same program. >> What I am trying to compare is their performance >> in two cases: >> - when cols is a compile-time constant (much faster) >> - when cols is a value determined at run-time >> >> If you are using jdk7 there are few flags you can use to print loop >> optimizations information. They need debug >> version of VM but it is not problem for you, I think, since you can >> use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop optimizations and loop >> tree after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information when it is moved from a >> loop, >> -XX:+TraceRangeLimitCheck prints additional information for RC >> elimination optimization. >> >> >> Thanks for these, I will have a look at what they output. >> >> Fourth, range check expression in your example is not what you think. >> RC expression should be next: >> (i*stride+offset) where 'i' is loop variable, 'stride' is constant and >> 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since it is loop invariant, 'k' >> is loop variable and stride is '1' (one). >> In both your methods RC will be moved out of inner loop so the code >> for it will be the same. The only difference in >> these methods will be where and how (j * cols) and (i * cols) >> expressions are calculated. >> >> >> I'd guess it's the difference in locality that made the difference >> in performance in your two tests. >> >> Thanks for the explanation. I understand from the above that the >> assembly output in both cases mentioned above may not >> be different, because the expressions are similar. The difference in >> runtime (due to cols being a compile-time constant) >> will be visible elsewhere. Is that right? If so, where would I be able to >> detect this? >> >> Cheers, >> Manohar >> >> In your PrintOptoAssembly output snippet, the instruction at 0x13e >> is a LoadRange, which loads the range from >> the header >> of an array: >> >> (from x86_64.ad ) >> >> // Load Range >> instruct loadRange(rRegI dst, memory mem) >> %{ >> match(Set dst (LoadRange mem)); >> >> ins_cost(125); // XXX >> format %{ "movl $dst, $mem\t# range" %} >> opcode(0x8B); >> ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); >> ins_pipe(ialu_reg_mem); >> %} >> >> That's not a range check just yet; the real check, if any, should >> come after the null check, in the form of >> comparing >> something else with RSI. But you didn't show what's after the null >> check, how RSI is used, so it's hard to say what >> you're seeing in your example. >> >> As for the two test examples, could you paste the entire source >> code, with the PrintOptoAssembly output of >> method1() and >> method2() ? The first example looks weird, maybe it's a typo but >> you're using "j < cols" as the loop condition >> for the >> inner loop. >> >> >> - Kris >> >> On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda < >> manojo10386 at gmail.com >> **>> >> wrote: >> >> Hello, >> >> following this reference on Range Check Elimination done by >> the Hotspot compiler [1], I was keen in knowing >> how I >> can detect whether range checks are taking place in loops by >> inspecting output using the PrintAssembly flag; >> with >> the old PrintOptoAssembly flag, I have seen output such as the >> following, which I assume to be range checks : >> >> B11: # B73 B12 <- B10 Freq: 1.21365 >> 139 movq RAX, [rsp + #24] # spill >> 13e movl RSI, [RAX + #12 (8-bit)] # range >> 141 NullCheck RAX >> >> What is the equivalent with the new PrintAssembly flag (using >> hsdis)? >> >> Moreover, as stated on the wiki page [1], loops are optimized >> if the stride is a compile-time constant. I >> performed >> a few tests on a kmeans program, with 3 nested loops, having >> the following (high-level) structure: >> >> === >> void method1(){ >> //loop 1 >> for(int i = 0; i< rows1; i++){ >> //... >> for(int j = 0; j< rows2; j++){ >> //... >> for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} >> } >> } >> } >> >> void method2(){ >> //loop 2 >> for(int i =0; i < rows1; i++){ >> for(int j=0 ; i< rows2; j++){ >> for(int k=0 ; k< cols; k++){ >> array[i*cols+k] = //... >> } >> } >> } >> } >> >> void main(){ >> >> do{ >> method1(); method2(); >> }while(!converged) >> >> } >> ==== >> >> In the first test, cols is an int whose value is determined at >> runtime (by reading a file), in the second >> test, it >> is given as a compile-time constant(3). In the second test, >> there is a */significant*/ speed-up (around 40%). >> >> However, when studying the diff of the output of >> PrintOptoAssembly for both method1 and method2, there is no >> difference (apart from slight value changes in frequency). >> Would you have any hints as to where I could look for >> differences? >> >> Thanks a lot, >> Manohar >> >> [1] https://wikis.oracle.com/**display/HotSpotInternals/** >> RangeCheckElimination >> >> >> >> As Kris pointed you need to fix your example: >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120116/ef3cf0f0/attachment-0001.html From vladimir.kozlov at oracle.com Mon Jan 16 15:45:14 2012 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Mon, 16 Jan 2012 15:45:14 -0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> Message-ID: <4F14B68A.6020808@oracle.com> Vitaly, We do use profile_trip_cnt during loop unroll calculation but not during fully unroll because we can't trust it 100% since program's phase and number of iterations could change after method is compiled. See policy_unroll() and policy_maximally_unroll(): http://hg.openjdk.java.net/hsx/hotspot-comp/hotspot/file/89d0a5d40008/src/share/vm/opto/loopTransform.cpp We could use deopt as you suggested but deoptimization is double-edge sword, when method recompiled after deoptimization some aggressive optimizations will not be executed for it so the new generated code could be slower. Regards, Vladimir On 1/16/12 2:37 PM, Vitaly Davidovich wrote: > Hi Vladimir, > > If x_col is always seen to be same value in the profile shouldn't the loop be unrolled as well with some deopt guard? Or > does this not participate in profiling? > > Thanks > > On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" > wrote: > > > be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time > > constant) will be visible elsewhere. Is that right? If so, where would I be able to detect this? > > In such situations we usually use some visual tools to see difference between log outputs. At least you can use > 'diff'. You may need to replace instructions addresses in outputs (number at the beginning of lines) with the same > value to match. There are few tricks you may use to get similar PrintOptoAssembly output. Use next flags to avoid > mixing output from program output and from 2 compiler threads (flags stop program until a method is compiled and run > only one compiler thread): > > -Xbatch -XX:CICompilerCount=1 > > Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method is compiled and inlined. Note that you may see > similar output for individual methods but could be big difference in compiled caller (computeAll()) method where 2 > loop methods could be inlined. So you need to compare all compiled methods. > > In general, to have constant as loop limit is always win because some checks in generated code could be avoided and > more optimizations could be done for such loops. Use -XX:+TraceLoopOpts to see what loop optimizations are done in > both cases. > > For example, in your code you set 'x_col = 3', as result the next loop in constructNearestClusterVector(__) will be > fully unrolled when this method is inlined into computeAll() and x_col is replaced with '3': > > for(k = 0; k < x_col; k++) { > double tmp = x[i*x_col + k] - mu[j* mu_col + k]; > dist += tmp * tmp; > } > > Vladimir > > On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > > Hi Kris, Vladimir, > > thanks for both your responses. > > Second, your two test methods are different so you can't directly compare them. method1() iterates over rows > using > middle loop index 'j' and method2() uses external loop index 'i'. Unless they are typos again. > > > You are right, these are indeed typos. As Kris suggested, I have the code printed here: > http://pastebin.com/xRFD1Nt1. > The methods corresponding to method1, and method2 are constructNearestClusterVector and computeNewCentroids. Their > PrintOptoAssembly outputs are respectively at http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 > > Also, it seems I have not explained myself correctly. I am not trying to compare the performance of method1 with > respect > to that of method2: method1 and method2 both run in the same program. What I am trying to compare is their > performance > in two cases: > - when cols is a compile-time constant (much faster) > - when cols is a value determined at run-time > > If you are using jdk7 there are few flags you can use to print loop optimizations information. They need debug > version of VM but it is not problem for you, I think, since you can use debug PrintOptoAssembly flag. > > -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree after each round of loop opts, > -XX:+TraceLoopPredicate prints RC information when it is moved from a loop, > -XX:+TraceRangeLimitCheck prints additional information for RC elimination optimization. > > > Thanks for these, I will have a look at what they output. > > Fourth, range check expression in your example is not what you think. RC expression should be next: > (i*stride+offset) where 'i' is loop variable, 'stride' is constant and 'offset' is loop invariant. > > In your example 'offset' is (j * cols) since it is loop invariant, 'k' is loop variable and stride is '1' (one). > In both your methods RC will be moved out of inner loop so the code for it will be the same. The only > difference in > these methods will be where and how (j * cols) and (i * cols) expressions are calculated. > > > I'd guess it's the difference in locality that made the difference in performance in your two tests. > > Thanks for the explanation. I understand from the above that the assembly output in both cases mentioned above > may not > be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time > constant) > will be visible elsewhere. Is that right? If so, where would I be able to detect this? > > Cheers, > Manohar > > In your PrintOptoAssembly output snippet, the instruction at 0x13e is a LoadRange, which loads the range > from > the header > of an array: > > (from x86_64.ad ) > > // Load Range > instruct loadRange(rRegI dst, memory mem) > %{ > match(Set dst (LoadRange mem)); > > ins_cost(125); // XXX > format %{ "movl $dst, $mem\t# range" %} > opcode(0x8B); > ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); > ins_pipe(ialu_reg_mem); > %} > > That's not a range check just yet; the real check, if any, should come after the null check, in the form of > comparing > something else with RSI. But you didn't show what's after the null check, how RSI is used, so it's hard > to say what > you're seeing in your example. > > As for the two test examples, could you paste the entire source code, with the PrintOptoAssembly output of > method1() and > method2() ? The first example looks weird, maybe it's a typo but you're using "j < cols" as the loop > condition > for the > inner loop. > > > - Kris > > On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda > > >__>> wrote: > > Hello, > > following this reference on Range Check Elimination done by the Hotspot compiler [1], I was keen in > knowing > how I > can detect whether range checks are taking place in loops by inspecting output using the > PrintAssembly flag; > with > the old PrintOptoAssembly flag, I have seen output such as the following, which I assume to be range > checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page [1], loops are optimized if the stride is a compile-time > constant. I > performed > a few tests on a kmeans program, with 3 nested loops, having the following (high-level) structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int whose value is determined at runtime (by reading a file), in the > second > test, it > is given as a compile-time constant(3). In the second test, there is a */significant*/ speed-up > (around 40%). > > However, when studying the diff of the output of PrintOptoAssembly for both method1 and method2, > there is no > difference (apart from slight value changes in frequency). Would you have any hints as to where I > could look for > differences? > > Thanks a lot, > Manohar > > [1] https://wikis.oracle.com/__display/HotSpotInternals/__RangeCheckElimination > > > > > As Kris pointed you need to fix your example: > From vitalyd at gmail.com Mon Jan 16 16:15:18 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Mon, 16 Jan 2012 19:15:18 -0500 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F14B68A.6020808@oracle.com> References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> <4F14B68A.6020808@oracle.com> Message-ID: Vladimir, thanks for the explanation and the code pointer. Intuitively, it would seem like a good idea to trust the profile 100% if it reports the same value used 100% of the time (I can see how anything less than 100%, even a very high probability of same value, is not trustworthy) given sufficient trips through the loop. Although I can see how an app may have phases where same value is seen for a while before it's switched, but that's where I thought deopt would help. There must be a good chunk of code out there that doesn't know at static compilation time the loop count (so can't use compile-time constant), but at runtime the actual value doesn't change for many many trips through the loop; I know I have code like that in various places. What's the reason a compilation after deopt would not be as aggressive as the 1st time? Is it because the profile information may be "weaker" (i.e. more uncertainty in it)? I thought the profile is completely reset after deopt, so I would think if the loop is now executed with a different "constant" value (e.g. in our example, instead of 3 it's now 4), then the same optimizations will be applied (of course if unrolling the loop is no longer advantageous due to a much different value, I can see how different optimizations will be applied). Thanks On Mon, Jan 16, 2012 at 6:45 PM, Vladimir Kozlov wrote: > Vitaly, > > We do use profile_trip_cnt during loop unroll calculation but not during > fully unroll because we can't trust it 100% since program's phase and > number of iterations could change after method is compiled. See > policy_unroll() and policy_maximally_unroll(): > > http://hg.openjdk.java.net/**hsx/hotspot-comp/hotspot/file/** > 89d0a5d40008/src/share/vm/**opto/loopTransform.cpp > > We could use deopt as you suggested but deoptimization is double-edge > sword, when method recompiled after deoptimization some aggressive > optimizations will not be executed for it so the new generated code could > be slower. > > Regards, > Vladimir > > > On 1/16/12 2:37 PM, Vitaly Davidovich wrote: > >> Hi Vladimir, >> >> If x_col is always seen to be same value in the profile shouldn't the >> loop be unrolled as well with some deopt guard? Or >> does this not participate in profiling? >> >> Thanks >> >> On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" > vladimir.kozlov@**oracle.com >> wrote: >> >> > be different, because the expressions are similar. The difference >> in runtime (due to cols being a compile-time >> > constant) will be visible elsewhere. Is that right? If so, where >> would I be able to detect this? >> >> In such situations we usually use some visual tools to see difference >> between log outputs. At least you can use >> 'diff'. You may need to replace instructions addresses in outputs >> (number at the beginning of lines) with the same >> value to match. There are few tricks you may use to get similar >> PrintOptoAssembly output. Use next flags to avoid >> mixing output from program output and from 2 compiler threads (flags >> stop program until a method is compiled and run >> only one compiler thread): >> >> -Xbatch -XX:CICompilerCount=1 >> >> Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method >> is compiled and inlined. Note that you may see >> similar output for individual methods but could be big difference in >> compiled caller (computeAll()) method where 2 >> loop methods could be inlined. So you need to compare all compiled >> methods. >> >> In general, to have constant as loop limit is always win because some >> checks in generated code could be avoided and >> more optimizations could be done for such loops. Use >> -XX:+TraceLoopOpts to see what loop optimizations are done in >> both cases. >> >> For example, in your code you set 'x_col = 3', as result the next loop >> in constructNearestClusterVector(**__) will be >> >> fully unrolled when this method is inlined into computeAll() and x_col >> is replaced with '3': >> >> for(k = 0; k < x_col; k++) { >> double tmp = x[i*x_col + k] - mu[j* mu_col + k]; >> dist += tmp * tmp; >> } >> >> Vladimir >> >> On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: >> >> Hi Kris, Vladimir, >> >> thanks for both your responses. >> >> Second, your two test methods are different so you can't >> directly compare them. method1() iterates over rows >> using >> middle loop index 'j' and method2() uses external loop index >> 'i'. Unless they are typos again. >> >> >> You are right, these are indeed typos. As Kris suggested, I have >> the code printed here: >> http://pastebin.com/xRFD1Nt1. >> The methods corresponding to method1, and method2 are >> constructNearestClusterVector and computeNewCentroids. Their >> PrintOptoAssembly outputs are respectively at >> http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 >> >> Also, it seems I have not explained myself correctly. I am not >> trying to compare the performance of method1 with >> respect >> to that of method2: method1 and method2 both run in the same >> program. What I am trying to compare is their >> performance >> in two cases: >> - when cols is a compile-time constant (much faster) >> - when cols is a value determined at run-time >> >> If you are using jdk7 there are few flags you can use to print >> loop optimizations information. They need debug >> version of VM but it is not problem for you, I think, since >> you can use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop optimizations and >> loop tree after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information when it is moved >> from a loop, >> -XX:+TraceRangeLimitCheck prints additional information for RC >> elimination optimization. >> >> >> Thanks for these, I will have a look at what they output. >> >> Fourth, range check expression in your example is not what you >> think. RC expression should be next: >> (i*stride+offset) where 'i' is loop variable, 'stride' is >> constant and 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since it is loop >> invariant, 'k' is loop variable and stride is '1' (one). >> In both your methods RC will be moved out of inner loop so the >> code for it will be the same. The only >> difference in >> these methods will be where and how (j * cols) and (i * cols) >> expressions are calculated. >> >> >> I'd guess it's the difference in locality that made the >> difference in performance in your two tests. >> >> Thanks for the explanation. I understand from the above that the >> assembly output in both cases mentioned above >> may not >> be different, because the expressions are similar. The difference >> in runtime (due to cols being a compile-time >> constant) >> will be visible elsewhere. Is that right? If so, where would I be >> able to detect this? >> >> Cheers, >> Manohar >> >> In your PrintOptoAssembly output snippet, the instruction >> at 0x13e is a LoadRange, which loads the range >> from >> the header >> of an array: >> >> (from x86_64.ad < >> http://x86_64.ad>) >> >> >> // Load Range >> instruct loadRange(rRegI dst, memory mem) >> %{ >> match(Set dst (LoadRange mem)); >> >> ins_cost(125); // XXX >> format %{ "movl $dst, $mem\t# range" %} >> opcode(0x8B); >> ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, >> mem)); >> ins_pipe(ialu_reg_mem); >> %} >> >> That's not a range check just yet; the real check, if any, >> should come after the null check, in the form of >> comparing >> something else with RSI. But you didn't show what's after >> the null check, how RSI is used, so it's hard >> to say what >> you're seeing in your example. >> >> As for the two test examples, could you paste the entire >> source code, with the PrintOptoAssembly output of >> method1() and >> method2() ? The first example looks weird, maybe it's a >> typo but you're using "j < cols" as the loop >> condition >> for the >> inner loop. >> >> >> - Kris >> >> On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda < >> manojo10386 at gmail.com >> > manojo10386 at gmail.com>**> >> >> > **>__>> wrote: >> >> Hello, >> >> following this reference on Range Check Elimination >> done by the Hotspot compiler [1], I was keen in >> knowing >> how I >> can detect whether range checks are taking place in >> loops by inspecting output using the >> PrintAssembly flag; >> with >> the old PrintOptoAssembly flag, I have seen output >> such as the following, which I assume to be range >> checks : >> >> B11: # B73 B12 <- B10 Freq: 1.21365 >> 139 movq RAX, [rsp + #24] # spill >> 13e movl RSI, [RAX + #12 (8-bit)] # range >> 141 NullCheck RAX >> >> What is the equivalent with the new PrintAssembly flag >> (using hsdis)? >> >> Moreover, as stated on the wiki page [1], loops are >> optimized if the stride is a compile-time >> constant. I >> performed >> a few tests on a kmeans program, with 3 nested loops, >> having the following (high-level) structure: >> >> === >> void method1(){ >> //loop 1 >> for(int i = 0; i< rows1; i++){ >> //... >> for(int j = 0; j< rows2; j++){ >> //... >> for(int k = 0; j < cols; k++){ array[j * cols + k] >> = //...} >> } >> } >> } >> >> void method2(){ >> //loop 2 >> for(int i =0; i < rows1; i++){ >> for(int j=0 ; i< rows2; j++){ >> for(int k=0 ; k< cols; k++){ >> array[i*cols+k] = //... >> } >> } >> } >> } >> >> void main(){ >> >> do{ >> method1(); method2(); >> }while(!converged) >> >> } >> ==== >> >> In the first test, cols is an int whose value is >> determined at runtime (by reading a file), in the >> second >> test, it >> is given as a compile-time constant(3). In the second >> test, there is a */significant*/ speed-up >> (around 40%). >> >> However, when studying the diff of the output of >> PrintOptoAssembly for both method1 and method2, >> there is no >> difference (apart from slight value changes in >> frequency). Would you have any hints as to where I >> could look for >> differences? >> >> Thanks a lot, >> Manohar >> >> [1] https://wikis.oracle.com/__** >> display/HotSpotInternals/__**RangeCheckElimination >> >> > RangeCheckElimination >> > >> >> >> >> As Kris pointed you need to fix your example: >> >> -- Vitaly 617-548-7007 (mobile) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120116/cb2bce5a/attachment-0001.html From robert.ottenhag at oracle.com Mon Jan 16 18:04:18 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Mon, 16 Jan 2012 18:04:18 -0800 (PST) Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot Message-ID: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> Hi, Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 This fix adds optional validation control to the setting of command-line switches in Hotspot, and allows it to have vendor-specific extensions if necessary. The design follows the previously added framework for vendor-specific command-line switch extensions in CR7117389. The validation control is handled by new boolean methods Flag::is_valid_(value,origin) that are called at the beginning of each call to CommandLineFlags[Ex]::AtPut() to verify that the new value and origin are valid replacements for the current value and origin for this flag. When parsing the command line options, a failed validation will typically result in an error message and exit with "Unrecognized VM option ''". When used dynamically using the attach API or management API the resulting operation will fail, leaving it up to the caller to handle it as appropriate. A simple use case for validation is a manageable flag whose current value can not be less than the previous value, while a more complex example may base the validation on multiple other flags, etc. Thanks, /Robert -- Oracle Robert Ottenhag | Senior Member of Technical Staff Oracle Oracle Java VM | ORACLE Sweden | Stockholm From david.holmes at oracle.com Mon Jan 16 19:09:21 2012 From: david.holmes at oracle.com (David Holmes) Date: Tue, 17 Jan 2012 13:09:21 +1000 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> Message-ID: <4F14E661.5080405@oracle.com> Hi Robert, I've added serviceability to the cc list. On 17/01/2012 12:04 PM, Robert Ottenhag wrote: > Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 > > This fix adds optional validation control to the setting of command-line switches in Hotspot, and allows it to have vendor-specific extensions if necessary. Does this imply that the Java management APIs (eg com.sun.management.VMOption) need to be changed to reflect these restrictions? Presently VMOptions are either writeable or not, but this makes them conditionally-writeable. > The design follows the previously added framework for vendor-specific command-line switch extensions in CR7117389. > > The validation control is handled by new boolean methods Flag::is_valid_(value,origin) that are called at the beginning of each call to CommandLineFlags[Ex]::AtPut() to verify that the new value and origin are valid replacements for the current value and origin for this flag. > > When parsing the command line options, a failed validation will typically result in an error message and exit with "Unrecognized VM option ''". When used dynamically using the attach API or management API the resulting operation will fail, leaving it up to the caller to handle it as appropriate. The error message doesn't really seem appropriate - it may well be a recognized option, you just can't set it to that value in that way. Ideally there would be a way for the validation logic to supply a meaningful error message. In its absence the top-level message should reflect the new type of error. Also some of the failures lead to crashes - which seems wrong to me - see below. ---- src/share/vm/services/management.cpp: 1821 if (!succeed) { 1822 THROW_MSG(vmSymbols::java_lang_IllegalArgumentException(), 1823 "This flag is not writeable with this value or origin."); That's a rather cryptic error message. How about: "Flag can not be set to the requested value using this API" ? ---- src/share/vm/runtime/globals_ext.hpp With all the inline bool Flag::is_valid_ext_T(T value, FlagValueOrigin origin) functions, is it necessary to include the type T in the function name? ----- src/share/vm/runtime/globals.cpp The use of the guarantees seems wrong as it means an invalid option will trigger a VM crash rather than a clean exit during initialization. It seems to me that none of the code in arguments.cpp that uses the FLAG_SET_* macros (which in turn use the CommandLineFlagsEx functions you added the guarantees to) anticipates any possibility for failure. I think if you are going this path then you have no choice but to change the CommandLineFlagsEx methods to return bool and update the FLAG_SET macros to try and perform appropriate error handling. David ----- > A simple use case for validation is a manageable flag whose current value can not be less than the previous value, while a more complex example may base the validation on multiple other flags, etc. > > Thanks, > > /Robert > From vladimir.kozlov at oracle.com Mon Jan 16 20:52:49 2012 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Mon, 16 Jan 2012 20:52:49 -0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> <4F14B68A.6020808@oracle.com> Message-ID: <4F14FEA1.6020704@oracle.com> I need to be more clear. We can't get 100% correct profiling result for counters. First, profiling structure (MethodData) are created only after method is executed for some time (for Server compiler it is 3000 invocations). It is done to avoid profiling short programs since profiling will slowdown Interpreter by about 20%. This delayed profiling introduce first discrepancy between counters. MD could be also created when method has a hot loop (a lot of iterations) and OSR compilation is requested. As result counters in the loop will not correlate with counters before the loop. An other reason for not precise profiling counters is execution of method by multiple threads. We have only one MD structure per method which is updated (not atomically for speed) by all such threads. And when method is prepared for compilation compiler thread creates snapshot of MD which could happened in the middle of method execution by java threads. We don't reset MD during deoptimization because we still need previous values (in addition to counters we collect type information). So counters are accumulated. And profiling is resumed from deoptimization point which again adds discrepancy. I hope it explains why we can't trust 100% to profiling counters. Regards, Vladimir On 1/16/12 4:15 PM, Vitaly Davidovich wrote: > Vladimir, thanks for the explanation and the code pointer. > > Intuitively, it would seem like a good idea to trust the profile 100% if it reports the same value used 100% of the time > (I can see how anything less than 100%, even a very high probability of same value, is not trustworthy) given sufficient > trips through the loop. Although I can see how an app may have phases where same value is seen for a while before it's > switched, but that's where I thought deopt would help. There must be a good chunk of code out there that doesn't know > at static compilation time the loop count (so can't use compile-time constant), but at runtime the actual value doesn't > change for many many trips through the loop; I know I have code like that in various places. > > What's the reason a compilation after deopt would not be as aggressive as the 1st time? Is it because the profile > information may be "weaker" (i.e. more uncertainty in it)? I thought the profile is completely reset after deopt, so I > would think if the loop is now executed with a different "constant" value (e.g. in our example, instead of 3 it's now > 4), then the same optimizations will be applied (of course if unrolling the loop is no longer advantageous due to a much > different value, I can see how different optimizations will be applied). > > Thanks > > On Mon, Jan 16, 2012 at 6:45 PM, Vladimir Kozlov > wrote: > > Vitaly, > > We do use profile_trip_cnt during loop unroll calculation but not during fully unroll because we can't trust it 100% > since program's phase and number of iterations could change after method is compiled. See policy_unroll() and > policy_maximally_unroll(): > > http://hg.openjdk.java.net/__hsx/hotspot-comp/hotspot/file/__89d0a5d40008/src/share/vm/__opto/loopTransform.cpp > > > We could use deopt as you suggested but deoptimization is double-edge sword, when method recompiled after > deoptimization some aggressive optimizations will not be executed for it so the new generated code could be slower. > > Regards, > Vladimir > > > On 1/16/12 2:37 PM, Vitaly Davidovich wrote: > > Hi Vladimir, > > If x_col is always seen to be same value in the profile shouldn't the loop be unrolled as well with some deopt > guard? Or > does this not participate in profiling? > > Thanks > > On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" > >> wrote: > > > be different, because the expressions are similar. The difference in runtime (due to cols being a compile-time > > constant) will be visible elsewhere. Is that right? If so, where would I be able to detect this? > > In such situations we usually use some visual tools to see difference between log outputs. At least you can use > 'diff'. You may need to replace instructions addresses in outputs (number at the beginning of lines) with the same > value to match. There are few tricks you may use to get similar PrintOptoAssembly output. Use next flags to > avoid > mixing output from program output and from 2 compiler threads (flags stop program until a method is compiled > and run > only one compiler thread): > > -Xbatch -XX:CICompilerCount=1 > > Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method is compiled and inlined. Note that you > may see > similar output for individual methods but could be big difference in compiled caller (computeAll()) method > where 2 > loop methods could be inlined. So you need to compare all compiled methods. > > In general, to have constant as loop limit is always win because some checks in generated code could be > avoided and > more optimizations could be done for such loops. Use -XX:+TraceLoopOpts to see what loop optimizations are > done in > both cases. > > For example, in your code you set 'x_col = 3', as result the next loop in > constructNearestClusterVector(____) will be > > fully unrolled when this method is inlined into computeAll() and x_col is replaced with '3': > > for(k = 0; k < x_col; k++) { > double tmp = x[i*x_col + k] - mu[j* mu_col + k]; > dist += tmp * tmp; > } > > Vladimir > > On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > > Hi Kris, Vladimir, > > thanks for both your responses. > > Second, your two test methods are different so you can't directly compare them. method1() iterates > over rows > using > middle loop index 'j' and method2() uses external loop index 'i'. Unless they are typos again. > > > You are right, these are indeed typos. As Kris suggested, I have the code printed here: > http://pastebin.com/xRFD1Nt1. > The methods corresponding to method1, and method2 are constructNearestClusterVector and > computeNewCentroids. Their > PrintOptoAssembly outputs are respectively at http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 > > Also, it seems I have not explained myself correctly. I am not trying to compare the performance of > method1 with > respect > to that of method2: method1 and method2 both run in the same program. What I am trying to compare is their > performance > in two cases: > - when cols is a compile-time constant (much faster) > - when cols is a value determined at run-time > > If you are using jdk7 there are few flags you can use to print loop optimizations information. They > need debug > version of VM but it is not problem for you, I think, since you can use debug PrintOptoAssembly flag. > > -XX:+TraceLoopOpts prints all happened loop optimizations and loop tree after each round of loop opts, > -XX:+TraceLoopPredicate prints RC information when it is moved from a loop, > -XX:+TraceRangeLimitCheck prints additional information for RC elimination optimization. > > > Thanks for these, I will have a look at what they output. > > Fourth, range check expression in your example is not what you think. RC expression should be next: > (i*stride+offset) where 'i' is loop variable, 'stride' is constant and 'offset' is loop invariant. > > In your example 'offset' is (j * cols) since it is loop invariant, 'k' is loop variable and stride > is '1' (one). > In both your methods RC will be moved out of inner loop so the code for it will be the same. The only > difference in > these methods will be where and how (j * cols) and (i * cols) expressions are calculated. > > > I'd guess it's the difference in locality that made the difference in performance in your two tests. > > Thanks for the explanation. I understand from the above that the assembly output in both cases > mentioned above > may not > be different, because the expressions are similar. The difference in runtime (due to cols being a > compile-time > constant) > will be visible elsewhere. Is that right? If so, where would I be able to detect this? > > Cheers, > Manohar > > In your PrintOptoAssembly output snippet, the instruction at 0x13e is a LoadRange, which loads > the range > from > the header > of an array: > > (from x86_64.ad ) > > > // Load Range > instruct loadRange(rRegI dst, memory mem) > %{ > match(Set dst (LoadRange mem)); > > ins_cost(125); // XXX > format %{ "movl $dst, $mem\t# range" %} > opcode(0x8B); > ins_encode(REX_reg_mem(dst, mem), OpcP, reg_mem(dst, mem)); > ins_pipe(ialu_reg_mem); > %} > > That's not a range check just yet; the real check, if any, should come after the null check, in > the form of > comparing > something else with RSI. But you didn't show what's after the null check, how RSI is used, so > it's hard > to say what > you're seeing in your example. > > As for the two test examples, could you paste the entire source code, with the PrintOptoAssembly > output of > method1() and > method2() ? The first example looks weird, maybe it's a typo but you're using "j < cols" as the loop > condition > for the > inner loop. > > > - Kris > > On Mon, Jan 16, 2012 at 1:59 AM, Manohar Jonnalagedda > > >__> > > > >__>__>> wrote: > > Hello, > > following this reference on Range Check Elimination done by the Hotspot compiler [1], I was > keen in > knowing > how I > can detect whether range checks are taking place in loops by inspecting output using the > PrintAssembly flag; > with > the old PrintOptoAssembly flag, I have seen output such as the following, which I assume to > be range > checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page [1], loops are optimized if the stride is a compile-time > constant. I > performed > a few tests on a kmeans program, with 3 nested loops, having the following (high-level) > structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int whose value is determined at runtime (by reading a file), > in the > second > test, it > is given as a compile-time constant(3). In the second test, there is a */significant*/ speed-up > (around 40%). > > However, when studying the diff of the output of PrintOptoAssembly for both method1 and method2, > there is no > difference (apart from slight value changes in frequency). Would you have any hints as to > where I > could look for > differences? > > Thanks a lot, > Manohar > > [1] https://wikis.oracle.com/____display/HotSpotInternals/____RangeCheckElimination > > > > > > > > As Kris pointed you need to fix your example: > > > > > -- > Vitaly > 617-548-7007 (mobile) From vitalyd at gmail.com Mon Jan 16 21:23:34 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Tue, 17 Jan 2012 00:23:34 -0500 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F14FEA1.6020704@oracle.com> References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> <4F14B68A.6020808@oracle.com> <4F14FEA1.6020704@oracle.com> Message-ID: Vladimir, thanks for that -- very useful and interesting to know. So I take it a compilation after initial deopt may be less aggressive than 1st time around simply because the accumulated MD might be in a state such that it guides optimization in a different direction, is that right? I guess how does one reason about the generated assembly when it's so (possibly) dynamic? :) I've looked at the assembly in a product hotspot via hsdis, but then you realize that you may be looking at just one compilation of it, and it may be different at some point later. Also, for something like Intel's VTune (or Amplifier) that support showing JIT'd assembly, I guess it's the same issue? This is quite different, of course, with .NET CLR's JIT because there's no profiling there -- 1st time method is hit, it gets JIT'd and there's no code pitching (makes looking at the assembly seem a bit more "reassuring"). In general, would you say that the C2 compiler favors "over aggressiveness" with deopt guards? Or does it try to avoid deopts and only perform aggressive optimizations where the profile is quite conclusive on type and/or counters? Thanks On Mon, Jan 16, 2012 at 11:52 PM, Vladimir Kozlov < vladimir.kozlov at oracle.com> wrote: > I need to be more clear. We can't get 100% correct profiling result for > counters. First, profiling structure (MethodData) are created only after > method is executed for some time (for Server compiler it is 3000 > invocations). It is done to avoid profiling short programs since profiling > will slowdown Interpreter by about 20%. This delayed profiling introduce > first discrepancy between counters. MD could be also created when method > has a hot loop (a lot of iterations) and OSR compilation is requested. As > result counters in the loop will not correlate with counters before the > loop. An other reason for not precise profiling counters is execution of > method by multiple threads. We have only one MD structure per method which > is updated (not atomically for speed) by all such threads. And when method > is prepared for compilation compiler thread creates snapshot of MD which > could happened in the middle of method execution by java threads. > > We don't reset MD during deoptimization because we still need previous > values (in addition to counters we collect type information). So counters > are accumulated. And profiling is resumed from deoptimization point which > again adds discrepancy. > > I hope it explains why we can't trust 100% to profiling counters. > > Regards, > Vladimir > > > On 1/16/12 4:15 PM, Vitaly Davidovich wrote: > >> Vladimir, thanks for the explanation and the code pointer. >> >> Intuitively, it would seem like a good idea to trust the profile 100% if >> it reports the same value used 100% of the time >> (I can see how anything less than 100%, even a very high probability of >> same value, is not trustworthy) given sufficient >> trips through the loop. Although I can see how an app may have phases >> where same value is seen for a while before it's >> switched, but that's where I thought deopt would help. There must be a >> good chunk of code out there that doesn't know >> at static compilation time the loop count (so can't use compile-time >> constant), but at runtime the actual value doesn't >> change for many many trips through the loop; I know I have code like that >> in various places. >> >> What's the reason a compilation after deopt would not be as aggressive as >> the 1st time? Is it because the profile >> information may be "weaker" (i.e. more uncertainty in it)? I thought the >> profile is completely reset after deopt, so I >> would think if the loop is now executed with a different "constant" value >> (e.g. in our example, instead of 3 it's now >> 4), then the same optimizations will be applied (of course if unrolling >> the loop is no longer advantageous due to a much >> different value, I can see how different optimizations will be applied). >> >> Thanks >> >> On Mon, Jan 16, 2012 at 6:45 PM, Vladimir Kozlov < >> vladimir.kozlov at oracle.com >> >> wrote: >> >> Vitaly, >> >> We do use profile_trip_cnt during loop unroll calculation but not >> during fully unroll because we can't trust it 100% >> since program's phase and number of iterations could change after >> method is compiled. See policy_unroll() and >> policy_maximally_unroll(): >> >> http://hg.openjdk.java.net/__**hsx/hotspot-comp/hotspot/file/** >> __89d0a5d40008/src/share/vm/__**opto/loopTransform.cpp >> >> > 89d0a5d40008/src/share/vm/**opto/loopTransform.cpp >> > >> >> We could use deopt as you suggested but deoptimization is double-edge >> sword, when method recompiled after >> deoptimization some aggressive optimizations will not be executed for >> it so the new generated code could be slower. >> >> Regards, >> Vladimir >> >> >> On 1/16/12 2:37 PM, Vitaly Davidovich wrote: >> >> Hi Vladimir, >> >> If x_col is always seen to be same value in the profile shouldn't >> the loop be unrolled as well with some deopt >> guard? Or >> does this not participate in profiling? >> >> Thanks >> >> On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" < >> vladimir.kozlov at oracle.com >> > >> > vladimir.kozlov@**oracle.com >>> wrote: >> >> > be different, because the expressions are similar. The >> difference in runtime (due to cols being a compile-time >> > constant) will be visible elsewhere. Is that right? If so, >> where would I be able to detect this? >> >> In such situations we usually use some visual tools to see >> difference between log outputs. At least you can use >> 'diff'. You may need to replace instructions addresses in outputs >> (number at the beginning of lines) with the same >> value to match. There are few tricks you may use to get >> similar PrintOptoAssembly output. Use next flags to >> avoid >> mixing output from program output and from 2 compiler threads >> (flags stop program until a method is compiled >> and run >> only one compiler thread): >> >> -Xbatch -XX:CICompilerCount=1 >> >> Also add -XX:+PrintCompilation -XX:+PrintInlining to see what >> method is compiled and inlined. Note that you >> may see >> similar output for individual methods but could be big >> difference in compiled caller (computeAll()) method >> where 2 >> loop methods could be inlined. So you need to compare all >> compiled methods. >> >> In general, to have constant as loop limit is always win >> because some checks in generated code could be >> avoided and >> more optimizations could be done for such loops. Use >> -XX:+TraceLoopOpts to see what loop optimizations are >> done in >> both cases. >> >> For example, in your code you set 'x_col = 3', as result the >> next loop in >> constructNearestClusterVector(**____) will be >> >> >> fully unrolled when this method is inlined into computeAll() >> and x_col is replaced with '3': >> >> for(k = 0; k < x_col; k++) { >> double tmp = x[i*x_col + k] - mu[j* mu_col + k]; >> dist += tmp * tmp; >> } >> >> Vladimir >> >> On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: >> >> Hi Kris, Vladimir, >> >> thanks for both your responses. >> >> Second, your two test methods are different so you >> can't directly compare them. method1() iterates >> over rows >> using >> middle loop index 'j' and method2() uses external loop >> index 'i'. Unless they are typos again. >> >> >> You are right, these are indeed typos. As Kris suggested, >> I have the code printed here: >> http://pastebin.com/xRFD1Nt1. >> The methods corresponding to method1, and method2 are >> constructNearestClusterVector and >> computeNewCentroids. Their >> PrintOptoAssembly outputs are respectively at >> http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 >> >> Also, it seems I have not explained myself correctly. I am >> not trying to compare the performance of >> method1 with >> respect >> to that of method2: method1 and method2 both run in the >> same program. What I am trying to compare is their >> performance >> in two cases: >> - when cols is a compile-time constant (much faster) >> - when cols is a value determined at run-time >> >> If you are using jdk7 there are few flags you can use >> to print loop optimizations information. They >> need debug >> version of VM but it is not problem for you, I think, >> since you can use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop >> optimizations and loop tree after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information when it >> is moved from a loop, >> -XX:+TraceRangeLimitCheck prints additional >> information for RC elimination optimization. >> >> >> Thanks for these, I will have a look at what they output. >> >> Fourth, range check expression in your example is not >> what you think. RC expression should be next: >> (i*stride+offset) where 'i' is loop variable, 'stride' >> is constant and 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since it is >> loop invariant, 'k' is loop variable and stride >> is '1' (one). >> In both your methods RC will be moved out of inner >> loop so the code for it will be the same. The only >> difference in >> these methods will be where and how (j * cols) and (i >> * cols) expressions are calculated. >> >> >> I'd guess it's the difference in locality that >> made the difference in performance in your two tests. >> >> Thanks for the explanation. I understand from the above >> that the assembly output in both cases >> mentioned above >> may not >> be different, because the expressions are similar. The >> difference in runtime (due to cols being a >> compile-time >> constant) >> will be visible elsewhere. Is that right? If so, where >> would I be able to detect this? >> >> Cheers, >> Manohar >> >> In your PrintOptoAssembly output snippet, the >> instruction at 0x13e is a LoadRange, which loads >> the range >> from >> the header >> of an array: >> >> (from x86_64.ad < >> http://x86_64.ad> ) >> >> >> >> // Load Range >> instruct loadRange(rRegI dst, memory mem) >> %{ >> match(Set dst (LoadRange mem)); >> >> ins_cost(125); // XXX >> format %{ "movl $dst, $mem\t# range" %} >> opcode(0x8B); >> ins_encode(REX_reg_mem(dst, mem), OpcP, >> reg_mem(dst, mem)); >> ins_pipe(ialu_reg_mem); >> %} >> >> That's not a range check just yet; the real check, >> if any, should come after the null check, in >> the form of >> comparing >> something else with RSI. But you didn't show >> what's after the null check, how RSI is used, so >> it's hard >> to say what >> you're seeing in your example. >> >> As for the two test examples, could you paste the >> entire source code, with the PrintOptoAssembly >> output of >> method1() and >> method2() ? The first example looks weird, maybe >> it's a typo but you're using "j < cols" as the loop >> condition >> for the >> inner loop. >> >> >> - Kris >> >> On Mon, Jan 16, 2012 at 1:59 AM, Manohar >> Jonnalagedda > >> **> >> > > manojo10386 at gmail.com>**>__> >> >> > **> > manojo10386 at gmail.com> >> **>__>__>> >> wrote: >> >> Hello, >> >> following this reference on Range Check >> Elimination done by the Hotspot compiler [1], I was >> keen in >> knowing >> how I >> can detect whether range checks are taking >> place in loops by inspecting output using the >> PrintAssembly flag; >> with >> the old PrintOptoAssembly flag, I have seen >> output such as the following, which I assume to >> be range >> checks : >> >> B11: # B73 B12 <- B10 Freq: 1.21365 >> 139 movq RAX, [rsp + #24] # spill >> 13e movl RSI, [RAX + #12 (8-bit)] # >> range >> 141 NullCheck RAX >> >> What is the equivalent with the new >> PrintAssembly flag (using hsdis)? >> >> Moreover, as stated on the wiki page [1], >> loops are optimized if the stride is a compile-time >> constant. I >> performed >> a few tests on a kmeans program, with 3 nested >> loops, having the following (high-level) >> structure: >> >> === >> void method1(){ >> //loop 1 >> for(int i = 0; i< rows1; i++){ >> //... >> for(int j = 0; j< rows2; j++){ >> //... >> for(int k = 0; j < cols; k++){ array[j * >> cols + k] = //...} >> } >> } >> } >> >> void method2(){ >> //loop 2 >> for(int i =0; i < rows1; i++){ >> for(int j=0 ; i< rows2; j++){ >> for(int k=0 ; k< cols; k++){ >> array[i*cols+k] = //... >> } >> } >> } >> } >> >> void main(){ >> >> do{ >> method1(); method2(); >> }while(!converged) >> >> } >> ==== >> >> In the first test, cols is an int whose value >> is determined at runtime (by reading a file), >> in the >> second >> test, it >> is given as a compile-time constant(3). In the >> second test, there is a */significant*/ speed-up >> (around 40%). >> >> However, when studying the diff of the output >> of PrintOptoAssembly for both method1 and method2, >> there is no >> difference (apart from slight value changes in >> frequency). Would you have any hints as to >> where I >> could look for >> differences? >> >> Thanks a lot, >> Manohar >> >> [1] https://wikis.oracle.com/____** >> display/HotSpotInternals/____**RangeCheckElimination >> > RangeCheckElimination >> > >> >> >> > RangeCheckElimination >> > RangeCheckElimination >> >> >> >> >> >> As Kris pointed you need to fix your example: >> >> >> >> >> -- >> Vitaly >> 617-548-7007 (mobile) >> > -- Vitaly 617-548-7007 (mobile) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120117/784a0a21/attachment-0001.html From robert.ottenhag at oracle.com Tue Jan 17 04:07:50 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Tue, 17 Jan 2012 13:07:50 +0100 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F14E661.5080405@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F14E661.5080405@oracle.com> Message-ID: <4F156496.3060206@oracle.com> David, Thanks for the review. On 01/17/2012 04:09 AM, David Holmes wrote: > Hi Robert, > > I've added serviceability to the cc list. Good, will try to remember that ;-) > > On 17/01/2012 12:04 PM, Robert Ottenhag wrote: >> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >> >> This fix adds optional validation control to the setting of >> command-line switches in Hotspot, and allows it to have >> vendor-specific extensions if necessary. > > Does this imply that the Java management APIs (eg > com.sun.management.VMOption) need to be changed to reflect these > restrictions? Presently VMOptions are either writeable or not, but > this makes them conditionally-writeable. No, since the Java management APIs already cares for conditional writes. According to com.sun.management.HotSpotDiagnosticMXBean.setVMOption() it will throw IllegalArgumentException if the new value is invalid. > >> The design follows the previously added framework for vendor-specific >> command-line switch extensions in CR7117389. >> >> The validation control is handled by new boolean methods >> Flag::is_valid_(value,origin) that are called at the beginning >> of each call to CommandLineFlags[Ex]::AtPut() to verify that >> the new value and origin are valid replacements for the current value >> and origin for this flag. >> >> When parsing the command line options, a failed validation will >> typically result in an error message and exit with "Unrecognized VM >> option ''". When used dynamically using the attach API or >> management API the resulting operation will fail, leaving it up to >> the caller to handle it as appropriate. > > The error message doesn't really seem appropriate - it may well be a > recognized option, you just can't set it to that value in that way. > Ideally there would be a way for the validation logic to supply a > meaningful error message. In its absence the top-level message should > reflect the new type of error. You are absolutely right, but the current fix is in line with the existing bad error messages where any kind of malformatted command line flags results in Unrecognized VM option, whether the reason is an unknown name, bad type semantics (using +- for bool semantics on an integer flag), or if the flag is locked. I will target meaningful error messages for command line parsing in a direct follow up bug to this fix. > > Also some of the failures lead to crashes - which seems wrong to me - > see below. > > ---- > > src/share/vm/services/management.cpp: > > 1821 if (!succeed) { > 1822 THROW_MSG(vmSymbols::java_lang_IllegalArgumentException(), > 1823 "This flag is not writeable with this value or > origin."); > > That's a rather cryptic error message. How about: > > "Flag can not be set to the requested value using this API" > > ? Yes, "origin" does not make sense to the upper Java layer. I will use your suggestion. > > ---- > > src/share/vm/runtime/globals_ext.hpp > > With all the > > inline bool Flag::is_valid_ext_T(T value, FlagValueOrigin origin) > > functions, is it necessary to include the type T in the function name? It is necessary if using type safe variants with T value as argument since overloading does not differ between different typedef names that resolves to the same native types, e.g. uintx and uint64_t are both unsigned long int. I am considering a condensed variant that replaces T by void* instead, and do the type casting based on the targeted flag, reducing the number of functions. > > > ----- > > src/share/vm/runtime/globals.cpp > > The use of the guarantees seems wrong as it means an invalid option > will trigger a VM crash rather than a clean exit during > initialization. It seems to me that none of the code in arguments.cpp > that uses the FLAG_SET_* macros (which in turn use the > CommandLineFlagsEx functions you added the guarantees to) anticipates > any possibility for failure. I think if you are going this path then > you have no choice but to change the CommandLineFlagsEx methods to > return bool and update the FLAG_SET macros to try and perform > appropriate error handling. I see your point, and in theory such as VM crash could occur anytime later in a JVM session if rarely running code would make use of FLAG_SET_* to change the value of a VM flag to an invalid value or origin. Seems as if the options are either to a) ignore validation tests for the FLAG_SET_* macros, and trust that they always set valid values. This can be partly verified by static code inspection by looking for any variables that actually have validation logic associated to them (since the variable name is defined at compile time), assuming one has access to all code, but it is not perfect in case code for changing a variable with validation logic exists. b) contain the error handling within the FLAG_SET_* macros, like using guarantee(), but maybe exception logic can help? c) require usage of the FLAG_SET_* macros to handle result codes and pass it up the call chain. Also, the current macro FLAG_SET_DEFAULT does a direct write to the flag value without going through AtPut(). This macro must be rewritten to have validation control to close the holes. The current call format will require all call sites to include type name as with FLAG_SET_{CMDLINE,ERGO} has, or to use slower lookup by variable name. /Robert > > David > ----- > > >> A simple use case for validation is a manageable flag whose current >> value can not be less than the previous value, while a more complex >> example may base the validation on multiple other flags, etc. >> >> Thanks, >> >> /Robert >> -- Oracle Robert Ottenhag | Senior Member of Technical Staff Phone: +46850630961 | Fax: +46850630911 | Mobile: +46707106161 Oracle Java HotSpot Virtual Machine ORACLE Sweden | Folkungagatan 122 | SE-116 30 Stockholm Oracle Svenska AB, Kronborgsgr?nd 17, S-164 28 KISTA, reg.no. 556254-6746 Green Oracle Oracle is committed to developing practices and products that help protect the environment -- From Dmitry.Samersoff at oracle.com Tue Jan 17 04:22:11 2012 From: Dmitry.Samersoff at oracle.com (Dmitry Samersoff) Date: Tue, 17 Jan 2012 16:22:11 +0400 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> Message-ID: <4F1567F3.70106@oracle.com> Robert, I'm second to David,a message "wrong flag value or origin" looks very cryptic. Besides it, looks good for me. -Dmitry On 2012-01-17 06:04, Robert Ottenhag wrote: > Hi, > > Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 > > This fix adds optional validation control to the setting of command-line switches in Hotspot, and allows it to have vendor-specific extensions if necessary. > > The design follows the previously added framework for vendor-specific command-line switch extensions in CR7117389. > > The validation control is handled by new boolean methods Flag::is_valid_(value,origin) that are called at the beginning of each call to CommandLineFlags[Ex]::AtPut() to verify that the new value and origin are valid replacements for the current value and origin for this flag. > > When parsing the command line options, a failed validation will typically result in an error message and exit with "Unrecognized VM option ''". When used dynamically using the attach API or management API the resulting operation will fail, leaving it up to the caller to handle it as appropriate. > > A simple use case for validation is a manageable flag whose current value can not be less than the previous value, while a more complex example may base the validation on multiple other flags, etc. > > Thanks, > > /Robert > -- Dmitry Samersoff Java Hotspot development team, SPB04 * There will come soft rains ... From robert.ottenhag at oracle.com Tue Jan 17 05:33:01 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Tue, 17 Jan 2012 14:33:01 +0100 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F1567F3.70106@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F1567F3.70106@oracle.com> Message-ID: <4F15788D.7020004@oracle.com> Dmitry, Thanks for the review. /Robert On 01/17/2012 01:22 PM, Dmitry Samersoff wrote: > Robert, > > I'm second to David,a message "wrong flag value or origin" looks very > cryptic. > > Besides it, looks good for me. > > -Dmitry > > > On 2012-01-17 06:04, Robert Ottenhag wrote: >> Hi, >> >> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >> >> This fix adds optional validation control to the setting of command-line switches in Hotspot, and allows it to have vendor-specific extensions if necessary. >> >> The design follows the previously added framework for vendor-specific command-line switch extensions in CR7117389. >> >> The validation control is handled by new boolean methods Flag::is_valid_(value,origin) that are called at the beginning of each call to CommandLineFlags[Ex]::AtPut() to verify that the new value and origin are valid replacements for the current value and origin for this flag. >> >> When parsing the command line options, a failed validation will typically result in an error message and exit with "Unrecognized VM option ''". When used dynamically using the attach API or management API the resulting operation will fail, leaving it up to the caller to handle it as appropriate. >> >> A simple use case for validation is a manageable flag whose current value can not be less than the previous value, while a more complex example may base the validation on multiple other flags, etc. >> >> Thanks, >> >> /Robert >> > -- Oracle Robert Ottenhag | Senior Member of Technical Staff Phone: +46850630961 | Fax: +46850630911 | Mobile: +46707106161 Oracle Java HotSpot Virtual Machine ORACLE Sweden | Folkungagatan 122 | SE-116 30 Stockholm Oracle Svenska AB, Kronborgsgr?nd 17, S-164 28 KISTA, reg.no. 556254-6746 Green Oracle Oracle is committed to developing practices and products that help protect the environment -- From robert.ottenhag at oracle.com Tue Jan 17 05:41:37 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Tue, 17 Jan 2012 14:41:37 +0100 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F156496.3060206@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F14E661.5080405@oracle.com> <4F156496.3060206@oracle.com> Message-ID: <4F157A91.8080907@oracle.com> David, Regarding the FLAG_SET_* macros, I am thinking that we can leave them to a follow up bug instead. The reason is that it can be verified by code inspection (of preprocessed sources) if any FLAG_SET_* macro writes to a variable known to have validation control. Also, fixing that hole would require any access to the variables to occur through interface get/set functions, preventing direct read and write access (wrapping the variable in a class to prevent direct writes), a change too intrusive for now. Will come back with an updated and cleaned up patch. /Robert On 01/17/2012 01:07 PM, Robert Ottenhag wrote: > David, > > Thanks for the review. > > On 01/17/2012 04:09 AM, David Holmes wrote: >> Hi Robert, >> >> I've added serviceability to the cc list. > > Good, will try to remember that ;-) > >> >> On 17/01/2012 12:04 PM, Robert Ottenhag wrote: >>> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >>> >>> This fix adds optional validation control to the setting of >>> command-line switches in Hotspot, and allows it to have >>> vendor-specific extensions if necessary. >> >> Does this imply that the Java management APIs (eg >> com.sun.management.VMOption) need to be changed to reflect these >> restrictions? Presently VMOptions are either writeable or not, but >> this makes them conditionally-writeable. > > No, since the Java management APIs already cares for conditional > writes. According to > com.sun.management.HotSpotDiagnosticMXBean.setVMOption() it will throw > IllegalArgumentException if the new value is invalid. > >> >>> The design follows the previously added framework for >>> vendor-specific command-line switch extensions in CR7117389. >>> >>> The validation control is handled by new boolean methods >>> Flag::is_valid_(value,origin) that are called at the beginning >>> of each call to CommandLineFlags[Ex]::AtPut() to verify that >>> the new value and origin are valid replacements for the current >>> value and origin for this flag. >>> >>> When parsing the command line options, a failed validation will >>> typically result in an error message and exit with "Unrecognized VM >>> option ''". When used dynamically using the attach API >>> or management API the resulting operation will fail, leaving it up >>> to the caller to handle it as appropriate. >> >> The error message doesn't really seem appropriate - it may well be a >> recognized option, you just can't set it to that value in that way. >> Ideally there would be a way for the validation logic to supply a >> meaningful error message. In its absence the top-level message should >> reflect the new type of error. > > You are absolutely right, but the current fix is in line with the > existing bad error messages where any kind of malformatted command > line flags results in Unrecognized VM option, whether the reason is an > unknown name, bad type semantics (using +- for bool semantics on an > integer flag), or if the flag is locked. > > I will target meaningful error messages for command line parsing in a > direct follow up bug to this fix. > >> >> Also some of the failures lead to crashes - which seems wrong to me - >> see below. >> >> ---- >> >> src/share/vm/services/management.cpp: >> >> 1821 if (!succeed) { >> 1822 THROW_MSG(vmSymbols::java_lang_IllegalArgumentException(), >> 1823 "This flag is not writeable with this value or >> origin."); >> >> That's a rather cryptic error message. How about: >> >> "Flag can not be set to the requested value using this API" >> >> ? > > Yes, "origin" does not make sense to the upper Java layer. I will use > your suggestion. > >> >> ---- >> >> src/share/vm/runtime/globals_ext.hpp >> >> With all the >> >> inline bool Flag::is_valid_ext_T(T value, FlagValueOrigin origin) >> >> functions, is it necessary to include the type T in the function name? > > It is necessary if using type safe variants with T value as argument > since overloading does not differ between different typedef names that > resolves to the same native types, e.g. uintx and uint64_t are both > unsigned long int. > > I am considering a condensed variant that replaces T by void* instead, > and do the type casting based on the targeted flag, reducing the > number of functions. > >> >> >> ----- >> >> src/share/vm/runtime/globals.cpp >> >> The use of the guarantees seems wrong as it means an invalid option >> will trigger a VM crash rather than a clean exit during >> initialization. It seems to me that none of the code in arguments.cpp >> that uses the FLAG_SET_* macros (which in turn use the >> CommandLineFlagsEx functions you added the guarantees to) anticipates >> any possibility for failure. I think if you are going this path then >> you have no choice but to change the CommandLineFlagsEx methods to >> return bool and update the FLAG_SET macros to try and perform >> appropriate error handling. > > I see your point, and in theory such as VM crash could occur anytime > later in a JVM session if rarely running code would make use of > FLAG_SET_* to change the value of a VM flag to an invalid value or > origin. > > Seems as if the options are either to > a) ignore validation tests for the FLAG_SET_* macros, and trust that > they always set valid values. This can be partly verified by static > code inspection by looking for any variables that actually have > validation logic associated to them (since the variable name is > defined at compile time), assuming one has access to all code, but it > is not perfect in case code for changing a variable with validation > logic exists. > b) contain the error handling within the FLAG_SET_* macros, like using > guarantee(), but maybe exception logic can help? > c) require usage of the FLAG_SET_* macros to handle result codes and > pass it up the call chain. > > Also, the current macro FLAG_SET_DEFAULT does a direct write to the > flag value without going through AtPut(). This macro must be > rewritten to have validation control to close the holes. The current > call format will require all call sites to include type name as with > FLAG_SET_{CMDLINE,ERGO} has, or to use slower lookup by variable name. > > /Robert > >> >> David >> ----- >> >> >>> A simple use case for validation is a manageable flag whose current >>> value can not be less than the previous value, while a more complex >>> example may base the validation on multiple other flags, etc. >>> >>> Thanks, >>> >>> /Robert >>> > > -- Oracle Robert Ottenhag | Senior Member of Technical Staff Phone: +46850630961 | Fax: +46850630911 | Mobile: +46707106161 Oracle Java HotSpot Virtual Machine ORACLE Sweden | Folkungagatan 122 | SE-116 30 Stockholm Oracle Svenska AB, Kronborgsgr?nd 17, S-164 28 KISTA, reg.no. 556254-6746 Green Oracle Oracle is committed to developing practices and products that help protect the environment -- From mikael.gerdin at oracle.com Tue Jan 17 07:34:53 2012 From: mikael.gerdin at oracle.com (Mikael Gerdin) Date: Tue, 17 Jan 2012 16:34:53 +0100 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4F0F6605.5080900@oracle.com> References: <4ED50287.3070102@oracle.com> <4F0C910A.5070205@oracle.com> <4F0E8B10.3030600@oracle.com> <4F0F0850.8050402@oracle.com> <4F0F6605.5080900@oracle.com> Message-ID: <4F15951D.7080508@oracle.com> David, On 2012-01-13 00:00, David Holmes wrote: > Hi Mikael, > > On 13/01/2012 2:20 AM, Mikael Gerdin wrote: >>> wbapi.java: normal Java naming style is to use camel-case for class >>> names. Though as WB is itself an acronym I'd be okay with WBApi. In fact >>> I'd be happy with anything other than initial lower-case :) >> >> Many of our existing tests have lower-case names so I guess I thought >> that was some sort of convention, it does not really matter to me. > > I think those tests must have been written by C programers ;-) > >> WBApi it is then. > > Thanks.There is a slight typo in that the file is WBapi.java not WBApi.java Fixed. I also re-ran JPRT to verify that it still builds on all platforms and found that the size of a region in G1 had changed to size_t, so I added a cast to jint (region sizes of >2G seems to be unreasonable). I also tried with Jim Melvin's patch to run OS X and verified that "wbapitest" works on OS X as well. /Mikael > > David > ----- > >> >>> >>> --- >>> >>> test/Makefile: does wbapitest need to be added to the phoney list? >> >> Yes, fixed. >> >> New webrev at: >> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.3/ >> Incremental at: >> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2-to-3/webrev/ >> >> /Mikael >> >>> >>> --- >>> >>> Cheers, >>> David >>> ----- >>> >>> >>> On 11/01/2012 5:27 AM, Mikael Gerdin wrote: >>>> Hi all >>>> >>>> Back from vacations now with an updated version of the webrev based on >>>> the feedback received in this thread. >>>> Changes include: >>>> * removed install target from makefiles >>>> * renamed flag form EnableWhiteBoxAPI to remove redundant Enable >>>> * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to >>>> the boot class path from inside the VM. >>>> >>>> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ >>>> >>>> Thanks >>>> Mikael Gerdin >>>> >>>> On 2011-11-29 17:04, Mikael Gerdin wrote: >>>>> Hi >>>>> >>>>> I've been working on a white box testing API for HotSpot in order to >>>>> allow for improved precision in vm testing. >>>>> >>>>> The basic idea is to open up the possibility for tests written in Java >>>>> to call native methods which query or poke the vm in some way. >>>>> >>>>> The API is accessible by using the class sun/hotspot/WhiteBox which is >>>>> not intended to be available in public builds. >>>>> In order to allow the WhiteBox class access to the VM the >>>>> registerNatives function is linked to JVM_RegisterWhiteBoxMethods. >>>>> That >>>>> function then links all the implementation functions using normal JNI >>>>> RegisterNatives. >>>>> >>>>> The API is not meant to be used by end users for any intent or purpose >>>>> and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions >>>>> -XX:+EnableWhiteboxAPI" and the fact that the class files will not be >>>>> present in an end user build of a JDK. >>>>> If the VM crashes after this API has been accessed a note will be >>>>> written in the hs_err file to signal that the API has been used. >>>>> >>>>> Webrev: >>>>> http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ >>>>> (thanks to stefank for hosting my webrev :) >>>>> >>>>> CR: >>>>> I'll file a CR tomorrow. >>>>> >>>>> Change comments: >>>>> >>>>> make/jprt.properties >>>>> >>>>> Add a test target to make sure that the API is available on all >>>>> supported platforms >>>>> >>>>> make/** >>>>> >>>>> Makefile changes to build the class sun/hotspot/WhiteBox, put it in a >>>>> JAR file and copy it to the jre/lib/endorsed directory in the export >>>>> targets. >>>>> The BSD makefile changes are not tested since I don't have access to >>>>> any >>>>> BSD/OSX machine to test them on. >>>>> >>>>> src/share/vm/prims/nativeLookup.cpp >>>>> >>>>> Special-case the method sun/hotspot/WhiteBox/registerNatives and >>>>> link it >>>>> to JVM_RegisterWhiteBoxMethods >>>>> >>>>> src/share/vm/prims/whitebox.* >>>>> >>>>> The implementation of the white box API. The actual API functions are >>>>> only examples of what we want to be able to do using the API. >>>>> >>>>> src/share/vm/runtime/globals.hpp >>>>> >>>>> Add the command line flag >>>>> >>>>> src/share/vm/utilities/vmError.cpp >>>>> >>>>> Print a message in hs_err files when white box API has been used. >>>>> >>>>> test/Makefile >>>>> >>>>> Add a makefile test target for the white box API test >>>>> >>>>> test/sanity/wbapi.java >>>>> >>>>> JTreg test to ensure that the API works. >>>>> >>>>> >>>>> Thanks >>>>> /Mikael Gerdin >>>> From mikael.gerdin at oracle.com Tue Jan 17 07:36:10 2012 From: mikael.gerdin at oracle.com (Mikael Gerdin) Date: Tue, 17 Jan 2012 16:36:10 +0100 Subject: Review request: White box testing API for HotSpot In-Reply-To: <4F15951D.7080508@oracle.com> References: <4ED50287.3070102@oracle.com> <4F0C910A.5070205@oracle.com> <4F0E8B10.3030600@oracle.com> <4F0F0850.8050402@oracle.com> <4F0F6605.5080900@oracle.com> <4F15951D.7080508@oracle.com> Message-ID: <4F15956A.2000300@oracle.com> .. and the new webrev is at http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.4/ On 2012-01-17 16:34, Mikael Gerdin wrote: > David, > > On 2012-01-13 00:00, David Holmes wrote: >> Hi Mikael, >> >> On 13/01/2012 2:20 AM, Mikael Gerdin wrote: >>>> wbapi.java: normal Java naming style is to use camel-case for class >>>> names. Though as WB is itself an acronym I'd be okay with WBApi. In >>>> fact >>>> I'd be happy with anything other than initial lower-case :) >>> >>> Many of our existing tests have lower-case names so I guess I thought >>> that was some sort of convention, it does not really matter to me. >> >> I think those tests must have been written by C programers ;-) >> >>> WBApi it is then. >> >> Thanks.There is a slight typo in that the file is WBapi.java not >> WBApi.java > > Fixed. > > I also re-ran JPRT to verify that it still builds on all platforms and > found that the size of a region in G1 had changed to size_t, so I added > a cast to jint (region sizes of >2G seems to be unreasonable). > > I also tried with Jim Melvin's patch to run OS X and verified that > "wbapitest" works on OS X as well. > > /Mikael > >> >> David >> ----- >> >>> >>>> >>>> --- >>>> >>>> test/Makefile: does wbapitest need to be added to the phoney list? >>> >>> Yes, fixed. >>> >>> New webrev at: >>> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.3/ >>> Incremental at: >>> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2-to-3/webrev/ >>> >>> /Mikael >>> >>>> >>>> --- >>>> >>>> Cheers, >>>> David >>>> ----- >>>> >>>> >>>> On 11/01/2012 5:27 AM, Mikael Gerdin wrote: >>>>> Hi all >>>>> >>>>> Back from vacations now with an updated version of the webrev based on >>>>> the feedback received in this thread. >>>>> Changes include: >>>>> * removed install target from makefiles >>>>> * renamed flag form EnableWhiteBoxAPI to remove redundant Enable >>>>> * installs wb.jar into jre/lib and made -XX:+WhiteBoxAPI add wb.jar to >>>>> the boot class path from inside the VM. >>>>> >>>>> http://cr.openjdk.java.net/~mgerdin/wbapi/webrev.2/ >>>>> >>>>> Thanks >>>>> Mikael Gerdin >>>>> >>>>> On 2011-11-29 17:04, Mikael Gerdin wrote: >>>>>> Hi >>>>>> >>>>>> I've been working on a white box testing API for HotSpot in order to >>>>>> allow for improved precision in vm testing. >>>>>> >>>>>> The basic idea is to open up the possibility for tests written in >>>>>> Java >>>>>> to call native methods which query or poke the vm in some way. >>>>>> >>>>>> The API is accessible by using the class sun/hotspot/WhiteBox >>>>>> which is >>>>>> not intended to be available in public builds. >>>>>> In order to allow the WhiteBox class access to the VM the >>>>>> registerNatives function is linked to JVM_RegisterWhiteBoxMethods. >>>>>> That >>>>>> function then links all the implementation functions using normal JNI >>>>>> RegisterNatives. >>>>>> >>>>>> The API is not meant to be used by end users for any intent or >>>>>> purpose >>>>>> and as such it is both guarded by "-XX:+UnlockDiagnosticVMOptions >>>>>> -XX:+EnableWhiteboxAPI" and the fact that the class files will not be >>>>>> present in an end user build of a JDK. >>>>>> If the VM crashes after this API has been accessed a note will be >>>>>> written in the hs_err file to signal that the API has been used. >>>>>> >>>>>> Webrev: >>>>>> http://cr.openjdk.java.net/~stefank/mgerdin/wbapi.0/webrev/ >>>>>> (thanks to stefank for hosting my webrev :) >>>>>> >>>>>> CR: >>>>>> I'll file a CR tomorrow. >>>>>> >>>>>> Change comments: >>>>>> >>>>>> make/jprt.properties >>>>>> >>>>>> Add a test target to make sure that the API is available on all >>>>>> supported platforms >>>>>> >>>>>> make/** >>>>>> >>>>>> Makefile changes to build the class sun/hotspot/WhiteBox, put it in a >>>>>> JAR file and copy it to the jre/lib/endorsed directory in the export >>>>>> targets. >>>>>> The BSD makefile changes are not tested since I don't have access to >>>>>> any >>>>>> BSD/OSX machine to test them on. >>>>>> >>>>>> src/share/vm/prims/nativeLookup.cpp >>>>>> >>>>>> Special-case the method sun/hotspot/WhiteBox/registerNatives and >>>>>> link it >>>>>> to JVM_RegisterWhiteBoxMethods >>>>>> >>>>>> src/share/vm/prims/whitebox.* >>>>>> >>>>>> The implementation of the white box API. The actual API functions are >>>>>> only examples of what we want to be able to do using the API. >>>>>> >>>>>> src/share/vm/runtime/globals.hpp >>>>>> >>>>>> Add the command line flag >>>>>> >>>>>> src/share/vm/utilities/vmError.cpp >>>>>> >>>>>> Print a message in hs_err files when white box API has been used. >>>>>> >>>>>> test/Makefile >>>>>> >>>>>> Add a makefile test target for the white box API test >>>>>> >>>>>> test/sanity/wbapi.java >>>>>> >>>>>> JTreg test to ensure that the API works. >>>>>> >>>>>> >>>>>> Thanks >>>>>> /Mikael Gerdin >>>>> From daniel.daugherty at oracle.com Tue Jan 17 09:47:37 2012 From: daniel.daugherty at oracle.com (Daniel D. Daugherty) Date: Tue, 17 Jan 2012 10:47:37 -0700 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F149ABA.3000708@oracle.com> References: <4F11B537.2000504@oracle.com> <4F13F18E.1080300@oracle.com> <4F149ABA.3000708@oracle.com> Message-ID: <4F15B439.1030604@oracle.com> Thumbs up! Dan On 1/16/12 2:46 PM, James Melvin wrote: > Hi Mikael, > > Nice find! I've added the missing line to add internalvmtests to each > JPRT run on Mac OS X. New webrev posted and JPRT test job complete. > > WEBREV: http://cr.openjdk.java.net/~jmelvin/7126732/webrev.01 > JPRT: > http://prt-web.us.oracle.com//archive/2012/01/2012-01-16-172504.jmelvin.7126732 > > Thanks! > > Jim > > > > On 1/16/12 4:44 AM, Mikael Gerdin wrote: >> Hi Jim >> >> on line 575, is there any particular reason that you don't add macosx to >> jprt.make.rule.test.targets.standard.internalvmtests? >> >> /Mikael >> >> On 2012-01-14 18:02, James Melvin wrote: >>> Greetings, >>> >>> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >>> jobs. There are 3 mac-minis in each queue. Build/Test times are short >>> relative to other platforms. Uses the stable Linux testlist for now. >>> >>> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >>> >>> Tested with *several* JPRT submissions for other bugfixes. I'd like to >>> integrate this change right after the current snapshot window. >>> >>> Feedback welcome. >>> >>> Thanks, >>> >>> Jim From vladimir.kozlov at oracle.com Tue Jan 17 10:02:19 2012 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Tue, 17 Jan 2012 10:02:19 -0800 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> <4F14B68A.6020808@oracle.com> <4F14FEA1.6020704@oracle.com> Message-ID: <4F15B7AB.2050107@oracle.com> Vitaly Davidovich wrote: > Vladimir, thanks for that -- very useful and interesting to know. So I > take it a compilation after initial deopt may be less aggressive than > 1st time around simply because the accumulated MD might be in a state > such that it guides optimization in a different direction, is that right? Yes, it is right, especially for inlining dynamic calls based on target profiling when after some time an other method is called. We also have limit how many times method could be recompiled after which the method will be only interpreted. Also we have limit on how many uncommon traps (deoptimizations) happened in method after which we stop generating some optimizations which use uncommon traps. Usually it means generating slow paths in compiled code which we thought (based on original profiling) should not be executed (so we generated uncommon trap). As result compiled code size will be changed and inlining also. Note, it happens not immediately but only after significant number of deoptimizations: PerMethodTrapLimit is 100 and PerMethodRecompilationCutoff is 400. > > I guess how does one reason about the generated assembly when it's so > (possibly) dynamic? :) I've looked at the assembly in a product hotspot > via hsdis, but then you realize that you may be looking at just one > compilation of it, and it may be different at some point later. Also, > for something like Intel's VTune (or Amplifier) that support showing > JIT'd assembly, I guess it's the same issue? This is quite different, of > course, with .NET CLR's JIT because there's no profiling there -- 1st > time method is hit, it gets JIT'd and there's no code pitching (makes > looking at the assembly seem a bit more "reassuring"). VTune shows several compiled methods. We have our profiling tool on Solaris (Analizer) which do the same. Note that deoptimization and recompilation usually happen only during startup. So only latest compiled version used in long run. > > In general, would you say that the C2 compiler favors "over > aggressiveness" with deopt guards? Or does it try to avoid deopts and > only perform aggressive optimizations where the profile is quite > conclusive on type and/or counters? C2 does compilations as aggressive as possible. And it will be less aggressive only during later recompilations. Regards, Vladimir > > Thanks > > On Mon, Jan 16, 2012 at 11:52 PM, Vladimir Kozlov > > wrote: > > I need to be more clear. We can't get 100% correct profiling result > for counters. First, profiling structure (MethodData) are created > only after method is executed for some time (for Server compiler it > is 3000 invocations). It is done to avoid profiling short programs > since profiling will slowdown Interpreter by about 20%. This delayed > profiling introduce first discrepancy between counters. MD could be > also created when method has a hot loop (a lot of iterations) and > OSR compilation is requested. As result counters in the loop will > not correlate with counters before the loop. An other reason for not > precise profiling counters is execution of method by multiple > threads. We have only one MD structure per method which is updated > (not atomically for speed) by all such threads. And when method is > prepared for compilation compiler thread creates snapshot of MD > which could happened in the middle of method execution by java threads. > > We don't reset MD during deoptimization because we still need > previous values (in addition to counters we collect type > information). So counters are accumulated. And profiling is resumed > from deoptimization point which again adds discrepancy. > > I hope it explains why we can't trust 100% to profiling counters. > > Regards, > Vladimir > > > On 1/16/12 4:15 PM, Vitaly Davidovich wrote: > > Vladimir, thanks for the explanation and the code pointer. > > Intuitively, it would seem like a good idea to trust the profile > 100% if it reports the same value used 100% of the time > (I can see how anything less than 100%, even a very high > probability of same value, is not trustworthy) given sufficient > trips through the loop. Although I can see how an app may have > phases where same value is seen for a while before it's > switched, but that's where I thought deopt would help. There > must be a good chunk of code out there that doesn't know > at static compilation time the loop count (so can't use > compile-time constant), but at runtime the actual value doesn't > change for many many trips through the loop; I know I have code > like that in various places. > > What's the reason a compilation after deopt would not be as > aggressive as the 1st time? Is it because the profile > information may be "weaker" (i.e. more uncertainty in it)? I > thought the profile is completely reset after deopt, so I > would think if the loop is now executed with a different > "constant" value (e.g. in our example, instead of 3 it's now > 4), then the same optimizations will be applied (of course if > unrolling the loop is no longer advantageous due to a much > different value, I can see how different optimizations will be > applied). > > Thanks > > On Mon, Jan 16, 2012 at 6:45 PM, Vladimir Kozlov > > >> wrote: > > Vitaly, > > We do use profile_trip_cnt during loop unroll calculation but > not during fully unroll because we can't trust it 100% > since program's phase and number of iterations could change > after method is compiled. See policy_unroll() and > policy_maximally_unroll(): > > > http://hg.openjdk.java.net/____hsx/hotspot-comp/hotspot/file/____89d0a5d40008/src/share/vm/____opto/loopTransform.cpp > > > > > > > We could use deopt as you suggested but deoptimization is > double-edge sword, when method recompiled after > deoptimization some aggressive optimizations will not be > executed for it so the new generated code could be slower. > > Regards, > Vladimir > > > On 1/16/12 2:37 PM, Vitaly Davidovich wrote: > > Hi Vladimir, > > If x_col is always seen to be same value in the profile > shouldn't the loop be unrolled as well with some deopt > guard? Or > does this not participate in profiling? > > Thanks > > On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" > > > > __orac__le.com > >>> wrote: > > > be different, because the expressions are similar. The > difference in runtime (due to cols being a compile-time > > constant) will be visible elsewhere. Is that right? If > so, where would I be able to detect this? > > In such situations we usually use some visual tools > to see difference between log outputs. At least you can use > 'diff'. You may need to replace instructions addresses in > outputs (number at the beginning of lines) with the same > value to match. There are few tricks you may use to > get similar PrintOptoAssembly output. Use next flags to > avoid > mixing output from program output and from 2 compiler > threads (flags stop program until a method is compiled > and run > only one compiler thread): > > -Xbatch -XX:CICompilerCount=1 > > Also add -XX:+PrintCompilation -XX:+PrintInlining to > see what method is compiled and inlined. Note that you > may see > similar output for individual methods but could be > big difference in compiled caller (computeAll()) method > where 2 > loop methods could be inlined. So you need to compare > all compiled methods. > > In general, to have constant as loop limit is always > win because some checks in generated code could be > avoided and > more optimizations could be done for such loops. Use > -XX:+TraceLoopOpts to see what loop optimizations are > done in > both cases. > > For example, in your code you set 'x_col = 3', as > result the next loop in > constructNearestClusterVector(______) will be > > > fully unrolled when this method is inlined into > computeAll() and x_col is replaced with '3': > > for(k = 0; k < x_col; k++) { > double tmp = x[i*x_col + k] - mu[j* mu_col > + k]; > dist += tmp * tmp; > } > > Vladimir > > On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > > Hi Kris, Vladimir, > > thanks for both your responses. > > Second, your two test methods are different > so you can't directly compare them. method1() iterates > over rows > using > middle loop index 'j' and method2() uses > external loop index 'i'. Unless they are typos again. > > > You are right, these are indeed typos. As Kris > suggested, I have the code printed here: > http://pastebin.com/xRFD1Nt1. > The methods corresponding to method1, and method2 > are constructNearestClusterVector and > computeNewCentroids. Their > PrintOptoAssembly outputs are respectively at > http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 > > Also, it seems I have not explained myself > correctly. I am not trying to compare the performance of > method1 with > respect > to that of method2: method1 and method2 both run > in the same program. What I am trying to compare is their > performance > in two cases: > - when cols is a compile-time constant (much faster) > - when cols is a value determined at run-time > > If you are using jdk7 there are few flags you > can use to print loop optimizations information. They > need debug > version of VM but it is not problem for you, > I think, since you can use debug PrintOptoAssembly flag. > > -XX:+TraceLoopOpts prints all happened loop > optimizations and loop tree after each round of loop opts, > -XX:+TraceLoopPredicate prints RC information > when it is moved from a loop, > -XX:+TraceRangeLimitCheck prints additional > information for RC elimination optimization. > > > Thanks for these, I will have a look at what they > output. > > Fourth, range check expression in your > example is not what you think. RC expression should be next: > (i*stride+offset) where 'i' is loop variable, > 'stride' is constant and 'offset' is loop invariant. > > In your example 'offset' is (j * cols) since > it is loop invariant, 'k' is loop variable and stride > is '1' (one). > In both your methods RC will be moved out of > inner loop so the code for it will be the same. The only > difference in > these methods will be where and how (j * > cols) and (i * cols) expressions are calculated. > > > I'd guess it's the difference in locality > that made the difference in performance in your two tests. > > Thanks for the explanation. I understand from > the above that the assembly output in both cases > mentioned above > may not > be different, because the expressions are > similar. The difference in runtime (due to cols being a > compile-time > constant) > will be visible elsewhere. Is that right? If so, > where would I be able to detect this? > > Cheers, > Manohar > > In your PrintOptoAssembly output snippet, > the instruction at 0x13e is a LoadRange, which loads > the range > from > the header > of an array: > > (from x86_64.ad > > ) > > > > // Load Range > instruct loadRange(rRegI dst, memory mem) > %{ > match(Set dst (LoadRange mem)); > > ins_cost(125); // XXX > format %{ "movl $dst, $mem\t# range" %} > opcode(0x8B); > ins_encode(REX_reg_mem(dst, mem), > OpcP, reg_mem(dst, mem)); > ins_pipe(ialu_reg_mem); > %} > > That's not a range check just yet; the > real check, if any, should come after the null check, in > the form of > comparing > something else with RSI. But you didn't > show what's after the null check, how RSI is used, so > it's hard > to say what > you're seeing in your example. > > As for the two test examples, could you > paste the entire source code, with the PrintOptoAssembly > output of > method1() and > method2() ? The first example looks > weird, maybe it's a typo but you're using "j < cols" as the loop > condition > for the > inner loop. > > > - Kris > > On Mon, Jan 16, 2012 at 1:59 AM, Manohar > Jonnalagedda > > > >__> > > >__>__> > > > >__> > > >__>__>__>> wrote: > > Hello, > > following this reference on Range > Check Elimination done by the Hotspot compiler [1], I was > keen in > knowing > how I > can detect whether range checks are > taking place in loops by inspecting output using the > PrintAssembly flag; > with > the old PrintOptoAssembly flag, I > have seen output such as the following, which I assume to > be range > checks : > > B11: # B73 B12 <- B10 Freq: 1.21365 > 139 movq RAX, [rsp + #24] # spill > 13e movl RSI, [RAX + #12 > (8-bit)] # range > 141 NullCheck RAX > > What is the equivalent with the new > PrintAssembly flag (using hsdis)? > > Moreover, as stated on the wiki page > [1], loops are optimized if the stride is a compile-time > constant. I > performed > a few tests on a kmeans program, with > 3 nested loops, having the following (high-level) > structure: > > === > void method1(){ > //loop 1 > for(int i = 0; i< rows1; i++){ > //... > for(int j = 0; j< rows2; j++){ > //... > for(int k = 0; j < cols; k++){ > array[j * cols + k] = //...} > } > } > } > > void method2(){ > //loop 2 > for(int i =0; i < rows1; i++){ > for(int j=0 ; i< rows2; j++){ > for(int k=0 ; k< cols; k++){ > array[i*cols+k] = //... > } > } > } > } > > void main(){ > > do{ > method1(); method2(); > }while(!converged) > > } > ==== > > In the first test, cols is an int > whose value is determined at runtime (by reading a file), > in the > second > test, it > is given as a compile-time > constant(3). In the second test, there is a */significant*/ speed-up > (around 40%). > > However, when studying the diff of > the output of PrintOptoAssembly for both method1 and method2, > there is no > difference (apart from slight value > changes in frequency). Would you have any hints as to > where I > could look for > differences? > > Thanks a lot, > Manohar > > [1] > https://wikis.oracle.com/______display/HotSpotInternals/______RangeCheckElimination > > > > > > > > > > >> > > > > As Kris pointed you need to fix your example: > > > > > -- > Vitaly > 617-548-7007 (mobile) > > > > > -- > Vitaly > 617-548-7007 (mobile) From vitalyd at gmail.com Tue Jan 17 10:10:05 2012 From: vitalyd at gmail.com (Vitaly Davidovich) Date: Tue, 17 Jan 2012 13:10:05 -0500 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F15B7AB.2050107@oracle.com> References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> <4F14B68A.6020808@oracle.com> <4F14FEA1.6020704@oracle.com> <4F15B7AB.2050107@oracle.com> Message-ID: Awesome info - thanks very much for taking the time to answer. Sent from my phone On Jan 17, 2012 1:02 PM, "Vladimir Kozlov" wrote: > Vitaly Davidovich wrote: > >> Vladimir, thanks for that -- very useful and interesting to know. So I >> take it a compilation after initial deopt may be less aggressive than 1st >> time around simply because the accumulated MD might be in a state such that >> it guides optimization in a different direction, is that right? >> > > Yes, it is right, especially for inlining dynamic calls based on target > profiling when after some time an other method is called. We also have > limit how many times method could be recompiled after which the method will > be only interpreted. Also we have limit on how many uncommon traps > (deoptimizations) happened in method after which we stop generating some > optimizations which use uncommon traps. Usually it means generating slow > paths in compiled code which we thought (based on original profiling) > should not be executed (so we generated uncommon trap). As result compiled > code size will be changed and inlining also. Note, it happens not > immediately but only after significant number of deoptimizations: > PerMethodTrapLimit is 100 and PerMethodRecompilationCutoff is 400. > > >> I guess how does one reason about the generated assembly when it's so >> (possibly) dynamic? :) I've looked at the assembly in a product hotspot via >> hsdis, but then you realize that you may be looking at just one compilation >> of it, and it may be different at some point later. Also, for something >> like Intel's VTune (or Amplifier) that support showing JIT'd assembly, I >> guess it's the same issue? This is quite different, of course, with .NET >> CLR's JIT because there's no profiling there -- 1st time method is hit, it >> gets JIT'd and there's no code pitching (makes looking at the assembly seem >> a bit more "reassuring"). >> > > VTune shows several compiled methods. We have our profiling tool on > Solaris (Analizer) which do the same. Note that deoptimization and > recompilation usually happen only during startup. So only latest compiled > version used in long run. > > >> In general, would you say that the C2 compiler favors "over >> aggressiveness" with deopt guards? Or does it try to avoid deopts and only >> perform aggressive optimizations where the profile is quite conclusive on >> type and/or counters? >> > > C2 does compilations as aggressive as possible. And it will be less > aggressive only during later recompilations. > > Regards, > Vladimir > > >> Thanks >> >> On Mon, Jan 16, 2012 at 11:52 PM, Vladimir Kozlov < >> vladimir.kozlov at oracle.com >> >> wrote: >> >> I need to be more clear. We can't get 100% correct profiling result >> for counters. First, profiling structure (MethodData) are created >> only after method is executed for some time (for Server compiler it >> is 3000 invocations). It is done to avoid profiling short programs >> since profiling will slowdown Interpreter by about 20%. This delayed >> profiling introduce first discrepancy between counters. MD could be >> also created when method has a hot loop (a lot of iterations) and >> OSR compilation is requested. As result counters in the loop will >> not correlate with counters before the loop. An other reason for not >> precise profiling counters is execution of method by multiple >> threads. We have only one MD structure per method which is updated >> (not atomically for speed) by all such threads. And when method is >> prepared for compilation compiler thread creates snapshot of MD >> which could happened in the middle of method execution by java threads. >> >> We don't reset MD during deoptimization because we still need >> previous values (in addition to counters we collect type >> information). So counters are accumulated. And profiling is resumed >> from deoptimization point which again adds discrepancy. >> >> I hope it explains why we can't trust 100% to profiling counters. >> >> Regards, >> Vladimir >> >> >> On 1/16/12 4:15 PM, Vitaly Davidovich wrote: >> >> Vladimir, thanks for the explanation and the code pointer. >> >> Intuitively, it would seem like a good idea to trust the profile >> 100% if it reports the same value used 100% of the time >> (I can see how anything less than 100%, even a very high >> probability of same value, is not trustworthy) given sufficient >> trips through the loop. Although I can see how an app may have >> phases where same value is seen for a while before it's >> switched, but that's where I thought deopt would help. There >> must be a good chunk of code out there that doesn't know >> at static compilation time the loop count (so can't use >> compile-time constant), but at runtime the actual value doesn't >> change for many many trips through the loop; I know I have code >> like that in various places. >> >> What's the reason a compilation after deopt would not be as >> aggressive as the 1st time? Is it because the profile >> information may be "weaker" (i.e. more uncertainty in it)? I >> thought the profile is completely reset after deopt, so I >> would think if the loop is now executed with a different >> "constant" value (e.g. in our example, instead of 3 it's now >> 4), then the same optimizations will be applied (of course if >> unrolling the loop is no longer advantageous due to a much >> different value, I can see how different optimizations will be >> applied). >> >> Thanks >> >> On Mon, Jan 16, 2012 at 6:45 PM, Vladimir Kozlov >> >> > >> >> >>> >> wrote: >> >> Vitaly, >> >> We do use profile_trip_cnt during loop unroll calculation but >> not during fully unroll because we can't trust it 100% >> since program's phase and number of iterations could change >> after method is compiled. See policy_unroll() and >> policy_maximally_unroll(): >> >> http://hg.openjdk.java.net/___** >> _hsx/hotspot-comp/hotspot/**file/____89d0a5d40008/src/** >> share/vm/____opto/**loopTransform.cpp >> > __89d0a5d40008/src/share/vm/__**opto/loopTransform.cpp >> > >> >> > hsx/hotspot-comp/hotspot/file/**__89d0a5d40008/src/share/vm/__** >> opto/loopTransform.cpp >> > 89d0a5d40008/src/share/vm/**opto/loopTransform.cpp >> >> >> >> We could use deopt as you suggested but deoptimization is >> double-edge sword, when method recompiled after >> deoptimization some aggressive optimizations will not be >> executed for it so the new generated code could be slower. >> >> Regards, >> Vladimir >> >> >> On 1/16/12 2:37 PM, Vitaly Davidovich wrote: >> >> Hi Vladimir, >> >> If x_col is always seen to be same value in the profile >> shouldn't the loop be unrolled as well with some deopt >> guard? Or >> does this not participate in profiling? >> >> Thanks >> >> On Jan 16, 2012 4:57 PM, "Vladimir Kozlov" >> >> > >> >> >> >> >> > __ora**c__le.com < >> http://oracle.com> >> >> >>>> >> wrote: >> >> > be different, because the expressions are similar. The >> difference in runtime (due to cols being a compile-time >> > constant) will be visible elsewhere. Is that right? If >> so, where would I be able to detect this? >> >> In such situations we usually use some visual tools >> to see difference between log outputs. At least you can use >> 'diff'. You may need to replace instructions addresses in >> outputs (number at the beginning of lines) with the same >> value to match. There are few tricks you may use to >> get similar PrintOptoAssembly output. Use next flags to >> avoid >> mixing output from program output and from 2 compiler >> threads (flags stop program until a method is compiled >> and run >> only one compiler thread): >> >> -Xbatch -XX:CICompilerCount=1 >> >> Also add -XX:+PrintCompilation -XX:+PrintInlining to >> see what method is compiled and inlined. Note that you >> may see >> similar output for individual methods but could be >> big difference in compiled caller (computeAll()) method >> where 2 >> loop methods could be inlined. So you need to compare >> all compiled methods. >> >> In general, to have constant as loop limit is always >> win because some checks in generated code could be >> avoided and >> more optimizations could be done for such loops. Use >> -XX:+TraceLoopOpts to see what loop optimizations are >> done in >> both cases. >> >> For example, in your code you set 'x_col = 3', as >> result the next loop in >> constructNearestClusterVector(**______) will be >> >> >> fully unrolled when this method is inlined into >> computeAll() and x_col is replaced with '3': >> >> for(k = 0; k < x_col; k++) { >> double tmp = x[i*x_col + k] - mu[j* mu_col >> + k]; >> dist += tmp * tmp; >> } >> >> Vladimir >> >> On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: >> >> Hi Kris, Vladimir, >> >> thanks for both your responses. >> >> Second, your two test methods are different >> so you can't directly compare them. method1() iterates >> over rows >> using >> middle loop index 'j' and method2() uses >> external loop index 'i'. Unless they are typos again. >> >> >> You are right, these are indeed typos. As Kris >> suggested, I have the code printed here: >> http://pastebin.com/xRFD1Nt1. >> The methods corresponding to method1, and method2 >> are constructNearestClusterVector and >> computeNewCentroids. Their >> PrintOptoAssembly outputs are respectively at >> http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 >> >> Also, it seems I have not explained myself >> correctly. I am not trying to compare the performance of >> method1 with >> respect >> to that of method2: method1 and method2 both run >> in the same program. What I am trying to compare is their >> performance >> in two cases: >> - when cols is a compile-time constant (much faster) >> - when cols is a value determined at run-time >> >> If you are using jdk7 there are few flags you >> can use to print loop optimizations information. They >> need debug >> version of VM but it is not problem for you, >> I think, since you can use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop >> optimizations and loop tree after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information >> when it is moved from a loop, >> -XX:+TraceRangeLimitCheck prints additional >> information for RC elimination optimization. >> >> >> Thanks for these, I will have a look at what they >> output. >> >> Fourth, range check expression in your >> example is not what you think. RC expression should be next: >> (i*stride+offset) where 'i' is loop variable, >> 'stride' is constant and 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since >> it is loop invariant, 'k' is loop variable and stride >> is '1' (one). >> In both your methods RC will be moved out of >> inner loop so the code for it will be the same. The only >> difference in >> these methods will be where and how (j * >> cols) and (i * cols) expressions are calculated. >> >> >> I'd guess it's the difference in locality >> that made the difference in performance in your two tests. >> >> Thanks for the explanation. I understand from >> the above that the assembly output in both cases >> mentioned above >> may not >> be different, because the expressions are >> similar. The difference in runtime (due to cols being a >> compile-time >> constant) >> will be visible elsewhere. Is that right? If so, >> where would I be able to detect this? >> >> Cheers, >> Manohar >> >> In your PrintOptoAssembly output snippet, >> the instruction at 0x13e is a LoadRange, which loads >> the range >> from >> the header >> of an array: >> >> (from x86_64.ad >> >> ) >> >> >> >> // Load Range >> instruct loadRange(rRegI dst, memory mem) >> %{ >> match(Set dst (LoadRange mem)); >> >> ins_cost(125); // XXX >> format %{ "movl $dst, $mem\t# range" >> %} >> opcode(0x8B); >> ins_encode(REX_reg_mem(dst, mem), >> OpcP, reg_mem(dst, mem)); >> ins_pipe(ialu_reg_mem); >> %} >> >> That's not a range check just yet; the >> real check, if any, should come after the null check, in >> the form of >> comparing >> something else with RSI. But you didn't >> show what's after the null check, how RSI is used, so >> it's hard >> to say what >> you're seeing in your example. >> >> As for the two test examples, could you >> paste the entire source code, with the PrintOptoAssembly >> output of >> method1() and >> method2() ? The first example looks >> weird, maybe it's a typo but you're using "j < cols" as the loop >> condition >> for the >> inner loop. >> >> >> - Kris >> >> On Mon, Jan 16, 2012 at 1:59 AM, Manohar >> Jonnalagedda >> > >**> >> > > **>__> > >> > **> > > **>__>__> >> > > **> > >> > **>__> > > **> >> > > **>__>__>__>> wrote: >> >> Hello, >> >> following this reference on Range >> Check Elimination done by the Hotspot compiler [1], I was >> keen in >> knowing >> how I >> can detect whether range checks are >> taking place in loops by inspecting output using the >> PrintAssembly flag; >> with >> the old PrintOptoAssembly flag, I >> have seen output such as the following, which I assume to >> be range >> checks : >> >> B11: # B73 B12 <- B10 Freq: 1.21365 >> 139 movq RAX, [rsp + #24] # >> spill >> 13e movl RSI, [RAX + #12 >> (8-bit)] # range >> 141 NullCheck RAX >> >> What is the equivalent with the new >> PrintAssembly flag (using hsdis)? >> >> Moreover, as stated on the wiki page >> [1], loops are optimized if the stride is a compile-time >> constant. I >> performed >> a few tests on a kmeans program, with >> 3 nested loops, having the following (high-level) >> structure: >> >> === >> void method1(){ >> //loop 1 >> for(int i = 0; i< rows1; i++){ >> //... >> for(int j = 0; j< rows2; j++){ >> //... >> for(int k = 0; j < cols; k++){ >> array[j * cols + k] = //...} >> } >> } >> } >> >> void method2(){ >> //loop 2 >> for(int i =0; i < rows1; i++){ >> for(int j=0 ; i< rows2; j++){ >> for(int k=0 ; k< cols; k++){ >> array[i*cols+k] = //... >> } >> } >> } >> } >> >> void main(){ >> >> do{ >> method1(); method2(); >> }while(!converged) >> >> } >> ==== >> >> In the first test, cols is an int >> whose value is determined at runtime (by reading a file), >> in the >> second >> test, it >> is given as a compile-time >> constant(3). In the second test, there is a */significant*/ >> speed-up >> (around 40%). >> >> However, when studying the diff of >> the output of PrintOptoAssembly for both method1 and method2, >> there is no >> difference (apart from slight value >> changes in frequency). Would you have any hints as to >> where I >> could look for >> differences? >> >> Thanks a lot, >> Manohar >> >> [1] >> https://wikis.oracle.com/_____**_display/HotSpotInternals/____** >> __RangeCheckElimination >> > RangeCheckElimination >> > >> > display/HotSpotInternals/____**RangeCheckElimination >> > RangeCheckElimination >> >> >> >> >> > display/HotSpotInternals/____**RangeCheckElimination >> > RangeCheckElimination >> > >> > display/HotSpotInternals/__**RangeCheckElimination >> > RangeCheckElimination >> >>> >> >> >> >> As Kris pointed you need to fix your example: >> >> >> >> >> -- >> Vitaly >> 617-548-7007 (mobile) >> >> >> >> >> -- >> Vitaly >> 617-548-7007 (mobile) >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.openjdk.java.net/pipermail/hotspot-dev/attachments/20120117/033303e0/attachment-0001.html From kelly.ohair at oracle.com Tue Jan 17 11:44:39 2012 From: kelly.ohair at oracle.com (Kelly O'Hair) Date: Tue, 17 Jan 2012 11:44:39 -0800 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F11B537.2000504@oracle.com> References: <4F11B537.2000504@oracle.com> Message-ID: <4814BBC3-95A0-49D0-B3F0-157B0525E898@oracle.com> It seems to me that with this change, only jdk7 or jdk8 builds will work properly. Which is fine with me, but then it argues that all these lines that mention: jdk7b107 jdk7temp jdk6 jdk6perf jdk6u10 jdk6u14 jdk6u18 jdk6u20 ejdk7 ejdk6 should all just be deleted. I doubt that a 'jprt submit -release XXX' where XXX is any of the above releases will even work. -kto On Jan 14, 2012, at 9:02 AM, James Melvin wrote: > Greetings, > > We're ready to require HotSpot builds on Mac OS X for JPRT integrate > jobs. There are 3 mac-minis in each queue. Build/Test times are short > relative to other platforms. Uses the stable Linux testlist for now. > > http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 > > Tested with *several* JPRT submissions for other bugfixes. I'd like to > integrate this change right after the current snapshot window. > > Feedback welcome. > > Thanks, > > Jim From dalibor.topic at oracle.com Tue Jan 17 16:11:55 2012 From: dalibor.topic at oracle.com (Dalibor Topic) Date: Wed, 18 Jan 2012 01:11:55 +0100 Subject: Fwd: OpenJDK wikis have moved In-Reply-To: <4F160D49.3090107@oracle.com> References: <4F160D49.3090107@oracle.com> Message-ID: <4F160E4B.40409@oracle.com> -------- Original Message -------- Subject: OpenJDK wikis have moved Date: Wed, 18 Jan 2012 01:07:37 +0100 From: Dalibor Topic To: web-discuss at openjdk.java.net See http://robilad.livejournal.com/111187.html . cheers, dalibor topic -- Oracle Dalibor Topic | Java F/OSS Ambassador Phone: +494023646738 | Mobile: +491772664192 Oracle Java Platform Group ORACLE Deutschland B.V. & Co. KG | Nagelsweg 55 | 20097 Hamburg ORACLE Deutschland B.V. & Co. KG Hauptverwaltung: Riesstr. 25, D-80992 M?nchen Registergericht: Amtsgericht M?nchen, HRA 95603 Gesch?ftsf?hrer: J?rgen Kunz Komplement?rin: ORACLE Deutschland Verwaltung B.V. Hertogswetering 163/167, 3543 AS Utrecht, Niederlande Handelsregister der Handelskammer Midden-Niederlande, Nr. 30143697 Gesch?ftsf?hrer: Alexander van der Ven, Astrid Kepper, Val Maher Green Oracle Oracle is committed to developing practices and products that help protect the environment From david.holmes at oracle.com Tue Jan 17 19:06:55 2012 From: david.holmes at oracle.com (David Holmes) Date: Wed, 18 Jan 2012 13:06:55 +1000 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4814BBC3-95A0-49D0-B3F0-157B0525E898@oracle.com> References: <4F11B537.2000504@oracle.com> <4814BBC3-95A0-49D0-B3F0-157B0525E898@oracle.com> Message-ID: <4F16374F.7050709@oracle.com> On 18/01/2012 5:44 AM, Kelly O'Hair wrote: > It seems to me that with this change, only jdk7 or jdk8 builds will work properly. > Which is fine with me, but then it argues that all these lines that mention: > jdk7b107 > jdk7temp > jdk6 > jdk6perf > jdk6u10 > jdk6u14 > jdk6u18 > jdk6u20 > ejdk7 > ejdk6 > should all just be deleted. I doubt that a 'jprt submit -release XXX' where XXX is any of the > above releases will even work. Oops! Yes I missed that too. When we added the embedded targets we had to split things into two groups (standard and all) and only include the embedded targets in "all". Then we only used "all" for releases that supported embedded. This argues for removing macosx from the set of standard targets, and adding it to the set of all targets via another grouping. David ----- > -kto > > On Jan 14, 2012, at 9:02 AM, James Melvin wrote: > >> Greetings, >> >> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >> jobs. There are 3 mac-minis in each queue. Build/Test times are short >> relative to other platforms. Uses the stable Linux testlist for now. >> >> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >> >> Tested with *several* JPRT submissions for other bugfixes. I'd like to >> integrate this change right after the current snapshot window. >> >> Feedback welcome. >> >> Thanks, >> >> Jim > From david.holmes at oracle.com Tue Jan 17 19:36:55 2012 From: david.holmes at oracle.com (David Holmes) Date: Wed, 18 Jan 2012 13:36:55 +1000 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F156496.3060206@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F14E661.5080405@oracle.com> <4F156496.3060206@oracle.com> Message-ID: <4F163E57.5010401@oracle.com> Hi Robert, Comments inline ... On 17/01/2012 10:07 PM, Robert Ottenhag wrote: >> On 17/01/2012 12:04 PM, Robert Ottenhag wrote: >>> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >>> >>> This fix adds optional validation control to the setting of >>> command-line switches in Hotspot, and allows it to have >>> vendor-specific extensions if necessary. >> >> Does this imply that the Java management APIs (eg >> com.sun.management.VMOption) need to be changed to reflect these >> restrictions? Presently VMOptions are either writeable or not, but >> this makes them conditionally-writeable. > > No, since the Java management APIs already cares for conditional writes. > According to com.sun.management.HotSpotDiagnosticMXBean.setVMOption() it > will throw IllegalArgumentException if the new value is invalid. I think there is a significant difference between trying to set an invalid value and setting to a valid value at a time that it is not permitted. >> src/share/vm/runtime/globals_ext.hpp >> >> With all the >> >> inline bool Flag::is_valid_ext_T(T value, FlagValueOrigin origin) >> >> functions, is it necessary to include the type T in the function name? > > It is necessary if using type safe variants with T value as argument > since overloading does not differ between different typedef names that > resolves to the same native types, e.g. uintx and uint64_t are both > unsigned long int. > > I am considering a condensed variant that replaces T by void* instead, > and do the type casting based on the targeted flag, reducing the number > of functions. Ok. I thought there might be some conflict with the integral types. >> src/share/vm/runtime/globals.cpp >> >> The use of the guarantees seems wrong as it means an invalid option >> will trigger a VM crash rather than a clean exit during >> initialization. It seems to me that none of the code in arguments.cpp >> that uses the FLAG_SET_* macros (which in turn use the >> CommandLineFlagsEx functions you added the guarantees to) anticipates >> any possibility for failure. I think if you are going this path then >> you have no choice but to change the CommandLineFlagsEx methods to >> return bool and update the FLAG_SET macros to try and perform >> appropriate error handling. > > I see your point, and in theory such as VM crash could occur anytime > later in a JVM session if rarely running code would make use of > FLAG_SET_* to change the value of a VM flag to an invalid value or origin. > > Seems as if the options are either to > a) ignore validation tests for the FLAG_SET_* macros, and trust that > they always set valid values. This can be partly verified by static code > inspection by looking for any variables that actually have validation > logic associated to them (since the variable name is defined at compile > time), assuming one has access to all code, but it is not perfect in > case code for changing a variable with validation logic exists. > b) contain the error handling within the FLAG_SET_* macros, like using > guarantee(), but maybe exception logic can help? > c) require usage of the FLAG_SET_* macros to handle result codes and > pass it up the call chain. > > Also, the current macro FLAG_SET_DEFAULT does a direct write to the flag > value without going through AtPut(). This macro must be rewritten > to have validation control to close the holes. The current call format > will require all call sites to include type name as with > FLAG_SET_{CMDLINE,ERGO} has, or to use slower lookup by variable name. I think you touched on the real problem in your later email - really these flags/options and the ways you can interact with them should be encapsulated in objects. Each different flag can then define its valid values, whether it is "locked", "writeable" etc. But that means every use of those flags in the VM would need changing - which is indeed a very intrusive change. But I can't help but feel that we are going to far in what we are trying to do with these flags when they are in fact simple variables. Also I think we may be overcomplicating this. I don't see why we can't consider the uses of the flags at initialization time and runtime to be distinct use-cases and use different APIs to interact with them. For initialization we have the FLAGS_SET_* macros, and the end result is that we have a set of flags that are either at their default values or have been set to a valid value. I don't think we need to consider (as I believe the current proposal does) multiple settings of a given flag at initialization time ie: java -XX:+UseFoo ... -XX:-UseFoo ... -XX:+UseFoo should simply result in UseFoo==true. Even if we have stated that once UseFoo is turned on it can't be turned off again. To me that should only relate to true "dynamic" runtime setting of the flags. In which case only the management APIs need to be augmented to support this and we may be able to create "shadow" objects for flags we need to handle specially at runtime. David ----- > /Robert > >> >> David >> ----- >> >> >>> A simple use case for validation is a manageable flag whose current >>> value can not be less than the previous value, while a more complex >>> example may base the validation on multiple other flags, etc. >>> >>> Thanks, >>> >>> /Robert >>> > > From david.holmes at oracle.com Tue Jan 17 21:21:09 2012 From: david.holmes at oracle.com (David Holmes) Date: Wed, 18 Jan 2012 15:21:09 +1000 Subject: RFR (XS): 7126732: MAC: Require Mac OS X builds/tests for JPRT integrate jobs for HotSpot In-Reply-To: <4F16374F.7050709@oracle.com> References: <4F11B537.2000504@oracle.com> <4814BBC3-95A0-49D0-B3F0-157B0525E898@oracle.com> <4F16374F.7050709@oracle.com> Message-ID: <4F1656C5.8070308@oracle.com> On 18/01/2012 1:06 PM, David Holmes wrote: > On 18/01/2012 5:44 AM, Kelly O'Hair wrote: >> It seems to me that with this change, only jdk7 or jdk8 builds will >> work properly. >> Which is fine with me, but then it argues that all these lines that >> mention: >> jdk7b107 >> jdk7temp >> jdk6 >> jdk6perf >> jdk6u10 >> jdk6u14 >> jdk6u18 >> jdk6u20 >> ejdk7 >> ejdk6 >> should all just be deleted. I doubt that a 'jprt submit -release XXX' >> where XXX is any of the >> above releases will even work. > > Oops! Yes I missed that too. When we added the embedded targets we had > to split things into two groups (standard and all) and only include the > embedded targets in "all". Then we only used "all" for releases that > supported embedded. > > This argues for removing macosx from the set of standard targets, and > adding it to the set of all targets via another grouping. On further reflection it argues for cleaning up the obsolete release definitions. Is there any reason someone with the latest hotspot workspace would be trying to do a JDK6 build in JPRT? Doesn't seem reasonable to me given we are not using latest hotspot in JDK6. And the two jdk7 releases mentioned were only temporary fixes for something or other anyway. The ejdk releases are obsolete as well. David > David > ----- > >> -kto >> >> On Jan 14, 2012, at 9:02 AM, James Melvin wrote: >> >>> Greetings, >>> >>> We're ready to require HotSpot builds on Mac OS X for JPRT integrate >>> jobs. There are 3 mac-minis in each queue. Build/Test times are short >>> relative to other platforms. Uses the stable Linux testlist for now. >>> >>> http://cr.openjdk.java.net/~jmelvin/7126732/webrev.00 >>> >>> Tested with *several* JPRT submissions for other bugfixes. I'd like to >>> integrate this change right after the current snapshot window. >>> >>> Feedback welcome. >>> >>> Thanks, >>> >>> Jim >> From robert.ottenhag at oracle.com Wed Jan 18 06:33:13 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Wed, 18 Jan 2012 15:33:13 +0100 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F163E57.5010401@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F14E661.5080405@oracle.com> <4F156496.3060206@oracle.com> <4F163E57.5010401@oracle.com> Message-ID: <4F16D829.7090201@oracle.com> Hi David, More comments inline... On 01/18/2012 04:36 AM, David Holmes wrote: > Hi Robert, > > Comments inline ... > > On 17/01/2012 10:07 PM, Robert Ottenhag wrote: >>> On 17/01/2012 12:04 PM, Robert Ottenhag wrote: >>>> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >>>> >>>> This fix adds optional validation control to the setting of >>>> command-line switches in Hotspot, and allows it to have >>>> vendor-specific extensions if necessary. >>> >>> Does this imply that the Java management APIs (eg >>> com.sun.management.VMOption) need to be changed to reflect these >>> restrictions? Presently VMOptions are either writeable or not, but >>> this makes them conditionally-writeable. >> >> No, since the Java management APIs already cares for conditional writes. >> According to com.sun.management.HotSpotDiagnosticMXBean.setVMOption() it >> will throw IllegalArgumentException if the new value is invalid. > > I think there is a significant difference between trying to set an > invalid value and setting to a valid value at a time that it is not > permitted. Agreed. There is a semantic difference here that can be confusing. > >>> src/share/vm/runtime/globals.cpp >>> >>> The use of the guarantees seems wrong as it means an invalid option >>> will trigger a VM crash rather than a clean exit during >>> initialization. It seems to me that none of the code in arguments.cpp >>> that uses the FLAG_SET_* macros (which in turn use the >>> CommandLineFlagsEx functions you added the guarantees to) anticipates >>> any possibility for failure. I think if you are going this path then >>> you have no choice but to change the CommandLineFlagsEx methods to >>> return bool and update the FLAG_SET macros to try and perform >>> appropriate error handling. >> >> I see your point, and in theory such as VM crash could occur anytime >> later in a JVM session if rarely running code would make use of >> FLAG_SET_* to change the value of a VM flag to an invalid value or >> origin. >> >> Seems as if the options are either to >> a) ignore validation tests for the FLAG_SET_* macros, and trust that >> they always set valid values. This can be partly verified by static code >> inspection by looking for any variables that actually have validation >> logic associated to them (since the variable name is defined at compile >> time), assuming one has access to all code, but it is not perfect in >> case code for changing a variable with validation logic exists. >> b) contain the error handling within the FLAG_SET_* macros, like using >> guarantee(), but maybe exception logic can help? >> c) require usage of the FLAG_SET_* macros to handle result codes and >> pass it up the call chain. >> >> Also, the current macro FLAG_SET_DEFAULT does a direct write to the flag >> value without going through AtPut(). This macro must be rewritten >> to have validation control to close the holes. The current call format >> will require all call sites to include type name as with >> FLAG_SET_{CMDLINE,ERGO} has, or to use slower lookup by variable name. > > I think you touched on the real problem in your later email - really > these flags/options and the ways you can interact with them should be > encapsulated in objects. Each different flag can then define its valid > values, whether it is "locked", "writeable" etc. But that means every > use of those flags in the VM would need changing - which is indeed a > very intrusive change. We might be able to keep existing usage by encapsulating it within a class that overrides the assignment and type conversion operators, but as you say it is a little more work than expected right now. > > But I can't help but feel that we are going to far in what we are > trying to do with these flags when they are in fact simple variables. > > Also I think we may be overcomplicating this. I don't see why we can't > consider the uses of the flags at initialization time and runtime to > be distinct use-cases and use different APIs to interact with them. > For initialization we have the FLAGS_SET_* macros, and the end result > is that we have a set of flags that are either at their default values > or have been set to a valid value. I don't think we need to consider > (as I believe the current proposal does) multiple settings of a given > flag at initialization time ie: > > java -XX:+UseFoo ... -XX:-UseFoo ... -XX:+UseFoo > > should simply result in UseFoo==true. Even if we have stated that once > UseFoo is turned on it can't be turned off again. To me that should > only relate to true "dynamic" runtime setting of the flags. In which > case only the management APIs need to be augmented to support this and > we may be able to create "shadow" objects for flags we need to handle > specially at runtime. The problem with the FLAG_SET_* macros is that they can also be used after initialization, or for that matter direct variable assignment can be also done, to bypass any validation logic that is observed by the dynamic interfaces. However, at this point I am also leaning towards a design that only focuses on the dynamic setting, which will not change existing behavior of command line flags parsing, i.e. any variable is writable with any value during the initialization phase. > > David > ----- > >> /Robert >> >>> >>> David >>> ----- >>> >>> >>>> A simple use case for validation is a manageable flag whose current >>>> value can not be less than the previous value, while a more complex >>>> example may base the validation on multiple other flags, etc. >>>> >>>> Thanks, >>>> >>>> /Robert >>>> >> >> -- Oracle Robert Ottenhag | Senior Member of Technical Staff Phone: +46850630961 | Fax: +46850630911 | Mobile: +46707106161 Oracle Java HotSpot Virtual Machine ORACLE Sweden | Folkungagatan 122 | SE-116 30 Stockholm Oracle Svenska AB, Kronborgsgr?nd 17, S-164 28 KISTA, reg.no. 556254-6746 Green Oracle Oracle is committed to developing practices and products that help protect the environment -- From robert.ottenhag at oracle.com Wed Jan 18 07:00:30 2012 From: robert.ottenhag at oracle.com (Robert Ottenhag) Date: Wed, 18 Jan 2012 16:00:30 +0100 Subject: RFR (S): 7130391: Add a framework for vendor-specific validation control of setting command-line switches in Hotspot In-Reply-To: <4F157A91.8080907@oracle.com> References: <5a181ad1-ef64-4b00-88ce-69829f2ce80b@default> <4F14E661.5080405@oracle.com> <4F156496.3060206@oracle.com> <4F157A91.8080907@oracle.com> Message-ID: <4F16DE8E.1090609@oracle.com> Updated webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.01 Changes to the previous version are: * src/share/vm/runtime/globals.cpp: Remove validation control from AtPut(CommandLineFlagsWithType, ...), that is only used by FLAG_SET_* macros in globals_extension.hpp. * src/share/vm/runtime/{globals.hpp, globals.cpp, globals_ext.hpp}: Replace multiple public type safe functions Flag::is_valid[_ext]_( value, ...) by single protected type generic functions CommandLineFlags::is_valid[_ext (const Flag*, const void*, ...), then do internal type casts on the values based on the type of the targeted flag (and assert on type correctness). * src/share/vm/services/management.cpp: Use a better error message (David Holmes). /Robert On 01/17/2012 02:41 PM, Robert Ottenhag wrote: > David, > > Regarding the FLAG_SET_* macros, I am thinking that we can leave them > to a follow up bug instead. > > The reason is that it can be verified by code inspection (of > preprocessed sources) if any FLAG_SET_* macro writes to a variable > known to have validation control. > > Also, fixing that hole would require any access to the variables to > occur through interface get/set functions, preventing direct read and > write access (wrapping the variable in a class to prevent direct > writes), a change too intrusive for now. > > Will come back with an updated and cleaned up patch. > > /Robert > > On 01/17/2012 01:07 PM, Robert Ottenhag wrote: >> David, >> >> Thanks for the review. >> >> On 01/17/2012 04:09 AM, David Holmes wrote: >>> Hi Robert, >>> >>> I've added serviceability to the cc list. >> >> Good, will try to remember that ;-) >> >>> >>> On 17/01/2012 12:04 PM, Robert Ottenhag wrote: >>>> Webrev: http://cr.openjdk.java.net/~rottenha/7130391/webrev.00 >>>> >>>> This fix adds optional validation control to the setting of >>>> command-line switches in Hotspot, and allows it to have >>>> vendor-specific extensions if necessary. >>> >>> Does this imply that the Java management APIs (eg >>> com.sun.management.VMOption) need to be changed to reflect these >>> restrictions? Presently VMOptions are either writeable or not, but >>> this makes them conditionally-writeable. >> >> No, since the Java management APIs already cares for conditional >> writes. According to >> com.sun.management.HotSpotDiagnosticMXBean.setVMOption() it will >> throw IllegalArgumentException if the new value is invalid. >> >>> >>>> The design follows the previously added framework for >>>> vendor-specific command-line switch extensions in CR7117389. >>>> >>>> The validation control is handled by new boolean methods >>>> Flag::is_valid_(value,origin) that are called at the >>>> beginning of each call to CommandLineFlags[Ex]::AtPut() to >>>> verify that the new value and origin are valid replacements for the >>>> current value and origin for this flag. >>>> >>>> When parsing the command line options, a failed validation will >>>> typically result in an error message and exit with "Unrecognized VM >>>> option ''". When used dynamically using the attach API >>>> or management API the resulting operation will fail, leaving it up >>>> to the caller to handle it as appropriate. >>> >>> The error message doesn't really seem appropriate - it may well be a >>> recognized option, you just can't set it to that value in that way. >>> Ideally there would be a way for the validation logic to supply a >>> meaningful error message. In its absence the top-level message >>> should reflect the new type of error. >> >> You are absolutely right, but the current fix is in line with the >> existing bad error messages where any kind of malformatted command >> line flags results in Unrecognized VM option, whether the reason is >> an unknown name, bad type semantics (using +- for bool semantics on >> an integer flag), or if the flag is locked. >> >> I will target meaningful error messages for command line parsing in a >> direct follow up bug to this fix. >> >>> >>> Also some of the failures lead to crashes - which seems wrong to me >>> - see below. >>> >>> ---- >>> >>> src/share/vm/services/management.cpp: >>> >>> 1821 if (!succeed) { >>> 1822 THROW_MSG(vmSymbols::java_lang_IllegalArgumentException(), >>> 1823 "This flag is not writeable with this value or >>> origin."); >>> >>> That's a rather cryptic error message. How about: >>> >>> "Flag can not be set to the requested value using this API" >>> >>> ? >> >> Yes, "origin" does not make sense to the upper Java layer. I will use >> your suggestion. >> >>> >>> ---- >>> >>> src/share/vm/runtime/globals_ext.hpp >>> >>> With all the >>> >>> inline bool Flag::is_valid_ext_T(T value, FlagValueOrigin origin) >>> >>> functions, is it necessary to include the type T in the function name? >> >> It is necessary if using type safe variants with T value as argument >> since overloading does not differ between different typedef names >> that resolves to the same native types, e.g. uintx and uint64_t are >> both unsigned long int. >> >> I am considering a condensed variant that replaces T by void* >> instead, and do the type casting based on the targeted flag, reducing >> the number of functions. >> >>> >>> >>> ----- >>> >>> src/share/vm/runtime/globals.cpp >>> >>> The use of the guarantees seems wrong as it means an invalid option >>> will trigger a VM crash rather than a clean exit during >>> initialization. It seems to me that none of the code in >>> arguments.cpp that uses the FLAG_SET_* macros (which in turn use the >>> CommandLineFlagsEx functions you added the guarantees to) >>> anticipates any possibility for failure. I think if you are going >>> this path then you have no choice but to change the >>> CommandLineFlagsEx methods to return bool and update the FLAG_SET >>> macros to try and perform appropriate error handling. >> >> I see your point, and in theory such as VM crash could occur anytime >> later in a JVM session if rarely running code would make use of >> FLAG_SET_* to change the value of a VM flag to an invalid value or >> origin. >> >> Seems as if the options are either to >> a) ignore validation tests for the FLAG_SET_* macros, and trust that >> they always set valid values. This can be partly verified by static >> code inspection by looking for any variables that actually have >> validation logic associated to them (since the variable name is >> defined at compile time), assuming one has access to all code, but it >> is not perfect in case code for changing a variable with validation >> logic exists. >> b) contain the error handling within the FLAG_SET_* macros, like >> using guarantee(), but maybe exception logic can help? >> c) require usage of the FLAG_SET_* macros to handle result codes and >> pass it up the call chain. >> >> Also, the current macro FLAG_SET_DEFAULT does a direct write to the >> flag value without going through AtPut(). This macro must be >> rewritten to have validation control to close the holes. The current >> call format will require all call sites to include type name as with >> FLAG_SET_{CMDLINE,ERGO} has, or to use slower lookup by variable name. >> >> /Robert >> >>> >>> David >>> ----- >>> >>> >>>> A simple use case for validation is a manageable flag whose current >>>> value can not be less than the previous value, while a more complex >>>> example may base the validation on multiple other flags, etc. >>>> >>>> Thanks, >>>> >>>> /Robert >>>> >> >> > > -- Oracle Robert Ottenhag | Senior Member of Technical Staff Phone: +46850630961 | Fax: +46850630911 | Mobile: +46707106161 Oracle Java HotSpot Virtual Machine ORACLE Sweden | Folkungagatan 122 | SE-116 30 Stockholm Oracle Svenska AB, Kronborgsgr?nd 17, S-164 28 KISTA, reg.no. 556254-6746 Green Oracle Oracle is committed to developing practices and products that help protect the environment -- From keith.mcguigan at oracle.com Wed Jan 18 14:14:24 2012 From: keith.mcguigan at oracle.com (keith.mcguigan at oracle.com) Date: Wed, 18 Jan 2012 22:14:24 +0000 Subject: hg: hsx/hotspot-main/hotspot: 6 new changesets Message-ID: <20120118221444.EB9F3479D8@hg.openjdk.java.net> Changeset: 94ec88ca68e2 Author: phh Date: 2012-01-11 17:34 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/94ec88ca68e2 7115199: Add event tracing hooks and Java Flight Recorder infrastructure Summary: Added a nop tracing infrastructure, JFR makefile changes and other infrastructure used only by JFR. Reviewed-by: acorn, sspitsyn Contributed-by: markus.gronlund at oracle.com ! make/Makefile ! make/bsd/makefiles/vm.make ! make/defs.make ! make/linux/makefiles/vm.make ! make/solaris/makefiles/vm.make ! make/windows/build.bat ! make/windows/create_obj_files.sh ! make/windows/makefiles/projectcreator.make ! make/windows/makefiles/vm.make ! src/share/vm/classfile/symbolTable.cpp ! src/share/vm/classfile/symbolTable.hpp ! src/share/vm/classfile/systemDictionary.cpp ! src/share/vm/oops/klass.cpp ! src/share/vm/oops/klass.hpp ! src/share/vm/oops/methodKlass.cpp ! src/share/vm/oops/methodOop.hpp ! src/share/vm/prims/jni.cpp + src/share/vm/prims/jniExport.hpp ! src/share/vm/runtime/java.cpp ! src/share/vm/runtime/mutexLocker.cpp ! src/share/vm/runtime/mutexLocker.hpp ! src/share/vm/runtime/os.cpp ! src/share/vm/runtime/thread.cpp ! src/share/vm/runtime/thread.hpp ! src/share/vm/runtime/vm_operations.hpp + src/share/vm/trace/traceEventTypes.hpp + src/share/vm/trace/traceMacros.hpp + src/share/vm/trace/tracing.hpp ! src/share/vm/utilities/globalDefinitions.hpp Changeset: 4f3ce9284781 Author: phh Date: 2012-01-11 17:58 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/4f3ce9284781 Merge ! src/share/vm/oops/klass.cpp ! src/share/vm/oops/klass.hpp Changeset: f1cd52d6ce02 Author: kamg Date: 2012-01-17 10:16 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/f1cd52d6ce02 Merge Changeset: d7e3846464d0 Author: zgu Date: 2012-01-17 13:08 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/d7e3846464d0 7071311: Decoder enhancement Summary: Made decoder thread-safe Reviewed-by: coleenp, kamg - src/os/bsd/vm/decoder_bsd.cpp + src/os/bsd/vm/decoder_machO.cpp + src/os/bsd/vm/decoder_machO.hpp ! src/os/linux/vm/decoder_linux.cpp ! src/os/linux/vm/os_linux.cpp ! src/os/solaris/vm/decoder_solaris.cpp ! src/os/solaris/vm/os_solaris.cpp ! src/os/windows/vm/decoder_windows.cpp + src/os/windows/vm/decoder_windows.hpp ! src/os/windows/vm/os_windows.cpp ! src/share/vm/utilities/decoder.cpp ! src/share/vm/utilities/decoder.hpp + src/share/vm/utilities/decoder_elf.cpp + src/share/vm/utilities/decoder_elf.hpp ! src/share/vm/utilities/elfFile.cpp ! src/share/vm/utilities/elfFile.hpp ! src/share/vm/utilities/elfStringTable.cpp ! src/share/vm/utilities/elfStringTable.hpp ! src/share/vm/utilities/elfSymbolTable.cpp ! src/share/vm/utilities/elfSymbolTable.hpp ! src/share/vm/utilities/vmError.cpp Changeset: 6520f9861937 Author: kamg Date: 2012-01-17 21:25 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/6520f9861937 Merge Changeset: db18ca98d237 Author: zgu Date: 2012-01-18 11:45 -0500 URL: http://hg.openjdk.java.net/hsx/hotspot-main/hotspot/rev/db18ca98d237 7131050: fix for "7071311 Decoder enhancement" does not build on MacOS X Summary: Decoder API changes did not reflect in os_bsd Reviewed-by: kamg, dcubed ! src/os/bsd/vm/os_bsd.cpp From manojo10386 at gmail.com Thu Jan 19 03:18:09 2012 From: manojo10386 at gmail.com (Manohar Jonnalagedda) Date: Thu, 19 Jan 2012 12:18:09 +0100 Subject: Detecting range check elimination with PrintAssembly In-Reply-To: <4F149D88.2060305@oracle.com> References:

<4F135374.8020004@oracle.com> <4F149D88.2060305@oracle.com> Message-ID: Hi Vladimir, Vitaly, In such situations we usually use some visual tools to see difference > between log outputs. Would you recommend any? I have had troubles using IdealGraphVisualizer: either the overhead of using it makes the methods not get JIT compiled, or, if I decrese the CompileThreshold, the VM crashes with an out of memory error. > At least you can use 'diff'. You may need to replace instructions > addresses in outputs (number at the beginning of lines) with the same value > to match. There are few tricks you may use to get similar PrintOptoAssembly > output. Use next flags to avoid mixing output from program output and from > 2 compiler threads (flags stop program until a method is compiled and run > only one compiler thread): > > -Xbatch -XX:CICompilerCount=1 > > Also add -XX:+PrintCompilation -XX:+PrintInlining to see what method is > compiled and inlined. Note that you may see similar output for individual > methods but could be big difference in compiled caller (computeAll()) > method where 2 loop methods could be inlined. So you need to compare all > compiled methods. > PrintInlining tells me that constructNearestClusterVector is inlined into ComputeAll, and from what I understand, computeNewCentroids is not: @ 109 Kmeans::computeNewCentroids (163 bytes) already compiled into a big method I have this message 3 times, do these belong to the Pre-Main-Post of the loop in the computeAll method? > In general, to have constant as loop limit is always win because some > checks in generated code could be avoided and more optimizations could be > done for such loops. Use -XX:+TraceLoopOpts to see what loop optimizations > are done in both cases. > I am new to this flag: should I use it in conjunction with the PrintOptoAssembly output for identifying loop numbers with the code they represent? Would there be any other way to interpret these? Thanks, Manohar > > For example, in your code you set 'x_col = 3', as result the next loop in > constructNearestClusterVector() will be fully unrolled when this method is > inlined into computeAll() and x_col is replaced with '3': > > for(k = 0; k < x_col; k++) { > double tmp = x[i*x_col + k] - mu[j* mu_col + k]; > dist += tmp * tmp; > } > > Vladimir > > > On 1/16/12 1:39 AM, Manohar Jonnalagedda wrote: > >> Hi Kris, Vladimir, >> >> thanks for both your responses. >> >> Second, your two test methods are different so you can't directly >> compare them. method1() iterates over rows using >> middle loop index 'j' and method2() uses external loop index 'i'. >> Unless they are typos again. >> >> >> You are right, these are indeed typos. As Kris suggested, I have the code >> printed here: http://pastebin.com/xRFD1Nt1. >> The methods corresponding to method1, and method2 are >> constructNearestClusterVector and computeNewCentroids. Their >> PrintOptoAssembly outputs are respectively at >> http://pastebin.com/1evN8b3K and http://pastebin.com/FxkVWTD5 >> >> Also, it seems I have not explained myself correctly. I am not trying to >> compare the performance of method1 with respect >> to that of method2: method1 and method2 both run in the same program. >> What I am trying to compare is their performance >> in two cases: >> - when cols is a compile-time constant (much faster) >> - when cols is a value determined at run-time >> >> If you are using jdk7 there are few flags you can use to print loop >> optimizations information. They need debug >> version of VM but it is not problem for you, I think, since you can >> use debug PrintOptoAssembly flag. >> >> -XX:+TraceLoopOpts prints all happened loop optimizations and loop >> tree after each round of loop opts, >> -XX:+TraceLoopPredicate prints RC information when it is moved from a >> loop, >> -XX:+TraceRangeLimitCheck prints additional information for RC >> elimination optimization. >> >> >> Thanks for these, I will have a look at what they output. >> >> Fourth, range check expression in your example is not what you think. >> RC expression should be next: >> (i*stride+offset) where 'i' is loop variable, 'stride' is constant and >> 'offset' is loop invariant. >> >> In your example 'offset' is (j * cols) since it is loop invariant, 'k' >> is loop variable and stride is '1' (one). >> In both your methods RC will be moved out of inner loop so the code >> for it will be the same. The only difference in >> these methods will be where and how (j * cols) and (i * cols) >> expressions are calculated. >> >> >> I'd guess it's the difference in locality that made the difference >> in performance in your two tests. >> >> Thanks for the explanation. I understand from the above that the >> assembly output in both cases mentioned above may not >> be different, because the expressions are similar. The difference in >> runtime (due to cols being a compile-time constant) >> will be visible elsewhere. Is that right? If so, where would I be able to >> detect this? >> >> Cheers, >> Manohar >> >> In your PrintOptoAssembly output snippet, the instruction at 0x13e >> is a LoadRange, which loads the range from >> the header >> of an array: >> >> (from x86_64.ad