From erik.joelsson at oracle.com Thu Jan 2 07:57:07 2020 From: erik.joelsson at oracle.com (Erik Joelsson) Date: Thu, 2 Jan 2020 08:57:07 +0100 Subject: [15] Review Request: 8235975 Update copyright year to match last edit in jdk repository for 2014/15/16/17/18 In-Reply-To: <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> References: <1e7d0395-fc57-4d5b-9cfa-c33e0f6462d5@oracle.com> <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> Message-ID: <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> Build files look good. /Erik On 2019-12-24 19:22, Sergey Bylokhov wrote: > Hello. > > Here is an updated version: > ? Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 > ? Patch (2 Mb): > http://cr.openjdk.java.net/~serb/8235975/webrev.03/open.patch > ? Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.03/ > > ?- "jdk.internal.vm.compiler" is removed from the patch. > ?- "Aes128CtsHmacSha2EType.java" is updated to "Copyright (c) 2018" > > On 12/22/19 11:24 pm, Sergey Bylokhov wrote: >> Hello. >> Please review the fix for JDK 15. >> >> Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >> Patch (2 Mb): >> http://cr.openjdk.java.net/~serb/8235975/webrev.02/open.patch >> Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.02 >> >> I have updated the source code copyrights by the >> "update_copyright_year.sh" >> script for 2014/15/16/18/19 years, unfortunately, cannot run it for 2017 >> because of: "JDK-8187443: Forest Consolidation: Move files to unified >> layout" >> which touched all files. >> >> > > From Sergey.Bylokhov at oracle.com Thu Jan 2 12:02:14 2020 From: Sergey.Bylokhov at oracle.com (Sergey Bylokhov) Date: Thu, 2 Jan 2020 15:02:14 +0300 Subject: [15] Review Request: 8235975 Update copyright year to match last edit in jdk repository for 2014/15/16/17/18 In-Reply-To: <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> References: <1e7d0395-fc57-4d5b-9cfa-c33e0f6462d5@oracle.com> <3460a6f6-6178-cc45-5840-0f215eebc53f@oracle.com> <7b67e39d-752c-50a8-8e18-9a8f86bd641c@oracle.com> Message-ID: <0caab700-28e3-16e7-db00-b698557443f0@oracle.com> I guess it is too late to fix it, will need to update the files at the end of 2020. On 1/2/20 10:57 am, Erik Joelsson wrote: > Build files look good. > > /Erik > > On 2019-12-24 19:22, Sergey Bylokhov wrote: >> Hello. >> >> Here is an updated version: >> ? Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >> ? Patch (2 Mb): http://cr.openjdk.java.net/~serb/8235975/webrev.03/open.patch >> ? Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.03/ >> >> ?- "jdk.internal.vm.compiler" is removed from the patch. >> ?- "Aes128CtsHmacSha2EType.java" is updated to "Copyright (c) 2018" >> >> On 12/22/19 11:24 pm, Sergey Bylokhov wrote: >>> Hello. >>> Please review the fix for JDK 15. >>> >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8235975 >>> Patch (2 Mb): http://cr.openjdk.java.net/~serb/8235975/webrev.02/open.patch >>> Fix: http://cr.openjdk.java.net/~serb/8235975/webrev.02 >>> >>> I have updated the source code copyrights by the "update_copyright_year.sh" >>> script for 2014/15/16/18/19 years, unfortunately, cannot run it for 2017 >>> because of: "JDK-8187443: Forest Consolidation: Move files to unified layout" >>> which touched all files. >>> >>> >> >> -- Best regards, Sergey. From igor.veresov at oracle.com Thu Jan 2 15:00:48 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Thu, 2 Jan 2020 07:00:48 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel Message-ID: Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 This change makes the TieredStopAtLevel processing aware of various compilation modes. Thanks! igor From vladimir.x.ivanov at oracle.com Thu Jan 2 23:16:00 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Fri, 3 Jan 2020 02:16:00 +0300 Subject: [14] RFR(S): 8236443 : Issues with specializing vector register type for phi operand with generic operands In-Reply-To: References: Message-ID: Hi Jatin, > Webrev: http://cr.openjdk.java.net/~jbhateja/8236443/webrev.02/ I like that you put the check on use side (ShiftV + ideal_in_idx). Though ShiftCntV def is the only viable option in practice (phis shouldn't occur on x86 and would trigger a bug due to mismatch between node type and ideal_reg), it's nice to see such case being ruled out. Some comments: ======================================================================= src/hotspot/cpu/x86/x86.ad: +const uint Matcher::special_vector_ideal_reg(int ideal_reg, int opcode, int ideal_in_idx, int size) { I don't see a compelling reason why it should be a platform-specific check. ShiftCntV nodes override Node::ideal_reg() and putting the check in shared code looks appropriate. Considering how LShiftCntVNode::ideal_reg is defined [1], IMO calling Matcher::vector_shift_count_ideal_reg(t->length_in_bytes()) is better than hardcoding VecS ideal register (which is x86-specific). ======================================================================= + case Op_RShiftCntV: + case Op_LShiftCntV: + // Ideal index corrosponding to def operand. + if (ideal_in_idx == -1) + return Op_VecS; + break; Shouldn't ideal_in_idx be 0 instead? Some stylistic notes: ======================================================================= src/hotspot/cpu/x86/x86.ad: +MachOper* Matcher::specialize_vector_operand_helper(Node* m, MachNode* n, int op_idx) { It's a bit confusing to see Node* named "m" and MachNode* named "n". In the code base it's usually the opposite: n stands for Node and m stands for MachNode. But maybe "def" and "use" are better names? ======================================================================= + switch(opcode) { + default: + return ideal_reg; + case Op_LShiftVB: ... + case Op_URShiftVL: + if (ideal_in_idx == 2) + return Op_VecS; + break; + case Op_RShiftCntV: + case Op_LShiftCntV: + // Ideal index corrosponding to def operand. + if (ideal_in_idx == -1) + return Op_VecS; + break; + } Does it make sense to introduce ShiftV and ShiftCntV node classes [2] and use Node::is_ShiftV()/is_ShiftCntV() checks instead? ======================================================================= + switch(opcode) { + default: + return ideal_reg; ... + } + return ideal_reg; return statement is redundant. ======================================================================= src/hotspot/share/opto/matcher.cpp: - MachNode* m = live_nodes.pop()->isa_Mach(); + Node* n = live_nodes.pop(); + if (!n->is_Mach()) + continue; + MachNode * m = n->as_Mach(); if (m != NULL) { Node::isa_Mach() + != NULL check is a well-established convention in C2 code base to perform type checks on nodes (is_Mach()+as_Mach()). I'd prefer to keep the original version. Best regards, Vladimir Ivanov [1] class LShiftCntVNode : public VectorNode { virtual uint ideal_reg() const { return Matcher::vector_shift_count_ideal_reg(vect_type()->length_in_bytes()); } }; [2] http://hg.openjdk.java.net/jdk/jdk/file/tip/src/hotspot/share/opto/node.hpp#l716 > Generic operand processing has been skipped for non-machine nodes (e.g. Phi) since they are skipped by the matcher. > Non-definition operand resolution of a machine node will be able to pull the type information from non-machine node. > > Re-organized the code by adding a target specific routine which can be used to return the operand types for special ideal nodes e.g. RShiftCntV/LShiftCntV. > For such nodes, definition machine operand vary for different targets and vector lengths, a type based generic operand resolution > is not possible in such cases. > > Best Regards, > Jatin > > From claes.redestad at oracle.com Fri Jan 3 10:53:25 2020 From: claes.redestad at oracle.com (Claes Redestad) Date: Fri, 3 Jan 2020 11:53:25 +0100 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: References: Message-ID: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> Hi, logic in limit_level is a bit hard to follow. In case TieredStopAtLevel is 4, level is 2 or 3 and compilation mode is high-only or high-only- quick-internal it seems it'll return level, which is an invalid level for the compilation mode..? /Claes On 2020-01-02 16:00, Igor Veresov wrote: > Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 > > This change makes the TieredStopAtLevel processing aware of various compilation modes. > > Thanks! > igor > > > From igor.veresov at oracle.com Fri Jan 3 15:16:59 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Fri, 3 Jan 2020 07:16:59 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> Message-ID: <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> That?s not a problem, the level argument is always valid. The case this change is fixing is the situation when the level needs to be changed due to TieredStopAtLevel (i.e. when level > TieredStopAtLevel). igor > On Jan 3, 2020, at 2:53 AM, Claes Redestad wrote: > > Hi, > > logic in limit_level is a bit hard to follow. In case TieredStopAtLevel > is 4, level is 2 or 3 and compilation mode is high-only or high-only- > quick-internal it seems it'll return level, which is an invalid level > for the compilation mode..? > > /Claes > > On 2020-01-02 16:00, Igor Veresov wrote: >> Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 >> This change makes the TieredStopAtLevel processing aware of various compilation modes. >> Thanks! >> igor From claes.redestad at oracle.com Fri Jan 3 15:44:01 2020 From: claes.redestad at oracle.com (Claes Redestad) Date: Fri, 3 Jan 2020 16:44:01 +0100 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> Message-ID: <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> Right, it seemed to be the case, but I couldn't conclude it by reading the surrounding/calling code. That's why I commented that the logic is a bit hard to follow. :-) I think an assert that the level argument is always valid would be a sufficient guide here. /Claes On 2020-01-03 16:16, Igor Veresov wrote: > That?s not a problem, the level argument is always valid. The case this change is fixing is the situation when the level needs to be changed due to TieredStopAtLevel (i.e. when level > TieredStopAtLevel). > > igor > >> On Jan 3, 2020, at 2:53 AM, Claes Redestad wrote: >> >> Hi, >> >> logic in limit_level is a bit hard to follow. In case TieredStopAtLevel >> is 4, level is 2 or 3 and compilation mode is high-only or high-only- >> quick-internal it seems it'll return level, which is an invalid level >> for the compilation mode..? >> >> /Claes >> >> On 2020-01-02 16:00, Igor Veresov wrote: >>> Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 >>> This change makes the TieredStopAtLevel processing aware of various compilation modes. >>> Thanks! >>> igor > From goetz.lindenmaier at sap.com Fri Jan 3 16:32:33 2020 From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz) Date: Fri, 3 Jan 2020 16:32:33 +0000 Subject: Help with downporting 8225019: Update JVMCI Message-ID: Hi, I am working on downporting 11.0.7-oracle fixes to OpenJDK 11.0.7. I encountered "8225019: Update JVMCI" which is applying badly. It has lots of rejects. There are various places that are not trivial to resolve. I don't have any background in JVMCI/AOT and SAP does not use it (yet). I would appreciate if someone with appropriate background or interest could step up and bring this change to jdk11u-dev. https://bugs.openjdk.java.net/browse/JDK-8225019 http://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-May/034050.html Best regards, Goetz. From matthias.baesken at sap.com Fri Jan 3 15:49:11 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Fri, 3 Jan 2020 15:49:11 +0000 Subject: struct SwitchRange and C++ One Definition Rule [-Wodr] Message-ID: Hello, when experimenting with gcc8 and the -flto compiler flag I was running into these warnings in the c1 coding : /open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:50:7: warning: type 'struct SwitchRange' violates the C++ One Definition Rule [-Wodr] class SwitchRange: public CompilationResourceObj { ^ /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:319: note: a different type is defined in another translation unit class SwitchRange : public StackObj { /usr/work/d040975/open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:52:7: note: the first difference of corresponding definitions is field '_low_key' int _low_key; ^ /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:321: note: a field with different name is defined in another translation unit jint _lo; // inclusive lower limit Do you think this should be fixed ( renaming one SwitchRange ) ? Martin suggested that even without flto added it could be problematic . Thanks, Matthias From kim.barrett at oracle.com Fri Jan 3 19:12:52 2020 From: kim.barrett at oracle.com (Kim Barrett) Date: Fri, 3 Jan 2020 14:12:52 -0500 Subject: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: References: Message-ID: <8EF4DC8C-37C0-4E17-AE92-2F2C8DD3869B@oracle.com> > On Jan 3, 2020, at 10:49 AM, Baesken, Matthias wrote: > > Hello, when experimenting with gcc8 and the -flto compiler flag I was running into these warnings in the c1 coding : > > > /open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:50:7: warning: type 'struct SwitchRange' violates the C++ One Definition Rule [-Wodr] > class SwitchRange: public CompilationResourceObj { > ^ > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:319: note: a different type is defined in another translation unit > class SwitchRange : public StackObj { > > /usr/work/d040975/open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:52:7: note: the first difference of corresponding definitions is field '_low_key' > int _low_key; > ^ > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:321: note: a field with different name is defined in another translation unit > jint _lo; // inclusive lower limit > > > > Do you think this should be fixed ( renaming one SwitchRange ) ? > > Martin suggested that even without flto added it could be problematic . Yes, please file a bug and let's get this fixed quickly. This sort of thing can lead to really weird looking problems, like using the same vtable (which one picked ?at random?) for instances of both classes. From igor.veresov at oracle.com Sat Jan 4 00:28:22 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Fri, 3 Jan 2020 16:28:22 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> Message-ID: <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> Ok, we can add some verification: http://cr.openjdk.java.net/~iveresov/8236050/webrev.01/ igor > On Jan 3, 2020, at 7:44 AM, Claes Redestad wrote: > > Right, it seemed to be the case, but I couldn't conclude it by reading > the surrounding/calling code. That's why I commented that the logic is a > bit hard to follow. :-) > > I think an assert that the level argument is always valid would be a > sufficient guide here. > > /Claes > > On 2020-01-03 16:16, Igor Veresov wrote: >> That?s not a problem, the level argument is always valid. The case this change is fixing is the situation when the level needs to be changed due to TieredStopAtLevel (i.e. when level > TieredStopAtLevel). >> igor >>> On Jan 3, 2020, at 2:53 AM, Claes Redestad wrote: >>> >>> Hi, >>> >>> logic in limit_level is a bit hard to follow. In case TieredStopAtLevel >>> is 4, level is 2 or 3 and compilation mode is high-only or high-only- >>> quick-internal it seems it'll return level, which is an invalid level >>> for the compilation mode..? >>> >>> /Claes >>> >>> On 2020-01-02 16:00, Igor Veresov wrote: >>>> Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 >>>> This change makes the TieredStopAtLevel processing aware of various compilation modes. >>>> Thanks! >>>> igor From claes.redestad at oracle.com Sat Jan 4 00:43:50 2020 From: claes.redestad at oracle.com (Claes Redestad) Date: Sat, 4 Jan 2020 01:43:50 +0100 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> Message-ID: Thanks, looks good to me! /Claes On 2020-01-04 01:28, Igor Veresov wrote: > Ok, we can add some verification: > http://cr.openjdk.java.net/~iveresov/8236050/webrev.01/ > > igor > > > >> On Jan 3, 2020, at 7:44 AM, Claes Redestad > > wrote: >> >> Right, it seemed to be the case, but I couldn't conclude it by reading >> the surrounding/calling code. That's why I commented that the logic is a >> bit hard to follow. :-) >> >> I think an assert that the level argument is always valid would be a >> sufficient guide here. >> >> /Claes >> >> On 2020-01-03 16:16, Igor Veresov wrote: >>> That?s not a problem, the level argument is always valid. The case >>> this change is fixing is the situation when the level needs to be >>> changed due to TieredStopAtLevel (i.e. when level > TieredStopAtLevel). >>> igor >>>> On Jan 3, 2020, at 2:53 AM, Claes Redestad >>>> > wrote: >>>> >>>> Hi, >>>> >>>> logic in limit_level is a bit hard to follow. In case TieredStopAtLevel >>>> is 4, level is 2 or 3 and compilation mode is high-only or high-only- >>>> quick-internal it seems it'll return level, which is an invalid level >>>> for the compilation mode..? >>>> >>>> /Claes >>>> >>>> On 2020-01-02 16:00, Igor Veresov wrote: >>>>> Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 >>>>> >>>>> This change makes the TieredStopAtLevel processing aware of various >>>>> compilation modes. >>>>> Thanks! >>>>> igor > From igor.veresov at oracle.com Sun Jan 5 02:25:43 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Sat, 4 Jan 2020 18:25:43 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> Message-ID: <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> Actually, the verification found a problem with how the compilation levels are chosen in case of -Xcomp. So I had to fix that: http://cr.openjdk.java.net/~iveresov/8236050/webrev.02/ igor > On Jan 3, 2020, at 4:43 PM, Claes Redestad wrote: > > Thanks, looks good to me! > > /Claes > > On 2020-01-04 01:28, Igor Veresov wrote: >> Ok, we can add some verification: http://cr.openjdk.java.net/~iveresov/8236050/webrev.01/ >> igor >>> On Jan 3, 2020, at 7:44 AM, Claes Redestad > wrote: >>> >>> Right, it seemed to be the case, but I couldn't conclude it by reading >>> the surrounding/calling code. That's why I commented that the logic is a >>> bit hard to follow. :-) >>> >>> I think an assert that the level argument is always valid would be a >>> sufficient guide here. >>> >>> /Claes >>> >>> On 2020-01-03 16:16, Igor Veresov wrote: >>>> That?s not a problem, the level argument is always valid. The case this change is fixing is the situation when the level needs to be changed due to TieredStopAtLevel (i.e. when level > TieredStopAtLevel). >>>> igor >>>>> On Jan 3, 2020, at 2:53 AM, Claes Redestad > wrote: >>>>> >>>>> Hi, >>>>> >>>>> logic in limit_level is a bit hard to follow. In case TieredStopAtLevel >>>>> is 4, level is 2 or 3 and compilation mode is high-only or high-only- >>>>> quick-internal it seems it'll return level, which is an invalid level >>>>> for the compilation mode..? >>>>> >>>>> /Claes >>>>> >>>>> On 2020-01-02 16:00, Igor Veresov wrote: >>>>>> Webrev: http://cr.openjdk.java.net/~iveresov/8236050/webrev.00 >>>>>> This change makes the TieredStopAtLevel processing aware of various compilation modes. >>>>>> Thanks! >>>>>> igor From tobias.hartmann at oracle.com Mon Jan 6 07:20:57 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 6 Jan 2020 08:20:57 +0100 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> Message-ID: <1e4fb10a-2462-f43d-2ae6-2479f45daae9@oracle.com> Hi Igor, On 05.01.20 03:25, Igor Veresov wrote: > Actually, the verification found a problem with how the compilation levels are chosen in case of -Xcomp. > So I had to fix that: http://cr.openjdk.java.net/~iveresov/8236050/webrev.02/ Looks good to me. Maybe adding a test such that we catch similar problems before ATR would be good. Best regards, Tobias From igor.veresov at oracle.com Mon Jan 6 18:47:18 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Mon, 6 Jan 2020 10:47:18 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <1e4fb10a-2462-f43d-2ae6-2479f45daae9@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> <1e4fb10a-2462-f43d-2ae6-2479f45daae9@oracle.com> Message-ID: > On Jan 5, 2020, at 11:20 PM, Tobias Hartmann wrote: > > Hi Igor, > > On 05.01.20 03:25, Igor Veresov wrote: >> Actually, the verification found a problem with how the compilation levels are chosen in case of -Xcomp. >> So I had to fix that: http://cr.openjdk.java.net/~iveresov/8236050/webrev.02/ > > Looks good to me. Maybe adding a test such that we catch similar problems before ATR would be good. > Sure thing. A quick test added: http://cr.openjdk.java.net/~iveresov/8236050/webrev.03/ Thanks, Igor > Best regards, > Tobias From tobias.hartmann at oracle.com Tue Jan 7 07:31:23 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 7 Jan 2020 08:31:23 +0100 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> <1e4fb10a-2462-f43d-2ae6-2479f45daae9@oracle.com> Message-ID: <7707d1d8-d5b0-7af1-877c-8fd0ffa5c84b@oracle.com> On 06.01.20 19:47, Igor Veresov wrote: > Sure thing. A quick test added:?http://cr.openjdk.java.net/~iveresov/8236050/webrev.03/ Looks good, thanks for adding! Best regards, Tobias From tobias.hartmann at oracle.com Tue Jan 7 07:44:33 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 7 Jan 2020 08:44:33 +0100 Subject: RFR: 8236181: C2: Remove useless step_over_gc_barrier() in int->bool conversion In-Reply-To: References: Message-ID: Hi Roman, this looks good to me. Best regards, Tobias On 18.12.19 13:40, Roman Kennke wrote: > In cfgnode.cpp, in is_x2logic() that converts a diamond-shape if/else to simple bool patterns, we > have a step_over_gc_barrier() at the end. This has been introduced by Shenandoah. I believe the > intention was to convert obj vs null check to a simple boolean expression and eliminate the barrier > on the unneeded path. However, it is not needed because Shenandoah we already eliminate barriers > when the only user is a null-check, and it might actually be counter-productive if the barrier is > needed on other paths, because it keeps the input of the barrier alive. This is probably a left-over > from pre-LRB. > > Bug: > https://bugs.openjdk.java.net/browse/JDK-8236181 > Webrev: > http://cr.openjdk.java.net/~rkennke/JDK-8236181/webrev.00/ > > Testing: hotspot_gc_shenandoah, submit-repo (in-progress) > > Can I please get a review? > > Thanks, > Roman > From christian.hagedorn at oracle.com Tue Jan 7 11:53:27 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 7 Jan 2020 12:53:27 +0100 Subject: [14] RFR(XS): 8236140: assert(!VerifyHashTableKeys || _hash_lock == 0) failed: remove node from hash table before modifying it Message-ID: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> Hi Please review the following patch: https://bugs.openjdk.java.net/browse/JDK-8236140 http://cr.openjdk.java.net/~chagedorn/8236140/webrev.00/ An input of the StoreL node in the test case was set without rehashing the store which triggers the assertion failure. Thank you! Best regards, Christian From rwestrel at redhat.com Tue Jan 7 12:07:33 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 07 Jan 2020 13:07:33 +0100 Subject: RFR: 8236181: C2: Remove useless step_over_gc_barrier() in int->bool conversion In-Reply-To: References: Message-ID: <875zhn79mi.fsf@redhat.com> > However, it is not needed because Shenandoah we already eliminate > barriers when the only user is a null-check, and it might actually be > counter-productive if the barrier is needed on other paths, because it > keeps the input of the barrier alive. This is probably a left-over > from pre-LRB. I'm not sure this is useless. If this triggers during igvn, it could be that the Phi is processed before the CmpP and so the barrier at the CmpP has no chance of being eliminated. Anyway, this is also likely uncommon so I'm fine with removing it. Roland. From matthias.baesken at sap.com Tue Jan 7 07:54:44 2020 From: matthias.baesken at sap.com (Baesken, Matthias) Date: Tue, 7 Jan 2020 07:54:44 +0000 Subject: struct SwitchRange and C++ One Definition Rule [-Wodr] In-Reply-To: <8EF4DC8C-37C0-4E17-AE92-2F2C8DD3869B@oracle.com> References: <8EF4DC8C-37C0-4E17-AE92-2F2C8DD3869B@oracle.com> Message-ID: Hi Kim, thanks for the feedback . I opened : 8236709: struct SwitchRange in HS violates C++ One Definition Rule https://bugs.openjdk.java.net/browse/JDK-8236709 Best regards, Matthias > > Hello, when experimenting with gcc8 and the -flto compiler flag I was > running into these warnings in the c1 coding : > > > > /open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGenerator.hpp:50:7: > warning: type 'struct SwitchRange' violates the C++ One Definition Rule [- > Wodr] > > class SwitchRange: public CompilationResourceObj { > > ^ > > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:319: note: a > different type is defined in another translation unit > > class SwitchRange : public StackObj { > > > > > /usr/work/d040975/open_jdk/jdk_3/jdk/src/hotspot/share/c1/c1_LIRGener > ator.hpp:52:7: note: the first difference of corresponding definitions is field > '_low_key' > > int _low_key; > > ^ > > /open_jdk/jdk_3/jdk/src/hotspot/share/opto/parse2.cpp:321: note: a > field with different name is defined in another translation unit > > jint _lo; // inclusive lower limit > > > > > > > > Do you think this should be fixed ( renaming one SwitchRange ) ? > > > > > Martin suggested that even without flto added it could be problematic . > > Yes, please file a bug and let's get this fixed quickly. This sort of thing can > lead > to really weird looking problems, like using the same vtable (which one > picked > ?at random?) for instances of both classes. From nils.eliasson at oracle.com Tue Jan 7 14:59:08 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Tue, 7 Jan 2020 15:59:08 +0100 Subject: [14] 8235984: C2: assert(out->in(PhiNode::Region) == head || out->in(PhiNode::Region) == slow_head) failed: phi must be either part of the slow or the fast loop In-Reply-To: <08f66862-fd59-feb1-5f3b-a0df81c07721@oracle.com> References: <08f66862-fd59-feb1-5f3b-a0df81c07721@oracle.com> Message-ID: <6b15705e-8b49-488f-02a0-5ccf4e7a7d6b@oracle.com> Hi Christian, I agree with Tobias - lets go with fix b for 14, it seems to have the lowest risk. Reviewed. Best Regards, Nils On 2019-12-23 14:12, Tobias Hartmann wrote: > Hi Christian, > > version b) looks good to me and I would go with that for JDK 14. > > Best regards, > Tobias > > On 23.12.19 10:30, Christian Hagedorn wrote: >> Hi >> >> Please review the following options for: >> https://bugs.openjdk.java.net/browse/JDK-8235984 >> >> The problem: >> The original fix for JDK-8233033 [1] assumed that partially peeled statements always have a >> dependency inside the loop to be unswitched (i.e. when following their outputs eventually a phi node >> is hit belonging to either the slow or the fast loop). However, that is not always the case. As a >> result the assert was hit. >> >> I suggest the following options: >> >> Option a) http://cr.openjdk.java.net/~chagedorn/8235984/webrev.a.00/ >> We first check at the start of loop unswitching if there are loop predicates for the loop to be >> unswitched and if they have an additional control dependency to partially peeled statements (outcnt >>> 1). Then we explicitly check the assumption that partially peeled statements have a dependency in >> the loop to be unswitched (in order to keep the fix for JDK-8233033). If that is not the case, we >> bailout. We could then file an RFE for JDK-15 to handle this missing case properly and remove the >> bailout. >> >> Option b) http://cr.openjdk.java.net/~chagedorn/8235984/webrev.b.00/ >> We only check at the start of loop unswitching if there are loop predicates for the loop to be >> unswitched and if they have an additional control dependency to partially peeled statements (outcnt >>> 1). If that's the case we bailout without having the fix from JDK-8233033. We could then file an >> RFE for JDK-15 to properly handle partially peeled statements all together (kinda a REDO of the fix >> for JDK-8233033). >> >> Option c) >> Trying to fix the missing cases from JDK-8233033 for JDK-14 without a bailout. >> >> >> I've tried to come up with a fix (option c) last week but without success so far. The idea was to >> also clone the partially peeled statements without a dependency in the loop to be unswitched, change >> their control input to the correct cloned predicates and then add a phi node on each loop exit, >> where we merge the slow and fast loop, to select the correct value. However, this has not worked >> properly, yet and also involves a higher risk due to its complexity. I think we should not target >> that option for JDK-14 but do it for JDK-15 in an RFE. >> >> Thus, I'd opt for either option a) or b). I tested Tier 1-7 for the complete bailout b) and Tier 1-8 >> for the "conditioned" bailout a). Both look fine. I also ran some standard benchmarks comparing a) >> and b) to a baseline where I excluded the fix for JDK-8233033 (without bailing out and trying to fix >> the problem). I could not see any difference in performance. Therefore, it suggests to go with the >> low risk option b) for JDK-14 and do the entire fix in an RFE for JDK-15. >> >> What do you think? >> >> >> Thank you! >> >> Best regards, >> Christian >> >> >> [1] https://bugs.openjdk.java.net/browse/JDK-8233033 From nils.eliasson at oracle.com Tue Jan 7 15:02:15 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Tue, 7 Jan 2020 16:02:15 +0100 Subject: [14] RFR(XS): 8236140: assert(!VerifyHashTableKeys || _hash_lock == 0) failed: remove node from hash table before modifying it In-Reply-To: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> References: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> Message-ID: <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> Hi Christian! The fix looks good to me. Reviewed. Best regards, Nils On 2020-01-07 12:53, Christian Hagedorn wrote: > Hi > > Please review the following patch: > https://bugs.openjdk.java.net/browse/JDK-8236140 > http://cr.openjdk.java.net/~chagedorn/8236140/webrev.00/ > > An input of the StoreL node in the test case was set without rehashing > the store which triggers the assertion failure. > > Thank you! > > Best regards, > Christian From igor.veresov at oracle.com Tue Jan 7 15:07:03 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Tue, 7 Jan 2020 07:07:03 -0800 Subject: [14] RFR(S) 8236050: Some compiler tests fail when executed with custom TieredLevel In-Reply-To: <7707d1d8-d5b0-7af1-877c-8fd0ffa5c84b@oracle.com> References: <6188d3c7-839f-b822-4293-97526cecac6a@oracle.com> <6ED127C5-BFCB-4955-97FE-DBFF0498FEE0@oracle.com> <474c5c62-6679-3ddb-0fc8-06ebc03f76bf@oracle.com> <58742AC8-84BD-425F-A64A-9CAA93DBD5A3@oracle.com> <8F745069-EAAE-4120-9577-8AEDA5A8EDCA@oracle.com> <1e4fb10a-2462-f43d-2ae6-2479f45daae9@oracle.com> <7707d1d8-d5b0-7af1-877c-8fd0ffa5c84b@oracle.com> Message-ID: <2F0775F8-2FD2-459B-A18F-4CB96D231DCA@oracle.com> Thanks, Tobias! igor > On Jan 6, 2020, at 11:31 PM, Tobias Hartmann wrote: > > > On 06.01.20 19:47, Igor Veresov wrote: >> Sure thing. A quick test added: http://cr.openjdk.java.net/~iveresov/8236050/webrev.03/ > > Looks good, thanks for adding! > > Best regards, > Tobias From christian.hagedorn at oracle.com Tue Jan 7 15:18:08 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 7 Jan 2020 16:18:08 +0100 Subject: [14] RFR(XS): 8236140: assert(!VerifyHashTableKeys || _hash_lock == 0) failed: remove node from hash table before modifying it In-Reply-To: <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> References: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> Message-ID: <328e7f32-c525-c471-8158-d80eb94cf7c2@oracle.com> Thank you Nils for the review! Best regards, Christian On 07.01.20 16:02, Nils Eliasson wrote: > Hi Christian! > > The fix looks good to me. > > Reviewed. > > Best regards, > Nils > > On 2020-01-07 12:53, Christian Hagedorn wrote: >> Hi >> >> Please review the following patch: >> https://bugs.openjdk.java.net/browse/JDK-8236140 >> http://cr.openjdk.java.net/~chagedorn/8236140/webrev.00/ >> >> An input of the StoreL node in the test case was set without rehashing >> the store which triggers the assertion failure. >> >> Thank you! >> >> Best regards, >> Christian > From christian.hagedorn at oracle.com Tue Jan 7 15:33:56 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 7 Jan 2020 16:33:56 +0100 Subject: [14] 8235984: C2: assert(out->in(PhiNode::Region) == head || out->in(PhiNode::Region) == slow_head) failed: phi must be either part of the slow or the fast loop In-Reply-To: <6b15705e-8b49-488f-02a0-5ccf4e7a7d6b@oracle.com> References: <08f66862-fd59-feb1-5f3b-a0df81c07721@oracle.com> <6b15705e-8b49-488f-02a0-5ccf4e7a7d6b@oracle.com> Message-ID: <820c7ec7-b7dc-134f-49ca-461e8516e88a@oracle.com> Thank you Nils for the review! Then let's go with option b). I filed an RFE to do a proper fix in 15 [1]. Best regards, Christian [1] https://bugs.openjdk.java.net/browse/JDK-8236722 On 07.01.20 15:59, Nils Eliasson wrote: > Hi Christian, > > I agree with Tobias - lets go with fix b for 14, it seems to have the > lowest risk. > > Reviewed. > > Best Regards, > Nils > > On 2019-12-23 14:12, Tobias Hartmann wrote: >> Hi Christian, >> >> version b) looks good to me and I would go with that for JDK 14. >> >> Best regards, >> Tobias >> >> On 23.12.19 10:30, Christian Hagedorn wrote: >>> Hi >>> >>> Please review the following options for: >>> https://bugs.openjdk.java.net/browse/JDK-8235984 >>> >>> The problem: >>> The original fix for JDK-8233033 [1] assumed that partially peeled >>> statements always have a >>> dependency inside the loop to be unswitched (i.e. when following >>> their outputs eventually a phi node >>> is hit belonging to either the slow or the fast loop). However, that >>> is not always the case. As a >>> result the assert was hit. >>> >>> I suggest the following options: >>> >>> Option a) http://cr.openjdk.java.net/~chagedorn/8235984/webrev.a.00/ >>> We first check at the start of loop unswitching if there are loop >>> predicates for the loop to be >>> unswitched and if they have an additional control dependency to >>> partially peeled statements (outcnt >>>> 1). Then we explicitly check the assumption that partially peeled >>>> statements have a dependency in >>> the loop to be unswitched (in order to keep the fix for JDK-8233033). >>> If that is not the case, we >>> bailout. We could then file an RFE for JDK-15 to handle this missing >>> case properly and remove the >>> bailout. >>> >>> Option b) http://cr.openjdk.java.net/~chagedorn/8235984/webrev.b.00/ >>> We only check at the start of loop unswitching if there are loop >>> predicates for the loop to be >>> unswitched and if they have an additional control dependency to >>> partially peeled statements (outcnt >>>> 1). If that's the case we bailout without having the fix from >>>> JDK-8233033. We could then file an >>> RFE for JDK-15 to properly handle partially peeled statements all >>> together (kinda a REDO of the fix >>> for JDK-8233033). >>> >>> Option c) >>> Trying to fix the missing cases from JDK-8233033 for JDK-14 without a >>> bailout. >>> >>> >>> I've tried to come up with a fix (option c) last week but without >>> success so far. The idea was to >>> also clone the partially peeled statements without a dependency in >>> the loop to be unswitched, change >>> their control input to the correct cloned predicates and then add a >>> phi node on each loop exit, >>> where we merge the slow and fast loop, to select the correct value. >>> However, this has not worked >>> properly, yet and also involves a higher risk due to its complexity. >>> I think we should not target >>> that option for JDK-14 but do it for JDK-15 in an RFE. >>> >>> Thus, I'd opt for either option a) or b). I tested Tier 1-7 for the >>> complete bailout b) and Tier 1-8 >>> for the "conditioned" bailout a). Both look fine. I also ran some >>> standard benchmarks comparing a) >>> and b) to a baseline where I excluded the fix for JDK-8233033 >>> (without bailing out and trying to fix >>> the problem). I could not see any difference in performance. >>> Therefore, it suggests to go with the >>> low risk option b) for JDK-14 and do the entire fix in an RFE for >>> JDK-15. >>> >>> What do you think? >>> >>> >>> Thank you! >>> >>> Best regards, >>> Christian >>> >>> >>> [1] https://bugs.openjdk.java.net/browse/JDK-8233033 > From tobias.hartmann at oracle.com Tue Jan 7 16:27:20 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 7 Jan 2020 17:27:20 +0100 Subject: RFR(XS): 8229855: C2 fails with assert(false) failed: bad AD file Message-ID: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> Hi, please review the following patch: https://bugs.openjdk.java.net/browse/JDK-8229855 http://cr.openjdk.java.net/~thartmann/8229855/webrev.00/ For the switch statement in TestJumpTable.java, C2 creates the following binary search tree with a jump table at (*): if (key_val > 45) { if (key_val != 60) { if (key_val <= 60) { (*) } else { [...] } } else { [...] } } else { [...] } The jump table at (*) handles the following key_val ranges: {46..50}, {51}, {52..59}. Parse::jump_switch_ranges -> Parse::create_jump_tables adds a CastII to narrow the type of key_val to type int:46..59 (type int:0..13 after normalizing to zero): http://hg.openjdk.java.net/jdk/jdk/file/44cb1f517839/src/hotspot/share/opto/parse2.cpp#l884 Now after some rounds of loop unrolling, key_val is known to be int:60..260 (normalized to int:14..214) in one of the loop bodies. As a result, the narrow CastII node is replaced by TOP and we crash in the matcher because the JumpNode user didn't go away. The problem is that the key_val != 60 && key_val <= 60 combination is *not* found to be always false if key_val becomes known as int:60..260 because C2's type system does not narrow the type after the first check. The fix is to simply strengthen the second check to ensure that both control and data paths die consistently. I would like to push this simple fix into JDK 14 and follow-up in JDK 15 with an enhancement that adds a general optimization for such integer checks: https://bugs.openjdk.java.net/browse/JDK-8236721 I've quickly hacked a prototype that shows promising results (and would also fix this issue): http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ Thanks, Tobias From tobias.hartmann at oracle.com Tue Jan 7 16:31:23 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 7 Jan 2020 17:31:23 +0100 Subject: [14] RFR(XS): 8236140: assert(!VerifyHashTableKeys || _hash_lock == 0) failed: remove node from hash table before modifying it In-Reply-To: <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> References: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> Message-ID: +1 Best regards, Tobias On 07.01.20 16:02, Nils Eliasson wrote: > Hi Christian! > > The fix looks good to me. > > Reviewed. > > Best regards, > Nils > > On 2020-01-07 12:53, Christian Hagedorn wrote: >> Hi >> >> Please review the following patch: >> https://bugs.openjdk.java.net/browse/JDK-8236140 >> http://cr.openjdk.java.net/~chagedorn/8236140/webrev.00/ >> >> An input of the StoreL node in the test case was set without rehashing the store which triggers >> the assertion failure. >> >> Thank you! >> >> Best regards, >> Christian > From rwestrel at redhat.com Tue Jan 7 16:37:56 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 07 Jan 2020 17:37:56 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> Message-ID: <87woa35ijf.fsf@redhat.com> Hi Vladimir, >>> Why you added check in must_be_not_null()? It is used only in library_call.cpp and should not relate to this changes. >>> Did you find some issues? >> >> I hit a crash when doing some CTW testing because an argument to >> must_be_not_null() was already known to be not null. I suppose it's >> unrelated to that change and something changed in the libraries instead. > > That's interesting. I initially thought it's just to simplify the IR and > avoid keeping redundant null check until loop opts are over. must_be_not_null() is there for correctness. For some intrinsics, we know that some parameter is not null and so the intrinsic implementation skips the null check. Without must_be_not_null() calls, we, for instance, add a load from an oop whose type is maybe a null pointer. After some transformation we may end up with a load from null. For instance if we have the following code shape: for (;;) { if (p == null) { .. } v = p.f; // intrinsic. Load without a null check } after unswitching: if (p == null) { for (;;) { v = null.f; // crashes the compiler } } else { for (;;) { v = p.f; } } > Doesn't it signal about a more general problem with > GraphKit::must_be_not_null()? > > What if a value becomes non-null after GraphKit::must_be_not_null()? Is > there a chance to hit the very same problem you observed? The problem I hit is that if the value passed to must_be_not_null is already known to be not null. The null branch of must_be_not_null optimizes out at parse time and: C->root()->add_req(halt); adds top to the RootNode which causes some failures during matching because top is unexpected from Root. Running RootNode::Ideal() would clean it up but it's not called. If the null branch becomes dead after parsing then, AFAICT, RootNode::Ideal() would be executed and the same problem would not happen. Roland. From christian.hagedorn at oracle.com Tue Jan 7 16:48:48 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 7 Jan 2020 17:48:48 +0100 Subject: [14] RFR(XS): 8236140: assert(!VerifyHashTableKeys || _hash_lock == 0) failed: remove node from hash table before modifying it In-Reply-To: References: <2d1fd1d5-dfaa-a1d3-3b18-c0e9f0a58a01@oracle.com> <38250d1f-f995-ff98-c683-7cfc6e111cd0@oracle.com> Message-ID: <1937385b-c626-b697-cd36-8b4dd1c4087a@oracle.com> Thank you for the review Tobias! Best regards, Christian On 07.01.20 17:31, Tobias Hartmann wrote: > +1 > > Best regards, > Tobias > > On 07.01.20 16:02, Nils Eliasson wrote: >> Hi Christian! >> >> The fix looks good to me. >> >> Reviewed. >> >> Best regards, >> Nils >> >> On 2020-01-07 12:53, Christian Hagedorn wrote: >>> Hi >>> >>> Please review the following patch: >>> https://bugs.openjdk.java.net/browse/JDK-8236140 >>> http://cr.openjdk.java.net/~chagedorn/8236140/webrev.00/ >>> >>> An input of the StoreL node in the test case was set without rehashing the store which triggers >>> the assertion failure. >>> >>> Thank you! >>> >>> Best regards, >>> Christian >> From luhenry at microsoft.com Tue Jan 7 22:16:05 2020 From: luhenry at microsoft.com (Ludovic Henry) Date: Tue, 7 Jan 2020 22:16:05 +0000 Subject: Fix potential bug in jaotc Message-ID: Hello, As I was learning and reading over the codebase of src/jdk.aot and src/hotspot/share/aot, I discovered a discrepancy which might lead to future bugs. The issue is around how some of the metadata is generated and loaded between jaotc and the runtime. In the runtime, `aot_metadata` (src/hotspot/share/aot/aotCompiledMethod.hpp) contains metadata about each method that is compiled in the AOT image - for example, information like the implicit and explicit exception table, or the oop map. Because the size of this data is specific to each method, `aot_metadata` contains fields that store offsets to the `this` pointer, and the data is stored right after the `aot_metadata` structure in the AOT image. The structure of `aot_metadata` is the following: class aot_metadata { int _size; int _code_size; int _entry; int _verified_entry; int _exception_handler_offset; int _deopt_handler_offset; int _stubs_offset; int _frame_size; int _orig_pc_offset; int _unsafe_access; // contain the offset to the `this` pointer int _pc_desc_begin; int _scopes_begin; int _reloc_begin; int _exception_table_begin; int _nul_chk_table_begin; int _oopmap_begin; } When creating the AOT image and generating the information in `jdk.tools.jaotc.MetadataBuilder` (src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/MetadataBuilder.java), the `aot_metadata._nul_chk_table_begin` field is conditionally generated. In the case where there is no implicit exceptions in the method, no data/field is generated in the AOT image. However, there is no check for the existence of this field when loading the `aot_metadata` from the AOT image. This issue is mitigated by the fact that the `HotSpotMetaData.implicitExceptionBytes` is always set, even if only with an empty `byte[]`. Nonetheless, this field is populated in the runtime in `getMetadata` (in src/hotspot/share/jvmci/jvmciCompilerToVM.cpp), making the initialization and usage of this field far apart, complexifying the overall solution. The following patch proposes to always generate the `aot_metadata._nul_chk_table_begin` field. It also has the advantage to unify how this field and the others are generated. diff --git a/src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/MetadataBuilder.java b/src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/MetadataBuilder.java index 34ce4d8c8b..fa89d34796 100644 --- a/src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/MetadataBuilder.java +++ b/src/jdk.aot/share/classes/jdk.tools.jaotc/src/jdk/tools/jaotc/MetadataBuilder.java @@ -98,7 +98,8 @@ final class MetadataBuilder { byte[] scopeDesc = metaData.scopesDescBytes(); byte[] relocationInfo = metaData.relocBytes(); byte[] oopMapInfo = metaData.oopMaps(); - byte[] implicitExceptionBytes = HotSpotGraalServices.getImplicitExceptionBytes(metaData); + byte[] exceptionBytes = metaData.exceptionBytes(); + byte[] implicitExceptionBytes = metaData.implicitExceptionBytes(); // create a global symbol at this position for this method NativeOrderOutputStream metadataStream = new NativeOrderOutputStream(); @@ -143,10 +144,7 @@ final class MetadataBuilder { NativeOrderOutputStream.PatchableInt scopeOffset = metadataStream.patchableInt(); NativeOrderOutputStream.PatchableInt relocationOffset = metadataStream.patchableInt(); NativeOrderOutputStream.PatchableInt exceptionOffset = metadataStream.patchableInt(); - NativeOrderOutputStream.PatchableInt implictTableOffset = null; - if (implicitExceptionBytes != null) { - implictTableOffset = metadataStream.patchableInt(); - } + NativeOrderOutputStream.PatchableInt implictTableOffset = metadataStream.patchableInt(); NativeOrderOutputStream.PatchableInt oopMapOffset = metadataStream.patchableInt(); metadataStream.align(8); @@ -160,12 +158,10 @@ final class MetadataBuilder { metadataStream.put(relocationInfo).align(8); exceptionOffset.set(metadataStream.position()); - metadataStream.put(metaData.exceptionBytes()).align(8); + metadataStream.put(exceptionBytes).align(8); - if (implicitExceptionBytes != null) { - implictTableOffset.set(metadataStream.position()); - metadataStream.put(implicitExceptionBytes).align(8); - } + implictTableOffset.set(metadataStream.position()); + metadataStream.put(implicitExceptionBytes).align(8); // oopmaps should be last oopMapOffset.set(metadataStream.position()); diff --git a/src/jdk.internal.vm.ci/share/classes/jdk.vm.ci.hotspot/src/jdk/vm/ci/hotspot/HotSpotMetaData.java b/src/jdk.internal.vm.ci/share/classes/jdk.vm.ci.hotspot/src/jdk/vm/ci/hotspot/HotSpotMetaData.java index fc54c41312..3acb6191b9 100644 --- a/src/jdk.internal.vm.ci/share/classes/jdk.vm.ci.hotspot/src/jdk/vm/ci/hotspot/HotSpotMetaData.java +++ b/src/jdk.internal.vm.ci/share/classes/jdk.vm.ci.hotspot/src/jdk/vm/ci/hotspot/HotSpotMetaData.java @@ -41,6 +41,7 @@ public class HotSpotMetaData { scopesDescBytes = new byte[0]; relocBytes = new byte[0]; exceptionBytes = new byte[0]; + implicitExceptionBytes = new byte[0]; oopMaps = new byte[0]; metadata = new String[0]; // ...some of them will be overwritten by the VM: Thank you, -- Ludovic From tobias.hartmann at oracle.com Wed Jan 8 10:09:01 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Wed, 8 Jan 2020 11:09:01 +0100 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

Message-ID: <4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> Hi Felix, On 16.12.19 11:14, Yangfei (Felix) wrote: > -- OK. I will add one assertion immediately before breaking the loop. Thanks. >> - Why do you need max_idx? Isn't is the case that if find_align_to_ref returns >> NULL, there aren't any comparable memory operations left and it essentially >> doesn't matter which one you chose? > > -- I was considering minimizing the performance impact of the patch here. > best_align_to_mem_ref is used in SuperWord::align_initial_loop_index for adjusting the pre-loop limit. > For the test case in the webrev, best_align_to_mem_ref was chosen from node 470 (StoreB) and node 431 (StoreL). > The vector width for these two memory operations are different on aarch64 platform: vw = 16 bytes for node 431 and 2 bytes for node 470. > SuperWord::align_initial_loop_index will emit different code sequences for the test case. > The max_idx tells us which memory operation has the biggest vector size. Okay, thanks for the details. Could you add a comment that explains this? >> Also, is it guaranteed that max_idx is always initialized in SuperWord::find_align_to_ref? > > -- Yes, I think so. > When memops is not empty and the memory ops in memops are not comparable, find_align_to_ref will always sets its max_idx. Okay. Thanks, Tobias From goetz.lindenmaier at sap.com Wed Jan 8 13:41:52 2020 From: goetz.lindenmaier at sap.com (Lindenmaier, Goetz) Date: Wed, 8 Jan 2020 13:41:52 +0000 Subject: [14] RFR: 8236772: Fix build for windows 32-bit after 8212160 and 8234331. Message-ID: Hi, Two warning fail the windows 32 bit build with VS 2017. In count_leading_zeroes.hpp, a declaration is not included in the _LP64 that covers all it's uses. Thus a warning about an unused variable is raised. A library for a test lacks the JNIEXPORT and JNICALL required to match the signature in jvmti.h. I'd like to push this to jdk14. Please review: http://cr.openjdk.java.net/~goetz/wr20/8236772-fix_ntintel_build-jdk14/01/ Best regards, Goetz. From christoph.langer at sap.com Wed Jan 8 14:50:44 2020 From: christoph.langer at sap.com (Langer, Christoph) Date: Wed, 8 Jan 2020 14:50:44 +0000 Subject: [14] RFR: 8236772: Fix build for windows 32-bit after 8212160 and 8234331. In-Reply-To: References: Message-ID: Hi Goetz, looks good and trivial to me. Cheers Christoph > -----Original Message----- > From: hotspot-compiler-dev bounces at openjdk.java.net> On Behalf Of Lindenmaier, Goetz > Sent: Mittwoch, 8. Januar 2020 14:42 > To: hotspot-compiler-dev at openjdk.java.net > Subject: [SUSPECTED SPAM] [CAUTION] [14] RFR: 8236772: Fix build for > windows 32-bit after 8212160 and 8234331. > > Hi, > > Two warning fail the windows 32 bit build with VS 2017. > > In count_leading_zeroes.hpp, a declaration is not included > in the _LP64 that covers all it's uses. Thus a warning > about an unused variable is raised. > > A library for a test lacks the JNIEXPORT and JNICALL > required to match the signature in jvmti.h. > > I'd like to push this to jdk14. > > Please review: > http://cr.openjdk.java.net/~goetz/wr20/8236772-fix_ntintel_build- > jdk14/01/ > > Best regards, > Goetz. From smita.kamath at intel.com Thu Jan 9 01:22:14 2020 From: smita.kamath at intel.com (Kamath, Smita) Date: Thu, 9 Jan 2020 01:22:14 +0000 Subject: RFR(M):8167065: Add intrinsic support for double precision shifting on x86_64 In-Reply-To: References: <6563F381B547594081EF9DE181D07912B2D6B7A7@fmsmsx123.amr.corp.intel.com> <70b5bff1-a2ad-6fea-24ee-f0748e2ccae6@oracle.com> <3914890d-74e3-c69e-6688-55cf9f0c8551@oracle.com> <8919a5ce-727a-fb25-e739-4c14da108a7a@oracle.com> Message-ID: Hi Alan, Thanks for the comment. I added these checks for caution but on further analysis it appears that these checks are not needed. 1) For PrimitiveRightShift, we have the input array and its length passed as arguments. Since the input array, result[], is copied using arraycopy, we know that IOOBE will be thrown. Therefore, it is safe to remove the checkIndex condition. int result[] = new int[nInts+len+1]; System.arraycopy(a, 0, result, 0, len); primitiveRightShift(result, result.length, 32 - nBits); 2) In case of PrimitiveLeftShift, I checked the call stack and verified that the length passed to the method is same as the length of the array. Therefore, the check is not necessary in this scenario. 3) In case of shiftLeft and shiftRight, the number of iterations (numIter) is computed from the mag.length. The newMag array length is also computed and adjusted based on mag.length. If these changes look fine, I can send an updated webrev removing these checks. Please let me know. Thanks and Regards, Smita Kamath -----Original Message----- From: Alan Bateman Sent: Saturday, December 28, 2019 12:22 AM To: Vladimir Kozlov ; Kamath, Smita Cc: core-libs-dev at openjdk.java.net; 'hotspot compiler' Subject: Re: RFR(M):8167065: Add intrinsic support for double precision shifting on x86_64 On 20/12/2019 22:19, Vladimir Kozlov wrote: > We should have added core-libs to review since you modified > BigInteger.java. > This adds Objects.checkFromToIndex checks in the middle of several supporting methods. Is IOOBE really possible in these cases or are these stand in for always-on asserts to ensure the instrinic is never used when the preconditions aren't satisfied? -Alan From Pengfei.Li at arm.com Thu Jan 9 09:36:21 2020 From: Pengfei.Li at arm.com (Pengfei Li) Date: Thu, 9 Jan 2020 09:36:21 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint Message-ID: Hi, JBS: https://bugs.openjdk.java.net/browse/JDK-8230591 Webrev: http://cr.openjdk.java.net/~pli/rfr/8230591/webrev.00/ This patch adds missing intrinsics for Math.ceil, floor and rint on AArch64 with Instructions FRINTP, FRINTM and FRINTN, respectively. These 3 instructions respect to the Java API Spec of the 3 math methods quite well, especially for special cases such as +/-0.0, +/-Inf and NaN. This also adds vectorized forms of these intrinsics. Intel has already added jtreg [1] and benchmark [2] for these intrinsics in its x86 patch [3]. Tests: Jtreg: no new failure after my change. JMH FpRoundingBenchmark.java[2]: (Before) Benchmark (TESTSIZE) Mode Cnt Score Error Units FpRoundingBenchmark.testceil 1024 thrpt 15 255.867 ? 0.123 ops/ms FpRoundingBenchmark.testfloor 1024 thrpt 15 249.013 ? 0.118 ops/ms FpRoundingBenchmark.testrint 1024 thrpt 15 419.425 ? 5.868 ops/ms (After) Benchmark (TESTSIZE) Mode Cnt Score Error Units FpRoundingBenchmark.testceil 1024 thrpt 15 1510.861 ? 2.626 ops/ms FpRoundingBenchmark.testfloor 1024 thrpt 15 1510.289 ? 6.147 ops/ms FpRoundingBenchmark.testrint 1024 thrpt 15 1510.780 ? 1.091 ops/ms [1] https://hg.openjdk.java.net/jdk/jdk/file/b94889c7e153/test/hotspot/jtreg/compiler/c2/cr6340864/TestDoubleVect.java [2] https://hg.openjdk.java.net/jdk/jdk/file/b94889c7e153/test/micro/org/openjdk/bench/java/math/FpRoundingBenchmark.java [3] https://hg.openjdk.java.net/jdk/jdk/rev/6fc57e391539 -- Thanks, Pengfei From aph at redhat.com Thu Jan 9 10:28:03 2020 From: aph at redhat.com (Andrew Haley) Date: Thu, 9 Jan 2020 10:28:03 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint In-Reply-To: References: Message-ID: On 1/9/20 9:36 AM, Pengfei Li wrote: > JBS: https://bugs.openjdk.java.net/browse/JDK-8230591 > Webrev: http://cr.openjdk.java.net/~pli/rfr/8230591/webrev.00/ These instructions fall into the category "AdvSIMD two-reg misc". Please put them there. Otherwise OK, thanks. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From rwestrel at redhat.com Thu Jan 9 12:25:24 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Thu, 09 Jan 2020 13:25:24 +0100 Subject: RFR(XS): 8229855: C2 fails with assert(false) failed: bad AD file In-Reply-To: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> References: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> Message-ID: <87lfqg6cln.fsf@redhat.com> > I would like to push this simple fix into JDK 14 and follow-up in JDK 15 with an enhancement that > adds a general optimization for such integer checks: > https://bugs.openjdk.java.net/browse/JDK-8236721 That sounds ok to me. Roland. From tobias.hartmann at oracle.com Thu Jan 9 13:08:47 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Thu, 9 Jan 2020 14:08:47 +0100 Subject: RFR(XS): 8229855: C2 fails with assert(false) failed: bad AD file In-Reply-To: <87lfqg6cln.fsf@redhat.com> References: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> <87lfqg6cln.fsf@redhat.com> Message-ID: <3ce85c62-2cfc-0e94-ba7d-28c3b9961a9f@oracle.com> Hi Roland, thanks for the review! Best regards, Tobias On 09.01.20 13:25, Roland Westrelin wrote: > >> I would like to push this simple fix into JDK 14 and follow-up in JDK 15 with an enhancement that >> adds a general optimization for such integer checks: >> https://bugs.openjdk.java.net/browse/JDK-8236721 > > That sounds ok to me. > > Roland. > From tobias.hartmann at oracle.com Thu Jan 9 13:22:24 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Thu, 9 Jan 2020 14:22:24 +0100 Subject: [15] RFR(S): 8236721: C2 should better optimize not-equal integer comparisons Message-ID: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> Hi, please review the following patch: https://bugs.openjdk.java.net/browse/JDK-8236721 http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ While working on JDK-8229855 [1], I've noticed that C2 is not able to fold not-equal integer comparisons of the following kind: val = int:lo..hi if (val != lo) { if (val <= lo) { // Dead code } } In general, C2's type system is quite limited when it comes to representing "sparse" types. But if an integer value is found to be unequal to it's lower/upper boundary, we can still narrow the type. With this enhancement, all checks in the 8 methods in TestFoldNECompares.java are folded and the methods are compiled as empty. Thanks, Tobias [1] https://bugs.openjdk.java.net/browse/JDK-8229855 From felix.yang at huawei.com Fri Jan 10 01:13:23 2020 From: felix.yang at huawei.com (Yangfei (Felix)) Date: Fri, 10 Jan 2020 01:13:23 +0000 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: <4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

<4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> Message-ID: Hi Tobias, Updated webrev: http://cr.openjdk.java.net/~fyang/8235762/webrev.01/ This also updated copyright years of the newly-added test case. Still test OK with the latest repo. Is this one better? Thanks, Felix > > Hi Felix, > > On 16.12.19 11:14, Yangfei (Felix) wrote: > > -- OK. I will add one assertion immediately before breaking the loop. > > Thanks. > > >> - Why do you need max_idx? Isn't is the case that if > >> find_align_to_ref returns NULL, there aren't any comparable memory > >> operations left and it essentially doesn't matter which one you chose? > > > > -- I was considering minimizing the performance impact of the patch here. > > best_align_to_mem_ref is used in SuperWord::align_initial_loop_index for > adjusting the pre-loop limit. > > For the test case in the webrev, best_align_to_mem_ref was chosen from > node 470 (StoreB) and node 431 (StoreL). > > The vector width for these two memory operations are different on > aarch64 platform: vw = 16 bytes for node 431 and 2 bytes for node 470. > > SuperWord::align_initial_loop_index will emit different code sequences for > the test case. > > The max_idx tells us which memory operation has the biggest vector size. > > Okay, thanks for the details. Could you add a comment that explains this? > > >> Also, is it guaranteed that max_idx is always initialized in > SuperWord::find_align_to_ref? > > > > -- Yes, I think so. > > When memops is not empty and the memory ops in memops are not > comparable, find_align_to_ref will always sets its max_idx. > > Okay. > > Thanks, > Tobias From Pengfei.Li at arm.com Fri Jan 10 03:34:38 2020 From: Pengfei.Li at arm.com (Pengfei Li) Date: Fri, 10 Jan 2020 03:34:38 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint In-Reply-To: References:

Message-ID: Hi Andrew, > These instructions fall into the category "AdvSIMD two-reg misc". Please put > them there. Otherwise OK, thanks. Some of these rounding instructions do not perfectly match category "AdvSIMD two-reg misc". In that category, we use "f((int)(T >> 1), 23, 22)" where the SIMD_Arrangement T is "100", "101" or "111" for "T2S", "T4S" or "T2D" respectively. In this case, the 23rd bit is always set to 1. But for some rounding instructions (FRINTN/FRINTM), the 23rd bit should be 0. Moreover, I don't think instructions FSQRT/FABS/FNEG should be put into that category as their 23rd bit is not actually derived from the data type (or we say the SIMD_Arrangement T), although there's no error since those bits happen to be 1. So for better instruction categories, I would like to add a new category (perhaps named "AdvSIMD two-reg arithmetic") and put rounding instructions as well as FSQRT/FABS/FNEG into this. I will upload a new webrev if you agree with me. -- Thanks, Pengfei From aph at redhat.com Fri Jan 10 09:38:57 2020 From: aph at redhat.com (Andrew Haley) Date: Fri, 10 Jan 2020 09:38:57 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint In-Reply-To: References:

Message-ID: <98800e19-9bc1-654f-443c-d1fc1adfe4b7@redhat.com> hi, On 1/10/20 3:34 AM, Pengfei Li wrote: >> These instructions fall into the category "AdvSIMD two-reg misc". Please put >> them there. Otherwise OK, thanks. > > Some of these rounding instructions do not perfectly match category > "AdvSIMD two-reg misc". In that category, we use "f((int)(T >> 1), > 23, 22)" where the SIMD_Arrangement T is "100", "101" or "111" for > "T2S", "T4S" or "T2D" respectively. In this case, the 23rd bit is > always set to 1. But for some rounding instructions (FRINTN/FRINTM), > the 23rd bit should be 0. Sure, but the code in "AdvSIMD two-reg misc" can easily be made flexible enough to accommodate that. All you have to do aas far as I can see is make size a parameter: everything else is the same. > Moreover, I don't think instructions FSQRT/FABS/FNEG should be put > into that category as their 23rd bit is not actually derived from > the data type (or we say the SIMD_Arrangement T), although there's > no error since those bits happen to be 1. > > So for better instruction categories, I would like to add a new > category (perhaps named "AdvSIMD two-reg arithmetic") and put > rounding instructions as well as FSQRT/FABS/FNEG into this. I will > upload a new webrev if you agree with me. The grouping names are defined by the ARM reference manual, they are not something we should invent for ourselves. If you go to the RM, see "Advanced SIMD two-register miscellaneous"; the encodings are all defined in terms of U, size, and opcode. The important thing is that our assembler should follow the layout and classification in that manual as closely as we possibly can. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From vladimir.x.ivanov at oracle.com Fri Jan 10 18:34:19 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Fri, 10 Jan 2020 21:34:19 +0300 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <87woa35ijf.fsf@redhat.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> Message-ID: <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> Thanks for the clarifications, Roland. >> Doesn't it signal about a more general problem with >> GraphKit::must_be_not_null()? >> >> What if a value becomes non-null after GraphKit::must_be_not_null()? Is >> there a chance to hit the very same problem you observed? > > The problem I hit is that if the value passed to must_be_not_null is > already known to be not null. The null branch of must_be_not_null > optimizes out at parse time and: > > C->root()->add_req(halt); > > adds top to the RootNode which causes some failures during matching > because top is unexpected from Root. > > Running RootNode::Ideal() would clean it up but it's not called. If the > null branch becomes dead after parsing then, AFAICT, RootNode::Ideal() > would be executed and the same problem would not happen. Makes sense. Do you know why RootNode::Ideal() isn't called during IGVN pass after parsing is over? Best regards, Vladimir Ivanov From sandhya.viswanathan at intel.com Fri Jan 10 19:35:12 2020 From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya) Date: Fri, 10 Jan 2020 19:35:12 +0000 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers Message-ID: JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ Currently legVecZ operand is defined as follows: operand legVecZ() %{ constraint(ALLOC_IN_RC(vectorz_reg_vl)); match(VecZ); format %{ %} interface(REG_INTER); %} This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. The legVecZ should instead be defined as: operand legVecZ() %{ constraint(ALLOC_IN_RC(vectorz_reg_legacy)); match(VecZ); format %{ %} interface(REG_INTER); %} Please consider this for JDK 14. Best Regards, Sandhya From vladimir.x.ivanov at oracle.com Fri Jan 10 21:50:39 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Sat, 11 Jan 2020 00:50:39 +0300 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers In-Reply-To: References: Message-ID: Looks good. Best regards, Vladimir Ivanov On 10.01.2020 22:35, Viswanathan, Sandhya wrote: > JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 > > Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ > > Currently legVecZ operand is defined as follows: > > operand legVecZ() %{ > > ? constraint(ALLOC_IN_RC(vectorz_reg_vl)); > > ? match(VecZ); > > ? format %{ %} > > ? interface(REG_INTER); > > %} > > This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. > > The legVecZ should instead be defined as: > > operand legVecZ() %{ > > ? constraint(ALLOC_IN_RC(vectorz_reg_legacy)); > > ? match(VecZ); > > ? format %{ %} > > ? interface(REG_INTER); > > %} > > Please consider this for JDK 14. > > Best Regards, > > Sandhya > From vladimir.x.ivanov at oracle.com Fri Jan 10 23:39:23 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Sat, 11 Jan 2020 02:39:23 +0300 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers In-Reply-To: References:

Message-ID: <951f79b5-3418-0803-b1ec-627270cac012@oracle.com> FYI test results are clean (hs-precheckin-comp,hs-tier1,hs-tier2). Best regards, Vladimir Ivanov On 11.01.2020 00:50, Vladimir Ivanov wrote: > Looks good. > > Best regards, > Vladimir Ivanov > > On 10.01.2020 22:35, Viswanathan, Sandhya wrote: >> JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 >> >> Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ >> >> Currently legVecZ operand is defined as follows: >> >> operand legVecZ() %{ >> >> ?? constraint(ALLOC_IN_RC(vectorz_reg_vl)); >> >> ?? match(VecZ); >> >> ?? format %{ %} >> >> ?? interface(REG_INTER); >> >> %} >> >> This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. >> >> The legVecZ should instead be defined as: >> >> operand legVecZ() %{ >> >> ?? constraint(ALLOC_IN_RC(vectorz_reg_legacy)); >> >> ?? match(VecZ); >> >> ?? format %{ %} >> >> ?? interface(REG_INTER); >> >> %} >> >> Please consider this for JDK 14. >> >> Best Regards, >> >> Sandhya >> From sandhya.viswanathan at intel.com Sat Jan 11 00:25:27 2020 From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya) Date: Sat, 11 Jan 2020 00:25:27 +0000 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers In-Reply-To: <951f79b5-3418-0803-b1ec-627270cac012@oracle.com> References:

<951f79b5-3418-0803-b1ec-627270cac012@oracle.com> Message-ID: Thanks a lot Vladimir! Can I push this onto JDK 14 branch? Best Regards, Sandhya -----Original Message----- From: Vladimir Ivanov Sent: Friday, January 10, 2020 3:39 PM To: Viswanathan, Sandhya ; hotspot compiler Subject: Re: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers FYI test results are clean (hs-precheckin-comp,hs-tier1,hs-tier2). Best regards, Vladimir Ivanov On 11.01.2020 00:50, Vladimir Ivanov wrote: > Looks good. > > Best regards, > Vladimir Ivanov > > On 10.01.2020 22:35, Viswanathan, Sandhya wrote: >> JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 >> >> Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ >> >> Currently legVecZ operand is defined as follows: >> >> operand legVecZ() %{ >> >> ?? constraint(ALLOC_IN_RC(vectorz_reg_vl)); >> >> ?? match(VecZ); >> >> ?? format %{ %} >> >> ?? interface(REG_INTER); >> >> %} >> >> This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. >> >> The legVecZ should instead be defined as: >> >> operand legVecZ() %{ >> >> ?? constraint(ALLOC_IN_RC(vectorz_reg_legacy)); >> >> ?? match(VecZ); >> >> ?? format %{ %} >> >> ?? interface(REG_INTER); >> >> %} >> >> Please consider this for JDK 14. >> >> Best Regards, >> >> Sandhya >> From jatin.bhateja at intel.com Sun Jan 12 18:12:16 2020 From: jatin.bhateja at intel.com (Bhateja, Jatin) Date: Sun, 12 Jan 2020 18:12:16 +0000 Subject: [14] RFR(S): 8236443 : Issues with specializing vector register type for phi operand with generic operands In-Reply-To: References:

Message-ID: Hi Vladimir, Please find the update patch at below link. http://cr.openjdk.java.net/~jbhateja/8236443/webrev.04/ Regards, Jatin > -----Original Message----- > From: Vladimir Ivanov > Sent: Friday, January 3, 2020 4:46 AM > To: Bhateja, Jatin ; hotspot-compiler- > dev at openjdk.java.net > Subject: Re: [14] RFR(S): 8236443 : Issues with specializing vector register type > for phi operand with generic operands > > Hi Jatin, > > > Webrev: http://cr.openjdk.java.net/~jbhateja/8236443/webrev.02/ > > I like that you put the check on use side (ShiftV + ideal_in_idx). > > Though ShiftCntV def is the only viable option in practice (phis shouldn't occur > on x86 and would trigger a bug due to mismatch between node type and > ideal_reg), it's nice to see such case being ruled out. > > Some comments: > > ================================================================ > ======= > src/hotspot/cpu/x86/x86.ad: > > +const uint Matcher::special_vector_ideal_reg(int ideal_reg, int opcode, > int ideal_in_idx, int size) { > > I don't see a compelling reason why it should be a platform-specific > check. ShiftCntV nodes override Node::ideal_reg() and putting the check > in shared code looks appropriate. > > Considering how LShiftCntVNode::ideal_reg is defined [1], IMO calling > Matcher::vector_shift_count_ideal_reg(t->length_in_bytes()) is better > than hardcoding VecS ideal register (which is x86-specific). > > > ================================================================ > ======= > + case Op_RShiftCntV: > + case Op_LShiftCntV: > + // Ideal index corrosponding to def operand. > + if (ideal_in_idx == -1) > + return Op_VecS; > + break; > > Shouldn't ideal_in_idx be 0 instead? > > > Some stylistic notes: > > ================================================================ > ======= > src/hotspot/cpu/x86/x86.ad: > > +MachOper* Matcher::specialize_vector_operand_helper(Node* m, > MachNode* > n, int op_idx) { > > It's a bit confusing to see Node* named "m" and MachNode* named "n". In > the code base it's usually the opposite: n stands for Node and m stands > for MachNode. > > But maybe "def" and "use" are better names? > > > ================================================================ > ======= > > + switch(opcode) { > + default: > + return ideal_reg; > + case Op_LShiftVB: > ... > + case Op_URShiftVL: > + if (ideal_in_idx == 2) > + return Op_VecS; > + break; > + case Op_RShiftCntV: > + case Op_LShiftCntV: > + // Ideal index corrosponding to def operand. > + if (ideal_in_idx == -1) > + return Op_VecS; > + break; > + } > > Does it make sense to introduce ShiftV and ShiftCntV node classes [2] > and use Node::is_ShiftV()/is_ShiftCntV() checks instead? > > ================================================================ > ======= > > + switch(opcode) { > + default: > + return ideal_reg; > ... > + } > + return ideal_reg; > > return statement is redundant. > > > ================================================================ > ======= > > src/hotspot/share/opto/matcher.cpp: > > - MachNode* m = live_nodes.pop()->isa_Mach(); > + Node* n = live_nodes.pop(); > + if (!n->is_Mach()) > + continue; > + MachNode * m = n->as_Mach(); > if (m != NULL) { > > > Node::isa_Mach() + != NULL check is a well-established convention in C2 > code base to perform type checks on nodes (is_Mach()+as_Mach()). I'd > prefer to keep the original version. > > Best regards, > Vladimir Ivanov > > [1] class LShiftCntVNode : public VectorNode { > virtual uint ideal_reg() const { return > Matcher::vector_shift_count_ideal_reg(vect_type()->length_in_bytes()); } > }; > > [2] > http://hg.openjdk.java.net/jdk/jdk/file/tip/src/hotspot/share/opto/node.hpp > #l716 > > > Generic operand processing has been skipped for non-machine nodes (e.g. > Phi) since they are skipped by the matcher. > > Non-definition operand resolution of a machine node will be able to pull the > type information from non-machine node. > > > > Re-organized the code by adding a target specific routine which can be used > to return the operand types for special ideal nodes e.g. RShiftCntV/LShiftCntV. > > For such nodes, definition machine operand vary for different targets and > vector lengths, a type based generic operand resolution > > is not possible in such cases. > > > > Best Regards, > > Jatin > > > > From fairoz.matte at oracle.com Mon Jan 13 06:00:34 2020 From: fairoz.matte at oracle.com (Fairoz Matte) Date: Sun, 12 Jan 2020 22:00:34 -0800 (PST) Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord Message-ID: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> Hi Please review the small change to fix the test bug. Changes done to replace SuperWord to UseSuperWord JBS - https://bugs.openjdk.java.net/browse/JDK-8236701 Webrev - http://cr.openjdk.java.net/~fmatte/8236701/webrev.00/ Thanks, Fairoz From tobias.hartmann at oracle.com Mon Jan 13 06:21:18 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 07:21:18 +0100 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers In-Reply-To: References:

<951f79b5-3418-0803-b1ec-627270cac012@oracle.com> Message-ID: <6fbb3509-c51f-7c52-832a-3af4c992b149@oracle.com> Hi Sandhya, looks good to me too. Please push to http://hg.openjdk.java.net/jdk/jdk14 Thanks, Tobias On 11.01.20 01:25, Viswanathan, Sandhya wrote: > Thanks a lot Vladimir! Can I push this onto JDK 14 branch? > > Best Regards, > Sandhya > > > -----Original Message----- > From: Vladimir Ivanov > Sent: Friday, January 10, 2020 3:39 PM > To: Viswanathan, Sandhya ; hotspot compiler > Subject: Re: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers > > FYI test results are clean (hs-precheckin-comp,hs-tier1,hs-tier2). > > Best regards, > Vladimir Ivanov > > On 11.01.2020 00:50, Vladimir Ivanov wrote: >> Looks good. >> >> Best regards, >> Vladimir Ivanov >> >> On 10.01.2020 22:35, Viswanathan, Sandhya wrote: >>> JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 >>> >>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ >>> >>> Currently legVecZ operand is defined as follows: >>> >>> operand legVecZ() %{ >>> >>> ?? constraint(ALLOC_IN_RC(vectorz_reg_vl)); >>> >>> ?? match(VecZ); >>> >>> ?? format %{ %} >>> >>> ?? interface(REG_INTER); >>> >>> %} >>> >>> This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. >>> >>> The legVecZ should instead be defined as: >>> >>> operand legVecZ() %{ >>> >>> ?? constraint(ALLOC_IN_RC(vectorz_reg_legacy)); >>> >>> ?? match(VecZ); >>> >>> ?? format %{ %} >>> >>> ?? interface(REG_INTER); >>> >>> %} >>> >>> Please consider this for JDK 14. >>> >>> Best Regards, >>> >>> Sandhya >>> From ioi.lam at oracle.com Mon Jan 13 07:21:49 2020 From: ioi.lam at oracle.com (Ioi Lam) Date: Sun, 12 Jan 2020 23:21:49 -0800 Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> Message-ID: <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> Hi Fairoz, Should the -XX:+IgnoreUnrecognizedVMOptions be removed? Thanks - Ioi On 1/12/20 10:00 PM, Fairoz Matte wrote: > Hi > > Please review the small change to fix the test bug. > Changes done to replace SuperWord to UseSuperWord > > JBS - https://bugs.openjdk.java.net/browse/JDK-8236701 > Webrev - http://cr.openjdk.java.net/~fmatte/8236701/webrev.00/ > > Thanks, > Fairoz From fairoz.matte at oracle.com Mon Jan 13 08:29:00 2020 From: fairoz.matte at oracle.com (Fairoz Matte) Date: Mon, 13 Jan 2020 00:29:00 -0800 (PST) Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> Message-ID: <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> Hi Ioi, I verified on my Ubuntu linux machine, without the -XX:+IgnoreUnrecognizedVMOptions also test case passes. I was not sure why that option was added. Let me know, if that is ok to be removed, I will send the next webrev. Thanks, Fairoz > -----Original Message----- > From: Ioi Lam > Sent: Monday, January 13, 2020 12:52 PM > To: hotspot-compiler-dev at openjdk.java.net > Subject: Re: RFR: 8236701: [TESTBUG] > compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:- > SuperWord > > Hi Fairoz, > > Should the -XX:+IgnoreUnrecognizedVMOptions be removed? > > Thanks > - Ioi > > On 1/12/20 10:00 PM, Fairoz Matte wrote: > > Hi > > > > Please review the small change to fix the test bug. > > Changes done to replace SuperWord to UseSuperWord > > > > JBS - https://bugs.openjdk.java.net/browse/JDK-8236701 > > Webrev - http://cr.openjdk.java.net/~fmatte/8236701/webrev.00/ > > > > Thanks, > > Fairoz > From rwestrel at redhat.com Mon Jan 13 09:58:50 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 10:58:50 +0100 Subject: RFR(S): 8236759: ShouldNotReachHere in PhaseIdealLoop::verify_strip_mined_scheduling Message-ID: <87zher1xut.fsf@redhat.com> http://cr.openjdk.java.net/~roland/8236759/webrev.00/ Splitting a load thru phi, if the phi is at a strip mined loop head, causes a load to be pinned at the outer strip mined loop head. That logic should pin the load above the outer loop instead. Roland. From rwestrel at redhat.com Mon Jan 13 10:20:30 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 11:20:30 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation Message-ID: <87wo9v1wup.fsf@redhat.com> http://cr.openjdk.java.net/~roland/8237007/webrev.00/ The assert fails because C2 expands a barrier between a membar and its control projection. That happens because the gc barrier is assigned the membar as control by loop optimizations (and not its control projection). The gc barrier has a load as input which itself depends on the memory state produced by the membar so the barrier has to be below the membar. The gc barrier has a CMove as use. That CMove has a control input that's above the membar and because it's considered pinned by loop opts, it's assigned earliest possible control: the membar. Considering the CMove as pinned is overly conservative AFAICT. My fix consists in excluding CMove nodes from being pinned. Roland. From rwestrel at redhat.com Mon Jan 13 10:54:26 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 11:54:26 +0100 Subject: RFR(S): 8235584: UseProfiledLoopPredicate fails with assert(_phase->get_loop(c) == loop) failed: have to be in the same loop Message-ID: <87r2031va5.fsf@redhat.com> http://cr.openjdk.java.net/~roland/8235584/webrev.00/ This is related to 8235452. As explained by Tobias: https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-December/036344.html a strip mined loop is transformed so the CountedLoopEnd and the outer strip mined loop are optimized out. There's still a loop but it's not longer strip mined and not even a valid counted loop anymore but it stays a CountedLoopNode with the strip mined flag set to true. The loop predication code fails with an assert because when it encounters the CountedLoop, the call to skip_strip_mined() sees it's flagged as strip mined so it needs to skip the outer strip mined loop but because the CountedLoop is no longer strip mined, it skips some control that are not the outer loop. The fix is to check both whether the loop is flagged as strip mined and a valid counted loop in skip_strip_mined(). Also, the fix that Tobias pushed for 8235452 erroneously disabled verification for the outer strip mined loop. This change restores it. Roland. From rwestrel at redhat.com Mon Jan 13 11:05:56 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 12:05:56 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> Message-ID: <87o8v71uqz.fsf@redhat.com> > Do you know why RootNode::Ideal() isn't called during IGVN pass after > parsing is over? The root node would need to be enqueued in the igvn worklist during parsing. The code pattern when an HaltNode is added to the graph is to use add_req() which doesn't cause the root node to be put on the igvn worklist. Roland. From patric.hedlin at oracle.com Mon Jan 13 11:11:56 2020 From: patric.hedlin at oracle.com (Patric Hedlin) Date: Mon, 13 Jan 2020 12:11:56 +0100 Subject: RFR(XS): 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() Message-ID: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> Dear all, I would like to ask for help to review the following change/update: Issue:? https://bugs.openjdk.java.net/browse/JDK-8235870 Webrev: http://cr.openjdk.java.net/~phedlin/tr8235870/ 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() ??? Method missing necessary checks, made more robust. Testing: hs-tier1-3, LuceneBench24H Best regards, Patric From tobias.hartmann at oracle.com Mon Jan 13 11:27:46 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 12:27:46 +0100 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

<4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> Message-ID: <8797205a-c4ff-253d-404b-62a178c6e169@oracle.com> Hi Felix, looks reasonable to me. Thanks for updating. Another review would be good. Best regards, Tobias On 10.01.20 02:13, Yangfei (Felix) wrote: > Hi Tobias, > > Updated webrev: http://cr.openjdk.java.net/~fyang/8235762/webrev.01/ > This also updated copyright years of the newly-added test case. > Still test OK with the latest repo. Is this one better? > > Thanks, > Felix > >> >> Hi Felix, >> >> On 16.12.19 11:14, Yangfei (Felix) wrote: >>> -- OK. I will add one assertion immediately before breaking the loop. >> >> Thanks. >> >>>> - Why do you need max_idx? Isn't is the case that if >>>> find_align_to_ref returns NULL, there aren't any comparable memory >>>> operations left and it essentially doesn't matter which one you chose? >>> >>> -- I was considering minimizing the performance impact of the patch here. >>> best_align_to_mem_ref is used in SuperWord::align_initial_loop_index for >> adjusting the pre-loop limit. >>> For the test case in the webrev, best_align_to_mem_ref was chosen from >> node 470 (StoreB) and node 431 (StoreL). >>> The vector width for these two memory operations are different on >> aarch64 platform: vw = 16 bytes for node 431 and 2 bytes for node 470. >>> SuperWord::align_initial_loop_index will emit different code sequences for >> the test case. >>> The max_idx tells us which memory operation has the biggest vector size. >> >> Okay, thanks for the details. Could you add a comment that explains this? >> >>>> Also, is it guaranteed that max_idx is always initialized in >> SuperWord::find_align_to_ref? >>> >>> -- Yes, I think so. >>> When memops is not empty and the memory ops in memops are not >> comparable, find_align_to_ref will always sets its max_idx. >> >> Okay. >> >> Thanks, >> Tobias From tobias.hartmann at oracle.com Mon Jan 13 11:29:50 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 12:29:50 +0100 Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> Message-ID: <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> Hi Fairoz, On 13.01.20 09:29, Fairoz Matte wrote: > I was not sure why that option was added. I think it's just because the author incorrectly assumed that some of the flags sre only available in debug builds (which is not the case). > Let me know, if that is ok to be removed, I will send the next webrev. Yes, please do so. The change looks good to me. Thanks, Tobias From tobias.hartmann at oracle.com Mon Jan 13 11:30:53 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 12:30:53 +0100 Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> Message-ID: <8f0d6d6c-927e-6b9e-6213-3274a099d288@oracle.com> Please also update the copyright. Best regards, Tobias On 13.01.20 12:29, Tobias Hartmann wrote: > Hi Fairoz, > > On 13.01.20 09:29, Fairoz Matte wrote: >> I was not sure why that option was added. > > I think it's just because the author incorrectly assumed that some of the flags sre only available > in debug builds (which is not the case). > >> Let me know, if that is ok to be removed, I will send the next webrev. > > Yes, please do so. The change looks good to me. > > Thanks, > Tobias > From tobias.hartmann at oracle.com Mon Jan 13 11:32:47 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 12:32:47 +0100 Subject: RFR(XS): 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() In-Reply-To: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> References: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> Message-ID: <3581fc6c-ca0d-597d-c702-f52a4a599172@oracle.com> Hi Patric, looks good and trivial to me. Best regards, Tobias On 13.01.20 12:11, Patric Hedlin wrote: > Dear all, > > I would like to ask for help to review the following change/update: > > Issue:? https://bugs.openjdk.java.net/browse/JDK-8235870 > Webrev: http://cr.openjdk.java.net/~phedlin/tr8235870/ > > 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() > > ??? Method missing necessary checks, made more robust. > > Testing: hs-tier1-3, LuceneBench24H > > > Best regards, > Patric > From tobias.hartmann at oracle.com Mon Jan 13 11:46:32 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 12:46:32 +0100 Subject: RFR(S): 8235584: UseProfiledLoopPredicate fails with assert(_phase->get_loop(c) == loop) failed: have to be in the same loop In-Reply-To: <87r2031va5.fsf@redhat.com> References: <87r2031va5.fsf@redhat.com> Message-ID: <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> Hi Roland, On 13.01.20 11:54, Roland Westrelin wrote: > http://cr.openjdk.java.net/~roland/8235584/webrev.00/ Looks good to me. > Also, the fix that Tobias pushed for 8235452 erroneously disabled > verification for the outer strip mined loop. This change restores it. Thanks for fixing. We should get this into JDK 14. Best regards, Tobias From tobias.hartmann at oracle.com Mon Jan 13 12:10:44 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 13:10:44 +0100 Subject: RFR(S): 8236759: ShouldNotReachHere in PhaseIdealLoop::verify_strip_mined_scheduling In-Reply-To: <87zher1xut.fsf@redhat.com> References: <87zher1xut.fsf@redhat.com> Message-ID: Hi Roland, On 13.01.20 10:58, Roland Westrelin wrote: > Splitting a load thru phi, if the phi is at a strip mined loop head, > causes a load to be pinned at the outer strip mined loop head. That > logic should pin the load above the outer loop instead. Just wondering why you need the checks in line 1487? Isn't in->is_OuterStripMinedLoop() sufficient since we should never schedule a data node in the outer strip mined loop? Best regards, Tobias From vladimir.x.ivanov at oracle.com Mon Jan 13 12:17:41 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Mon, 13 Jan 2020 15:17:41 +0300 Subject: [14] RFR(S): 8236443 : Issues with specializing vector register type for phi operand with generic operands In-Reply-To: References:

Message-ID: <6e7ecab1-e174-2c3d-50c3-92de6e1cfa67@oracle.com> > http://cr.openjdk.java.net/~jbhateja/8236443/webrev.04/ Looks good. Test results are clean (hs-precheckin-comp,hs-tier1,hs-tier2). Best regards, Vladimir Ivanov > > Regards, > Jatin > >> -----Original Message----- >> From: Vladimir Ivanov >> Sent: Friday, January 3, 2020 4:46 AM >> To: Bhateja, Jatin ; hotspot-compiler- >> dev at openjdk.java.net >> Subject: Re: [14] RFR(S): 8236443 : Issues with specializing vector register type >> for phi operand with generic operands >> >> Hi Jatin, >> >>> Webrev: http://cr.openjdk.java.net/~jbhateja/8236443/webrev.02/ >> >> I like that you put the check on use side (ShiftV + ideal_in_idx). >> >> Though ShiftCntV def is the only viable option in practice (phis shouldn't occur >> on x86 and would trigger a bug due to mismatch between node type and >> ideal_reg), it's nice to see such case being ruled out. >> >> Some comments: >> >> ================================================================ >> ======= >> src/hotspot/cpu/x86/x86.ad: >> >> +const uint Matcher::special_vector_ideal_reg(int ideal_reg, int opcode, >> int ideal_in_idx, int size) { >> >> I don't see a compelling reason why it should be a platform-specific >> check. ShiftCntV nodes override Node::ideal_reg() and putting the check >> in shared code looks appropriate. >> >> Considering how LShiftCntVNode::ideal_reg is defined [1], IMO calling >> Matcher::vector_shift_count_ideal_reg(t->length_in_bytes()) is better >> than hardcoding VecS ideal register (which is x86-specific). >> >> >> ================================================================ >> ======= >> + case Op_RShiftCntV: >> + case Op_LShiftCntV: >> + // Ideal index corrosponding to def operand. >> + if (ideal_in_idx == -1) >> + return Op_VecS; >> + break; >> >> Shouldn't ideal_in_idx be 0 instead? >> >> >> Some stylistic notes: >> >> ================================================================ >> ======= >> src/hotspot/cpu/x86/x86.ad: >> >> +MachOper* Matcher::specialize_vector_operand_helper(Node* m, >> MachNode* >> n, int op_idx) { >> >> It's a bit confusing to see Node* named "m" and MachNode* named "n". In >> the code base it's usually the opposite: n stands for Node and m stands >> for MachNode. >> >> But maybe "def" and "use" are better names? >> >> >> ================================================================ >> ======= >> >> + switch(opcode) { >> + default: >> + return ideal_reg; >> + case Op_LShiftVB: >> ... >> + case Op_URShiftVL: >> + if (ideal_in_idx == 2) >> + return Op_VecS; >> + break; >> + case Op_RShiftCntV: >> + case Op_LShiftCntV: >> + // Ideal index corrosponding to def operand. >> + if (ideal_in_idx == -1) >> + return Op_VecS; >> + break; >> + } >> >> Does it make sense to introduce ShiftV and ShiftCntV node classes [2] >> and use Node::is_ShiftV()/is_ShiftCntV() checks instead? >> >> ================================================================ >> ======= >> >> + switch(opcode) { >> + default: >> + return ideal_reg; >> ... >> + } >> + return ideal_reg; >> >> return statement is redundant. >> >> >> ================================================================ >> ======= >> >> src/hotspot/share/opto/matcher.cpp: >> >> - MachNode* m = live_nodes.pop()->isa_Mach(); >> + Node* n = live_nodes.pop(); >> + if (!n->is_Mach()) >> + continue; >> + MachNode * m = n->as_Mach(); >> if (m != NULL) { >> >> >> Node::isa_Mach() + != NULL check is a well-established convention in C2 >> code base to perform type checks on nodes (is_Mach()+as_Mach()). I'd >> prefer to keep the original version. >> >> Best regards, >> Vladimir Ivanov >> >> [1] class LShiftCntVNode : public VectorNode { >> virtual uint ideal_reg() const { return >> Matcher::vector_shift_count_ideal_reg(vect_type()->length_in_bytes()); } >> }; >> >> [2] >> http://hg.openjdk.java.net/jdk/jdk/file/tip/src/hotspot/share/opto/node.hpp >> #l716 >> >>> Generic operand processing has been skipped for non-machine nodes (e.g. >> Phi) since they are skipped by the matcher. >>> Non-definition operand resolution of a machine node will be able to pull the >> type information from non-machine node. >>> >>> Re-organized the code by adding a target specific routine which can be used >> to return the operand types for special ideal nodes e.g. RShiftCntV/LShiftCntV. >>> For such nodes, definition machine operand vary for different targets and >> vector lengths, a type based generic operand resolution >>> is not possible in such cases. >>> >>> Best Regards, >>> Jatin >>> >>> From vladimir.x.ivanov at oracle.com Mon Jan 13 12:23:12 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Mon, 13 Jan 2020 15:23:12 +0300 Subject: RFR(XS): 8229855: C2 fails with assert(false) failed: bad AD file In-Reply-To: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> References: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> Message-ID: <7c3f6220-dd80-78f2-9c92-2003eafe2ea9@oracle.com> > http://cr.openjdk.java.net/~thartmann/8229855/webrev.00/ - IfNode *iff_le = jump_if_fork_int(key_val, test_val, BoolTest::le, if_prob(cnt, total_cnt), if_cnt(cnt)); + IfNode *iff_le = jump_if_fork_int(key_val, test_val, BoolTest::lt, if_prob(cnt, total_cnt), if_cnt(cnt)); Would be nice to rename iff_le to iff_lt as well. Otherwise, looks good. Best regards, Vladimir Ivanov > For the switch statement in TestJumpTable.java, C2 creates the following binary search tree with a > jump table at (*): > > if (key_val > 45) { > if (key_val != 60) { > if (key_val <= 60) { > (*) > } else { > [...] > } > } else { > [...] > } > } else { > [...] > } > > The jump table at (*) handles the following key_val ranges: {46..50}, {51}, {52..59}. > Parse::jump_switch_ranges -> Parse::create_jump_tables adds a CastII to narrow the type of key_val > to type int:46..59 (type int:0..13 after normalizing to zero): > http://hg.openjdk.java.net/jdk/jdk/file/44cb1f517839/src/hotspot/share/opto/parse2.cpp#l884 > > Now after some rounds of loop unrolling, key_val is known to be int:60..260 (normalized to > int:14..214) in one of the loop bodies. As a result, the narrow CastII node is replaced by TOP and > we crash in the matcher because the JumpNode user didn't go away. > > The problem is that the key_val != 60 && key_val <= 60 combination is *not* found to be always false > if key_val becomes known as int:60..260 because C2's type system does not narrow the type after the > first check. The fix is to simply strengthen the second check to ensure that both control and data > paths die consistently. > > I would like to push this simple fix into JDK 14 and follow-up in JDK 15 with an enhancement that > adds a general optimization for such integer checks: > https://bugs.openjdk.java.net/browse/JDK-8236721 > > I've quickly hacked a prototype that shows promising results (and would also fix this issue): > http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ > > Thanks, > Tobias > From rwestrel at redhat.com Mon Jan 13 12:28:54 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 13:28:54 +0100 Subject: RFR(S): 8235584: UseProfiledLoopPredicate fails with assert(_phase->get_loop(c) == loop) failed: have to be in the same loop In-Reply-To: <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> References: <87r2031va5.fsf@redhat.com> <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> Message-ID: <87imlf1qwp.fsf@redhat.com> Thanks for the review, Tobias. Roland. From rwestrel at redhat.com Mon Jan 13 12:30:45 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 13:30:45 +0100 Subject: RFR(S): 8236759: ShouldNotReachHere in PhaseIdealLoop::verify_strip_mined_scheduling In-Reply-To: References: <87zher1xut.fsf@redhat.com> Message-ID: <87ftgj1qtm.fsf@redhat.com> Thanks for reviewing this, Tobias. > Just wondering why you need the checks in line 1487? Isn't in->is_OuterStripMinedLoop() sufficient > since we should never schedule a data node in the outer strip mined loop? It could be top too. Also I guess, I'm becoming paranoid about little things going wrong with strip mining... Roland. From patric.hedlin at oracle.com Mon Jan 13 12:31:24 2020 From: patric.hedlin at oracle.com (Patric Hedlin) Date: Mon, 13 Jan 2020 13:31:24 +0100 Subject: RFR(XS): 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() In-Reply-To: <3581fc6c-ca0d-597d-c702-f52a4a599172@oracle.com> References: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> <3581fc6c-ca0d-597d-c702-f52a4a599172@oracle.com> Message-ID: Thanks for reviewing Tobias. /Patric On 13/01/2020 12:32, Tobias Hartmann wrote: > Hi Patric, > > looks good and trivial to me. > > Best regards, > Tobias > > On 13.01.20 12:11, Patric Hedlin wrote: >> Dear all, >> >> I would like to ask for help to review the following change/update: >> >> Issue:? https://bugs.openjdk.java.net/browse/JDK-8235870 >> Webrev: http://cr.openjdk.java.net/~phedlin/tr8235870/ >> >> 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() >> >> ??? Method missing necessary checks, made more robust. >> >> Testing: hs-tier1-3, LuceneBench24H >> >> >> Best regards, >> Patric From tobias.hartmann at oracle.com Mon Jan 13 12:33:24 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Mon, 13 Jan 2020 13:33:24 +0100 Subject: RFR(XS): 8229855: C2 fails with assert(false) failed: bad AD file In-Reply-To: <7c3f6220-dd80-78f2-9c92-2003eafe2ea9@oracle.com> References: <790571f6-2a0e-613f-ebc8-0317f4a8f50c@oracle.com> <7c3f6220-dd80-78f2-9c92-2003eafe2ea9@oracle.com> Message-ID: <721d5f78-a08c-4978-ae65-97c454cc4f85@oracle.com> Hi Vladimir, On 13.01.20 13:23, Vladimir Ivanov wrote: > Would be nice to rename iff_le to iff_lt as well. Good catch, I'll adjust that before pushing. > Otherwise, looks good. Thanks for the review! Best regards, Tobias From nils.eliasson at oracle.com Mon Jan 13 12:36:22 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Mon, 13 Jan 2020 13:36:22 +0100 Subject: RFR(XS): 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() In-Reply-To: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> References: <1f4bd3ac-8a0f-3d77-677b-1bdbdbaa49a2@oracle.com> Message-ID: <2edeb499-a777-6e92-9b7b-3977de94d68e@oracle.com> Hi Patrick, Looks good! Regards, Nils Eliasson On 2020-01-13 12:11, Patric Hedlin wrote: > Dear all, > > I would like to ask for help to review the following change/update: > > Issue:? https://bugs.openjdk.java.net/browse/JDK-8235870 > Webrev: http://cr.openjdk.java.net/~phedlin/tr8235870/ > > 8235870: C2 crashes in IdealLoopTree::est_loop_flow_merge_sz() > > ??? Method missing necessary checks, made more robust. > > Testing: hs-tier1-3, LuceneBench24H > > > Best regards, > Patric > From vladimir.x.ivanov at oracle.com Mon Jan 13 14:00:47 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Mon, 13 Jan 2020 17:00:47 +0300 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <87o8v71uqz.fsf@redhat.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> <87o8v71uqz.fsf@redhat.com> Message-ID: <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> >> Do you know why RootNode::Ideal() isn't called during IGVN pass after >> parsing is over? > > The root node would need to be enqueued in the igvn worklist during > parsing. The code pattern when an HaltNode is added to the graph is to > use add_req() which doesn't cause the root node to be put on the igvn > worklist. I think the right way to fix the bug is to transform HaltNode after it is added to RootNode (e.g., as done in AllocateArrayNode::Ideal() [1]). In case HaltNode is eliminiated, RootNode should be put on worklist. I'm fine with leaving your change in GraphKit::must_be_not_null() as is. With the shortcut it becomes easier to read without double-checking how Opaque4 behaves. I quickly looked through other places with the same idiom and MemNode::Ideal_common() looks suspicious [3]. I don't see how it can suffer from the same problem, but it would be nice to fix it as well. I'm fine with filing a separate bug for it. Best regards, Vladimir Ivanov [1] http://hg.openjdk.java.net/jdk/jdk/file/tip/src/hotspot/share/opto/callnode.cpp#l1445 [2] ifnode.cpp:1234: igvn->C->root()->add_req(halt); loopTransform.cpp:1271: C->root()->add_req(halt); loopTransform.cpp:2434: C->root()->add_req(halt); callnode.cpp:1447: phase->C->root()->add_req(halt); loopnode.cpp:3676: C->root()->add_req(halt); memnode.cpp:343: phase->C->root()->add_req(halt); graphKit.cpp:1417: C->root()->add_req(halt); [3] http://hg.openjdk.java.net/jdk/jdk/file/tip/src/hotspot/share/opto/memnode.cpp#l342 From rwestrel at redhat.com Mon Jan 13 15:25:28 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Mon, 13 Jan 2020 16:25:28 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> <87o8v71uqz.fsf@redhat.com> <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> Message-ID: <87blr71iqf.fsf@redhat.com> > I think the right way to fix the bug is to transform HaltNode after it > is added to RootNode (e.g., as done in AllocateArrayNode::Ideal() [1]). > In case HaltNode is eliminiated, RootNode should be put on worklist. AFAICT, that would not work. PhaseGVN::transform() doesn't change uses of the node being transformed. The RootNode would need to be explicity pushed on the igvn worklist with a call to Compile::record_for_igvn(). But with the change to must_be_not_null(), there's no need for that. So I don't see anything that need fixing to be honest. > I quickly looked through other places with the same idiom and > MemNode::Ideal_common() looks suspicious [3]. I don't see how it can > suffer from the same problem, but it would be nice to fix it as well. > I'm fine with filing a separate bug for it. For the Halt node to be top, ctl would need to be top but that's handled before or frame would need to be top which doesn't seem possible so I don't see a problem there. Roland. From fairoz.matte at oracle.com Mon Jan 13 15:53:08 2020 From: fairoz.matte at oracle.com (Fairoz Matte) Date: Mon, 13 Jan 2020 07:53:08 -0800 (PST) Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <8f0d6d6c-927e-6b9e-6213-3274a099d288@oracle.com> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> <8f0d6d6c-927e-6b9e-6213-3274a099d288@oracle.com> Message-ID: <015d7aac-a487-41bb-a0c4-d51f58b9f85d@default> Hi Tobias, Thanks for the review, It does look like we still need "-XX:+IgnoreUnrecognizedVMOptions" flag. Summary of the test case shows, Superword support is only for limited platforms. * @summary Add C2 x86 Superword support for VNNI VPDPWSSD Instruction * @requires os.arch=="x86" | os.arch=="i386" | os.arch=="amd64" | os.arch=="x86_64" I have updated copyright. Thanks, Fairoz -----Original Message----- From: Tobias Hartmann Sent: Monday, January 13, 2020 5:01 PM To: Fairoz Matte ; Ioi Lam ; hotspot-compiler-dev at openjdk.java.net Subject: Re: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord Please also update the copyright. Best regards, Tobias On 13.01.20 12:29, Tobias Hartmann wrote: > Hi Fairoz, > > On 13.01.20 09:29, Fairoz Matte wrote: >> I was not sure why that option was added. > > I think it's just because the author incorrectly assumed that some of > the flags sre only available in debug builds (which is not the case). > >> Let me know, if that is ok to be removed, I will send the next webrev. > > Yes, please do so. The change looks good to me. > > Thanks, > Tobias > From sandhya.viswanathan at intel.com Mon Jan 13 16:29:45 2020 From: sandhya.viswanathan at intel.com (Viswanathan, Sandhya) Date: Mon, 13 Jan 2020 16:29:45 +0000 Subject: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers In-Reply-To: <6fbb3509-c51f-7c52-832a-3af4c992b149@oracle.com> References:

<951f79b5-3418-0803-b1ec-627270cac012@oracle.com> <6fbb3509-c51f-7c52-832a-3af4c992b149@oracle.com> Message-ID: Pushed to http://hg.openjdk.java.net/jdk/jdk14. Thanks, Sandhya -----Original Message----- From: Tobias Hartmann Sent: Sunday, January 12, 2020 10:21 PM To: Viswanathan, Sandhya ; Vladimir Ivanov ; hotspot compiler Subject: Re: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers Hi Sandhya, looks good to me too. Please push to http://hg.openjdk.java.net/jdk/jdk14 Thanks, Tobias On 11.01.20 01:25, Viswanathan, Sandhya wrote: > Thanks a lot Vladimir! Can I push this onto JDK 14 branch? > > Best Regards, > Sandhya > > > -----Original Message----- > From: Vladimir Ivanov > Sent: Friday, January 10, 2020 3:39 PM > To: Viswanathan, Sandhya ; hotspot compiler > Subject: Re: [14]: RFR(XXS): 8236944:The legVecZ operand should be limited to zmm0-zmm15 registers > > FYI test results are clean (hs-precheckin-comp,hs-tier1,hs-tier2). > > Best regards, > Vladimir Ivanov > > On 11.01.2020 00:50, Vladimir Ivanov wrote: >> Looks good. >> >> Best regards, >> Vladimir Ivanov >> >> On 10.01.2020 22:35, Viswanathan, Sandhya wrote: >>> JBS: https://bugs.openjdk.java.net/browse/JDK-8236944 >>> >>> Webrev: http://cr.openjdk.java.net/~sviswanathan/8236944/webrev.00/ >>> >>> Currently legVecZ operand is defined as follows: >>> >>> operand legVecZ() %{ >>> >>> ?? constraint(ALLOC_IN_RC(vectorz_reg_vl)); >>> >>> ?? match(VecZ); >>> >>> ?? format %{ %} >>> >>> ?? interface(REG_INTER); >>> >>> %} >>> >>> This allows legVecZ to be zmm0-zmm31 on platforms supporting AVX 512 vl. >>> >>> The legVecZ should instead be defined as: >>> >>> operand legVecZ() %{ >>> >>> ?? constraint(ALLOC_IN_RC(vectorz_reg_legacy)); >>> >>> ?? match(VecZ); >>> >>> ?? format %{ %} >>> >>> ?? interface(REG_INTER); >>> >>> %} >>> >>> Please consider this for JDK 14. >>> >>> Best Regards, >>> >>> Sandhya >>> From vladimir.x.ivanov at oracle.com Mon Jan 13 17:03:03 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Mon, 13 Jan 2020 20:03:03 +0300 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <87blr71iqf.fsf@redhat.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> <87o8v71uqz.fsf@redhat.com> <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> <87blr71iqf.fsf@redhat.com> Message-ID: <580b9af0-1523-0cde-9b29-e09278e2fa1c@oracle.com> >> I think the right way to fix the bug is to transform HaltNode after it >> is added to RootNode (e.g., as done in AllocateArrayNode::Ideal() [1]). >> In case HaltNode is eliminiated, RootNode should be put on worklist. > > AFAICT, that would not work. PhaseGVN::transform() doesn't change uses > of the node being transformed. The RootNode would need to be explicity > pushed on the igvn worklist with a call to > Compile::record_for_igvn(). But with the change to must_be_not_null(), > there's no need for that. So I don't see anything that need fixing to be > honest. Yes, you are right. PhaseGVN::transform() doesn't register existing users for IGVN and the only way to make it work is to explicitly register RootNode when a HaltNode goes dead. I understand that the change you have is enough to avoid the problematic case in GraphKit::must_be_not_null(). What I'd like to get rid of is the need for a correctness check to ensure that HaltNode can't go away. It's error-prone and can eventually get out of sync. If you think it doesn't worth the effort, feel free to proceed with the current fix and I'll file a followup RFE. Also, I'd like the bug to be handled separately (from 8231291). It'll simplify possible backporting: as I see from the history, the problematic code (JDK-8176506) was introduced in 10 and it may show up in 11u eventually. >> I quickly looked through other places with the same idiom and >> MemNode::Ideal_common() looks suspicious [3]. I don't see how it can >> suffer from the same problem, but it would be nice to fix it as well. >> I'm fine with filing a separate bug for it. > > For the Halt node to be top, ctl would need to be top but that's handled > before or frame would need to be top which doesn't seem possible so I > don't see a problem there. My point is it would be nice to avoid the need in such reasoning (especially for correctness purposes) in the first place. Best regards, Vladimir Ivanov From hohensee at amazon.com Mon Jan 13 22:53:50 2020 From: hohensee at amazon.com (Hohensee, Paul) Date: Mon, 13 Jan 2020 22:53:50 +0000 Subject: RFR(XXS): 8236228: clean up BarrierSet headers in c1_LIRAssembler In-Reply-To: References: <1982726B-9702-4087-B962-354D2B9FAD92@amazon.com> <3C45D717-C0BF-40A4-93F5-EDA78FE91350@amazon.com> <3CC88DEC-56BE-4FDA-B775-7678F44E252A@amazon.com> Message-ID: <2CDFB80B-04EA-4C95-A67A-1C1284C80C66@amazon.com> Ping on behalf of Xin. Thanks, Paul ?On 12/30/19, 2:39 PM, "Liu, Xin" wrote: Hi, Reviewers, Thanks for reviewing it. Paul is a reviewer. Martin reviewed PPC and s390 and Ningsheng reviewed Arm&Aarch64. Is that good to push? Thanks, --lx On 12/30/19, 11:16 AM, "Hohensee, Paul" wrote: Lgtm. Seems very low risk to me too. Paul On 12/26/19, 7:06 PM, "hotspot-compiler-dev on behalf of Liu, Xin" wrote: Hi, Ningsheng, Thank you for submitting the trial repo. I got the result email and it has passed all 80 tests. Here is the updated webrev. I updated the reviewers. https://cr.openjdk.java.net/~xliu/8236228/01/webrev/ This is a low-risk patch. As long as we can compile it, it won't have any side-effect at runtime. thanks, --lx On 12/24/19, 9:52 PM, "Ningsheng Jian" wrote: Hi lx, I've submitted it for you: http://hg.openjdk.java.net/jdk/submit/rev/9d61b00d5982 I've also verified aarch64 and arm build locally, and it looks fine. Thanks, Ningsheng > -----Original Message----- > From: hotspot-compiler-dev > On Behalf Of Liu, Xin > Sent: Wednesday, December 25, 2019 4:57 AM > To: Vladimir Kozlov ; 'hotspot-compiler- > dev at openjdk.java.net' > Subject: Re: RFR(XXS): 8236228: clean up BarrierSet headers in c1_LIRAssembler > > I update the subject and switch back the corporation email. Sorry, I didn?t > realize that there?s code of conduct. I will pay more attention on it. > > I validate the patch without PCH using --disable-precompiled-headers on both > x86_64 and aarch64. Both of them built well. > I still have difficulty to verify SPARC. I try the submit repo but I don?t have > permission to push an experimental branch. May I ask a sponsor submit it on > behalf of me? > > Thanks, > --lx > > > > > Hi Lx > > First, when you post RFR, please, include bug id in email's Subject. > > You can use jdk/submit testing to verify builds on SPARC. We are still building on > it, with warning. > Changes seems fine to me but make sure verify that it builds without PCH. > > Regards, > Vladimir > > On 12/20/19 9:24 AM, Liu Xin wrote: > > Martin? > > > > Thank you very much. May I know how to validate Sparc? I don't have > > any SPARC machine to access. > > > > Thanks, > > --lx > > > > > > On Fri, Dec 20, 2019, 7:05 AM Doerr, Martin > > wrote: > > > >> Hi lx, > >> > >> PPC and s390 parts are ok. > >> > >> Best regards, > >> Martin > >> > >> > >>> -----Original Message----- > >>> From: hotspot-compiler-dev >>> bounces at openjdk.java.net> On Behalf > >>> Of Liu, Xin > >>> Sent: Freitag, 20. Dezember 2019 04:48 > >>> To: > >>> 'hotspot-compiler-dev at openjdk.java.net >>> penjdk.java.net>' >>> dev at openjdk.java.net> > >>> Subject: [DMARC FAILURE] RFR(XXS): clean up BarrierSet headers in > >>> c1_LIRAssembler > >>> > >>> Hi, Reviewers, > >>> > >>> Could you take a look at my webrev? I feel that those barrisetSet > >> interfaces > >>> have nothing with c1_LIRAssembler. > >>> > >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8236228 > >>> Webrev: https://cr.openjdk.java.net/~xliu/8236228/00/webrev/ > >>> > >>> I try to build on aarch64 and x86_64 and it?s fine. > >>> > >>> Thanks, > >>> --lx > >> > >> From jiefu at tencent.com Tue Jan 14 01:26:12 2020 From: jiefu at tencent.com (=?utf-8?B?amllZnUo5YKF5p2wKQ==?=) Date: Tue, 14 Jan 2020 01:26:12 +0000 Subject: RFR: 8237055: [TESTBUG] compiler/c2/TestJumpTable.java fails with release VMs Message-ID: <51CA1A9F-883F-49D2-8CC0-629F9CDAD420@tencent.com> Hi all, JBS: https://bugs.openjdk.java.net/browse/JDK-8237055 May I get reviews for this one-line fix? The patch just adds -XX:+UnlockDiagnosticVMOptions for the test. ----------------------------------------------------------------------- diff -r db9bdbeaed29 test/hotspot/jtreg/compiler/c2/TestJumpTable.java --- a/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Mon Jan 13 14:09:30 2020 -0800 +++ b/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Tue Jan 14 09:11:06 2020 +0800 @@ -26,7 +26,7 @@ * @bug 8229855 * @summary Test jump table with key value that gets out of bounds after loop unrolling. * @run main/othervm -XX:CompileCommand=dontinline,compiler.c2.TestJumpTable::test* - * -Xbatch -XX:-TieredCompilation -XX:-UseSwitchProfiling + * -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:-TieredCompilation -XX:-UseSwitchProfiling * compiler.c2.TestJumpTable */ ----------------------------------------------------------------------- Thanks a lot. Best regards, Jie From felix.yang at huawei.com Tue Jan 14 06:33:08 2020 From: felix.yang at huawei.com (Yangfei (Felix)) Date: Tue, 14 Jan 2020 06:33:08 +0000 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: <8797205a-c4ff-253d-404b-62a178c6e169@oracle.com> References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

<4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> <8797205a-c4ff-253d-404b-62a178c6e169@oracle.com> Message-ID: Hi Tobias, Thanks for reviewing this. May I have another review please? Felix > -----Original Message----- > From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] > Sent: Monday, January 13, 2020 7:28 PM > To: Yangfei (Felix) ; Christian Hagedorn > ; hotspot-compiler-dev at openjdk.java.net > Subject: Re: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation > > Hi Felix, > > looks reasonable to me. Thanks for updating. > > Another review would be good. > > Best regards, > Tobias > > On 10.01.20 02:13, Yangfei (Felix) wrote: > > Hi Tobias, > > > > Updated webrev: http://cr.openjdk.java.net/~fyang/8235762/webrev.01/ > > This also updated copyright years of the newly-added test case. > > Still test OK with the latest repo. Is this one better? > > > > Thanks, > > Felix > > From tobias.hartmann at oracle.com Tue Jan 14 08:51:03 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 09:51:03 +0100 Subject: RFR: 8237055: [TESTBUG] compiler/c2/TestJumpTable.java fails with release VMs In-Reply-To: <51CA1A9F-883F-49D2-8CC0-629F9CDAD420@tencent.com> References: <51CA1A9F-883F-49D2-8CC0-629F9CDAD420@tencent.com> Message-ID: <11f12f70-bfdc-1b9f-9cca-e9119f8f8752@oracle.com> Hi Jie, looks good and trivial to me. Thanks for fixing! Please push this into the JDK 14 repo: http://hg.openjdk.java.net/jdk/jdk14 Thanks, Tobias On 14.01.20 02:26, jiefu(??) wrote: > Hi all, > > > > JBS: https://bugs.openjdk.java.net/browse/JDK-8237055 > > > > May I get reviews for this one-line fix? > > The patch just adds -XX:+UnlockDiagnosticVMOptions for the test. > > > > ----------------------------------------------------------------------- > > diff -r db9bdbeaed29 test/hotspot/jtreg/compiler/c2/TestJumpTable.java > > --- a/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Mon Jan 13 14:09:30 2020 -0800 > > +++ b/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Tue Jan 14 09:11:06 2020 +0800 > > @@ -26,7 +26,7 @@ > > * @bug 8229855 > > * @summary Test jump table with key value that gets out of bounds after loop unrolling. > > * @run main/othervm -XX:CompileCommand=dontinline,compiler.c2.TestJumpTable::test* > > - * -Xbatch -XX:-TieredCompilation -XX:-UseSwitchProfiling > > + * -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:-TieredCompilation -XX:-UseSwitchProfiling > > * compiler.c2.TestJumpTable > > */ > > ----------------------------------------------------------------------- > > > > Thanks a lot. > > Best regards, > > Jie > From Pengfei.Li at arm.com Tue Jan 14 08:54:50 2020 From: Pengfei.Li at arm.com (Pengfei Li) Date: Tue, 14 Jan 2020 08:54:50 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint In-Reply-To: <98800e19-9bc1-654f-443c-d1fc1adfe4b7@redhat.com> References:

<98800e19-9bc1-654f-443c-d1fc1adfe4b7@redhat.com> Message-ID: Hi Andrew, > Sure, but the code in "AdvSIMD two-reg misc" can easily be made flexible > enough to accommodate that. All you have to do aas far as I can see is make > size a parameter: everything else is the same. > The important thing is that our assembler should follow the layout and > classification in that manual as closely as we possibly can. Thanks for your suggestion. I agree that we should align the layout and classification of the assembler code with ArmARM as closely as we can. But I found just adding one "size" parameter for this is not enough. If it is enough the RBIT instruction shouldn't be oddly encoded before in this category (from _rbit to rbit). The problem here is that, after adding a "size" parameter for this misc category, the 2 bits in the size field [23:22] may be determined by the parameter "size" or the "SIMD_Arrangement T", or both. And who determines which bit differs instruction by instruction. To solve this problem in my latest webrev [1], I added an additional parameter "tmask" which is a 2-bit mask used accompany with "size" to indicate which bits in [23:22] are determined by the "SIMD_Arrangement T". The bit of "tmask" should be set to 1 if corresponding bit marked as "x" in the ArmARM doc. With this, we can also encode RBIT normally like other instructions in the misc category. [1] http://cr.openjdk.java.net/~pli/rfr/8230591/webrev.01/ I ran full jtreg and no new failure is found. How do you think of this? -- Thanks, Pengfei From martin.doerr at sap.com Tue Jan 14 09:01:42 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Tue, 14 Jan 2020 09:01:42 +0000 Subject: RFR(XXS): 8236228: clean up BarrierSet headers in c1_LIRAssembler In-Reply-To: <2CDFB80B-04EA-4C95-A67A-1C1284C80C66@amazon.com> References: <1982726B-9702-4087-B962-354D2B9FAD92@amazon.com> <3C45D717-C0BF-40A4-93F5-EDA78FE91350@amazon.com> <3CC88DEC-56BE-4FDA-B775-7678F44E252A@amazon.com> <2CDFB80B-04EA-4C95-A67A-1C1284C80C66@amazon.com> Message-ID: Hi Xin and Paul, yes, if submission forest testing has passed, it can get pushed. Best regards, Martin > -----Original Message----- > From: hotspot-compiler-dev bounces at openjdk.java.net> On Behalf Of Hohensee, Paul > Sent: Montag, 13. Januar 2020 23:54 > To: Liu, Xin ; Ningsheng Jian > ; Vladimir Kozlov ; > 'hotspot-compiler-dev at openjdk.java.net' dev at openjdk.java.net> > Cc: nd > Subject: Re: RFR(XXS): 8236228: clean up BarrierSet headers in > c1_LIRAssembler > > Ping on behalf of Xin. > > Thanks, > > Paul > > ?On 12/30/19, 2:39 PM, "Liu, Xin" wrote: > > Hi, Reviewers, > > Thanks for reviewing it. Paul is a reviewer. Martin reviewed PPC and s390 > and Ningsheng reviewed Arm&Aarch64. > Is that good to push? > > Thanks, > --lx > > > > > On 12/30/19, 11:16 AM, "Hohensee, Paul" > wrote: > > Lgtm. Seems very low risk to me too. > > Paul > > On 12/26/19, 7:06 PM, "hotspot-compiler-dev on behalf of Liu, Xin" > xxinliu at amazon.com> wrote: > > Hi, Ningsheng, > Thank you for submitting the trial repo. I got the result email and it has > passed all 80 tests. > > Here is the updated webrev. I updated the reviewers. > https://cr.openjdk.java.net/~xliu/8236228/01/webrev/ > > This is a low-risk patch. As long as we can compile it, it won't have any > side-effect at runtime. > > thanks, > --lx > > On 12/24/19, 9:52 PM, "Ningsheng Jian" > wrote: > > Hi lx, > > I've submitted it for you: > > http://hg.openjdk.java.net/jdk/submit/rev/9d61b00d5982 > > I've also verified aarch64 and arm build locally, and it looks fine. > > Thanks, > Ningsheng > > > -----Original Message----- > > From: hotspot-compiler-dev bounces at openjdk.java.net> > > On Behalf Of Liu, Xin > > Sent: Wednesday, December 25, 2019 4:57 AM > > To: Vladimir Kozlov ; 'hotspot- > compiler- > > dev at openjdk.java.net' dev at openjdk.java.net> > > Subject: Re: RFR(XXS): 8236228: clean up BarrierSet headers in > c1_LIRAssembler > > > > I update the subject and switch back the corporation email. Sorry, I > didn?t > > realize that there?s code of conduct. I will pay more attention on it. > > > > I validate the patch without PCH using --disable-precompiled- > headers on both > > x86_64 and aarch64. Both of them built well. > > I still have difficulty to verify SPARC. I try the submit repo but I > don?t have > > permission to push an experimental branch. May I ask a sponsor > submit it on > > behalf of me? > > > > Thanks, > > --lx > > > > > > > > > > Hi Lx > > > > First, when you post RFR, please, include bug id in email's Subject. > > > > You can use jdk/submit testing to verify builds on SPARC. We are > still building on > > it, with warning. > > Changes seems fine to me but make sure verify that it builds > without PCH. > > > > Regards, > > Vladimir > > > > On 12/20/19 9:24 AM, Liu Xin wrote: > > > Martin? > > > > > > Thank you very much. May I know how to validate Sparc? I don't > have > > > any SPARC machine to access. > > > > > > Thanks, > > > --lx > > > > > > > > > On Fri, Dec 20, 2019, 7:05 AM Doerr, Martin > > > wrote: > > > > > >> Hi lx, > > >> > > >> PPC and s390 parts are ok. > > >> > > >> Best regards, > > >> Martin > > >> > > >> > > >>> -----Original Message----- > > >>> From: hotspot-compiler-dev > >>> > bounces at openjdk.java.net> On Behalf > > >>> Of Liu, Xin > > >>> Sent: Freitag, 20. Dezember 2019 04:48 > > >>> To: > > >>> 'hotspot-compiler-dev at openjdk.java.net compiler-dev at o > > >>> penjdk.java.net>' > >>> dev at openjdk.java.net> > > >>> Subject: [DMARC FAILURE] RFR(XXS): clean up BarrierSet > headers in > > >>> c1_LIRAssembler > > >>> > > >>> Hi, Reviewers, > > >>> > > >>> Could you take a look at my webrev? I feel that those > barrisetSet > > >> interfaces > > >>> have nothing with c1_LIRAssembler. > > >>> > > >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8236228 > > >>> Webrev: > https://cr.openjdk.java.net/~xliu/8236228/00/webrev/ > > >>> > > >>> I try to build on aarch64 and x86_64 and it?s fine. > > >>> > > >>> Thanks, > > >>> --lx > > >> > > >> > > > > > > > From rwestrel at redhat.com Tue Jan 14 09:43:34 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 10:43:34 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <580b9af0-1523-0cde-9b29-e09278e2fa1c@oracle.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> <87o8v71uqz.fsf@redhat.com> <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> <87blr71iqf.fsf@redhat.com> <580b9af0-1523-0cde-9b29-e09278e2fa1c@oracle.com> Message-ID: <878sma1igp.fsf@redhat.com> > I understand that the change you have is enough to avoid the problematic > case in GraphKit::must_be_not_null(). What I'd like to get rid of is the > need for a correctness check to ensure that HaltNode can't go away. It's > error-prone and can eventually get out of sync. Fair enough. Then why not always enqueue the root node for igvn at the end of parsing? > Also, I'd like the bug to be handled separately (from 8231291). It'll > simplify possible backporting: as I see from the history, the > problematic code (JDK-8176506) was introduced in 10 and it may show up > in 11u eventually. You're right. That's cleaner. I'll send new webrevs. Roland. From aph at redhat.com Tue Jan 14 10:04:04 2020 From: aph at redhat.com (Andrew Haley) Date: Tue, 14 Jan 2020 10:04:04 +0000 Subject: RFR(S): 8230591: AArch64: Missing intrinsics for Math.ceil, floor, rint In-Reply-To: References:

<98800e19-9bc1-654f-443c-d1fc1adfe4b7@redhat.com> Message-ID: On 1/14/20 8:54 AM, Pengfei Li wrote: > [1] http://cr.openjdk.java.net/~pli/rfr/8230591/webrev.01/ > > I ran full jtreg and no new failure is found. How do you think of this? Perfect, thanks. That's as well as it could be done, given the awkward encoding. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From jiefu at tencent.com Tue Jan 14 10:19:40 2020 From: jiefu at tencent.com (=?utf-8?B?amllZnUo5YKF5p2wKQ==?=) Date: Tue, 14 Jan 2020 10:19:40 +0000 Subject: 8237055: [TESTBUG] compiler/c2/TestJumpTable.java fails with release VMs(Internet mail) Message-ID: <4A3B3BF9-BAAC-4636-B2AD-34684DD12D36@tencent.com> Hi Tobias, Thanks for your review. I have some problems to push the code and access my webrev through ssh (maybe caused by my local network config). So I need your help to sponsor it. I had prepared the patch in the attachment. Thanks a lot. Best regards, Jie ?On 2020/1/14, 4:52 PM, "Tobias Hartmann" wrote: Hi Jie, looks good and trivial to me. Thanks for fixing! Please push this into the JDK 14 repo: http://hg.openjdk.java.net/jdk/jdk14 Thanks, Tobias On 14.01.20 02:26, jiefu(??) wrote: > Hi all, > > > > JBS: https://bugs.openjdk.java.net/browse/JDK-8237055 > > > > May I get reviews for this one-line fix? > > The patch just adds -XX:+UnlockDiagnosticVMOptions for the test. > > > > ----------------------------------------------------------------------- > > diff -r db9bdbeaed29 test/hotspot/jtreg/compiler/c2/TestJumpTable.java > > --- a/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Mon Jan 13 14:09:30 2020 -0800 > > +++ b/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Tue Jan 14 09:11:06 2020 +0800 > > @@ -26,7 +26,7 @@ > > * @bug 8229855 > > * @summary Test jump table with key value that gets out of bounds after loop unrolling. > > * @run main/othervm -XX:CompileCommand=dontinline,compiler.c2.TestJumpTable::test* > > - * -Xbatch -XX:-TieredCompilation -XX:-UseSwitchProfiling > > + * -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:-TieredCompilation -XX:-UseSwitchProfiling > > * compiler.c2.TestJumpTable > > */ > > ----------------------------------------------------------------------- > > > > Thanks a lot. > > Best regards, > > Jie > From tobias.hartmann at oracle.com Tue Jan 14 11:17:43 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 12:17:43 +0100 Subject: 8237055: [TESTBUG] compiler/c2/TestJumpTable.java fails with release VMs(Internet mail) In-Reply-To: <4A3B3BF9-BAAC-4636-B2AD-34684DD12D36@tencent.com> References: <4A3B3BF9-BAAC-4636-B2AD-34684DD12D36@tencent.com> Message-ID: <28670645-10e3-363c-4e2b-b91495860be9@oracle.com> Hi Jie, sure, pushed. Best regards, Tobias On 14.01.20 11:19, jiefu(??) wrote: > Hi Tobias, > > Thanks for your review. > > I have some problems to push the code and access my webrev through ssh (maybe caused by my local network config). > So I need your help to sponsor it. > I had prepared the patch in the attachment. > > Thanks a lot. > Best regards, > Jie > > ?On 2020/1/14, 4:52 PM, "Tobias Hartmann" wrote: > > Hi Jie, > > looks good and trivial to me. Thanks for fixing! > > Please push this into the JDK 14 repo: > http://hg.openjdk.java.net/jdk/jdk14 > > Thanks, > Tobias > > On 14.01.20 02:26, jiefu(??) wrote: > > Hi all, > > > > > > > > JBS: https://bugs.openjdk.java.net/browse/JDK-8237055 > > > > > > > > May I get reviews for this one-line fix? > > > > The patch just adds -XX:+UnlockDiagnosticVMOptions for the test. > > > > > > > > ----------------------------------------------------------------------- > > > > diff -r db9bdbeaed29 test/hotspot/jtreg/compiler/c2/TestJumpTable.java > > > > --- a/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Mon Jan 13 14:09:30 2020 -0800 > > > > +++ b/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Tue Jan 14 09:11:06 2020 +0800 > > > > @@ -26,7 +26,7 @@ > > > > * @bug 8229855 > > > > * @summary Test jump table with key value that gets out of bounds after loop unrolling. > > > > * @run main/othervm -XX:CompileCommand=dontinline,compiler.c2.TestJumpTable::test* > > > > - * -Xbatch -XX:-TieredCompilation -XX:-UseSwitchProfiling > > > > + * -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:-TieredCompilation -XX:-UseSwitchProfiling > > > > * compiler.c2.TestJumpTable > > > > */ > > > > ----------------------------------------------------------------------- > > > > > > > > Thanks a lot. > > > > Best regards, > > > > Jie > > > > > From vladimir.x.ivanov at oracle.com Tue Jan 14 11:22:46 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Tue, 14 Jan 2020 14:22:46 +0300 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <878sma1igp.fsf@redhat.com> References: <878sma1igp.fsf@redhat.com> Message-ID: ? >> I understand that the change you have is enough to avoid the problematic >> case in GraphKit::must_be_not_null(). What I'd like to get rid of is the >> need for a correctness check to ensure that HaltNode can't go away. It's >> error-prone and can eventually get out of sync. > > Fair enough. Then why not always enqueue the root node for igvn at the > end of parsing? Sounds good. FTR I?m fine with both options. But unconditionally registering RootNode for IGVN looks like a simpler and cleaner solution. > >> Also, I'd like the bug to be handled separately (from 8231291). It'll >> simplify possible backporting: as I see from the history, the >> problematic code (JDK-8176506) was introduced in 10 and it may show up >> in 11u eventually. > > You're right. That's cleaner. I'll send new webrevs. Thanks! Best regards, Vladimir Ivanov > > Roland. > From jiefu at tencent.com Tue Jan 14 11:53:55 2020 From: jiefu at tencent.com (=?utf-8?B?amllZnUo5YKF5p2wKQ==?=) Date: Tue, 14 Jan 2020 11:53:55 +0000 Subject: 8237055: [TESTBUG] compiler/c2/TestJumpTable.java fails with release VMs(Internet mail) In-Reply-To: <28670645-10e3-363c-4e2b-b91495860be9@oracle.com> References: <4A3B3BF9-BAAC-4636-B2AD-34684DD12D36@tencent.com> <28670645-10e3-363c-4e2b-b91495860be9@oracle.com> Message-ID: <23418799-5240-4FF4-B1D9-DDCABD178180@tencent.com> Thank you so much, Tobias. ?On 2020/1/14, 7:18 PM, "Tobias Hartmann" wrote: Hi Jie, sure, pushed. Best regards, Tobias On 14.01.20 11:19, jiefu(??) wrote: > Hi Tobias, > > Thanks for your review. > > I have some problems to push the code and access my webrev through ssh (maybe caused by my local network config). > So I need your help to sponsor it. > I had prepared the patch in the attachment. > > Thanks a lot. > Best regards, > Jie > > On 2020/1/14, 4:52 PM, "Tobias Hartmann" wrote: > > Hi Jie, > > looks good and trivial to me. Thanks for fixing! > > Please push this into the JDK 14 repo: > http://hg.openjdk.java.net/jdk/jdk14 > > Thanks, > Tobias > > On 14.01.20 02:26, jiefu(??) wrote: > > Hi all, > > > > > > > > JBS: https://bugs.openjdk.java.net/browse/JDK-8237055 > > > > > > > > May I get reviews for this one-line fix? > > > > The patch just adds -XX:+UnlockDiagnosticVMOptions for the test. > > > > > > > > ----------------------------------------------------------------------- > > > > diff -r db9bdbeaed29 test/hotspot/jtreg/compiler/c2/TestJumpTable.java > > > > --- a/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Mon Jan 13 14:09:30 2020 -0800 > > > > +++ b/test/hotspot/jtreg/compiler/c2/TestJumpTable.java Tue Jan 14 09:11:06 2020 +0800 > > > > @@ -26,7 +26,7 @@ > > > > * @bug 8229855 > > > > * @summary Test jump table with key value that gets out of bounds after loop unrolling. > > > > * @run main/othervm -XX:CompileCommand=dontinline,compiler.c2.TestJumpTable::test* > > > > - * -Xbatch -XX:-TieredCompilation -XX:-UseSwitchProfiling > > > > + * -Xbatch -XX:+UnlockDiagnosticVMOptions -XX:-TieredCompilation -XX:-UseSwitchProfiling > > > > * compiler.c2.TestJumpTable > > > > */ > > > > ----------------------------------------------------------------------- > > > > > > > > Thanks a lot. > > > > Best regards, > > > > Jie > > > > > From fairoz.matte at oracle.com Tue Jan 14 13:32:47 2020 From: fairoz.matte at oracle.com (Fairoz Matte) Date: Tue, 14 Jan 2020 05:32:47 -0800 (PST) Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: <015d7aac-a487-41bb-a0c4-d51f58b9f85d@default> References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> <8f0d6d6c-927e-6b9e-6213-3274a099d288@oracle.com> <015d7aac-a487-41bb-a0c4-d51f58b9f85d@default> Message-ID: Hi Tobias, Sorry for my misunderstanding, @requires will restrict the execution on other platforms. We don't really need "-XX:+IgnoreUnrecognizedVMOptions" flag. Below is the updated webrev http://cr.openjdk.java.net/~fmatte/8236701/webrev.01/ Thanks, Fairoz -----Original Message----- From: Fairoz Matte Sent: Monday, January 13, 2020 9:23 PM To: Tobias Hartmann ; Ioi Lam ; hotspot-compiler-dev at openjdk.java.net Subject: RE: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord Hi Tobias, Thanks for the review, It does look like we still need "-XX:+IgnoreUnrecognizedVMOptions" flag. Summary of the test case shows, Superword support is only for limited platforms. * @summary Add C2 x86 Superword support for VNNI VPDPWSSD Instruction * @requires os.arch=="x86" | os.arch=="i386" | os.arch=="amd64" | os.arch=="x86_64" I have updated copyright. Thanks, Fairoz -----Original Message----- From: Tobias Hartmann Sent: Monday, January 13, 2020 5:01 PM To: Fairoz Matte ; Ioi Lam ; hotspot-compiler-dev at openjdk.java.net Subject: Re: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord Please also update the copyright. Best regards, Tobias On 13.01.20 12:29, Tobias Hartmann wrote: > Hi Fairoz, > > On 13.01.20 09:29, Fairoz Matte wrote: >> I was not sure why that option was added. > > I think it's just because the author incorrectly assumed that some of > the flags sre only available in debug builds (which is not the case). > >> Let me know, if that is ok to be removed, I will send the next webrev. > > Yes, please do so. The change looks good to me. > > Thanks, > Tobias > From tobias.hartmann at oracle.com Tue Jan 14 13:47:49 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 14:47:49 +0100 Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: References: <7e5c671e-6fef-46c1-9429-51cafe243c65@default> <1205f0b4-2353-ac86-a519-5fc7ace2a4e5@oracle.com> <68c8cbca-4f0e-41cd-bc07-d8f0268e0205@default> <7ad3ddcb-b2d2-d815-a2f6-192612da9abc@oracle.com> <8f0d6d6c-927e-6b9e-6213-3274a099d288@oracle.com> <015d7aac-a487-41bb-a0c4-d51f58b9f85d@default> Message-ID: Hi Fairoz, On 14.01.20 14:32, Fairoz Matte wrote: > http://cr.openjdk.java.net/~fmatte/8236701/webrev.01/ Looks good to me. Best regards, Tobias From tobias.hartmann at oracle.com Tue Jan 14 14:06:35 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 15:06:35 +0100 Subject: [15] RFR(M): 8236493: C2: Inconsistent known instance type information after EA Message-ID: <4ee2aba1-2e74-fdc5-f2e0-edb96f809d7b@oracle.com> Hi, please review the following patch: https://bugs.openjdk.java.net/browse/JDK-8236493 http://cr.openjdk.java.net/~thartmann/8236493/webrev.00/ While working on 8233164, I've noticed that escape analysis does not set known instance type information for LoadNodes and their users if the loaded object does not escape but that information is only (incrementally) propagated during IGVN. That's a problem because depending on the sequence in which nodes on the worklist are processed, optimizations might still see the not-yet-updated (and therefore incorrect) type. For example, another LoadNode optimization might rely on it's memory input to be a general memory slice and re-wire it's inputs accordingly (ignoring known instance type memory), when it is really reading from a known instance. For a detailed description of an instance of such a problem, see my RFR for 8233164 [1]. There, the type of a CastPPNode is not updated (yet) and a Load user is incorrectly re-wired. To catch this case before things go horribly wrong, I've added an assert to Node::raise_bottom_type(). Reverting the fix for 8233164 now immediately triggers the assert because the known instance id of a DecodeNNode is only updated during IGVN [2]. All new tests in TestInstanceIdPropagation.java trigger the assert as well. The fix is to always eagerly propagate that information during EA. If an object loaded from the field of a non-escaping object is also non-escaping, we need to propagate that information. Thanks, Tobias [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-December/036616.html [2] Failure with TestArrayCopyMemoryChain.java when reverting fix for 8233164: 300 Proj === 288 [[ 301 305 312 ]] #5 !jvms: TestArrayCopyMemoryChain::test2 @ bci:25 305 CheckCastPP === 302 300 [[ 385 316 ]] #byte[int:1]:NotNull:exact *,iid=288 !jvms: TestArrayCopyMemoryChain::test2 @ bci:25 385 EncodeP === _ 305 [[ 387 403 ]] #narrowoop: byte[int:1]:NotNull:exact *,iid=288 !orig=[402] !jvms: TestArrayCopyMemoryChain$Wrapper:: @ bci:11 TestArrayCopyMemoryChain::test2 @ bci:41 403 > DecodeN === _ 385 [[ 494 420 ]] #byte[int:1]:NotNull:exact *,iid=288 !jvms: TestArrayCopyMemoryChain::test2 @ bci:48 494 CastPP === 486 403 [[ 526 514 ]] #byte[int:>=0]:NotNull:exact * !jvms: TestArrayCopyMemoryChain::test2 @ bci:71 420 CallStaticJava === 404 327 341 8 9 ( 419 1 1 1 64 1 1 1 403 257 43 64 11 230 ) [[ 421 ]] # Static uncommon_trap(reason='intrinsic_or_type_checked_inlining' action='make_not_entrant' debug_id='0') void ( int ) C=0.000100 TestArrayCopyMemoryChain::test2 @ bci:61 !jvms: TestArrayCopyMemoryChain::test2 @ bci:61 # To suppress the following error report, specify this argument # after -XX: or in .hotspotrc: SuppressErrorAt=/node.cpp:1088 # # A fatal error has been detected by the Java Runtime Environment: # # Internal Error (/oracle/jdk_jdk/open/src/hotspot/share/opto/node.cpp:1088), pid=6482, tid=6496 # assert(false) failed: Known instance type should have been set during EA From rwestrel at redhat.com Tue Jan 14 14:09:38 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 15:09:38 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: References: <878sma1igp.fsf@redhat.com> Message-ID: <875zhe1659.fsf@redhat.com> New webrev without the change GraphKit::must_be_not_null(): http://cr.openjdk.java.net/~roland/8231291/webrev.03/ Roland. From rwestrel at redhat.com Tue Jan 14 14:27:01 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 15:27:01 +0100 Subject: [15] RFR(S): 8236721: C2 should better optimize not-equal integer comparisons In-Reply-To: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> References: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> Message-ID: <8736ci15ca.fsf@redhat.com> > http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ Looks good to me. Roland. From christian.hagedorn at oracle.com Tue Jan 14 14:29:31 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 14 Jan 2020 15:29:31 +0100 Subject: [15] RFR(S): 8235332: TestInstanceCloneAsLoadsStores.java fails with -XX:+StressGCM Message-ID: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> Hi Please review the following patch http://cr.openjdk.java.net/~chagedorn/8235332/webrev.00/ https://bugs.openjdk.java.net/browse/JDK-8235332 An ArrayCopyNode for a clone is not inlined for more than 8 (=ArrayCopyLoadStoreMaxElem) fields. Therefore, the test cases uses 9 fields. There is a GC barrier for 109 ArrayCopy [1] when disabling ReduceInitialCardMarks which is not taken into account in the pattern matching check for a clone at [2] in ConnectionGraph::find_inst_mem() during escape analysis. As a result, 161 MemBarStoreStore is skipped and the search continues at 160 MergeMem and the result is set to 97 Proj at [3]. Escape Analysis will then optimize 171 LoadI to directly use the memory output of the uninitialized allocation instead of the output of the ArrayCopyNode because it thinks the memory is the same [4]. This bug, however, is only observed when enabling the StressGCM flag which emits the field loading code before the array copy code which results in reads from uninitialized memory. The existing pattern matching code to find an ArrayCopyNode over the MergeMemNode is replaced by a pattern matching over the MemBarNode taking GC barriers into account. Thank you! Best regards, Christian [1] https://bugs.openjdk.java.net/secure/attachment/86398/array_copy_with_gc_barrier.png [2] http://hg.openjdk.java.net/jdk/jdk/file/f7edb9ca045c/src/hotspot/share/opto/escape.cpp#l2749 [3] http://hg.openjdk.java.net/jdk/jdk/file/f7edb9ca045c/src/hotspot/share/opto/escape.cpp#l2765 [4] https://bugs.openjdk.java.net/secure/attachment/86397/field_load_wrong_memory.png From rwestrel at redhat.com Tue Jan 14 14:36:33 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 15:36:33 +0100 Subject: [15] RFR(S): 8235332: TestInstanceCloneAsLoadsStores.java fails with -XX:+StressGCM In-Reply-To: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> References: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> Message-ID: <87zheqyuj2.fsf@redhat.com> Hi Christian, > http://cr.openjdk.java.net/~chagedorn/8235332/webrev.00/ If there's no GC barrier then: Node* control_proj_ac = bs->step_over_gc_barrier(proj_in->in(0)); control_proj_ac is not a projection, it's proj_in->in(0) so the test below can never succeed? Roland. From vladimir.x.ivanov at oracle.com Tue Jan 14 14:41:16 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Tue, 14 Jan 2020 16:41:16 +0200 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <875zhe1659.fsf@redhat.com> References: <875zhe1659.fsf@redhat.com> Message-ID: > http://cr.openjdk.java.net/~roland/8231291/webrev.03/ Looks good. Best regards, Vladimir Ivanov From tobias.hartmann at oracle.com Tue Jan 14 15:00:07 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 16:00:07 +0100 Subject: [15] RFR(S): 8236721: C2 should better optimize not-equal integer comparisons In-Reply-To: <8736ci15ca.fsf@redhat.com> References: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> <8736ci15ca.fsf@redhat.com> Message-ID: <85a387b3-b71b-b96c-b70b-c58be54f4aab@oracle.com> Hi Roland, thanks for the review! Best regards, Tobias On 14.01.20 15:27, Roland Westrelin wrote: > >> http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ > > Looks good to me. > > Roland. > From nils.eliasson at oracle.com Tue Jan 14 15:14:55 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Tue, 14 Jan 2020 16:14:55 +0100 Subject: RFR(S): 8235584: UseProfiledLoopPredicate fails with assert(_phase->get_loop(c) == loop) failed: have to be in the same loop In-Reply-To: <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> References: <87r2031va5.fsf@redhat.com> <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> Message-ID: +1 Best regards, Nils Eliasson On 2020-01-13 12:46, Tobias Hartmann wrote: > Hi Roland, > > On 13.01.20 11:54, Roland Westrelin wrote: >> http://cr.openjdk.java.net/~roland/8235584/webrev.00/ > Looks good to me. > >> Also, the fix that Tobias pushed for 8235452 erroneously disabled >> verification for the outer strip mined loop. This change restores it. > Thanks for fixing. We should get this into JDK 14. > > Best regards, > Tobias From IOI.LAM at ORACLE.COM Tue Jan 14 15:26:42 2020 From: IOI.LAM at ORACLE.COM (Ioi Lam) Date: Tue, 14 Jan 2020 07:26:42 -0800 Subject: RFR: 8236701: [TESTBUG] compiler/loopopts/superword/Vec_MulAddS2I.java uses wrong flag -XX:-SuperWord In-Reply-To: References: Message-ID: <3B9527AD-F34B-4813-92A2-803129A31061@ORACLE.COM> Looks good to me too. Thanks Ioi Sent from my iPad > On Jan 14, 2020, at 5:47 AM, Tobias Hartmann wrote: > > ?Hi Fairoz, > >> On 14.01.20 14:32, Fairoz Matte wrote: >> http://cr.openjdk.java.net/~fmatte/8236701/webrev.01/ > > Looks good to me. > > Best regards, > Tobias From tobias.hartmann at oracle.com Tue Jan 14 15:29:30 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 16:29:30 +0100 Subject: RFR(S): 8236759: ShouldNotReachHere in PhaseIdealLoop::verify_strip_mined_scheduling In-Reply-To: <87ftgj1qtm.fsf@redhat.com> References: <87zher1xut.fsf@redhat.com> <87ftgj1qtm.fsf@redhat.com> Message-ID: Hi Roland, On 13.01.20 13:30, Roland Westrelin wrote: >> Just wondering why you need the checks in line 1487? Isn't in->is_OuterStripMinedLoop() sufficient >> since we should never schedule a data node in the outer strip mined loop? > > It could be top too. Also I guess, I'm becoming paranoid about little > things going wrong with strip mining... Fair enough. Looks good to me but please add a brief comment (something like "should not schedule a data node in the outer strip mined loop") before pushing. Thanks, Tobias From nils.eliasson at oracle.com Tue Jan 14 15:30:21 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Tue, 14 Jan 2020 16:30:21 +0100 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

<4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> <8797205a-c4ff-253d-404b-62a178c6e169@oracle.com> Message-ID: Hi Felix, Your change looks good. Thanks for fixing! Best regards, Nils On 2020-01-14 07:33, Yangfei (Felix) wrote: > Hi Tobias, > > Thanks for reviewing this. May I have another review please? > > Felix > >> -----Original Message----- >> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] >> Sent: Monday, January 13, 2020 7:28 PM >> To: Yangfei (Felix) ; Christian Hagedorn >> ; hotspot-compiler-dev at openjdk.java.net >> Subject: Re: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation >> >> Hi Felix, >> >> looks reasonable to me. Thanks for updating. >> >> Another review would be good. >> >> Best regards, >> Tobias >> >> On 10.01.20 02:13, Yangfei (Felix) wrote: >>> Hi Tobias, >>> >>> Updated webrev: http://cr.openjdk.java.net/~fyang/8235762/webrev.01/ >>> This also updated copyright years of the newly-added test case. >>> Still test OK with the latest repo. Is this one better? >>> >>> Thanks, >>> Felix >>> From nils.eliasson at oracle.com Tue Jan 14 15:31:26 2020 From: nils.eliasson at oracle.com (Nils Eliasson) Date: Tue, 14 Jan 2020 16:31:26 +0100 Subject: [15] RFR(S): 8236721: C2 should better optimize not-equal integer comparisons In-Reply-To: <8736ci15ca.fsf@redhat.com> References: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> <8736ci15ca.fsf@redhat.com> Message-ID: +1 Best regards, Nils Eliasosn On 2020-01-14 15:27, Roland Westrelin wrote: >> http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ > Looks good to me. > > Roland. > From tobias.hartmann at oracle.com Tue Jan 14 15:36:23 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 16:36:23 +0100 Subject: [15] RFR(S): 8236721: C2 should better optimize not-equal integer comparisons In-Reply-To: References: <480a02e6-1d4b-900f-28cf-ce37b6382e81@oracle.com> <8736ci15ca.fsf@redhat.com> Message-ID: <54a90eae-33c4-0b83-18ee-694c485f20b8@oracle.com> Thanks Nils! Best regards, Tobias On 14.01.20 16:31, Nils Eliasson wrote: > +1 > > Best regards, > Nils Eliasosn > > On 2020-01-14 15:27, Roland Westrelin wrote: >>> http://cr.openjdk.java.net/~thartmann/8236721/webrev.00/ >> Looks good to me. >> >> Roland. >> > From rwestrel at redhat.com Tue Jan 14 15:53:58 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 16:53:58 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: <878sma1igp.fsf@redhat.com> References: <87o8w8ptsq.fsf@redhat.com> <8736dfkko7.fsf@redhat.com> <7bc32dc2-1612-cb77-a467-ac67987b148c@oracle.com> <87woa35ijf.fsf@redhat.com> <9f364210-53e4-0b4d-df3e-a02aa5219cb0@oracle.com> <87o8v71uqz.fsf@redhat.com> <467329e0-10e7-3dfa-ab4c-9f011630a756@oracle.com> <87blr71iqf.fsf@redhat.com> <580b9af0-1523-0cde-9b29-e09278e2fa1c@oracle.com> <878sma1igp.fsf@redhat.com> Message-ID: <87wo9uyqy1.fsf@redhat.com> >> I understand that the change you have is enough to avoid the problematic >> case in GraphKit::must_be_not_null(). What I'd like to get rid of is the >> need for a correctness check to ensure that HaltNode can't go away. It's >> error-prone and can eventually get out of sync. > > Fair enough. Then why not always enqueue the root node for igvn at the > end of parsing? I filed 8237086 for this. Roland. From rwestrel at redhat.com Tue Jan 14 16:07:24 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 14 Jan 2020 17:07:24 +0100 Subject: RFR(S): 8237086: assert(is_MachReturn()) running CTW with fix for JDK-8231291 Message-ID: <87tv4yyqbn.fsf@redhat.com> http://cr.openjdk.java.net/~roland/8237086/webrev.00/ As discussed in: https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2020-January/036689.html GraphKit::must_be_not_null() is called with a known to be non null input. This causes the null branch of the test to be optimized to top and a top be added to the RootNode. RootNode::Ideal() is never called to clean the top inputs and the assert is hit during matching. Roland. From tobias.hartmann at oracle.com Tue Jan 14 16:13:52 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Tue, 14 Jan 2020 17:13:52 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <87wo9v1wup.fsf@redhat.com> References: <87wo9v1wup.fsf@redhat.com> Message-ID: <80847ab4-3b49-68c8-33af-8946fd14cda3@oracle.com> Hi Roland, as we've discussed off-thread, considering the CMove as unpinned seems reasonable because it does not have any (side-)effects that would not be explicit in the graph. Still wondering why the CMove then has a control input in the first place but your change looks good to me. I'll run some testing. Best regards, Tobias On 13.01.20 11:20, Roland Westrelin wrote: > > http://cr.openjdk.java.net/~roland/8237007/webrev.00/ > > The assert fails because C2 expands a barrier between a membar and its > control projection. That happens because the gc barrier is assigned the > membar as control by loop optimizations (and not its control > projection). The gc barrier has a load as input which itself depends on > the memory state produced by the membar so the barrier has to be below > the membar. The gc barrier has a CMove as use. That CMove has a control > input that's above the membar and because it's considered pinned by loop > opts, it's assigned earliest possible control: the membar. Considering > the CMove as pinned is overly conservative AFAICT. My fix consists in > excluding CMove nodes from being pinned. > > Roland. > From christian.hagedorn at oracle.com Tue Jan 14 16:18:05 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Tue, 14 Jan 2020 17:18:05 +0100 Subject: [15] RFR(S): 8235332: TestInstanceCloneAsLoadsStores.java fails with -XX:+StressGCM In-Reply-To: <87zheqyuj2.fsf@redhat.com> References: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> <87zheqyuj2.fsf@redhat.com> Message-ID: <3f4d0542-5d72-dca4-d14d-7a13402fbf1c@oracle.com> Hi Roland >> http://cr.openjdk.java.net/~chagedorn/8235332/webrev.00/ > > If there's no GC barrier then: > > Node* control_proj_ac = bs->step_over_gc_barrier(proj_in->in(0)); > > control_proj_ac is not a projection, it's proj_in->in(0) so the test > below can never succeed? If there is no GC barrier, then proj_in is a membar with its control input being a control projection from the ArrayCopyNode. As an example, I have taken the test case TestEliminatedCloneBadMemEdge [1] which does not disable ReduceInitialCardMarks and has no GC barrier. We have the following situation at [2]: proj_in is 132 MemBarCPUOrder. The original code then looked at the MergeMem 100 input following in(Compile::AliasIdxRaw) which is 101 Proj and its input 98 ArrayCopy. This passes the check. My fix now directly looks at the control input of 132 MemBarCPUOrder which is the projection 99 Proj coming from the 98 ArrayCopy. This also passes the check. 98 ArrayCopy === 85 1 65 8 1 ( 47 _ 90 _ 97 _ _ _ _ ) [[ 99 101 ]] void ( java/lang/Object *, int, java/lang/Object *, int, int, int, int, BotPTR *+bot, BotPTR *+bot ) (clone, tightly coupled allocation) !jvms: TestEliminatedCloneBadMemEdge::test @ bci:1 101 Proj === 98 [[ 100 ]] #2 Memory: @rawptr:BotPTR, idx=Raw; !jvms: TestEliminatedCloneBadMemEdge::test @ bci:1 100 MergeMem === _ 1 39 101 86 1 86 86 86 86 86 86 86 86 86 86 [[ 132 ]] { N101:rawptr:BotPTR N86:java/lang/Object+8 ... 99 Proj === 98 [[ 132 ]] #0 !jvms: TestEliminatedCloneBadMemEdge::test @ bci:1 132 MemBarCPUOrder === 99 1 100 1 1 [[ 104 103 ]] !orig=102 !jvms: TestEliminatedCloneBadMemEdge::test @ bci:1 Best regards, Christian [1] http://hg.openjdk.java.net/jdk/jdk/file/2080e45e61ea/test/hotspot/jtreg/compiler/arraycopy/TestEliminatedCloneBadMemEdge.java [2] http://hg.openjdk.java.net/jdk/jdk/file/f7edb9ca045c/src/hotspot/share/opto/escape.cpp#l2749 From rkennke at redhat.com Tue Jan 14 17:06:53 2020 From: rkennke at redhat.com (Roman Kennke) Date: Tue, 14 Jan 2020 18:06:53 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <87wo9v1wup.fsf@redhat.com> References: <87wo9v1wup.fsf@redhat.com> Message-ID: <6026815f-10ab-dc72-9322-4484e96ce43a@redhat.com> Hi Roland, It looks ok to me. I'd probably add case Op_CMove cases to the switch instead. Thanks, Roman > http://cr.openjdk.java.net/~roland/8237007/webrev.00/ > > The assert fails because C2 expands a barrier between a membar and its > control projection. That happens because the gc barrier is assigned the > membar as control by loop optimizations (and not its control > projection). The gc barrier has a load as input which itself depends on > the memory state produced by the membar so the barrier has to be below > the membar. The gc barrier has a CMove as use. That CMove has a control > input that's above the membar and because it's considered pinned by loop > opts, it's assigned earliest possible control: the membar. Considering > the CMove as pinned is overly conservative AFAICT. My fix consists in > excluding CMove nodes from being pinned. > > Roland. > From hohensee at amazon.com Tue Jan 14 17:17:52 2020 From: hohensee at amazon.com (Hohensee, Paul) Date: Tue, 14 Jan 2020 17:17:52 +0000 Subject: RFR(XXS): 8236228: clean up BarrierSet headers in c1_LIRAssembler In-Reply-To: References: <1982726B-9702-4087-B962-354D2B9FAD92@amazon.com> <3C45D717-C0BF-40A4-93F5-EDA78FE91350@amazon.com> <3CC88DEC-56BE-4FDA-B775-7678F44E252A@amazon.com> <2CDFB80B-04EA-4C95-A67A-1C1284C80C66@amazon.com> Message-ID: <72203708-CF49-47D7-B4C1-737E9E5C0B07@amazon.com> Thanks, Martin. Ningsheng and Xin report that the submit repo run passed (below). Pushed. Paul ?On 1/14/20, 1:03 AM, "Doerr, Martin" wrote: Hi Xin and Paul, yes, if submission forest testing has passed, it can get pushed. Best regards, Martin > -----Original Message----- > From: hotspot-compiler-dev bounces at openjdk.java.net> On Behalf Of Hohensee, Paul > Sent: Montag, 13. Januar 2020 23:54 > To: Liu, Xin ; Ningsheng Jian > ; Vladimir Kozlov ; > 'hotspot-compiler-dev at openjdk.java.net' dev at openjdk.java.net> > Cc: nd > Subject: Re: RFR(XXS): 8236228: clean up BarrierSet headers in > c1_LIRAssembler > > Ping on behalf of Xin. > > Thanks, > > Paul > > On 12/30/19, 2:39 PM, "Liu, Xin" wrote: > > Hi, Reviewers, > > Thanks for reviewing it. Paul is a reviewer. Martin reviewed PPC and s390 > and Ningsheng reviewed Arm&Aarch64. > Is that good to push? > > Thanks, > --lx > > > > > On 12/30/19, 11:16 AM, "Hohensee, Paul" > wrote: > > Lgtm. Seems very low risk to me too. > > Paul > > On 12/26/19, 7:06 PM, "hotspot-compiler-dev on behalf of Liu, Xin" > xxinliu at amazon.com> wrote: > > Hi, Ningsheng, > Thank you for submitting the trial repo. I got the result email and it has > passed all 80 tests. > > Here is the updated webrev. I updated the reviewers. > https://cr.openjdk.java.net/~xliu/8236228/01/webrev/ > > This is a low-risk patch. As long as we can compile it, it won't have any > side-effect at runtime. > > thanks, > --lx > > On 12/24/19, 9:52 PM, "Ningsheng Jian" > wrote: > > Hi lx, > > I've submitted it for you: > > http://hg.openjdk.java.net/jdk/submit/rev/9d61b00d5982 > > I've also verified aarch64 and arm build locally, and it looks fine. > > Thanks, > Ningsheng > > > -----Original Message----- > > From: hotspot-compiler-dev bounces at openjdk.java.net> > > On Behalf Of Liu, Xin > > Sent: Wednesday, December 25, 2019 4:57 AM > > To: Vladimir Kozlov ; 'hotspot- > compiler- > > dev at openjdk.java.net' dev at openjdk.java.net> > > Subject: Re: RFR(XXS): 8236228: clean up BarrierSet headers in > c1_LIRAssembler > > > > I update the subject and switch back the corporation email. Sorry, I > didn?t > > realize that there?s code of conduct. I will pay more attention on it. > > > > I validate the patch without PCH using --disable-precompiled- > headers on both > > x86_64 and aarch64. Both of them built well. > > I still have difficulty to verify SPARC. I try the submit repo but I > don?t have > > permission to push an experimental branch. May I ask a sponsor > submit it on > > behalf of me? > > > > Thanks, > > --lx > > > > > > > > > > Hi Lx > > > > First, when you post RFR, please, include bug id in email's Subject. > > > > You can use jdk/submit testing to verify builds on SPARC. We are > still building on > > it, with warning. > > Changes seems fine to me but make sure verify that it builds > without PCH. > > > > Regards, > > Vladimir > > > > On 12/20/19 9:24 AM, Liu Xin wrote: > > > Martin? > > > > > > Thank you very much. May I know how to validate Sparc? I don't > have > > > any SPARC machine to access. > > > > > > Thanks, > > > --lx > > > > > > > > > On Fri, Dec 20, 2019, 7:05 AM Doerr, Martin > > > wrote: > > > > > >> Hi lx, > > >> > > >> PPC and s390 parts are ok. > > >> > > >> Best regards, > > >> Martin > > >> > > >> > > >>> -----Original Message----- > > >>> From: hotspot-compiler-dev > >>> > bounces at openjdk.java.net> On Behalf > > >>> Of Liu, Xin > > >>> Sent: Freitag, 20. Dezember 2019 04:48 > > >>> To: > > >>> 'hotspot-compiler-dev at openjdk.java.net compiler-dev at o > > >>> penjdk.java.net>' > >>> dev at openjdk.java.net> > > >>> Subject: [DMARC FAILURE] RFR(XXS): clean up BarrierSet > headers in > > >>> c1_LIRAssembler > > >>> > > >>> Hi, Reviewers, > > >>> > > >>> Could you take a look at my webrev? I feel that those > barrisetSet > > >> interfaces > > >>> have nothing with c1_LIRAssembler. > > >>> > > >>> Bug: https://bugs.openjdk.java.net/browse/JDK-8236228 > > >>> Webrev: > https://cr.openjdk.java.net/~xliu/8236228/00/webrev/ > > >>> > > >>> I try to build on aarch64 and x86_64 and it?s fine. > > >>> > > >>> Thanks, > > >>> --lx > > >> > > >> > > > > > > > From tobias.hartmann at oracle.com Wed Jan 15 06:06:25 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Wed, 15 Jan 2020 07:06:25 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <80847ab4-3b49-68c8-33af-8946fd14cda3@oracle.com> References: <87wo9v1wup.fsf@redhat.com> <80847ab4-3b49-68c8-33af-8946fd14cda3@oracle.com> Message-ID: <069e55af-de92-0525-d3c0-9dfef181381c@oracle.com> On 14.01.20 17:13, Tobias Hartmann wrote: > I'll run some testing. Tests passed. Best regards, Tobias From felix.yang at huawei.com Wed Jan 15 08:55:45 2020 From: felix.yang at huawei.com (Yangfei (Felix)) Date: Wed, 15 Jan 2020 08:55:45 +0000 Subject: RFR(S): 8235762: JVM crash in SWPointer during C2 compilation In-Reply-To: References:

<4ef39906-839b-9a08-3225-f99b974f08de@oracle.com>

<4103e17e-57c4-ed78-bf64-1bb3cb99e9f4@oracle.com> <8797205a-c4ff-253d-404b-62a178c6e169@oracle.com>

Message-ID: Hi Nils, Thanks for giving it another review. Will do the push. Best regards, Felix > > Hi Felix, > > Your change looks good. > > Thanks for fixing! > > Best regards, > Nils > > On 2020-01-14 07:33, Yangfei (Felix) wrote: > > Hi Tobias, > > > > Thanks for reviewing this. May I have another review please? > > > > Felix > > > >> -----Original Message----- > >> From: Tobias Hartmann [mailto:tobias.hartmann at oracle.com] > >> Sent: Monday, January 13, 2020 7:28 PM > >> To: Yangfei (Felix) ; Christian Hagedorn > >> ; > >> hotspot-compiler-dev at openjdk.java.net > >> Subject: Re: RFR(S): 8235762: JVM crash in SWPointer during C2 > >> compilation > >> > >> Hi Felix, > >> > >> looks reasonable to me. Thanks for updating. > >> > >> Another review would be good. > >> > >> Best regards, > >> Tobias > >> > >> On 10.01.20 02:13, Yangfei (Felix) wrote: > >>> Hi Tobias, > >>> > >>> Updated webrev: > http://cr.openjdk.java.net/~fyang/8235762/webrev.01/ > >>> This also updated copyright years of the newly-added test case. > >>> Still test OK with the latest repo. Is this one better? > >>> > >>> Thanks, > >>> Felix > >>> From rwestrel at redhat.com Wed Jan 15 09:01:47 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Wed, 15 Jan 2020 10:01:47 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <069e55af-de92-0525-d3c0-9dfef181381c@oracle.com> References: <87wo9v1wup.fsf@redhat.com> <80847ab4-3b49-68c8-33af-8946fd14cda3@oracle.com> <069e55af-de92-0525-d3c0-9dfef181381c@oracle.com> Message-ID: <87pnflytxg.fsf@redhat.com> Hi Tobias, Thanks for the review and testing. Roland. From rwestrel at redhat.com Wed Jan 15 09:03:39 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Wed, 15 Jan 2020 10:03:39 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <6026815f-10ab-dc72-9322-4484e96ce43a@redhat.com> References: <87wo9v1wup.fsf@redhat.com> <6026815f-10ab-dc72-9322-4484e96ce43a@redhat.com> Message-ID: <87muapytuc.fsf@redhat.com> Hi Roman, > It looks ok to me. I'd probably add case Op_CMove cases to the switch > instead. Thanks for looking at this. I would have to list all variants in the switch: CMoveD CMoveVD CMoveF CMoveVF CMoveI CMoveL CMoveP CMoveN That doesn't seem better to me and is quite error prone too. Roland. From rwestrel at redhat.com Wed Jan 15 09:39:41 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Wed, 15 Jan 2020 10:39:41 +0100 Subject: RFR(S): 8235584: UseProfiledLoopPredicate fails with assert(_phase->get_loop(c) == loop) failed: have to be in the same loop In-Reply-To: References: <87r2031va5.fsf@redhat.com> <1a4c0874-946b-aea3-fe83-ac5e2b9f1862@oracle.com> Message-ID: <87k15tys6a.fsf@redhat.com> Thanks for the review, Nils. Roland. From rwestrel at redhat.com Wed Jan 15 09:40:29 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Wed, 15 Jan 2020 10:40:29 +0100 Subject: RFR(S): 8231291: C2: loop opts before EA should maximally unroll loops In-Reply-To: References: <875zhe1659.fsf@redhat.com> Message-ID: <87h80xys4y.fsf@redhat.com> Thanks for the review, Vladimir. Roland. From rkennke at redhat.com Wed Jan 15 09:44:10 2020 From: rkennke at redhat.com (Roman Kennke) Date: Wed, 15 Jan 2020 10:44:10 +0100 Subject: RFR(S): 8237007: Shenandoah: assert(_base == Tuple) failure during C2 compilation In-Reply-To: <87muapytuc.fsf@redhat.com> References: <87wo9v1wup.fsf@redhat.com> <6026815f-10ab-dc72-9322-4484e96ce43a@redhat.com> <87muapytuc.fsf@redhat.com> Message-ID: <22c15858-02fd-4606-136b-8a86f8246c7d@redhat.com> Hi Roland, >> It looks ok to me. I'd probably add case Op_CMove cases to the switch >> instead. > > Thanks for looking at this. I would have to list all variants in the > switch: > > CMoveD > CMoveVD > CMoveF > CMoveVF > CMoveI > CMoveL > CMoveP > CMoveN > > That doesn't seem better to me and is quite error prone too. Yes, true. It's the case for Op_Load* above too, and less pronounced for Div, Mod, etc. I don't have a strong preference either way, your call. Patch is good. Thank you! Roman From rwestrel at redhat.com Wed Jan 15 10:08:17 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Wed, 15 Jan 2020 11:08:17 +0100 Subject: [15] RFR(S): 8235332: TestInstanceCloneAsLoadsStores.java fails with -XX:+StressGCM In-Reply-To: <3f4d0542-5d72-dca4-d14d-7a13402fbf1c@oracle.com> References: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> <87zheqyuj2.fsf@redhat.com> <3f4d0542-5d72-dca4-d14d-7a13402fbf1c@oracle.com> Message-ID: <87eew1yqum.fsf@redhat.com> >> If there's no GC barrier then: >> >> Node* control_proj_ac = bs->step_over_gc_barrier(proj_in->in(0)); >> >> control_proj_ac is not a projection, it's proj_in->in(0) so the test >> below can never succeed? > > If there is no GC barrier, then proj_in is a membar with its control > input being a control projection from the ArrayCopyNode. > > As an example, I have taken the test case TestEliminatedCloneBadMemEdge > [1] which does not disable ReduceInitialCardMarks and has no GC barrier. > We have the following situation at [2]: > > proj_in is 132 MemBarCPUOrder. The original code then looked at the > MergeMem 100 input following in(Compile::AliasIdxRaw) which is 101 Proj > and its input 98 ArrayCopy. This passes the check. > > My fix now directly looks at the control input of 132 MemBarCPUOrder > which is the projection 99 Proj coming from the 98 ArrayCopy. This also > passes the check. Ah! Right. I wonder if there's a reason why the current code follows the control graph (as you do) instead of the memory graph. I couldn't find one and your change looks good to me. Roland. From christian.hagedorn at oracle.com Wed Jan 15 11:02:52 2020 From: christian.hagedorn at oracle.com (Christian Hagedorn) Date: Wed, 15 Jan 2020 12:02:52 +0100 Subject: [15] RFR(S): 8235332: TestInstanceCloneAsLoadsStores.java fails with -XX:+StressGCM In-Reply-To: <87eew1yqum.fsf@redhat.com> References: <2ef5e2fd-d73c-5e1f-e50f-8ea36cd24029@oracle.com> <87zheqyuj2.fsf@redhat.com> <3f4d0542-5d72-dca4-d14d-7a13402fbf1c@oracle.com> <87eew1yqum.fsf@redhat.com> Message-ID: >>> If there's no GC barrier then: >>> >>> Node* control_proj_ac = bs->step_over_gc_barrier(proj_in->in(0)); >>> >>> control_proj_ac is not a projection, it's proj_in->in(0) so the test >>> below can never succeed? >> >> If there is no GC barrier, then proj_in is a membar with its control >> input being a control projection from the ArrayCopyNode. >> >> As an example, I have taken the test case TestEliminatedCloneBadMemEdge >> [1] which does not disable ReduceInitialCardMarks and has no GC barrier. >> We have the following situation at [2]: >> >> proj_in is 132 MemBarCPUOrder. The original code then looked at the >> MergeMem 100 input following in(Compile::AliasIdxRaw) which is 101 Proj >> and its input 98 ArrayCopy. This passes the check. >> >> My fix now directly looks at the control input of 132 MemBarCPUOrder >> which is the projection 99 Proj coming from the 98 ArrayCopy. This also >> passes the check. > > Ah! Right. I wonder if there's a reason why the current code follows the > control graph (as you do) instead of the memory graph. I couldn't find > one and your change looks good to me. Thank you for your review Roland! Was wondering about that, too. Best regards, Christian From vladimir.kozlov at oracle.com Wed Jan 15 18:08:39 2020 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Wed, 15 Jan 2020 10:08:39 -0800 Subject: [14] RFR (T) 8236726: Put vmTestbase/vm/mlvm/indy/stress/java tests on AOT Problem list Message-ID: <37b1ce65-749d-1d14-29e6-3620db823e9e@oracle.com> https://bugs.openjdk.java.net/browse/JDK-8236726 I need more time to work on 8226689 [1]. I can't reproduce failure locally - it seems very intermittent. For JDK 14 I suggest to put failed tests on AOT problem list to skip them. Thanks, Vladimir diff -r 5b2e2fdb4d19 test/hotspot/jtreg/ProblemList-aot.txt --- a/test/hotspot/jtreg/ProblemList-aot.txt +++ b/test/hotspot/jtreg/ProblemList-aot.txt @@ -81,3 +81,5 @@ compiler/intrinsics/sha/sanity/TestSHA1MultiBlockIntrinsics.java 8167430 generic-all compiler/intrinsics/sha/sanity/TestSHA512MultiBlockIntrinsics.java 8167430 generic-all +vmTestbase/vm/mlvm/indy/stress/java/relinkMutableCallSiteFreq/Test.java 8226689 generic-all +vmTestbase/vm/mlvm/indy/stress/java/relinkVolatileCallSiteFreq/Test.java 8226689 generic-all [1] https://bugs.openjdk.java.net/browse/JDK-8226689 From igor.ignatyev at oracle.com Wed Jan 15 18:43:59 2020 From: igor.ignatyev at oracle.com (Igor Ignatev) Date: Wed, 15 Jan 2020 10:43:59 -0800 Subject: [14] RFR (T) 8236726: Put vmTestbase/vm/mlvm/indy/stress/java tests on AOT Problem list In-Reply-To: <37b1ce65-749d-1d14-29e6-3620db823e9e@oracle.com> References: <37b1ce65-749d-1d14-29e6-3620db823e9e@oracle.com> Message-ID: <82CE54AD-5AF2-4C48-8A6C-DEBBE43EA9C4@oracle.com> LGTM ? Igor > On Jan 15, 2020, at 10:09 AM, Vladimir Kozlov wrote: > > ?https://bugs.openjdk.java.net/browse/JDK-8236726 > > I need more time to work on 8226689 [1]. I can't reproduce failure locally - it seems very intermittent. > For JDK 14 I suggest to put failed tests on AOT problem list to skip them. > > Thanks, > Vladimir > > diff -r 5b2e2fdb4d19 test/hotspot/jtreg/ProblemList-aot.txt > --- a/test/hotspot/jtreg/ProblemList-aot.txt > +++ b/test/hotspot/jtreg/ProblemList-aot.txt > @@ -81,3 +81,5 @@ > compiler/intrinsics/sha/sanity/TestSHA1MultiBlockIntrinsics.java 8167430 generic-all > compiler/intrinsics/sha/sanity/TestSHA512MultiBlockIntrinsics.java 8167430 generic-all > > +vmTestbase/vm/mlvm/indy/stress/java/relinkMutableCallSiteFreq/Test.java 8226689 generic-all > +vmTestbase/vm/mlvm/indy/stress/java/relinkVolatileCallSiteFreq/Test.java 8226689 generic-all > > > [1] https://bugs.openjdk.java.net/browse/JDK-8226689 From vladimir.kozlov at oracle.com Wed Jan 15 19:01:20 2020 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Wed, 15 Jan 2020 11:01:20 -0800 Subject: [14] RFR (T) 8236726: Put vmTestbase/vm/mlvm/indy/stress/java tests on AOT Problem list In-Reply-To: <82CE54AD-5AF2-4C48-8A6C-DEBBE43EA9C4@oracle.com> References: <37b1ce65-749d-1d14-29e6-3620db823e9e@oracle.com> <82CE54AD-5AF2-4C48-8A6C-DEBBE43EA9C4@oracle.com> Message-ID: <90ac9bdb-0835-ac27-241e-5741ce1f539d@oracle.com> Thank you, Igor Vladimir On 1/15/20 10:43 AM, Igor Ignatev wrote: > LGTM > > ? Igor > >> On Jan 15, 2020, at 10:09 AM, Vladimir Kozlov wrote: >> >> ?https://bugs.openjdk.java.net/browse/JDK-8236726 >> >> I need more time to work on 8226689 [1]. I can't reproduce failure locally - it seems very intermittent. >> For JDK 14 I suggest to put failed tests on AOT problem list to skip them. >> >> Thanks, >> Vladimir >> >> diff -r 5b2e2fdb4d19 test/hotspot/jtreg/ProblemList-aot.txt >> --- a/test/hotspot/jtreg/ProblemList-aot.txt >> +++ b/test/hotspot/jtreg/ProblemList-aot.txt >> @@ -81,3 +81,5 @@ >> compiler/intrinsics/sha/sanity/TestSHA1MultiBlockIntrinsics.java 8167430 generic-all >> compiler/intrinsics/sha/sanity/TestSHA512MultiBlockIntrinsics.java 8167430 generic-all >> >> +vmTestbase/vm/mlvm/indy/stress/java/relinkMutableCallSiteFreq/Test.java 8226689 generic-all >> +vmTestbase/vm/mlvm/indy/stress/java/relinkVolatileCallSiteFreq/Test.java 8226689 generic-all >> >> >> [1] https://bugs.openjdk.java.net/browse/JDK-8226689 > From martin.doerr at sap.com Thu Jan 16 13:30:37 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Thu, 16 Jan 2020 13:30:37 +0000 Subject: RFR: 8236179: C1 register allocation error with T_ADDRESS In-Reply-To: References: <87h81wl7vo.fsf@redhat.com> <87bls3ksen.fsf@redhat.com> <875zibkmyq.fsf@redhat.com> <0e22c4e6-74b9-a50b-df44-83c332457f14@redhat.com>

Message-ID: Hi Roman, fix is in 11.0.7., 14 and 15, now. Only backport to 8u is missing (I'm not working on 8u backports). Best regards, Martin > -----Original Message----- > From: Roman Kennke > Sent: Freitag, 20. Dezember 2019 21:37 > To: Aditya Mandaleeka ; Doerr, Martin > ; Roland Westrelin ; > hotspot compiler > Cc: shenandoah-dev > Subject: Re: RFR: 8236179: C1 register allocation error with T_ADDRESS > > Hi Aditya, > > > Thanks again to everyone who helped get this change in! I am happy to > help backport it as well if it can wait until January. > > Thank *you* for figuring this out in the first place. This seems a > rather serious bug for Shenandoah GC (and I'm still a bit surprised how > we haven't seen it yet). I just realized we're also gonna need it in > JDK14. I am not even quite sure what the process for this would be. > We'll figure it out. > > Thank you! > > Roman > > > > Thanks, > > Aditya > > > > -----Original Message----- > > From: Roman Kennke > > Sent: Friday, December 20, 2019 11:59 AM > > To: Doerr, Martin ; Roland Westrelin > ; Aditya Mandaleeka ; > hotspot compiler > > Cc: shenandoah-dev > > Subject: Re: RFR: 8236179: C1 register allocation error with T_ADDRESS > > > > Hi Martin, > > > >> thanks for reviewing it. Pushed to jdk/jdk. > > > > Thanks a lot! > > > >> I guess we'll have to backport it after some testing time. > > > > Yes, we're gonna need it in 11u and 8u. > > > > Thanks and have a nice weekend (and xmas, etc, if you're also taking time > off)! > > > > Cheers, > > Roman > > > >> Best regards, > >> Martin > >> > >> > >>> -----Original Message----- > >>> From: Roland Westrelin > >>> Sent: Freitag, 20. Dezember 2019 16:59 > >>> To: Doerr, Martin ; Aditya Mandaleeka > >>> ; hotspot compiler >>> dev at openjdk.java.net> > >>> Cc: shenandoah-dev > >>> Subject: RE: RFR: 8236179: C1 register allocation error with > >>> T_ADDRESS > >>> > >>> > >>>> > http://cr.openjdk.java.net/~mdoerr/8236179_C1_T_ADDRESS/webrev.02/ > >>> > >>> That looks good to me. > >>> > >>> Roland. > >> > > From tobias.hartmann at oracle.com Fri Jan 17 11:07:37 2020 From: tobias.hartmann at oracle.com (Tobias Hartmann) Date: Fri, 17 Jan 2020 12:07:37 +0100 Subject: [15] RFR(M): 8236493: C2: Inconsistent known instance type information after EA In-Reply-To: <4ee2aba1-2e74-fdc5-f2e0-edb96f809d7b@oracle.com> References: <4ee2aba1-2e74-fdc5-f2e0-edb96f809d7b@oracle.com> Message-ID: <4a3e8089-bad3-57bf-cd95-27ed2cee3e55@oracle.com> Hi, I found some non-trivial issues with this. I will follow up with a new webrev after fixing. Thanks, Tobias On 14.01.20 15:06, Tobias Hartmann wrote: > Hi, > > please review the following patch: > https://bugs.openjdk.java.net/browse/JDK-8236493 > http://cr.openjdk.java.net/~thartmann/8236493/webrev.00/ > > While working on 8233164, I've noticed that escape analysis does not set known instance type > information for LoadNodes and their users if the loaded object does not escape but that information > is only (incrementally) propagated during IGVN. That's a problem because depending on the sequence > in which nodes on the worklist are processed, optimizations might still see the not-yet-updated (and > therefore incorrect) type. For example, another LoadNode optimization might rely on it's memory > input to be a general memory slice and re-wire it's inputs accordingly (ignoring known instance type > memory), when it is really reading from a known instance. > > For a detailed description of an instance of such a problem, see my RFR for 8233164 [1]. There, the > type of a CastPPNode is not updated (yet) and a Load user is incorrectly re-wired. > > To catch this case before things go horribly wrong, I've added an assert to > Node::raise_bottom_type(). Reverting the fix for 8233164 now immediately triggers the assert because > the known instance id of a DecodeNNode is only updated during IGVN [2]. All new tests in > TestInstanceIdPropagation.java trigger the assert as well. > > The fix is to always eagerly propagate that information during EA. If an object loaded from the > field of a non-escaping object is also non-escaping, we need to propagate that information. > > Thanks, > Tobias > > [1] https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-December/036616.html > > [2] Failure with TestArrayCopyMemoryChain.java when reverting fix for 8233164: > > 300 Proj === 288 [[ 301 305 312 ]] #5 !jvms: TestArrayCopyMemoryChain::test2 @ bci:25 > 305 CheckCastPP === 302 300 [[ 385 316 ]] #byte[int:1]:NotNull:exact *,iid=288 !jvms: > TestArrayCopyMemoryChain::test2 @ bci:25 > 385 EncodeP === _ 305 [[ 387 403 ]] #narrowoop: byte[int:1]:NotNull:exact *,iid=288 !orig=[402] > !jvms: TestArrayCopyMemoryChain$Wrapper:: @ bci:11 TestArrayCopyMemoryChain::test2 @ bci:41 > 403 > DecodeN === _ 385 [[ 494 420 ]] #byte[int:1]:NotNull:exact *,iid=288 !jvms: > TestArrayCopyMemoryChain::test2 @ bci:48 > 494 CastPP === 486 403 [[ 526 514 ]] #byte[int:>=0]:NotNull:exact * !jvms: > TestArrayCopyMemoryChain::test2 @ bci:71 > 420 CallStaticJava === 404 327 341 8 9 ( 419 1 1 1 64 1 1 1 403 257 43 64 11 230 > ) [[ 421 ]] # Static uncommon_trap(reason='intrinsic_or_type_checked_inlining' > action='make_not_entrant' debug_id='0') void ( int ) C=0.000100 TestArrayCopyMemoryChain::test2 @ > bci:61 !jvms: TestArrayCopyMemoryChain::test2 @ bci:61 > # To suppress the following error report, specify this argument > # after -XX: or in .hotspotrc: SuppressErrorAt=/node.cpp:1088 > # > # A fatal error has been detected by the Java Runtime Environment: > # > # Internal Error (/oracle/jdk_jdk/open/src/hotspot/share/opto/node.cpp:1088), pid=6482, tid=6496 > # assert(false) failed: Known instance type should have been set during EA > From tom.rodriguez at oracle.com Fri Jan 17 21:10:58 2020 From: tom.rodriguez at oracle.com (Tom Rodriguez) Date: Fri, 17 Jan 2020 13:10:58 -0800 Subject: [14] RFR (M) 8231515: [Graal] Crash during exception throwing in InterpreterRuntime::resolve_invoke Message-ID: This fixes two separate but related issues. The primary crash is the JVMTI crash when using the post on exceptions support. In that case we use a FrameState which isn't suitable for deopt and we will crash if that deopt is taken. The second issue is related but more rare where our support for explicit exception throwing uses a stub that has a FrameState which is also not suitable for deopt. It's more rare because deoptimization in that path is much less likely. New logic was added to Graal to verify the FrameStates used for deopt which caught both of these problems. Minor changes to JVMCI were made to expose information to Graal but there are no changes that would affect anything besides Graal. New unit tests exercise these paths explicitly and local kitchensink testing of the fix inidicates that there were no more crashes with Graal. preliminary mach5 testing against 15 was clean but mach5 testing against jdk 14 is in progress. http://cr.openjdk.java.net/~never/8231515/webrev https://bugs.openjdk.java.net/browse/JDK-8231515 From vladimir.kozlov at oracle.com Fri Jan 17 23:12:15 2020 From: vladimir.kozlov at oracle.com (Vladimir Kozlov) Date: Fri, 17 Jan 2020 15:12:15 -0800 Subject: [14] RFR (M) 8231515: [Graal] Crash during exception throwing in InterpreterRuntime::resolve_invoke In-Reply-To: References: Message-ID: Looks good. Only small nitpick is spacing of '\' in new line in vmStructs_jvmci.cpp. No need for new webrev. Note for records, it was reviewed by Dean already in Graal's PR. Thanks, Vladimir On 1/17/20 1:10 PM, Tom Rodriguez wrote: > This fixes two separate but related issues. The primary crash is the JVMTI crash when using the post on exceptions > support. In that case we use a FrameState which isn't suitable for deopt and we will crash if that deopt is taken. The > second issue is related but more rare where our support for explicit exception throwing uses a stub that has a > FrameState which is also not suitable for deopt. It's more rare because deoptimization in that path is much less likely. > New logic was added to Graal to verify the FrameStates used for deopt which caught both of these problems. Minor changes > to JVMCI were made to expose information to Graal but there are no changes that would affect anything besides Graal. New > unit tests exercise these paths explicitly and local kitchensink testing of the fix inidicates that there were no more > crashes with Graal. preliminary mach5 testing against 15 was clean but mach5 testing against jdk 14 is in progress. > > http://cr.openjdk.java.net/~never/8231515/webrev > https://bugs.openjdk.java.net/browse/JDK-8231515 From martin.doerr at sap.com Mon Jan 20 14:25:26 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Mon, 20 Jan 2020 14:25:26 +0000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization Message-ID: Hi, I'd like to fix warmup issues we observed in test compiler/c2/Test8004741.java. The issue is a missing timestamp initialization which leads to a premature counter decay at the first safepoint with -XX:-TieredCompilation. Bug: https://bugs.openjdk.java.net/browse/JDK-8237375 Webrev: http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ The webrev applies on top of the preliminary fix for https://bugs.openjdk.java.net/browse/JDK-8235741. I'll wait until that one gets pushed. What it changes: * Move class CounterDecay up * Introduce reset_last_timestamp() * Use this function in decay() and in SimpleCompPolicy::initialize() Best regards, Martin From volker.simonis at gmail.com Mon Jan 20 17:57:57 2020 From: volker.simonis at gmail.com (Volker Simonis) Date: Mon, 20 Jan 2020 09:57:57 -0800 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: References: Message-ID: Looks good to me. Thanks, Volker On Mon, Jan 20, 2020 at 6:26 AM Doerr, Martin wrote: > > Hi, > > I'd like to fix warmup issues we observed in test compiler/c2/Test8004741.java. > The issue is a missing timestamp initialization which leads to a premature counter decay at the first safepoint with -XX:-TieredCompilation. > > Bug: > https://bugs.openjdk.java.net/browse/JDK-8237375 > > Webrev: > http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ > The webrev applies on top of the preliminary fix for https://bugs.openjdk.java.net/browse/JDK-8235741. > I'll wait until that one gets pushed. > > What it changes: > > * Move class CounterDecay up > * Introduce reset_last_timestamp() > * Use this function in decay() and in SimpleCompPolicy::initialize() > > Best regards, > Martin > From david.holmes at oracle.com Mon Jan 20 21:53:36 2020 From: david.holmes at oracle.com (David Holmes) Date: Tue, 21 Jan 2020 07:53:36 +1000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: References: Message-ID: Hi Martin, On 21/01/2020 12:25 am, Doerr, Martin wrote: > Hi, > > I?d like to fix warmup issues we observed in test > compiler/c2/Test8004741.java. > > The issue is a missing timestamp initialization which leads to a > premature counter decay at the first safepoint with -XX:-TieredCompilation. > > Bug: > > https://bugs.openjdk.java.net/browse/JDK-8237375 > > Webrev: > > http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ > > The webrev applies on top of the preliminary fix for > https://bugs.openjdk.java.net/browse/JDK-8235741. > > I?ll wait until that one gets pushed. > > What it changes: > > * Move class CounterDecay up > * Introduce reset_last_timestamp() "reset" suggests clearing or moving back to an original value to me. How about just "set" or "update"? Cheers, David > * Use this function in decay() and in SimpleCompPolicy::initialize() > > Best regards, > > Martin > From xxinliu at amazon.com Mon Jan 20 22:24:53 2020 From: xxinliu at amazon.com (Liu, Xin) Date: Mon, 20 Jan 2020 22:24:53 +0000 Subject: Does C2 disable incremental inlining for methods without @ForceInline? Message-ID: <39E5D093-E575-4850-BBC3-9ED0A99B403D@amazon.com> Hi, hotspot-compiler, If I understand correctly, C2 has actually disabled incremental inlining for almost all methods without @ForceInline. Is it intentional? 2 positions check !callee-force_inline(). 1. https://hg.openjdk.java.net/jdk/jdk/file/76b9822d2e65/src/hotspot/share/opto/bytecodeInfo.cpp#l368 2. https://hg.openjdk.java.net/jdk/jdk/file/76b9822d2e65/src/hotspot/share/opto/bytecodeInfo.cpp#l474 I ran ?dacapo/eclipse? and got 5345 inline failure msg ?size > DesiredMethodLimit? If I comment out these 2 checks, those messages will be all gone. By checking LogCompilation, I can confirm that 386 more methods are inlined by incremental inlining. jdk333 is baseline. jdk335 is with the patch that comments out !callee-force_inline(). diff --git a/src/hotspot/share/opto/bytecodeInfo.cpp b/src/hotspot/share/opto/bytecodeInfo.cpp --- a/src/hotspot/share/opto/bytecodeInfo.cpp +++ b/src/hotspot/share/opto/bytecodeInfo.cpp @@ -365,7 +365,7 @@ WarmCallInfo* wci_result, bool& should_delay) { if (ClipInlining && (int)count_inline_bcs() >= DesiredMethodLimit) { - if (!callee_method->force_inline() || !IncrementalInline) { + if (/*!callee_method->force_inline() || */!IncrementalInline) { set_msg("size > DesiredMethodLimit"); return false; } else if (!C->inlining_incrementally()) { @@ -471,7 +471,7 @@ int size = callee_method->code_size_for_inlining(); if (ClipInlining && (int)count_inline_bcs() + size >= DesiredMethodLimit) { - if (!callee_method->force_inline() || !IncrementalInline) { + if (/*!callee_method->force_inline() || */!IncrementalInline) { set_msg("size > DesiredMethodLimit"); return false; } else if (!C->inlining_incrementally()) { 386 more methods are late_inlined. % grep "late_inline method=" jit_jdk333.xml | wc -l 162 % grep "late_inline method=" jit_jdk335.xml | wc -l 548 Eg. //jdk333.log @ 3299 org.eclipse.jdt.internal.compiler.parser.Parser::consumeClassOrInterfaceName (19 bytes) size > DesiredMethodLimit //Jdk335.log @ 3299 org.eclipse.jdt.internal.compiler.parser.Parser::consumeClassOrInterfaceName (19 bytes) inline (hot) @ 10 org.eclipse.jdt.internal.compiler.parser.Parser::pushOnGenericsIdentifiersLengthStack (53 bytes) inline (hot) @ 15 org.eclipse.jdt.internal.compiler.parser.Parser::pushOnGenericsLengthStack (53 bytes) inline (hot) The reason I ran dacapo/eclipse because I observed the most ?size > DesiredMethodLimit? so far. Unfortunately, I can?t observe performance gain. My understanding is dacapo/eclipse is too big. The background noise can easily overshadow this tiny inlining gain. My understanding is incremental inlining is especially useful when C2 optimizer can slim down methods from over-threshold to below-threshold. I plan to make up a microbench to prove my point. Is it worth digging? Thanks, --lx From Pengfei.Li at arm.com Tue Jan 21 08:40:11 2020 From: Pengfei.Li at arm.com (Pengfei Li) Date: Tue, 21 Jan 2020 08:40:11 +0000 Subject: RFR(S): 8237524: AArch64: String.compareTo() may return incorrect result Message-ID: Hi, Please help review this small fix on AArch64 string intrinsics. JBS: https://bugs.openjdk.java.net/browse/JDK-8237524 Webrev: http://cr.openjdk.java.net/~pli/rfr/8237524/webrev.00/ AArch64 String.compareTo() may return incorrect result in some corner cases. This issue can be reproduced in below program by constructing two strings from the same byte array but with different encoding options. -------- import java.lang.reflect.Constructor; public class Test { public static void main(String[] args) throws Exception { Constructor c = String.class.getDeclaredConstructor(byte[].class, byte.class); c.setAccessible(true); byte[] bytes = new byte[] { 'Y', 'm', '_', 'l', 'V', 'n', 'W', 'S', 'w', 'm', 'W', 'S' }; String s1 = c.newInstance(bytes, (byte) 0); String s2 = c.newInstance(bytes, (byte) 1); System.out.println(s1.compareTo(s2)); } } // $ java -Xint Test // -27904 // $ java -Xcomp -XX:-TieredCompilation Test // 6 -------- Root cause is in the AArch64 String.compareTo intrinsics. In the code of MacroAssembler::string_compare(), we have a fast path that compares the addresses of initial bytes of the two arrays. If the result is equal, we skip the inflation of the Latin one and the byte-wise comparison part. But we shouldn't do in this way if the encodings of the two strings are different (the LU/UL cases). This patch removes the incorrect fast path check. Tests: No new failure found in hotspot::hotspot_all_no_apps, jdk::jdk_core, langtools::tier1 after this patch. A new jtreg case is also added for this issue. -- Thanks, Pengfei From rwestrel at redhat.com Tue Jan 21 08:46:24 2020 From: rwestrel at redhat.com (Roland Westrelin) Date: Tue, 21 Jan 2020 09:46:24 +0100 Subject: Does C2 disable incremental inlining for methods without @ForceInline? In-Reply-To: <39E5D093-E575-4850-BBC3-9ED0A99B403D@amazon.com> References: <39E5D093-E575-4850-BBC3-9ED0A99B403D@amazon.com> Message-ID: <87y2u1xkm7.fsf@redhat.com> Hi lx, > If I understand correctly, C2 has actually disabled incremental > inlining for almost all methods without @ForceInline. Is it > intentional? Incremental inlining was implemented to help with method handle/invokedynamic and dynamic languages performance (nashorn at the time). The problem there is that inlining through a method handle invoke causes the inlining budget to be exhausted quickly both because the graph grows fast and the inlining budget estimate C2 uses (number of nodes in the graph estimated during parsing as number of nodes added to the graph) is especially inaccurate for method invoke handle because the graph at the call optimizes well. So the idea is that once the inlining budget is exhausted, we run a pass of IGVN to clean the graph, count the number of actual live nodes to have an accurate estimate of the graph size and resume inlining with incremental inlining if the inlining budget allows it. There was no attempt at using incremental inlining for "regular" java programs eventhough with lambdas, java code relies on invokedynamic and so should trigger incremental inlining. Roland. From aph at redhat.com Tue Jan 21 09:33:33 2020 From: aph at redhat.com (Andrew Haley) Date: Tue, 21 Jan 2020 09:33:33 +0000 Subject: RFR(S): 8237524: AArch64: String.compareTo() may return incorrect result In-Reply-To: References: Message-ID: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> On 1/21/20 8:40 AM, Pengfei Li wrote: > Root cause is in the AArch64 String.compareTo intrinsics. In the code of > MacroAssembler::string_compare(), we have a fast path that compares the > addresses of initial bytes of the two arrays. If the result is equal, we > skip the inflation of the Latin one and the byte-wise comparison part. > But we shouldn't do in this way if the encodings of the two strings are > different (the LU/UL cases). This patch removes the incorrect fast path > check. Is this really a bug? The test case is violating the protections provided by the Java runtime, and the programmer deserves everything that happens. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From nick.gasson at arm.com Tue Jan 21 10:11:25 2020 From: nick.gasson at arm.com (Nick Gasson) Date: Tue, 21 Jan 2020 18:11:25 +0800 Subject: [aarch64-port-dev ] RFR(S): 8237524: AArch64: String.compareTo() may return incorrect result In-Reply-To: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> References: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> Message-ID: On 21/01/2020 17:33, Andrew Haley wrote: > On 1/21/20 8:40 AM, Pengfei Li wrote: >> Root cause is in the AArch64 String.compareTo intrinsics. In the code of >> MacroAssembler::string_compare(), we have a fast path that compares the >> addresses of initial bytes of the two arrays. If the result is equal, we >> skip the inflation of the Latin one and the byte-wise comparison part. >> But we shouldn't do in this way if the encodings of the two strings are >> different (the LU/UL cases). This patch removes the incorrect fast path >> check. > > Is this really a bug? The test case is violating the protections > provided by the Java runtime, and the programmer deserves everything > that happens. > Even if it's not strictly a bug, the pointer equality checks in the LU/UL branches are unnecessary: the only time they succeed is when the Strings were constructed in this backdoor way, and in that case they give the wrong result. So it seems better to just remove them. Nick From aph at redhat.com Tue Jan 21 10:12:34 2020 From: aph at redhat.com (Andrew Haley) Date: Tue, 21 Jan 2020 10:12:34 +0000 Subject: [aarch64-port-dev ] RFR(S): 8237524: AArch64: String.compareTo() may return incorrect result In-Reply-To: References: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> Message-ID: <56670d62-defa-4192-c084-bfa130adde41@redhat.com> On 1/21/20 10:11 AM, Nick Gasson wrote: > On 21/01/2020 17:33, Andrew Haley wrote: >> On 1/21/20 8:40 AM, Pengfei Li wrote: >>> Root cause is in the AArch64 String.compareTo intrinsics. In the code of >>> MacroAssembler::string_compare(), we have a fast path that compares the >>> addresses of initial bytes of the two arrays. If the result is equal, we >>> skip the inflation of the Latin one and the byte-wise comparison part. >>> But we shouldn't do in this way if the encodings of the two strings are >>> different (the LU/UL cases). This patch removes the incorrect fast path >>> check. >> >> Is this really a bug? The test case is violating the protections >> provided by the Java runtime, and the programmer deserves everything >> that happens. > > Even if it's not strictly a bug, the pointer equality checks in the > LU/UL branches are unnecessary: the only time they succeed is when the > Strings were constructed in this backdoor way, and in that case they > give the wrong result. So it seems better to just remove them. True enough; OK. -- Andrew Haley (he/him) Java Platform Lead Engineer Red Hat UK Ltd. https://keybase.io/andrewhaley EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671 From Pengfei.Li at arm.com Tue Jan 21 10:13:30 2020 From: Pengfei.Li at arm.com (Pengfei Li) Date: Tue, 21 Jan 2020 10:13:30 +0000 Subject: RFR(S): 8237524: AArch64: String.compareTo() may return incorrect result In-Reply-To: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> References: <5111fcc0-9c3a-6f1b-9d6a-b666a41f4453@redhat.com> Message-ID: Hi Andrew, > Is this really a bug? The test case is violating the protections provided by the > Java runtime, and the programmer deserves everything that happens. To be honest, I don't know what kind of incorrect result should be classified as a bug. Indeed, I found this issue several months ago but didn't want to fix it because it doesn't break real Java workloads except the ones that use non-public string constructors. But now I post this RFR with this thought: If the case that two strings with different encodings share the same memory cannot happen in normal applications, why don't we remove the redundant check to save 4 instructions in the generated intrinsics code? From this point, it's likely that this JBS should be classified as an enhancement instead of a bug. -- Thanks, Pengfei From igor.veresov at oracle.com Wed Jan 22 00:12:27 2020 From: igor.veresov at oracle.com (Igor Veresov) Date: Tue, 21 Jan 2020 16:12:27 -0800 Subject: [14] RFR (M) 8231515: [Graal] Crash during exception throwing in InterpreterRuntime::resolve_invoke In-Reply-To: References: Message-ID: I think you need to add setting of _aot_deopt_blob_unpack_with_exception_in_tls in AOTCodeHeap::link_graal_runtime_symbols() in aotCodeHeap.cpp. igor > On Jan 17, 2020, at 1:10 PM, Tom Rodriguez wrote: > > This fixes two separate but related issues. The primary crash is the JVMTI crash when using the post on exceptions support. In that case we use a FrameState which isn't suitable for deopt and we will crash if that deopt is taken. The second issue is related but more rare where our support for explicit exception throwing uses a stub that has a FrameState which is also not suitable for deopt. It's more rare because deoptimization in that path is much less likely. New logic was added to Graal to verify the FrameStates used for deopt which caught both of these problems. Minor changes to JVMCI were made to expose information to Graal but there are no changes that would affect anything besides Graal. New unit tests exercise these paths explicitly and local kitchensink testing of the fix inidicates that there were no more crashes with Graal. preliminary mach5 testing against 15 was clean but mach5 testing against jdk 14 is in progress. > > http://cr.openjdk.java.net/~never/8231515/webrev > https://bugs.openjdk.java.net/browse/JDK-8231515 From dean.long at oracle.com Wed Jan 22 02:27:59 2020 From: dean.long at oracle.com (Dean Long) Date: Tue, 21 Jan 2020 18:27:59 -0800 Subject: [14] RFR (M) 8231515: [Graal] Crash during exception throwing in InterpreterRuntime::resolve_invoke In-Reply-To: References:

Message-ID: <501caf4e-ff8e-bd9d-59f6-6c64e77ef9bc@oracle.com> Yes, we need the mapping in both directions. Otherwise looks good. dl On 1/21/2020 4:12 PM, Igor Veresov wrote: > I think you need to add setting of _aot_deopt_blob_unpack_with_exception_in_tls in AOTCodeHeap::link_graal_runtime_symbols() in aotCodeHeap.cpp. > > igor > > > >> On Jan 17, 2020, at 1:10 PM, Tom Rodriguez wrote: >> >> This fixes two separate but related issues. The primary crash is the JVMTI crash when using the post on exceptions support. In that case we use a FrameState which isn't suitable for deopt and we will crash if that deopt is taken. The second issue is related but more rare where our support for explicit exception throwing uses a stub that has a FrameState which is also not suitable for deopt. It's more rare because deoptimization in that path is much less likely. New logic was added to Graal to verify the FrameStates used for deopt which caught both of these problems. Minor changes to JVMCI were made to expose information to Graal but there are no changes that would affect anything besides Graal. New unit tests exercise these paths explicitly and local kitchensink testing of the fix inidicates that there were no more crashes with Graal. preliminary mach5 testing against 15 was clean but mach5 testing against jdk 14 is in progress. >> >> http://cr.openjdk.java.net/~never/8231515/webrev >> https://bugs.openjdk.java.net/browse/JDK-8231515 > From xxinliu at amazon.com Wed Jan 22 07:47:49 2020 From: xxinliu at amazon.com (Liu, Xin) Date: Wed, 22 Jan 2020 07:47:49 +0000 Subject: Does C2 disable incremental inlining for methods without @ForceInline? In-Reply-To: <87y2u1xkm7.fsf@redhat.com> References: <39E5D093-E575-4850-BBC3-9ED0A99B403D@amazon.com> <87y2u1xkm7.fsf@redhat.com> Message-ID: <840D453C-B544-4A0F-ABA6-AF4A6800F1E8@amazon.com> Hi, Roland, Thank you to fill me the background. I read your title of 8005071 again and it is for JSR 292. I ran some benchmarks in dacapo with my patch and indeed got nothing. Thanks, --lx ?On 1/21/20, 12:46 AM, "Roland Westrelin" wrote: Hi lx, > If I understand correctly, C2 has actually disabled incremental > inlining for almost all methods without @ForceInline. Is it > intentional? Incremental inlining was implemented to help with method handle/invokedynamic and dynamic languages performance (nashorn at the time). The problem there is that inlining through a method handle invoke causes the inlining budget to be exhausted quickly both because the graph grows fast and the inlining budget estimate C2 uses (number of nodes in the graph estimated during parsing as number of nodes added to the graph) is especially inaccurate for method invoke handle because the graph at the call optimizes well. So the idea is that once the inlining budget is exhausted, we run a pass of IGVN to clean the graph, count the number of actual live nodes to have an accurate estimate of the graph size and resume inlining with incremental inlining if the inlining budget allows it. There was no attempt at using incremental inlining for "regular" java programs eventhough with lambdas, java code relies on invokedynamic and so should trigger incremental inlining. Roland. From martin.doerr at sap.com Wed Jan 22 10:45:30 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Wed, 22 Jan 2020 10:45:30 +0000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: References: Message-ID: Hi Volker, thanks for the review! Best regards, Martin > -----Original Message----- > From: Volker Simonis > Sent: Montag, 20. Januar 2020 18:58 > To: Doerr, Martin > Cc: hotspot-compiler-dev at openjdk.java.net; David Holmes > (david.holmes at oracle.com) > Subject: Re: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay > timestamp initialization > > Looks good to me. > > Thanks, > Volker > > On Mon, Jan 20, 2020 at 6:26 AM Doerr, Martin > wrote: > > > > Hi, > > > > I'd like to fix warmup issues we observed in test > compiler/c2/Test8004741.java. > > The issue is a missing timestamp initialization which leads to a premature > counter decay at the first safepoint with -XX:-TieredCompilation. > > > > Bug: > > https://bugs.openjdk.java.net/browse/JDK-8237375 > > > > Webrev: > > http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ > > The webrev applies on top of the preliminary fix for > https://bugs.openjdk.java.net/browse/JDK-8235741. > > I'll wait until that one gets pushed. > > > > What it changes: > > > > * Move class CounterDecay up > > * Introduce reset_last_timestamp() > > * Use this function in decay() and in SimpleCompPolicy::initialize() > > > > Best regards, > > Martin > > From martin.doerr at sap.com Wed Jan 22 10:47:17 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Wed, 22 Jan 2020 10:47:17 +0000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: References:

Message-ID: Hi David, I'll change it to update_last_timestamp before pushing. Can I add you as reviewer? Best regards, Martin > -----Original Message----- > From: David Holmes > Sent: Montag, 20. Januar 2020 22:54 > To: Doerr, Martin ; 'hotspot-compiler- > dev at openjdk.java.net' > Cc: David Holmes > Subject: Re: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay > timestamp initialization > > Hi Martin, > > On 21/01/2020 12:25 am, Doerr, Martin wrote: > > Hi, > > > > I'd like to fix warmup issues we observed in test > > compiler/c2/Test8004741.java. > > > > The issue is a missing timestamp initialization which leads to a > > premature counter decay at the first safepoint with -XX:- > TieredCompilation. > > > > Bug: > > > > https://bugs.openjdk.java.net/browse/JDK-8237375 > > > > Webrev: > > > > http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ > > > > The webrev applies on top of the preliminary fix for > > https://bugs.openjdk.java.net/browse/JDK-8235741. > > > > I'll wait until that one gets pushed. > > > > What it changes: > > > > * Move class CounterDecay up > > * Introduce reset_last_timestamp() > > "reset" suggests clearing or moving back to an original value to me. How > about just "set" or "update"? > > Cheers, > David > > > * Use this function in decay() and in SimpleCompPolicy::initialize() > > > > Best regards, > > > > Martin > > From david.holmes at oracle.com Wed Jan 22 11:52:14 2020 From: david.holmes at oracle.com (David Holmes) Date: Wed, 22 Jan 2020 21:52:14 +1000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: References:

Message-ID: On 22/01/2020 8:47 pm, Doerr, Martin wrote: > Hi David, > > I'll change it to update_last_timestamp before pushing. > Can I add you as reviewer? Sure. Thanks, David > Best regards, > Martin > > >> -----Original Message----- >> From: David Holmes >> Sent: Montag, 20. Januar 2020 22:54 >> To: Doerr, Martin ; 'hotspot-compiler- >> dev at openjdk.java.net' >> Cc: David Holmes >> Subject: Re: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay >> timestamp initialization >> >> Hi Martin, >> >> On 21/01/2020 12:25 am, Doerr, Martin wrote: >>> Hi, >>> >>> I'd like to fix warmup issues we observed in test >>> compiler/c2/Test8004741.java. >>> >>> The issue is a missing timestamp initialization which leads to a >>> premature counter decay at the first safepoint with -XX:- >> TieredCompilation. >>> >>> Bug: >>> >>> https://bugs.openjdk.java.net/browse/JDK-8237375 >>> >>> Webrev: >>> >>> http://cr.openjdk.java.net/~mdoerr/8237375_CounterDecay/webrev.00/ >>> >>> The webrev applies on top of the preliminary fix for >>> https://bugs.openjdk.java.net/browse/JDK-8235741. >>> >>> I'll wait until that one gets pushed. >>> >>> What it changes: >>> >>> * Move class CounterDecay up >>> * Introduce reset_last_timestamp() >> >> "reset" suggests clearing or moving back to an original value to me. How >> about just "set" or "update"? >> >> Cheers, >> David >> >>> * Use this function in decay() and in SimpleCompPolicy::initialize() >>> >>> Best regards, >>> >>> Martin >>> From vladimir.x.ivanov at oracle.com Wed Jan 22 11:52:25 2020 From: vladimir.x.ivanov at oracle.com (Vladimir Ivanov) Date: Wed, 22 Jan 2020 14:52:25 +0300 Subject: [15] RFR (L): 7175279: Don't use x87 FPU on x86-64 In-Reply-To: <7063EB29-D415-4A48-BA4F-B16C5A1F52F8@oracle.com> References: <0b0897e0-1dbc-306d-b2cb-31de13fb8b34@oracle.com> <7063EB29-D415-4A48-BA4F-B16C5A1F52F8@oracle.com> Message-ID: <16d75ed8-896b-25fe-0a0a-babc9087e631@oracle.com> Thanks, Vladimir. Anybody for a second (R)eview, please? http://cr.openjdk.java.net/~vlivanov/7175279/webrev.00/ + http://cr.openjdk.java.net/~vlivanov/7175279/webrev.01-00/ Best regards, Vladimir Ivanov On 20.12.2019 01:00, Vladimir Kozlov wrote: > Good > > Thanks > Vladimir > >> On Dec 19, 2019, at 1:58 PM, Vladimir Ivanov wrote: >> >> ? >>>>> Make ConversionStub x86_32-specific only if possible. From what I see it is only LIR_OpConvert in c1_LIR.hpp we have to deal with. I actually can't see how it could be only 32-specific. Hmm? >>>> >>>> I experimented with it, but it requires #ifdefs in c1_LIR.cpp/hpp which I don't like. So, I don't consider it as an > option right now. >>> Okay, NOT_IA32( ShouldNotReachHere() ) with comment in c1_CodeStubs.hpp should be enough for now. >> >> Incremental diff: >> http://cr.openjdk.java.net/~vlivanov/7175279/webrev.01-00/ >> >> Best regards, >> Vladimir Ivanov >> >>>>>>> c1_LinearScan.cpp - I think IA64 is used for Itanium. For 64-bit x86 we use AMD64: >>>>>>> >>>>>>> https://hg.openjdk.java.net/jdk/jdk/file/cfaa2457a60a/src/hotspot/share/utilities/macros.hpp#l456 >>>>>> >>>>>> >>>>>> >>>>>> Yes, you are right. Good catch! :-) >>>>>> >>>>>> Best regards, >>>>>> Vladimir Ivanov >>>>>> >>>>>>> On 12/17/19 4:50 AM, Vladimir Ivanov wrote: >>>>>>>> http://cr.openjdk.java.net/~vlivanov/7175279/webrev.00/ >>>>>>>> https://bugs.openjdk.java.net/browse/JDK-7175279 >>>>>>>> >>>>>>>> There was a major rewrite of math intrinsics which in 9 time frame which almost completely eliminated x87 code in x86-64 code base. >>>>>>>> >>>>>>>> Proposed patch removes the rest and makes x86-64 code x87-free. >>>>>>>> >>>>>>>> The main motivation for the patch is to completely eliminate non-strictfp behaving code in order to prepare the JVM for JEP 306 [1] and related enhancements [2]. >>>>>>>> >>>>>>>> Most of the changes are in C1, but there is one case in template interpreter (java_lang_math_abs) which now uses StubRoutines::x86::double_sign_mask(). It forces its initialization to be moved to StubRoutines::initialize1(). >>>>>>>> >>>>>>>> x87 instructions are made available only on x86-32. >>>>>>>> >>>>>>>> C1 changes involve removing FPU support on x86-64 and effectively make x86-specific support in linear scan allocator [2] x86-32-only. >>>>>>>> >>>>>>>> Testing: tier1-6, build on x86-23, linux-aarch64, linux-arm32. >>>>>>>> >>>>>>>> Best regards, >>>>>>>> Vladimir Ivanov >>>>>>>> >>>>>>>> [1] https://bugs.openjdk.java.net/browse/JDK-8175916 >>>>>>>> >>>>>>>> [2] https://bugs.openjdk.java.net/browse/JDK-8136414 >>>>>>>> >>>>>>>> [3] http://hg.openjdk.java.net/jdk/jdk/file/ff7cd49f2aef/src/hotspot/cpu/x86/c1_LinearScan_x86.cpp > From martin.doerr at sap.com Wed Jan 22 11:56:01 2020 From: martin.doerr at sap.com (Doerr, Martin) Date: Wed, 22 Jan 2020 11:56:01 +0000 Subject: RFR(S): 8237375: SimpleThresholdPolicy misses CounterDecay timestamp initialization In-Reply-To: