switch: using an expicit type as total is dangerous

Sun Aug 30 15:57:15 UTC 2020

One of the things that I think is being missed here is that switch is turning into a much more powerful construct, and as a result, our intuition for what it means needs to change with it.  All of the sky-is-falling hyperbole about action-at-a-distance comes, in no small part, from having not fully upgraded our mental models to understand what switch means.  Of course, we want to strike a balance between preserving suitable existing intuitions and extending he power of the construct, but let’s not fool ourselves that there isn’t something new to learn here.  But given tradeoffs where we damage the expressiveness or consistency or clarity of the new construct to prop up old, obsolete intuitions, there’s really no decision here.  

Legacy switches have only constants as their cases.  This means that at most one case could match, and in turn this means that order is independent.  They implement the equivalent of a dynamically-computed (possibly partial) map.  

New switches are like constrained if-else chains.  The constraints are useful to the compiler because they are more easily scrutable; compilers can optimize away redundant tests, and turn some O(n) switches into O(log n) or O(1).  And they are useful to the user because they are asking a simpler question which can have a less error-prone expression.  But at root, some things have changed, such as non-overlap of cases, and they can have far more interesting structure.  And these structures are not single-level, they are recursive (we will see chains of D(P) … D(Q) just as we will see chains of P .. Q), and our model should embrace that natural structure.  

While it may feel unfamiliar today, we should expect to routinely recognize the shape of: 

     case Foo(Bar x): 
     case Foo(Baz x):
     case Foo f:  // or case Foo(Object o): 
     case Object o: // or default

where the cases form a tree, because these shapes will be ubiquitous.  And developers will develop intuitions about these kinds of shapes too, but right now, they look a little more foreign, and so we’re engaging in “bargaining” to try to make their behaviors less strange. 

So, rather than trying to constrain the model to be more consistent with what we’re familiar with, we should be trying to get familiar with the shapes that will naturally arise (which is not hard, because we can look in code from other languages that have similar features) before we start to make claims about what is “natural” or “confusing.”  Otherwise, we’re just starting the race with bricks tied to our feet.  

> On Aug 30, 2020, at 11:07 AM, Brian Goetz <brian.goetz at oracle.com> wrote:
> 
> Sorry, but I don’t find this example that  compelling either.  And  I certainly don’t think it comes remotely close to “bad enough that we have to throw the design in the trash.”  
> 
> As I said from the beginning, yes, you can construct puzzlers.  But the existence of a puzzler is not necessarily evidence that the language design is broken, and you are lobbying (implicitly) for an extreme solution: to take the entire design that we’ve worked on for years and toss it in the trash, and replace it with something that is not even yet a proposal, I think the bar is much higher than this.  (I don’t think that’s an exaggeration; totality is what makes the entire design stick together without being an ad-hoc bag of “but on tuesday, do it differently”.)  
> 
> This example feels to me in the same category as combining var with diamond.  There’s nothing wrong with that, but by leaving so many things implicit in your program, you may get an inference that is not what you expected.  That doesn’t mean that var is bad, or diamond is bad, or even that we should outlaw their interaction (which some suggested at the time.)  It just means that when you combine features, especially features that involve implicitness, your program becomes more brittle.  
> 
> This example combines a lot of implicitness to get the same kind of brittleness, including switching on a complex expression whose type isn’t obvious, and, more importantly, making an incompatible change to a method.  I don’t think you get to lay the blame on the language inferring totality here, unless you’re advocating that we should never infer anything!  Making a change like this could easily change inferred types, which could silently affect overload decisions, and, when we get to Valhalla, even runtime layouts.  That’s just part of the trade we make when we allow users to leave something unspecified, whether it be a manifest type (var, diamond, generic method witnesses), finality (lambda capture), totality (as here), accessibility (such as when migrating a class to an interface), etc.  
> 
> So, it’s a good example, to call our attention to the consequences of leaving totality implicit.  (We’re having a separate discussion about whether to let the  user opt into making totality explicit, and that’s another tool that could be used to make this example less brittle, just as manifest types would make it less brittle than switching on an expression.)  
> 
> Really, though, I think you’re attacking totality because of .01% imperfections without really appreciating how much worse the alternatives are, and how much more often their pain points would come up.  (Refactoring switches to instanceof should be expected to happen 1000x more often than making an incompatible change to a method signature and hoping nothing changes.)  It’s good to identify the warts, but I’d prefer a little less jumping from “wart, ergo mistake” — it took us three years to converge on this answer precisely because there are no perfect answers.
> 
> 
>> On Aug 30, 2020, at 7:37 AM, forax at univ-mlv.fr wrote:
>> 
>> ----- Mail original -----
>>> De: "Brian Goetz" <brian.goetz at oracle.com>
>>> À: "Remi Forax" <forax at univ-mlv.fr>, "amber-spec-experts" <amber-spec-experts at openjdk.java.net>
>>> Envoyé: Lundi 24 Août 2020 20:57:03
>>> Objet: Re: switch: using an expicit type as total is dangerous
>> 
>>>> 2/ using an explicit type for a total type is a footgun because the semantics
>>>> will change if the hierarchy or the return type of a method switched upon
>>>> change.
>>> 
>>> Sorry, I think this argument is a pure red herring.   I get why this is
>>> one of those "scary the first time you see it" issues, but I think the
>>> fear has been overblown to near-panic proportions.  We've spent a lot of
>>> time talking about it and, the more we talk, the less worried I am.
>> 
>> good for you,
>> the more i talk about it, the more i'm worried because you don't seem to understand that having the semantics that change underneath you is bad.
>> 
>>> 
>>> The conditions that have to combine for this to happen are already
>>> individually rare:
>>>    - a hierarchy change, combined with
>>>    - enough use-site type inference that is not obvious what the type
>>> dependencies are, combined with
>>>    - null actually being a member of the domain, combined with
>>>    - users not realizing null is a member of the domain.
>> 
>> 
>> nope, you don't need a hierarchy change, changing the return type (as noticed by Tagir) and null being part of the domain is enough. 
>> 
>>> 
>>> Then, for it to actually be a problem, not only do all of the above have
>>> to happen, but an unhandled null has to actually show up.
>>> 
>>> Even then, the severity of this case is low -- most likely, the NPE gets
>>> moved from one place to another.
>> 
>> nope see below
>> 
>>> 
>>> Even then, the remediation cost is trivial.
>> 
>> for having remediation, as a user you have to first see the change of semantics, but you don't.
>> 
>> 
>> Ok, let's take an example, i've written a method getLiteral()
>> Number getLiteral(String token) {
>>   if (token.equals("null")) {
>>     return null; // null is part of the domain
>>   }
>>   try {
>>     return Integer.parseInt(token);
>>   } catch(NumberFormatException e) {
>>     return Double.parseDouble(token);
>>   }
>> }
>> 
>> and a statement switch in another package/module
>> switch(getLiteral(token)) {
>>   case Integer -> System.out.println("Integer"); 
>>   case Double -> System.out.println("Double"); 
>>   case Number -> System.out.println("null");
>> }
>> 
>> but now i change getLiteral() to add string literal
>> Object getLiteral(String token) {
>>   if (token.equals("null")) {
>>     return null; // null is part of the domain
>>   }
>>   if (token.startsWith("\"") {
>>     return token.substring(1, token.length() - 1);
>>   }
>>   try {
>>     return Integer.parseInt(token);
>>   } catch(NumberFormatException e) {
>>     return Double.parseDouble(token);
>>   }
>> }
>> 
>> If i only recompile getLiteral(), and run the code containing the switch, i get a ICCE at runtime because the signature of getLiteral() has changed, which is good,
>> but if i now recompile the switch, the code compiles without any error but with a different semantics, duh ?
>> 
>> Using "case var _" as the last case at least keep the same semantics, using "default Number" does not compile.
>> 
>> [...]
>> 
>> Rémi
>