Update on JEP-286 (Local Variable Type Inference)
Brian Goetz
brian.goetz at oracle.com
Tue Dec 6 19:37:04 UTC 2016
[ Please do not respond directly; see below for instructions on
providing feedback. ]
Over 2500 people participated in two rounds of surveys on Local Variable
Type Inference (http://openjdk.java.net/jeps/286).
The results of the first round can be found at:
https://www.surveymonkey.com/results/SM-FLWGS5PW/
The results of the followup survey are here:
https://www.surveymonkey.com/results/SM-Q63DFRZW/
Responses to many of the issues raised in the survey are here:
http://mail.openjdk.java.net/pipermail/platform-jep-discuss/2016-March/000039.html
http://mail.openjdk.java.net/pipermail/platform-jep-discuss/2016-March/000063.html
Some analysis on readability, and goals, can be found here:
http://mail.openjdk.java.net/pipermail/platform-jep-discuss/2016-March/000064.html
Some of these have been collected into a FAQ, here:
http://cr.openjdk.java.net/~briangoetz/jep-286/lvti-faq.html
We'll be pushing an implementation soon that people can try out.
SUMMARY OF THE SURVEY RESULTS
-----------------------------
Overall, the responses were strongly in favor of the feature; 74% were
strongly in favor, with another 12% mildly in favor. 10% thought it was
a bad idea.
The written comments had more of a negative bias, but this shouldn't be
surprising; people generally have more to say in disagreement than in
agreement. The positive comments were very positive; the negative
comments were very negative. So no matter what we do here, some people
are going to be very disappointed.
When given a choice, the most popular syntax choice was "val and var",
with "var only" as the second choice (the others were all obvious
losers). But when asked about how people felt about these choices, we
saw a divergence; more people liked var/val, but more people hated it too.
While there were some passionate arguments against, the numbers speak
loudly: this is a feature that most developers want. (It is the most
frequently repeated request of developers coming to Java from other
languages.)
READABILITY
-----------
The biggest category of negative comments regarded worries about
readability (though most of these came from folks who have never used
the feature; those who have used it in other languages were
overwhelmingly positive). We have always held out "Reading code is more
important than writing code" as a core value of the language; plenty of
folks took the opportunity to throw this seeming inconsistency in our
face, on the assumption that this feature would inevitably lead to less
readable code.
For example, code like the following strawman was offered by several
commenters:
var x = y.getFoo()
as evidence of "see, its unreadable, you have no idea what 'x' is." The
readability problem here, though, stems from the fact that 'x' is just a
poorly chosen variable name. Having a manifest type might make up for
the programmer's laziness, but it would be better to just choose a good
variable name in the first place.
While every situation may be different, we believe that, with judicious
use of this feature in well-written code, readability is actually
*enhanced*. Consider a block of locals:
UserModelHandle userDB = broker.findUserDB();
List<User> users = db.getUsers();
Map<User, Address> addressesByUser = db.getAddresses();
What is the most important thing on each line, the variable *name*, the
variable *type*, or the initializing expression? We claim it is the
name that is most important -- because they describe the role of the
variable *in the current program*. And the variables names are not so
easy to visually pick out from the above code -- they're stuck in the
middle of each line, and at a different place on each line.
Ideally, we'd like for the most important thing to be front-and-center
in the reader's view. If we rewrite the above block with inferred types:
var userDB = broker.findUserDB();
var users = db.getUsers();
var addressesByUser = db.getAddresses();
the true intent of this code pops out much more readily; the variable
names are (almost) front and center. The lack of manifest types is not
an impediment, because we've chosen good variable names.
Another aspect in which this feature could improve readability is that
users frequently construct complex nested and chained expressions, not
because this is the most readable way to write the code, but because the
overhead of declaring additional temporary variable seems burdensome.
(We all do this.) By reducing this overhead, implementation patterns
will likely reequilibrate to a less artificially-condensed form,
enhancing readability.
MUTABILITY
----------
Many of the comments were not so much about the use of type inference,
but about mutability -- a lot of people like the idea of reducing the
ceremony associated with finality. (And we like this idea too.)
Initially, we considered that perhaps inference would only apply to
effectively final locals. But after working with a prototype, we saw
that this had an undesirable side-effect; mutable locals stick out badly:
var immutableLocal = ...
var anotherImmutableLocal = ...
var alsoImmutable = ...
LocalDateTime latestDateSeenSoFar = ...
var thisOneDoesntChangeEither = ...
Fans of immutability might enjoy that mutability is punished in this
way, but the irregularity was definitely annoying to those who tried
this version.
Both of the paired versions (var/val, var/let) suffered from a similar
problem. Some readers found that var/val were so similar that they
could mostly ignore the difference:
var immutableLocal = ...
var anotherImmutableLocal = ...
var alsoImmutable = ...
val latestDateSeenSoFar = ...
var thisOneDoesntChangeEither = ...
whereas others found the small difference distracted them from the more
important parts of the code. Similarly, some found the var/let
distinction distracting, whereas others liked it:
var immutableLocal = ...
var anotherImmutableLocal = ...
var alsoImmutable = ...
let latestDateSeenSoFar = ...
var thisOneDoesntChangeEither = ...
Immutability is important, but in reality, local variables are the least
important place where we need more help making things immutable. Local
variables are immune to data races; most locals are effectively final
anyway. Where we need more help in encouraging immutability is *fields*
-- but applying type inference there would be foolish.
Further, the var/val distinction makes far more sense in other languages
than in Java. In Scala, for example, all variables -- locals and fields
alike -- are declared with "val name : type", where you can omit the ":
type" if you want inference. So not only is mutability orthogonal to
inference, but the power of the var/val distinction is greater because
it is used in multiple contexts -- but we'd only use it for local
variables, and only when their types are inferred. (While the "name :
type" syntax may be superior for this reason (and also because it moves
the more important component forward), this ship sailed twenty years
ago.) So var/val is a feature that *looks* like it carries over
cleanly, but in reality suffers greatly in translation.
SYNTAX
------
I know this is the part people really care about :) After considering
the pros and cons at length, there appears to be an obvious winner --
var-only. Reasons for this include:
- While it was not the most popular choice in the survey, it was
clearly the choice that the most people were OK with. Many hated
var/val; others hated var/let. Almost no one hated var-only.
- Experience with C# -- which has var only -- has shown that this is a
reasonable solution in Java-like languages. There is no groundswell of
demand for "val" in C#.
- The desire to reduce the ceremony of immutability is certainly
well-taken, but in this case is pushing on the wrong end of the lever.
Where we need help for immutability is with fields, not with locals. But
var/val doesn't apply to fields, and it almost certainly never will.
- If the incremental overhead of getting mutability control over that
of type inference were zero, there might be a stronger case, but it was
clear that many people found two different leading keywords to be a
distraction that kept their eyes from quickly settling on the important
stuff. If variable names are more important than types, they're more
important than mutability modifiers too.
PROVIDING FEEDBACK
------------------
To keep the signal-to-noise ratio to an acceptable level, we're asking
people to provide (constructive) feedback through the following survey,
whose results will be made public.
https://www.surveymonkey.com/r/FBT9WYP
Thanks for all your feedback so far!
More information about the platform-jep-discuss
mailing list