Return types array/list/stream [was: API question for point lambdafication]
Stephen Colebourne
scolebourne at joda.org
Wed Feb 20 08:47:41 PST 2013
On 20 February 2013 16:35, Brian Goetz <brian.goetz at oracle.com> wrote:
>> My preference would be as folows:
>> - only use array as a return type to match other methods in the same type
>> - prefer List to Set or Collection
>> - prefer Stream to Iterator and Enumeration (and probably Iterable)
>> - prefer List to Stream for small/bounded data sets
>>
>> Thus, on the last point, I would argue that String.splitAsStream() is
>> wrong, and should be String.splitToList(). (As a side note, with my
>> general "arrays are bad" hat on, returning an array here was another
>> bad choice)
>>
>> Users wanting a stream can easily get it from the list. The reverse is
>> not so true, and almost certainly more wasteful.
>
> This ignores a significant consideration -- List requires buffering all the
> matches before any work can be done, whereas Stream is lazy. What if the
> List is potentially huge (or infinite)? What if you're only going to look
> at the first few? What if producing elements is expensive? What if you
> have an expensive operation to perform on the results, and want to do so in
> parallel? In all of these cases, preferring List over Stream gives up
> significant performance opportunities. And, constructing lists is likely to
> be more expensive than constructing streams.
>
> The argument that we should "prefer List to Stream because you can always
> get a stream from a list" is too simplistic.
And I didn't say that... I said "prefer List to Stream for
small/bounded data sets"
> Regex parsing is a good example of where you might want a stream rather than
> a list, as you might well not consume all the matches.
My regex usage tends to be on small (<128 character, or a typical
"line") strings where I want to extract out groups of text. I can
always cope with the entire list of results, and I often want to
access them in random order. For my use case, returning a list would
often have been more useful than an array.
I still say that the original question stands. And I ask it because I
think, like all new features, there is a danger of over-use. Laying
down some guidelines on usage is a Good Thing IMO. Stream really
shouldn't be the default return type - List is frequently much more
useful to clients.
Stephen
More information about the lambda-dev
mailing list