RFR - 8132734: java.util.jar.* changes to support multi-release jar files

Steve Drach steve.drach at oracle.com
Tue Feb 2 17:32:18 UTC 2016


I think the following is a reasonable solution.

If the JarFile is constructed with any of the 5 constructors that do not contain a Release argument, then entries()/stream() returns the set of all entries in the jar file including those under the META-INF/versions/directory.  The base entries are “raw”, they are not aliases for versioned entries.

If the JarFile is constructed with the 1 constructor that does include a Release argument, then entries()/stream() returns the set of appropriately versioned entries that would result from invoking getEntry()/getJarEntry() with the name of each base entry.  The entries in the tree rooted at the directory META-INF/versions/ are not returned.

> On Feb 1, 2016, at 12:29 PM, Steve Drach <steve.drach at oracle.com> wrote:
> 
> I’m sorry, I didn’t look at the code close enough before I started talking ;-)  Right now entries()/stream() returns all entries and if the JarFile is constructed with a Release object != Release.BASE, the base entries that are returned are the versioned entries.  I think this behavior is a bit confusing and we should just return all entries without regard to versioning.  Then create the two new methods for specific versioned entries.
> 
>> On Feb 1, 2016, at 12:18 PM, Steve Drach <steve.drach at oracle.com> wrote:
>> 
>>>>>> Alan’s point is that traversing using entries()/stream() always returns the versioned entries (if any) rather than all entries, thus in a sense filters.
>>>>>> 
>>>>>> My assumption was the traversal should by default be consistent with a calls to getEntry, thus:
>>>>>> 
>>>>>> jarFile.stream().forEach(e ->  {
>>>>>>   JarEntry je = jarFile.getJarEntry(e.getName());
>>>>>>   assert e.equals(je);
>>>>>> });
>>>>>> 
>>>>>> There might need to be another stream method that returns all entries.
>>>>>> 
>>>>> Right, I'm mostly just wondering if entries()/streams() should override the entries in the stream with versioned entries and filter out the META-INF/versions/ tree.
>>>> I don’t think so.  That kind of behavior might be difficult to understand.  Returning all the entries provides some flexibility.  One can write code like this:
>>>> 
>>>> jarfile.stream().map(JarEntry::getName).filter(s ->  !s.startsWith(“META-INF”)).map(JarFile::getJarEntry).etc
>>>> 
>>>> to get the versioned results for any version you specify when constructing the JarFile.
>>> 
>>> The current specification treats those class files under meta-inf/releases like
>>> kind of "metadata" of those base entries. Ideally those files should not even
>>> be individual "files", but part of their corresponding entries. The consumer of
>>> the MR-Jar should not need to be aware of these version-ed entries at all to use
>>> this MR-jar file to load classes/resources. From this point of view, these entries
>>> probably should be "invisible" from entries()/stream(), when the jar file is opened
>>> with "version-enabled". And all returned entries should have all their "data"
>>> (size, csize, timestamps, crc ...) pointed to the corresponding version-ed entries,
>>> withe the only exception is the "name".
>>> 
>>> On the other hand it might be desired to keep JarFile.entries()/stream() asis to
>>> match their "zip file" perspective, to return "all" raw entries. Then it might also
>>> be desired to have an alternative "versioned streamVersion()" …
>> 
>> It seems to that we have two reasonable alternatives: (1) return all entries, and (2) return all entries except those under the “META-INF/versions/“ directory and for any entries returned, return their versioned equivalent if it exists.  If we choose alternative 2, we can still get alternative 1 by asking for JarFile.super.entries() and JarFile.super.stream().
>> 
>> Or we can do it both ways, leaving entries()/stream() as is and adding two new methods, versionedEntries() and versionedStream().
>> 
>>> 
>>> something like
>>> 
>>>  public Stream<JarEntry> stream(Release r); ?
>> 
>> We should not parametrize the methods with a Release, because what does it mean if we construct the JarFile with one Release but specify a different Release for the stream argument.  Parameterizing methods with a Release object feels like we’re starting to slide down a slippery slope.
>> 
>> I think adding the two new methods is the “right” solution, but I’d like some consensus here.
>> 
>>> 
>>> -sherman
>>> 
>>> 
>>> 
>>> 
>>>>> If I've gone to trouble of specifying the a Release then it seems the right thing to do. On the other hand, it comes at a cost and there will be use-cases like "get the names of all entries" that would be more efficient to just build on the current entries()/stream(). I'm loath to suggest this might need a new method but it might be one of the options to consider here. Minimally there is a javadoc to specify on how these methods behave when the JAR is multi-release and opened by specifying a release.
>>>> How’s this?
>>>> 
>>>> diff -r 68867430065b src/java.base/share/classes/java/util/jar/JarFile.java
>>>> --- a/src/java.base/share/classes/java/util/jar/JarFile.java	Fri Jan 29 12:34:44 2016 -0800
>>>> +++ b/src/java.base/share/classes/java/util/jar/JarFile.java	Mon Feb 01 09:48:05 2016 -0800
>>>> @@ -576,9 +576,11 @@
>>>>    }
>>>> 
>>>>    /**
>>>> -     * Returns an enumeration of the jar file entries.
>>>> +     * Returns an enumeration of all the jar file entries.  Constructing this
>>>> +     * JarFile with the {@link JarFile#JarFile(File, boolean, int, Release)}
>>>> +     * constructor does not modify the behavior of this method.
>>>>     *
>>>> -     * @return an enumeration of the jar file entries
>>>> +     * @return an enumeration of the all jar file entries
>>>>     * @throws IllegalStateException
>>>>     *         may be thrown if the jar file has been closed
>>>>     */
>>>> @@ -587,11 +589,13 @@
>>>>    }
>>>> 
>>>>    /**
>>>> -     * Returns an ordered {@code Stream} over the jar file entries.
>>>> +     * Returns an ordered {@code Stream} over all the jar file entries.
>>>>     * Entries appear in the {@code Stream} in the order they appear in
>>>> -     * the central directory of the jar file.
>>>> +     * the central directory of the jar file.  Constructing this
>>>> +     * JarFile with the {@link JarFile#JarFile(File, boolean, int, Release)}
>>>> +     * constructor does not modify the behavior of this method.
>>>>     *
>>>> -     * @return an ordered {@code Stream} of entries in this jar file
>>>> +     * @return an ordered {@code Stream} of all entries in this jar file
>>>>     * @throws IllegalStateException if the jar file has been closed
>>>>     * @since 1.8
>>>>     */
> 




More information about the core-libs-dev mailing list