Miscellaneous improvements to "jar".

Ulf Zibis Ulf.Zibis at gmx.de
Fri Jun 26 16:31:33 UTC 2009


Martin, thanks for taking the time.

Am 26.06.2009 15:53, Martin Buchholz schrieb:
>
>
> On Fri, Jun 26, 2009 at 01:37, Ulf Zibis <Ulf.Zibis at gmx.de 
> <mailto:Ulf.Zibis at gmx.de>> wrote:
>
>     1. Hopefully some volunteer would be found to fix
>     http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6818737
>     before JDK7 API-freeze.
>     Especially, if jar is not compressed, as in case of normal JDK
>     installation, reading entries from jar should be much faster
>     through java.nio.channels, than via BuffererdInputStream.
>
>
> The way to motivate us around here
> is to provide the prototype implementation that
> demonstrates the speedup.

Sorry, I'm not the specialist on how to provide NIO buffers from native 
memory, and first, I will finish my work on charsets.

Motivation:
Xueming states:
*"dat" based uses less disk space, but it has larger startup time, 
reading an additional "big" dat file during class loading/initializing 
actually takes much longer time than I expected (I hit the extreme when 
I worked on the EUC_TW, which I make the size only 30% of the existing 
one, but startup is a disaster regression, ...
*
If loading x bytes from dat file via getResourceAsStream() takes much 
longer time than loading x+30% bytes from class file, processing the 
UTF-8 conversion, instantiating and initializing additional Class 
objects, I imperatively presume, that there must be a big chance for 
significantly improving read speed from uncompressed jar file (here 
charsets.jar), by using direct channels or how ever. I presume, 
enhancing reading from jar files would be a big fish in performance gain 
for the whole JDK, as it is very common task in JVM's daily work.


>  
>
>
>
>
>         While benchmarking, I discovered to my horror that the simple
>
>         jar cf /tmp/t1 ...
>         jar i /tmp/t1
>
>         fails, because it tries to create the replacement jar in "."
>         and then
>         rename() it into place.  Oops... Better refactor out the code that
>         puts the replacement temp file in the same directory.
>         Better write some tests for that, too.
>
>
>     2. I don't like to refactor out the code in case of only once
>     used, and only to better "comment" what the 2 lines are doing.
>     It blows up the code, and following the code demands annoying
>     scrolling.
>     Better add additional comment to original code.
>
>
> The original code created temp files in *two* places,
> and did it differently.

Oops, at my first search on your code I only found *one* usage of 
createTempFileInSameDirectoryAs(). Did you add the 2nd later?
But there is only one usage of directoryOf(). Shouldn't you inline this?

> I think the name
> createTempFileInSameDirectoryAs
> makes the current code much clearer.

Yes, this is pretty clear, but the cost is 19 lines against 2+2 plus 
demanding the reader for annoying scrolling.
Thinking about directoryOf() I guess, following this strategy you would 
find ten's of locations in Main.java where you could refactor out code 
into small well self-explaining methods, but wouldn't this end up in a 
mess of unreadable blown-up code?

>
> Also, JITs tend to be very good at inlining.

(... after some loops), yes, I know

>  
>
>
>     3. What happens, if original file is exactly named "jartmp"
>     I think you would better add ".tmp" at the end of the filename,
>     and remove it later.
>     Does your new code work with? :
>     jar cf /jartmp/t1 ...
>     jar i /jartmp/t1
>
>
> File.createTempFile doesn't literally create a file named jartmp.
> That's only the prefix.  And it promises to return
> a freshly created empty file.

Now I understand deeper. I just wondered why in fact just renaming "tmp" 
to "jartmp" would resolve your bug. I didn't recognize the 2nd location, 
where wrong "." was used for dir name.

-Ulf

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openjdk.java.net/pipermail/core-libs-dev/attachments/20090626/83baa1b9/attachment.html>


More information about the core-libs-dev mailing list