[rfc][icedtea-web] ResourceTracker Thread Pool Enhancement

Andrew Azores aazores at redhat.com
Mon Oct 6 18:08:26 UTC 2014


On 10/01/2014 12:26 PM, Jie Kang wrote:
> Hello,
>
> This patch replaces the manual thread management system in Resource Tracker with a fixed thread pool using Java's Executor Service class.
>
> The functionality remains the same as before. In terms of testing, using the Performance Test [1], running the reproducers and performing manual testing has shown no regressions.
>
> Thoughts?
>
>
> [1] : http://mail.openjdk.java.net/pipermail/distro-pkg-dev/2014-August/028980.html
>
>
> Regards,
>

In addition to everything Jacob has already said, I'd also like to 
comment on the pool having always exactly 5 threads. I agree with his 
assessment that it would be nice to have some statistical data and some 
heuristic to use to determine a number of threads to use, but I think 
this is pretty infeasible. Really, the optimization problem here IMO is 
to try to minimize the number of threads used while maximizing network 
bandwidth utilization; in other words, there's no point downloading 5 at 
once if the first 3 would have already saturated the connection, but 
there's also no point downloading 5 if their combined utilization is 
only 50% of the available bandwidth. But really, it's worse to be using 
only 50% of the available bandwidth than to be downloading more 
resources than is required to saturate bandwidth, since the cost of 
extra threads is really not very high, but the cost to the user of 
downloading at less than full speed is (potentially a lot of) wasted 
time for no good reason. So, I think it's better to overshoot the 
optimal number of threads than to undershoot it. The cost of 
overshooting is just so minimal, which Jacob also says.

That said, it would be silly to pre-allocate some huge number of threads 
for this thread pool, say, 1000 threads. The chances of a user's applet 
having 1000 things for us to download is very small. If there are only 5 
resources then there are 995 threads for no reason whatsoever; worse, 
all 1000 threads will be sitting around in the JVM with nothing to do 
until the JVM is exited because the user is done with the applet. 
Although this probably isn't a massive performance issue, it's still 
wasteful.

So, with these thoughts in mind, I'd recommend you consider switching 
from newFixedThreadPool(5) to simply newCachedThreadPool [0]. It does 
mean that every resource will be downloaded at once, rather than queued 
up for processing, but AFAICT this will allow for a good amount of 
architectural simplification in ResourceTracker, and it also guarantees 
that if it's possible to saturate the download bandwidth, then we will, 
and so the task will be completed as quickly as possible (ignoring 
scheduler overhead etc). And, it also means that after the download 
tasks are completed, the additional threads will eventually be cleaned 
up automatically, unless additional tasks come in, in which case some of 
the threads will be reused. IMO this is pretty much exactly the best 
behaviour we can reasonably provide, unless we start gathering that 
statistical data and measuring maximum download throughput vs current 
resource download combined throughput, and all those extra details. To 
me all of that work probably isn't worth the effort - but it would 
definitely be very, very cool if it was done ;)

[0] 
http://docs.oracle.com/javase/7/docs/api/java/util/concurrent/Executors.html#newCachedThreadPool%28%29

Thanks,

-- 
Andrew Azores


More information about the distro-pkg-dev mailing list