[rfc][icedtea-web] ResourceTracker Thread Pool Enhancement

Jacob Wisor gitne at gmx.de
Thu Oct 2 16:27:39 UTC 2014


On 10/01/2014 at 06:26 PM, Jie Kang wrote:
> Hello,
>
> This patch replaces the manual thread management system in Resource Tracker with a fixed thread pool using Java's Executor Service class.
>
> The functionality remains the same as before. In terms of testing, using the Performance Test [1], running the reproducers and performing manual testing has shown no regressions.
>
> Thoughts?
>
>
> [1] : http://mail.openjdk.java.net/pipermail/distro-pkg-dev/2014-August/028980.html
>
>
> Regards,

Hey, this looks interesting. :-)

I am aware that you have been aiming for identical functionality, so please keep 
in mind that this post is more of a discussion than a comment on the code quality.

> +    private static final ExecutorService threadPool = Executors.newFixedThreadPool(5);

A constant pool size of 5 seems rather arbitrary. I suppose the most suiting 
thread pool scheduler/model in this case would be one that tries to utilize the 
CPU(s) most effectively, choosing threads that are most probable to finish 
first. The thread pool size should/could probably be calculated based on the 
available bandwidth, average resource size, average latency to a resource, and 
the number of cores or CPUs. But then, this would also require gathering some 
considerable amount of statistical data which seems not feasible for this 
purpose. Hmm, I do know, 5 seems as arbitrary as 42. ;-)

Furthermore as I understand (please correct me if am I wrong), the fixed thread 
pool does not run queued threads when all slots in the pool are blocked or 
waiting, say for example for a response on the network. In this case, which 
probably happens a lot, this thread pool does not help much. With many resources 
to be download, waiting for the first 5 to complete while others might be 
downloading instead, is a zero-sum game.

Also, the positive runtime effects of recycling or reusing existing threads is 
negligible in this scenario either. Actually, such an effect is only even 
measurable if the threads' average lifetime is below 100 ms and hundreds if not 
thousands of threads are to be created per second. So, a thread pool size of 5 
and the average lifetime of a thread downloading a resource for about several 
ten seconds if not more, has no practical effect on the overall performance. 
Again, I know you are just recreating already existing functionality with the 
concurrent framework. But, if you do so, you may want to consider improvements 
to IcedTea-Web that the concurrent framework might enable.

> -        if (threads == 0) {
> -            synchronized (prefetchTrackers) {
> -                queue.trimToSize(); // these only accessed by threads so no sync needed
> -                active.clear(); // no threads so no trackers actively downloading
> -                active.trimToSize();
> -                prefetchTrackers.trimToSize();
> -            }
> +        synchronized (prefetchTrackers) {
> +            queue.trimToSize(); // these only accessed by threads so no sync needed
> +            prefetchTrackers.trimToSize();
>          }

A few questions:
1. Why is the ResourceTracker.queue still required if there is a thread pool 
managing a queue now? Shouldn't every resource get its own thread to be queued 
in the thread pool?

2. Why is ResourceTracker.queue accessed while holding a lock on 
ResourceTracker.prefetchTrackers? If access to ResourceTracker.queue does need 
to be synchronized, then why not synchronize access to it too?

I hope this helps a bit with thinking. ;-)

Jacob


More information about the distro-pkg-dev mailing list