Performance regression with IntStream.parallel.sum?
Sebastian Zarnekow
Sebastian.Zarnekow at itemis.de
Sat Oct 26 04:03:52 PDT 2013
Hi,
I think I stumbled on some performance regression in b112. Please consider this simple code:
public class C {
public int parallelSum(final int reps) {
IntStream stream = Arrays.stream(array).parallel().map(e ->
e * 5 * reps
);
int result += stream.sum();
return result;
}
C() {
this.length = 2;
setUp();
parallelSum(5);
}
private int[] array;
private int length;
protected void setUp() {
array = new int[length];
for(int i = 0; i < length; i++) {
array[i] = 3 * i;
}
}
public static void main(String[] args) {
new C();
}
}
which was extracted from some Caliper benchmark code. The benchmark results (code: https://gist.github.com/szarnekow/7168147) are here (produced with build 1.8.0-ea-b112):
https://microbenchmarks.appspot.com/runs/a0579577-2df9-4894-9b19-ac345f78b020#r:scenario.benchmarkSpec.methodName,scenario.benchmarkSpec.parameters.length
whereas the following results were produced with an earlier build (1.8.0-ea-b92)
https://microbenchmarks.appspot.com/runs/220a5db5-434c-43f3-9cd9-93ef869d51e8#r:scenario.benchmarkSpec.methodName,scenario.benchmarkSpec.parameters.length
Initially thought it can be tracked down to java.util.stream.IntPipeline#reduce(int, java.util.function.IntBinaryOperator) which ultimately causes boxing / unboxing in ReduceOps.makeInt.ReducingSink.get() but the benchmark results for a sequential stream contradicted that. Also it turned out that #get is only called once in the parallel scenario, too. So here I was on the wrong track. Is there some other information that I could provide to track this regression down? Should the code be written differently?
Best regards,
Sebastian
More information about the lambda-dev
mailing list