> The parallel version is almost certainly suffering false cache line > sharing when adjacent tasks are writing to the shared arrays u0, etc. > Nothing to do with streams, just a standard parallelism gotcha. Cure: don't write to shared arrays from parallel tasks.