RFR: 8265175: (fs) Files.copy(Path, Path, CopyOption...) should use zero-copy on Linux

Alan Bateman alanb at openjdk.java.net
Wed Apr 14 10:24:58 UTC 2021


On Tue, 13 Apr 2021 21:40:26 GMT, Brian Burkhalter <bpb at openjdk.org> wrote:

> Please consider this request to change the underlying native implementation of `java.nio.file.Files.copy(Path,Path,CopyOption...)` on Linux only to perform zero-copy via `sendfile(2)`. The `sendfile()` system call is already used in `java.nio.channels.FIleChannel.transferTo​(long,long,WritableByteChannel)`. It is intentionally _not_ proposed to use `sendfile()` in the native macOS implementation as on macOS the function requires that the destination file descriptor is for a socket.
> 
> This change showed some performance improvement as measured by JMH.
> 
> **Before: user-space buffers (read() + write())**
> 
> Benchmark           (size)   Mode  Cnt      Score     Error  Units
> FilesCopy.copy       10240  thrpt    5  39167.400 ± 683.887  ops/s
> FilesCopy.copy       51200  thrpt    5  20782.622 ± 558.031  ops/s
> FilesCopy.copy      102400  thrpt    5  13260.709 ± 176.673  ops/s
> FilesCopy.copy      512000  thrpt    5   3171.837 ± 175.803  ops/s
> FilesCopy.copy     1048568  thrpt    5   1654.253 ±  39.419  ops/s
> FilesCopy.copy    10485760  thrpt    5    145.328 ±   7.192  ops/s
> FilesCopy.copy   104857600  thrpt    5     12.440 ±   2.275  ops/s
> FilesCopy.copy  1073741824  thrpt    5      1.073 ±   0.081  ops/s
> 
> **After: zero-copy (sendfile())**
> 
> Benchmark           (size)   Mode  Cnt      Score     Error  Units
> FilesCopy.copy       10240  thrpt    5  40571.516 ± 548.977  ops/s
> FilesCopy.copy       51200  thrpt    5  23993.334 ± 506.817  ops/s
> FilesCopy.copy      102400  thrpt    5  15927.485 ± 309.081  ops/s
> FilesCopy.copy      512000  thrpt    5   4207.129 ±  95.454  ops/s
> FilesCopy.copy     1048568  thrpt    5   2147.046 ±  33.446  ops/s
> FilesCopy.copy    10485760  thrpt    5    148.798 ±   1.329  ops/s
> FilesCopy.copy   104857600  thrpt    5     14.541 ±   0.675  ops/s
> FilesCopy.copy  1073741824  thrpt    5      1.270 ±   0.029  ops/s

I agree it's worth exploring using sendfile64 as file-to-file was not available when this code was written.

Have you measured the user vs. kernel time when testing? I'm also interested to see if you've tried using a buffer size that is a LCM of the source and destination block sizes.

For large files, can you can what byte_sent returns? I'm wondering if the cancellable mechanism is effective with this implementation.

The call already has the file size so the call to fstat is not needed. I think I'd prefer to continue to read to EOF as that it is a bit friendly to files that are changing when being copied. I assume the current patch is problematic on 32-bit (use sendfile64 and off64_t to be consistent with FileChannelImpl). It will also need to handle the EINTR case.

-------------

Changes requested by alanb (Reviewer).

PR: https://git.openjdk.java.net/jdk/pull/3476


More information about the nio-dev mailing list