- 4.2x faster than cp for 100K × 100KB files (local NVMe) - 58% faster than rsync using kTLS for network transfer - Single worker + deep queue beats multi-threading