diff --git a/README.md b/README.md index 9927a35d3bb67ee73ec37f1484b647bf5dce5ae9..1009af21185c3d8214ca59ba7292c55b7fa7c27a 100644 --- a/README.md +++ b/README.md @@ -82,7 +82,7 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32. <img src="https://github.com/xuagu37/Benchmark_nnU-Net_for_PyTorch/blob/main/figures/benchmark_throughput_batch_size.png" width="400"> -- Observation 2: The coefficient of variation of throughput for the 100 iterations is smallest when batch_size = 128. +- Observation 2: The coefficient of variation of throughput for 100 iterations is smallest when batch_size = 128. **Benchmarking with dim = 2, nodes = 1, 2, gpus = 8, batch_size = 128 can be used for node health check.** - The expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 128 would be 4700 ± 500 (TF32).