diff --git a/README.md b/README.md index a18082965802239162620e9afd3c35fc8872b331..3ea26269333b3978e30e9050a746d692d00b6f55 100644 --- a/README.md +++ b/README.md @@ -87,7 +87,7 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32. Benchmarking with dim = 2, nodes = 1,2, gpus = 8, batch_size = 128 can be used for node health check. For example, the expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 128 would be ? ± ? (TF32) and ? ± ? (AMP). -<img src="https://github.com/xuagu37/Benchmark_nnU-Net_for_PyTorch/blob/main/figures/benchmark_cv_example.png" width="400"> +<img src="https://github.com/xuagu37/Benchmark_nnU-Net_for_PyTorch/blob/main/figures/benchmark_throughput_cv.png" width="400"> - Observation 3: Ideally, the improvement of throughput would be linear when batch_size increases. In practice, throughtput stays below the ideal curve when batch_size > 16.