From fb5f181452dfee8725af3a57ad9edd04c5e5bb62 Mon Sep 17 00:00:00 2001
From: Xuan Gu <xuagu37@gmail.com>
Date: Mon, 24 Oct 2022 11:55:12 +0200
Subject: [PATCH] Update README.md

---
 README.md | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index b9077dc..eda7f00 100644
--- a/README.md
+++ b/README.md
@@ -99,10 +99,11 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32.
 
 <img src="https://github.com/xuagu37/Benchmark_nnU-Net_for_PyTorch/blob/main/figures/benchmark_throughput_cv.png" width="400">
 
-- The expected throughput for dim = 2, nodes = 1, gpus = 1, batch_size = 256 would be 670 Â± 10 (TF32).
-- The expected throughput for dim = 2, nodes = 1, gpus = 4, batch_size = 256 would be 2600 Â± 100 (TF32).
-- The expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 256 would be 5150 Â± 150 (TF32).
-- The expected throughput for dim = 2, nodes = 2, gpus = 16, batch_size = 128 would be 9250 Â± 150 (TF32).
+- The expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 256 would be 5130 Â± 180 (TF32).
+- The expected throughput for dim = 2, nodes = 2, gpus = 16, batch_size = 128 would be 9300 Â± 70 (TF32).
+- The expected throughput for dim = 2, nodes = 3, gpus = 24, batch_size = 128 would be 13880 Â± 85 (TF32).
+- The expected throughput for dim = 2, nodes = 4, gpus = 24, batch_size = 128 would be 18500 Â± 90 (TF32).
+
 
 **Observation 3**: Ideally, the improvement of throughput would be linear when batch_size increases. In practice, throughtput stays below the ideal curve when batch_size > 16.
 
-- 
GitLab