diff --git a/README.md b/README.md
index 025581c278e0916527ec7f12a7b8b4cd4eb1f5db..40ecf35433c499587b06f2815885f8b5378bdd57 100644
--- a/README.md
+++ b/README.md
@@ -84,8 +84,10 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32.
 
 - Observation 2: The coefficient of variation of throughput for the 100 iterations is smallest when batch_size = 128.  
 
-**Benchmarking with dim = 2, nodes = 1, 2, gpus = 8, batch_size = 128 can be used for node health check.  
-For example, the expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 128 would be 4700 ± 500 (TF32).**
+**Benchmarking with dim = 2, nodes = 1, 2, gpus = 8, batch_size = 128 can be used for node health check.** 
+- The expected throughput for dim = 2, nodes = 1, gpus = 8, batch_size = 128 would be 4700 ± 500 (TF32).
+- The expected throughput for dim = 2, nodes = 2, gpus = 16, batch_size = 128 would be 9250 ± 150 (TF32).
+
 
 <img src="https://github.com/xuagu37/Benchmark_nnU-Net_for_PyTorch/blob/main/figures/benchmark_throughput_cv.png" width="400">