diff --git a/README.md b/README.md index 4940fe6c251f33203daead3be2f3e99024fefe6d..719c35a0537aa27e1d6e514f2cd1b806dc1c9037 100644 --- a/README.md +++ b/README.md @@ -60,7 +60,7 @@ We collect benchmark results of throughput (images/sec) for - Dimention = 2 - Nodes = 1, 2 - GPUs = 1 - 8 (for 1 node), 16 (for 2 nodes) -- Batch size = 2, 4, 8, 16, 32, 64, 128 +- Batch size = 1, 2, 4, 8, 16, 32, 64, 128 We run 100 iterations for each set of parameters. - Observation 1: throughput_tf32 > throughput_amp when batch_size is small (1, 2, 4, 8);