diff --git a/README.md b/README.md index 291f08cf39d661fca3a4601b5dc55b84a3c5f46a..e575a420a209adf465bac42e05ddba071c6f38ee 100644 --- a/README.md +++ b/README.md @@ -64,6 +64,6 @@ We collect benchmark results of throughput (images/sec) for - Batch size = 1, 2, 4, 8, 16, 32, 64, 128 We run 100 iterations for each set of parameters. -- Observation 1: throughput_tf32 > throughput_amp when batch_size is small (1, 2, 4, 8); -throughput_tf32 < throughput_amp when batch_size is large (16, 32, 64, 128). +- Observation 1: when batch_size is small (1, 2, 4, 8), throughput_amp ≈ throughput_tf32; +when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32. - Observation 2: The coefficient of variation of throughput for the 100 iterations is smallest when batch_size = 128.