Update README.md

d1e33495 · Xuan Gu · GitHub · 928785af · d1e33495
Unverified Commit d1e33495 authored 2 years ago by Xuan Gu Committed by GitHub 2 years ago
--- a/README.md
+++ b/README.md
@@ -102,6 +102,6 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32.

 #### Notes
 - It seems running directly via singularity shell will give worse performance (when I WFH). We should run it via sbatch script instead.
- It took around a week to finish all iterations of benchmarking.
+- It took around a week to finish 100 iterations of benchmarking for all sets of parameters.
 - For multi-node benchmarking, we need to use "srun" command; also, the line "#SBATCH --ntasks-per-node=8" has to been added. Otherwise the process will hang.
 - Benchmarking with dim = 2, nodes = 1, gpus = 8, batch_size = 128 takes ~2mins. If we want to finish it within a minute, we can change the number of batches from 150 (the default value) to a smaller number.