Skip to content
Snippets Groups Projects
Unverified Commit 10f32525 authored by Xuan Gu's avatar Xuan Gu Committed by GitHub
Browse files

Update README.md

parent 0f355ccf
No related branches found
No related tags found
No related merge requests found
...@@ -103,3 +103,4 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32. ...@@ -103,3 +103,4 @@ when batch_size is large (16, 32, 64, 128), throughput_amp > throughput_tf32.
#### Notes #### Notes
- It seems running directly via singularity shell will give worse performance (when I WFH). We should run it via sbatch script instead. - It seems running directly via singularity shell will give worse performance (when I WFH). We should run it via sbatch script instead.
- It took around a week to finish all iterations of benchmarking. - It took around a week to finish all iterations of benchmarking.
- For multi-node bash script, the line "#SBATCH --ntasks-per-node=8" has to been added; otherwise the process will hang.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment