From ccb26e0fffe75ea1c6da01a77be3e9b0c82a34c3 Mon Sep 17 00:00:00 2001 From: Xuan Gu <xuagu37@gmail.com> Date: Mon, 31 Oct 2022 14:34:18 +0100 Subject: [PATCH] Update README.md --- README.md | 32 +++----------------------------- 1 file changed, 3 insertions(+), 29 deletions(-) diff --git a/README.md b/README.md index 6f5a788..4b645c9 100644 --- a/README.md +++ b/README.md @@ -51,39 +51,13 @@ sbash benchmark_nnunet_pytorch_berzelius.sh sbash benchmark_nnunet_pytorch_berzelius_multi_node.sh ``` -<!-- -#### For single node -- Start an interactive session -``` -interactive -N2 --reservation=nsc-testing -t 600 -``` - -- Pull the image for Singularity and run -``` -cd /proj/nsc/xuan/ngc/DeepLearningExamples/PyTorch/Segmentation/nnUNet -singularity pull nvidia_nnu-net_for_pytorch.sif docker://xuagu37/nvidia_nnu-net_for_pytorch:21.11.0 -singularity shell -B ${PWD}/data:/data -B ${PWD}/results:/results --nv nvidia_nnu-net_for_pytorch.sif -``` -- Run the benchmark script -``` -bash benchmark_nnunet_pytorch_berzelius.sh -``` - -#### For multi-node -- Run the benchmark script -``` -cd /proj/nsc/xuan/ngc/DeepLearningExamples/PyTorch/Segmentation/nnUNet -sbash benchmark_nnunet_pytorch_berzelius_multi_node.sh -``` ---> - ### Results We collect benchmark results of throughput (images/sec) for - Precisions = TF32, AMP - Dimention = 2 -- Nodes = 1, 2 -- GPUs = 1 - 8 (for 1 node), 16 (for 2 nodes) -- Batch size = 1, 2, 4, 8, 16, 32, 64, 128 +- Nodes = 1, 2, 3, 4, 5, 6, 7, 8 +- GPUs = 1 - 8 (for 1 node), all gpus (for multi-node) +- Batch size = 1, 2, 4, 8, 16, 32, 64, 128, 256 TF32 (TensorFloat32) mode is for accelerating FP32 convolutions and matrix multiplications. TF32 mode is the default option for AI training with 32-bit variables on Ampere GPU architecture. -- GitLab