NVIDIA
NVIDIA Data Center GPUs
Name | bits/s | Bytes/s |
---|---|---|
SanDisk Extreme SSD | 550MB/s read | |
USB 3.1 Gen 2 | 10Gb/s | 1.25GB/s |
HDMI | 10Gb/s | 1.25GB/s |
HDMI 4K | 18Gb/s | 2.25GB/s |
M1 Macbook Pro 1TB SSD | 7.4GB/s read | |
M2 Air 1TB SSD | 2.8GB/s read | |
M2 Air Memory | 100GB/s | |
M1 Pro Memory | 200GB/s | |
M1 Max Memory | 400GB/s | |
NVLink(A100) | 600GB/s | |
HBM(A100 40G) | 1.5TB/s | |
HBM(A100 80G) | 1.9TB/s | |
HBM(H100 SXM 80G) | 3.35TB/s |
V100, A100, H100
Streaming Multiprocessor (SM), Compute Unit (AMD)
SuperPOD
Compute Nodes: 40ea DGX A100 system(8x A100)
A100 80GB부터는 VRAM이 80GB. V100까지는 16/32GB까지만 제공되어 insufficient memory error가 잦았다.
$ nvidia-smi topo --matrix
GPU0 GPU1 GPU2 GPU3 GPU4 GPU5 GPU6 GPU7 CPU Affinity NUMA Affinity
GPU0 X NV1 NV1 NV2 NV2 PHB PHB PHB 0-63 0-1
GPU1 NV1 X NV2 NV1 PHB NV2 PHB PHB 0-63 0-1
GPU2 NV1 NV2 X NV2 PHB PHB NV1 PHB 0-63 0-1
GPU3 NV2 NV1 NV2 X PHB PHB PHB NV1 0-63 0-1
GPU4 NV2 PHB PHB PHB X NV1 NV1 NV2 0-63 0-1
GPU5 PHB NV2 PHB PHB NV1 X NV2 NV1 0-63 0-1
GPU6 PHB PHB NV1 PHB NV1 NV2 X NV2 0-63 0-1
GPU7 PHB PHB PHB NV1 NV2 NV1 NV2 X 0-63 0-1
Legend:
X = Self
SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
PHB = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
PXB = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
PIX = Connection traversing at most a single PCIe bridge
NV# = Connection traversing a bonded set of # NVLinks
Last Modified: 2024/10/14 15:47:40