MIG enables multiple GPU instances to run in parallel on a single, physical NVIDIA A100 GPU. MIG mode spatially partitions the hardware of GPU so that each MIG can be fully isolated with its own streaming multiprocessors (SM’s), high-bandwidth, and memory. MIG can partition available GPU compute resources as well. Figure 1.