MIG Estimator - A Quick Javascript Calculator or Model Sizing Inference on Nvidia Multi Instance GPU (MIG)

Lots of conversations revolve around sizing models to GPUs. In enterprise environments, its looking at how many of what can be deployed where.

MIG Estimator

Parameter count:
Select Precision:
Context Length:
Number of Concurrent Sessions:
Select Target GPU: (optional)

Number of Transformer Layers:
Number of Attention Headers:
Head Dimension:

Select an example:

GPU Name:

MIG Name:

Number of Instances: