| 2024 |
Blackwell |
B100, B200, GB200, RTX 50xx |
~20k–30k+ |
HBM3e, 96–288 GB |
~4–8 TB/s |
FP64, FP32, TF32, FP16, BF16, FP8, FP6, FP4, INT8/4 |
| 2022 |
Hopper |
H100, H200, GH200 |
~14k–18k+ |
HBM3, 80–94 GB |
~3–3.9 TB/s |
FP64, FP32, TF32, FP16, BF16, FP8, INT8 |
| 2022 |
Ada Lovelace |
RTX 4090, RTX 4080, RTX 4070, RTX 6000 Ada |
~5k–16k |
GDDR6X, 12–24 GB |
~21 Gbps (~1 TB/s) |
FP32, FP16, BF16, TF32, INT8, limited FP8 |
| 2020 |
Ampere |
RTX 3090, RTX 3080, RTX 3070, A100, A30 |
~3k–10k+ |
GDDR6X / HBM2e, 8–80 GB |
~760 GB/s–1.6 TB/s |
FP64, FP32, TF32, FP16, BF16, INT8/4 |
| 2018 |
Turing |
RTX 2080 Ti, RTX 2080, RTX 2070, T4 |
~2300–4300 |
GDDR6, 6–11 GB |
~14 Gbps |
FP32, FP16, INT8, INT4 |
| 2017 |
Volta |
V100, Titan V |
5120 |
HBM2, 16–32 GB |
~900 GB/s |
FP64, FP32, FP16, INT8 |
| 2016 |
Pascal |
GTX 1080 Ti, GTX 1080, Tesla P100, P40 |
~2560–3584 |
GDDR5X / HBM2, 8–16 GB |
~10–16 Gbps |
FP64, FP32, FP16 (limited) |