NVIDIA's president and CEO is set to take to the stage tomorrow morning at GTC 2017 to talk about the future of GPGPU computing. The company's annual GPU Technology Conference is all about graphics technology and how to employ it to accelerate a plethora of existing, developing and future workloads.
Every year NVIDIA President and CEO Jen-Hsun Huang gives audiences a glimpse into whichever cutting edge graphics architecture the company is working on next. Volta is in line to succeed Pascal so we should hear a good bit about the new architecture tomorrow. And as is tradition, NVIDIA is likely to issue an update to its long-term graphics roadmap, so a Volta successor could be announced for the very first time.
- 815mm2 die
- 12nm FinFet
- 21B Transistors
- 15 FP32 TFLOPS
- 7.5 FP64 TFLOPS
- NEW 120 Tensor TFLOPS
- 16GB HBM2 @ 900 GB/s
- 5120 CUDA cores
Tesla Product | Tesla K40 | Tesla M40 | Tesla P100 | Tesla V100 |
---|---|---|---|---|
GPU | GK110 (Kepler) | GM200 (Maxwell) | GP100 (Pascal) | GV100 (Volta) |
SMs | 15 | 24 | 56 | 80 |
TPCs | 15 | 24 | 28 | 40 |
FP32 Cores / SM | 192 | 128 | 64 | 64 |
FP32 Cores / GPU | 2880 | 3072 | 3584 | 5120 |
FP64 Cores / SM | 64 | 4 | 32 | 32 |
FP64 Cores / GPU | 960 | 96 | 1792 | 2560 |
Tensor Cores / SM | NA | NA | NA | 8 |
Tensor Cores / GPU | NA | NA | NA | 640 |
GPU Boost Clock | 810/875 MHz | 1114 MHz | 1480 MHz | 1455 MHz |
Peak FP32 TFLOP/s* | 5.04 | 6.8 | 10.6 | 15 |
Peak FP64 TFLOP/s* | 1.68 | 2.1 | 5.3 | 7.5 |
Peak Tensor Core TFLOP/s* | NA | NA | NA | 120 |
Texture Units | 240 | 192 | 224 | 320 |
Memory Interface | 384-bit GDDR5 | 384-bit GDDR5 | 4096-bit HBM2 | 4096-bit HBM2 |
Memory Size | Up to 12 GB | Up to 24 GB | 16 GB | 16 GB |
L2 Cache Size | 1536 KB | 3072 KB | 4096 KB | 6144 KB |
Shared Memory Size / SM | 16 KB/32 KB/48 KB | 96 KB | 64 KB | Configurable up to 96 KB |
Register File Size / SM | 256 KB | 256 KB | 256 KB | 256KB |
Register File Size / GPU | 3840 KB | 6144 KB | 14336 KB | 20480 KB |
TDP | 235 Watts | 250 Watts | 300 Watts | 300 Watts |
Transistors | 7.1 billion | 8 billion | 15.3 billion | 21.1 billion |
GPU Die Size | 551 mm² | 601 mm² | 610 mm² | 815 mm² |
Manufacturing Process | 28 nm | 28 nm | 16 nm FinFET+ | 12 nm FFN |
NVIDIA's Next Generation Volta Architecture
The Volta architecture is scheduled to reach gamers in the form of the next generation GeForce GTX graphics cards sometime next year. However, NVIDIA's public plans for Volta indicate that the company intends to launch Volta accelerators for the enterprise markets later this year. Think Telsa P100 successors.
Industry whispers allege that Volta will not be built on 16nm FinFET like Pascal was. Reports indicate that the upcoming architecture will in fact be built on TSMC's new 12nm FinFET manufacturing process. We may get a definitive answer regarding Volta's process tomorrow.
NVIDIA Volta To Support GDDR6 & HBM2 Memory
Moving beyond 2017, Volta is set to replace the Pascal refresh in 2018. The new architecture will, just like Pascal before it, focus on pushing the performance per watt envelope. Although the GPUs are said to feature GDDR6 and HBM2 memory technology, little is actually known about the architecture itself. GDDR6 chips rated at 16Gbps are expected to boost memory bandwidth by 60% compared to current GDDR5X 10Gbps memory technology we see deployed in the GTX 1080 and TITAN Xp.
NVIDIA GeForce 30 Series - Volta GV110, GV102 & GV104 GPUs Launching In 2018
Volta GPUs
- NVIDIA GV110 GPU
- NVIDIA GV102 GPU
- NVIDIA GV104 GPU
GV104 represents NVIDIA's bread & butter Volta gaming GPU. The company will likely target the popular ~$400 segment with a cut down variant of the chip and the high-end with a fully enabled variant. That is if the company doesn't veer too far from what we've seen it historically do with its 104 class chips. GV104 should replace GP104, the chip that powers the current GTX 1080 and GTX 1070 cards.
GV104 based graphics cards, i.e. GTX 3080 & 3070 cards are rumored to feature 256-bit memory interfaces with 16GB of GDDR6 memory running at 16Gbps for a total of 512GB/s of bandwidth, 32GB/s more than what's available to the 384-bit TITAN Xp. The growth in overall memory bandwidth indicates fairly healthy gains in performance as well.
Volta will also see a GV102 chip to replace GP102, the GPU that powers the TITAN Xp. GV102 will be the fastest gaming chip NVIDIA has to offer in the Volta stack. GV110 on the other hand is built purely for the professional market. Think AI inferencing, deep-learning & datacenter workloads. These top-end server grade Volta chips are expected to feature 32GB of HBM2 & 1TB/s of memory bandwidth and debut sometime this year in a host of new supercomputers.
Wccftech | GeForce 10 Series | Tesla V100 | GeForce 20/11 Series | AMD 400 Series | AMD 500 Series | AMD 600 Series |
---|---|---|---|---|---|---|
Architecture | Pascal | Volta | Turing | Polaris | Polaris / Vega | Navi |
Process Node | TSMC 16nm FF | TSMC 12nm FF | TSMC 12nm FF | Samsung 14nm LPP | Samsung 14nm LPP | TSMC 7nm FinFET |
Memory | GDDR5/X | HBM2 | GDDR6 | GDDR5 | GDDR5 / HBM2 | GDDR6/HBM2 |
Year | 2016 | 2017 | 2018 | 2016 | 2017 | 2019 |
GPUs | GP102, GP104, GP106, GP107 | GV110 | TU102, TU104, TU106, TU107 | Polaris 10, Polaris 11 | Polaris 20, Polaris 21, Vega 10, Vega 11 | Navi 10, Navi 11 (Navi Chiplet?) |