Nvidia Pascal GTX 1080 Has 8GB GDDR5X & 320GB/s Of Bandwidth, GTX 1070 Has 8GB GDDR5 & 256GB/s – GP104 GPU Supports GDDR5/X

Khalid Moammer

According to the latest whispers Nvidia has allegedly designed two reference PCBs with GDDR5X and GDDR5 compatibility for its GP104 GPU based GTX 1080 and GTX 1070 graphics cards. The latest whispers claim that Nvidia has decided to create a "premium" GP104 board based on the GP104-400 GPU that is going to power the flagship Pascal GeForce GTX graphics card this year. Otherwise known as the GTX 1080 in the web's echochambers, this "premium" board will allegedly feature GDDR5X rather than GDDR5.

NVIDIA 364.47 WHQL Drivers

Whilst Nvidia's more mainstream GP104 based graphics card, the purported GTX 1070, will be based on a cut down version of the same GP104 chip code named GP104-200 and feature 8Gbps GDDR5 chips instead. This rumor comes straight from the chiphell forums via bitsandchips.it, which have also brought us the leaked GP104 die shots a few days ago. So while there maybe veracity to these claims, we'd still advise our readers to take this with the usual grain of salt.

Nvidia GeForce GTX 1080 And GeForce GTX 1070 To Feature Different PCBs Due to Different GDDR5X & GDDR5 Pin Layout

According to the same source two different PCB designs are necessary due to the different pin layout of GDDR5X and GDDR5 chips. So whilst the GP104 GPU is claimed to be compatible with both memory technologies, the different pin layout doesn't allow GDDR5X to be a simple drop-in replacement.

NVIDIA Pascal GP104 GPu

The leaked GP104 die shot revealed that the pictured graphics board features 8Gbps Samsung GDDR5 memory chips. Unfortunately the nscripted info on the die has been omitted, otherwise we would've been able to determine whether this is GP104-400 or GP104-200 and validate the rumored claims of GP104-400 using GDDR5X. Assuming the whispers are true, this die shot should be of GP104-200 and this should be a GTX 1070 board rather than a GTX 1080.

The first wave of GDDR5X memory chips that Micron has started sampling last month and will be mass producing in the summer are rated at 10Gbps, 11Gbps and 12Gbps. Which means that the fastest GDDR5X configuration will yield up to 50% more bandwidth vs the 8Gbps GDDR5 memory chips pictured above.

Because the GP104 GPU is configured with a 256bit memory interface. With 10gbps GDDR5X chips chosen, the GTX 1080 will have access to320GB/s of memory bandwidth. That's up to 43% more compared to the GTX 980 and just 5% less than the GTX 980 Ti.

Nvidia Pascal Specs

WCCFGTX 980 TiGTX 980GTX 1080GTX 1070TESLA P100 (GP100)
GPUGM200GM204GP104GP104GP100
Process Node28nm28nm16nm FinFET16nm FinFET16nm FinFET
Transistors8 Billion5.2 BillionTBATBA15.3 Billion
CUDA Cores2816 CUDA Cores2048 CUDA Cores2560 CUDA Cores?2048 CUDA Cores?3840 CUDA Cores
VRAM6 GB GDDR54 GB GDDR58 GB GDDR5X8 GB GDDR516GB HBM2
Memory Bus384-bit256-bit256-bit256-bit4096-bit
Memory Speed7Gbps7Gbps10Gbps8Gbps1.4Gbps
Bandwidth336GB/s224GB/s320GB/s256GB/s720GB/s
TDP250W165WTBATBA300W
Launch DateMay 2015September 2014June 2016June 2016Q1 2017

Micron announced late last month that it's already shipping 10Gbps, 11Gbps and 12Gbps samples to its customers. Which means that Nvidia, as well as AMD, have already got access to GDDR5X chips to test and will be ready to roll out graphics cards featuring the new memory technology as production ramps up this summer. This indicates that the decision to use both GDDR5X and GDDR5 memory technologies as opposed to just GDDR5X was driven mainly by a desire from Nvidia to reduce cost.

So far all rumors and leaks point towards a Computex, late May, announcement and June launch of Nvidia's next generation Pascal GP104 based GTX 1080 and GTX 1070 graphics cards. Whether Nvidia will actually name their next generation GTX 980 and GTX 970 replacements GTX 1080 and GTX 1070 is subject to speculation at this point. But I fully expect Nvidia to roll out a new naming scheme for its new products this year.

Nvidia's Pascal Architecture - Fewer, Faster CUDA Cores With Significantly Higher Per Thread Throughput

We dove deep into Nvidia's Pascal architecture last week after the company's GTC 2016 reveal of the Tesla P100 and the flagship Pascal GP100 GPU that will be launching in 2017. We discussed all the architectural updates that Nvidia has made to Pascal which I'd highly recommend that you check out if you're interested in finding out how much faster Pascal is going to be.

NVIDIA Pascal SMP

A few very significant changes from Maxwell to Pascal stick out. Each Pascal CUDA core has been beefed up considerably compared to Maxwell and clock speeds have gone up by 33%. So core for core, Pascal will be much faster than Maxwell.

Tesla ProductsTesla K40Tesla M40Tesla P100
GPUGK110 (Kepler)GM200 (Maxwell)GP100 (Pascal)
SMs152456
TPCs152428
FP32 CUDA Cores / SM19212864
FP32 CUDA Cores / GPU288030723584
FP64 CUDA Cores / SM64432
FP64 CUDA Cores / GPU960961792
Base Clock745 MHz948 MHz1328 MHz
GPU Boost Clock810/875 MHz1114 MHz1480 MHz
Compute Performance - FP32 5.04 TFLOPS6.82 TFLOPS10.6 TFLOPS
Compute Performance - FP64 1.68 TFLOPS0.21 TFLOPS5.3 TFLOPS
Texture Units240192224
Memory Interface384-bit GDDR5384-bit GDDR54096-bit HBM2
Memory SizeUp to 12 GBUp to 24 GB16 GB
L2 Cache Size1536 KB3072 KB4096 KB
Register File Size / SM256 KB256 KB256 KB
Register File Size / GPU3840 KB6144 KB14336 KB
TDP235 Watts250 Watts300 Watts
Transistors7.1 billion8 billion15.3 billion
GPU Die Size551 mm²601 mm²610 mm²
Manufacturing Process28-nm28-nm16-nm

This should come as a relief to those who have been wondering if Nvidia's GP104 based, GTX 1080 and GTX 1070, graphics cards will offer a reasonable speed-up over the GTX 980 Ti and GTX 980. The leaked die shot of GP104 revealed that the chip in question is roughly only 300mm² large, half that of the 3840 CUDA core GP100 GPU.

The GP100 GPU features 6 GPCs, Graphics Processing Clusters. Each contains 10 Pascam SMs, Streaming Multiprocessors. Each SM contains 64 Pascal CUDA cores. Which means that each GPC houses 640 Pascal CUDA cores. Since the GP104 GPU is almost exactly half the size of GP100, If Nvidia maintains the same 10 SM per GPC design,the GTX 1080 should feature 3 GPCs and 1920 CUDA cores. And up to 2048 CUDA cores if Nvidia decides to tweak the design and opt for a 4 GPC layout with n 8 SMs per GPC.

GPUKepler GK110Maxwell GM200Pascal GP100Volta GV100
Compute Capability3.55.36.07.0
Threads / Warp32323232
Max Warps / Multiprocessor64646464
Max Threads / Multiprocessor2048204820482048
Max Thread Blocks / Multiprocessor16323232
Max 32-bit Registers / SM65536655366553665536
Max Registers / Block65536327686553665536
Max Registers / Thread255255255255
Max Thread Block Size1024102410241024
CUDA Cores / SM1921286464
Shared Memory Size / SM Configurations (bytes)16K/32K/48K96K64K96K

In either case, with Pascal's significant architectural improvements and very high frequency increase in mind. A 1920-2048 CUDA core GTX 1080 graphics card should end up being faster than a GTX 980 Ti. The performance delta should be reminiscent of what GTX 980 brought to the table when it launched compared to the GTX 780 Ti. A 15-20% performance increase at a lower price point. That being said the GTX 1070 should be the star of the lineup. Offering GTX 980 Ti comparable performance at a much more affordable price point, again similar to what the 970 delivered compared to the 780 Ti.

AMD's Polaris GCN 4.0 architecture is purported to deliver similar gains over the company's current GCN iteration, we'll be detailing those in an upcoming architectural deep dive piece as well. Polaris 10 and Polaris 11 based cards, R9 490, 480 and 470 series, are also pegged for a June launch against Pascal. So we can't wait to see both Pascal and Polaris in action this summer.

 

Share this story

Deal of the Day

Comments