AMD Vega GPU Pictured Up Close, Biggest FinFET GPU by RTG, Over 500mm2 – Features Two HBM2 Stacks, Carries 8 GB VRAM, 512 GB/s Bandwidth

Hassan Mujtaba
AMD Vega GPU Radeon Vega

AMD has officially unveiled their Vega GPU to the masses at CES 2017. Along with the architectural preview, AMD also provided a glimpse at the Vega GPU die which will power the next-generation Radeon RX 500 series graphics cards.

AMD Vega GPU Pictured Up Close - 8 GB VRAM, 512 GB/s Bandwidth From Two HBM2 Stacks

The AMD Vega GPU was actually showcased to selected press at the AMD Tech Summit. The new graphics chip will be AMD's first consumer aimed GPU to utilize the new HBM2 standard. This will allow them to double the capacity per die and clock per pin.

Related Story It’s Time To Bid Farewell To AMD RDNA 2 “Radeon RX 6000” GPUs, Inventory Hits Rock Bottom

The first generation HBM graphics cards such as the Radeon R9 Fury X was limited to just 4 GB of VRAM and had a bandwidth of 512 GB/s. It had 4-layers per stack (256 MB per layer) and that will continue with the latest Vega GPUs since AMD will have to maximize value on these cards for the gaming audience. In the case of 4-layers, we will be looking at higher densities per layer. The pin speed also increases with HBM2. The new memory standard can clock up to 2 Gb/s compared to 1 Gb/s on HBM1.

The increased clock speed would allow the same memory bandwidth as four HBM1 stacks on just two HBM2 stacks. The increased density also allows AMD to cut down the costs in designing larger interposers. HBM2 itself takes more space compared to HBM1 with a die size around 92mm2 while HBM1 was just 35mm2 in size.

A Close-up of AMD's Biggest FinFET GPU To Date - The Mighty Vega, Over 500mm2

AMD Vega GPU is compared to other chips from NVIDIA and AMD. (Image Credits: Videocardz)

Close-ups of the GPU posted by TechReport, Computerbase and 4Gamer reveal that the GPU is the biggest FinFET design by AMD. Videocardz also made a comparison that showcases the huge chip against other GPUs such as the NVIDIA GP102, GP102 and AMD's Fiji GPU. The chip is somewhere in between 530-560mm2 so this is really a monstrous chip design and should pack a lot of power.

amd-vega-gpu-2
amd-vega-gpu_1

AMD GCN GPU Architectures

WccftechHD 7000 SeriesRadeon 200 SeriesRadeon Fury SeriesRadeon RX 400 SeriesTBA
Flagship GPUTahitiHawaiiFijiPolaris 10Vega #TBA
GCN IteraionGCN 1GCN 2GCN 3GCN 4GCN NCU
Launch20112013201520162017
Die Size352mm2438mm2596mm2232mm2530-560mm2
Memory ArchitectureGDDR5GDDR5HBM1GDDR5HBM2

Highlights of the Vega GPU architecture's advancements include:

  • The world's most advanced GPU memory architecture: The Vega architecture enables a new memory hierarchy for GPUs. This radical new approach comes in the form of a new high-bandwidth cache and its controller. The cache features leading-edge HBM2 technology which is capable of transferring terabytes of data every second, doubling the bandwidth-per-pin over the previous generation HBM technology. HBM2 also enables much greater capacity at less than half the footprint of GDDR5 memory. Vega architecture is optimized for streaming very large datasets and can work with a variety of memory types with up to 512TB of virtual address space.
  • Next-generation geometry pipeline: Today's games and professional applications make use of incredibly complex geometry enabled by the extraordinary increase in the resolutions of data acquisition devices. The hundreds of millions of polygons in any given frame have meshes so dense that there are often many polygons being rendered per pixel. Vega's next-generation geometry pipeline enables the programmer to extract incredible efficiency in processing this complex geometry, while also delivering more than 200% of the throughput-per-clock over previous Radeon architectures. It also features improved load-balancing with an intelligent workload distributor to deliver consistent performance.
  • Next-generation compute engine: At the core of the Vega architecture is a new, next-generation compute engine built on flexible compute units that can natively process 8-bit, 16-bit, 32-bit or 64-bit operations in each clock cycle. These compute units are optimized to attain significantly higher frequencies than previous generations and their support of variable datatypes makes the architecture highly versatile across workloads.
  • Advanced pixel engine: The new Vega pixel engine employs a Draw Stream Binning Rasterizer, designed to improve performance and power efficiency. It allows for "fetch once, shade once" of pixels through the use of a smart on-chip bin cache and early culling of pixels invisible in a final scene. Vega's pixel engine is now a client of the onboard L2 cache, enabling considerable overhead reduction for graphics workloads which perform frequent read-after-write operations.

AMD Radeon Vega Official Architecture Slides:

slides-2
vega-final-presentation-03
vega-final-presentation-04
vega-final-presentation-05
vega-final-presentation-06
vega-final-presentation-07
slides-8
slides-11
vega-final-presentation-12
slides-13
slides-14
vega-final-presentation-15
slides-16
vega-final-presentation-17
vega-final-presentation-18
slides-19
vega-final-presentation-20
slides-21
vega-final-presentation-22
vega-final-presentation-23
slides-24
vega-final-presentation-25
slides-27
slides-28
vega-final-presentation-29
vega-final-presentation-30
vega-final-presentation-31
slides-32
vega-final-presentation-34
vega-final-presentation-35
slides-36
vega-final-presentation-36
slides-37

AMD Vega Lineup

Graphics CardRadeon R9 Fury XRadeon RX 480Radeon RX Vega Frontier EditionRadeon Vega ProRadeon RX Vega (Gaming)Radeon RX Vega Pro Duo
GPUFiji XTPolaris 10Vega 10Vega 10Vega 102x Vega 10
Process Node28nm14nm FinFETFinFETFinFETFinFETFinFET
Stream Processors40962304409635844096 (?)Up to 8192
Performance8.6 TFLOPS
8.6 (FP16) TFLOPS
5.8 TFLOPS
5.8 (FP16) TFLOPS
~13 TFLOLPS
~25 (FP16) TFLOPS
11 TFLOLPS
22 (FP16) TFLOPS
>13 TFLOLPS
>25 (FP16) TFLOPS
TBA
TBA
Memory4GB HBM8GB GDDR516GB HBM2TBATBATBA
Memory Bus4096-bit256-bit2048-bit2048-bit2048-bit4096-bit
Bandwidth512GB/s256GB/S480GB/S400GB/STBATBA
TDP275W150WTBATBATBATBA
Launch20152016June 2017June 2017July 2017TBA

AMD Radeon Vega Early Performance Expectations From DOOM

According to the report from TechReport who got to thoroughly test the Vega sample with 8 GB VRAM in DOOM, this specific sample is somewhere in between the GTX 1070 and GTX 1080 levels of performance. When set to Nightmare settings, the game reported max frame times of 24.8ms. We also recorded some gameplay of DOOM running on the Vega sample while we were at AMD's CES event.

Though that performance might not sound so impressive, it's worth noting that all of the demo system's vents (including the graphics card's exhaust) were taped up, and it's quite likely the chip was sweating to death in its own waste heat. By my rough estimate, that puts the early Vega card inside somewhere between the performance of a GTX 1070 and a GTX 1080 in Doom. via TechReport

The AMD Vega GPU is an exciting product and AMD has shared some key details of their next generation GPU with the audience. The launch of Vega GPUs is slated for the first half of 2017 and there couldn't be a better time to introduce Vega since AMD is also launching their next-generation AM4 platform for Ryzen processors in the upcoming months. For more details on the Vega GPU architecture, please visit our detailed post here.

Share this story

Deal of the Day

Comments