Nvidia Turing architecture, GeForce RTX 20 series expalined

With the introduction of the GeForce RTX 20 series, Nvidia introduces a new graphics processor architecture called Turing. It brings important improvements over Pascal (GeForce GTX 1000 series) but is still not completely known. Nvidia is developing Volta, which was launched last year to address the gaming market.

Before seeing its big progress, Turing physically stands out from Pascal with much bigger GPUs. At this point, Nvidia announces an area of 754 mm² for the most powerful version, a good advance over the GP102 of the GTX 1080 Ti and its 471 mm². This GPU, named TU102, is the heart of the showcase of this new generation for the GeForce RTX 2080 Ti.

The expansion makes it possible to accommodate many more transistors, which can reach up to 18.6 billion.

Nvidia Turing Architecture

All this is used to verify the distribution of calculations in the center of GPUs. We have retouched at the level of caches, calculations of integers and floats or the arrival of units that are intended for very specific tasks.

Turing, RT Core, Tensor Core and Parallelism

Nvidia continues to organize its architecture according to the SM (Streaming Multiprocessor) concept. It collects the Cuda Core calculation units (integers and floats) and now has 8 Tensor Core and one RT Core. The tensor core offers very specific calculations around AI and Deep Learning, while the RT core is designed for calculations around Ray Tracing.

With Turing, Cuda Core units are now able to process integers and real numbers simultaneously. According to Nvidia, this increases the variable flow rate by 36%. The big news is the RT Core, RT for Ray-Tracing Core. It was on the front page of Nvidia’s presentation at the launch of the GeForce RTX 2080 and 2080 Ti.

The aim is to improve the quality of rendering by introducing a so-called hybrid method. It is based on the existing system, the so-called screening, and at the same time offers real-time ray tracing.

Ray tracing or ray throw offers a much more realistic representation. It affects the quality of lighting, shadows or reflections. This hardware wiring allows the GeForce RTX 2080 Ti to be 10 times more efficient than its big brother, the GeForce GTX 1080 Ti when performing this type of calculation.

Nvidia plays a card for the future because this hybrid rendering does not come from its family, but that is from those of the DirectX RT (DXRT) API from Microsoft. Consequently, any compatible card that supports this API is also compatible. On the other hand, the advantage is the presence of a hardware acceleration, which should ensure a first-class performance in competition. It cannot be ruled out that AMD will make this choice at the launch of its next generation of Radeon.

However, Ray Tracing is not yet widespread. The API will be available when the Windows 10 October 2018 update is released, while games must integrate this technology. To return to the tensor core mentioned above, Nvidia wanted to exploit it through a new anti-aliasing filter, DLSS.

This is the contraction of Deep Learning Super Sampling. The company proposes to the developers to use a neural network to analyze the representation of their titles in order to develop an algorithm tailored to their game.

It is of course implemented by the tensor core. More than ten titles support this DLSS. With this new generation of GeForce, Nvidia supports DisplayPort 1.4a with DSC 1.2. This protocol allows you to operate a display in 8K resolution at 60 Hz with HDR. Note that the latter is managed natively. The video driver supports up to two 8K screens at 60 Hz. In addition, there is an HDMI 2.0b port with HDCP 2.2 protection support.

The graphics card has a USB-C port to enjoy VirtualLink. The aim is to simplify the operation of a virtual headset by limiting the wiring. We are entitled to four DisplayPort HBR3 lines (video streams) and 2 USB 3.1 Gen 2 channels (data). The SLI technology evolves through the use of an already known NVlink. The goal is to increase the bandwidth. Turing has an improved video encoding/decoding machine on board compared to Pascal’s.

Here is a summary of the characteristics of these GeForce RTX 20 series.

	GeForce RTX 2070	GeForce RTX 2080	GeForce RTX 2080 Ti
Architecture	Turing – 12 nm FFN	Turing – 12 nm FFN	Turing – 12 nm FFN
GPU	TU106	TU104	TU102
Number of transistors	10.8 billion	13.6 billion	18.6 billion
Surface of the die	445 mm²	545 mm²	754 mm²
SM blocks	36	46	68
CUDA Core	2304	2944	4352
Tensor Core	288	368	544
RT Core	36	46	68
ROPs	64	64	88
TMU	144	184	272
Basic frequency	1410	1515	1350
Boost Frequency	1620/1710 (FE)	1710/1800	1545/1635
GDDR6 memory	8GB	8GB	11 GB
Memory frequency	14 Gbps
Memory bus	256 bits	256 bits	352 bits

Nvidia Turing Architecture

Turing, RT Core, Tensor Core and Parallelism

Related posts: