#### BattleScar

##### Member

- Jul 29, 2016

- 1,131

- 2,360

- 485

Not correct.

## AMD Radeon RX 6900 XT Review - The Biggest Big Navi

AMD's Radeon RX 6900 XT offers convincing 4K gaming performance, yet stays below 300 W. Thanks to this impressive efficiency, the card is almost whisper-quiet, quieter than any RTX 3090 we've ever tested. In our review, we not only benchmark the RX 6900 XT on Intel, but also on Zen 3, with fast...www.techpowerup.com

RTX Ampere was designed to have excess TFLOPS relative rasterization hardware for other workloads such as

1. Mesh shader (compute)

2. Direct Storage GpGPU decompression (compute)

3. DirectML (compute)

For each Turing SM vs Ampere SM difference, each Turing Integer CUDA core was turned into integer/floating-point CUDA core in Ampere.

Turing SM

64 Integer CUDA cores

64 floating-point CUDA cores

Ampere SM

64 Integer/ floating-point CUDA cores

64 floating-point CUDA cores

Try again.

Direct Storage and DirectML are GPU agnostic technologies designed by Microsoft. If anything it will have been developed to extensively be supported by Microsoft's Xbox - a console that uses RDNA2 - not Ampere.

And I know exactly how Ampere differs from Turing with respect to floating point.

BTW, did you know that before Turing, CUDA cores could do either Integer math OR floating point math? Amazing isn't it.

Which means that GA102 does indeed have 10752 cuda cores - that's literally how Nvidia defines GA102 - that's what their whitepapers say, so that's the truth. And if a CUDA Core can do either INT or FP calcs, its still a Vector ALU. No matter what fancy branding Nvidia gives it.

So the end result is that we have two GPUs

3090 - 82SM's - 10496 CUDA cores (Vector ALUs) - 100% performance (relative)

6900XT - 80CU - 5120 Stream processors (Vector ALUs) - 94% performance (relative)

Where is all that extra compute power going?