AMD Trinity APU architecture

The upcoming Trinity processors bring quite a few enhancements to the table, foremost of which is a revamped CPU part that ditches the old Phenom II processor cores in favor of updated Piledriver architecture CPU modules. The move to the Piledriver x86 cores promises an increase in IPC, leakage reduction, CAC reduction, and increased clockspeeds according to the leaked slides, but the most important change is the increased performance per clock numbers. The Trinity APUs are set to replace the A8–or performance series of–Llano APUs with quad core Trinity processors that utilize two Piledriver modules that each share 2MB cache for 4MB of total L2 cache. In that respect, Trinity will be similar to Llano in that it does not employ any L3 cache that is shared between the CPU and GPU cores. Interestingly, that may mean that using higher clocked RAM can improve performance on Trinity just as it did with Llano. If true, that would make Trinity’s improved DDR3 support–up to DDR3 2133MHz– all the better. On the GPU side of things, Trinity moves to a “Northern Islands” VLIW4 architecture with up to 384 stream processing units. Although the GPU area is physically smaller, it is said to be more efficient than the GPU cores in Llano APUs. The new GPU core is DirectX 11 and OpenCL 1.1 compliant. Also, it includes an updated hardware tessellator engine and hardware encoding unit (AMD Accelerated Video Converter).

Trinity also ratchets up the automatic overclocking with Turbo Core 3 support which can boost the CPU clockspeed up to 19% or the GPU clockspeed up to 20% above stock clocks. Even better, the APU is able to allocate power to either the GPU or CPU depending on which area needs the boost and how much TDP headroom the chip has when doing certain tasks. For example, AMD shows that the A10-4600M APU can downclock the GPU from the default clockspeed of 685MHz to 496MHz, allowing the x86 Piledriver cores to achieve up to a 900MHz overclock at a clockspeed of 3.2GHz. Alternatively, when the GPU is needed, it can run at 685MHz while the CPU sits at 2.3GHz. They are likely not able to push the GPU much further as any more reductions in CPU speeds would need to be much bigger than any accompanying GPU increases. And at that point, the GPU would likely become bottlenecked and the system would be starved of too much CPU power anyway.

The Trinity APUs continue to be based on GlobalFoundries’ 32nm SOI manufacturing process, but this time the chips are slightly larger with a die size of 246mm^2. Although the APU is wholly larger than Llano, they actually have fewer transistors at 1.303 billion versus the 1.45 billion in Llano. Although that may seem like a step in the wrong direction, the new CPU modules and GPU cores are much more efficient than those in Llano so it should all balance out and Trinity should come out on top despite the lower transistor count. The Trinity APUs will also feature an improved instruction set that includes AVX, AVX1.1, FMA3, AES, and F16C which should help the CPU in certain tasks.

The following picture can click to enlarge.

Trinity APU CPU upgrade to the second-generation the bulldozers core Piledriver, the GPU is also upgraded to the second generation DX11 core, enhanced performance while reducing power consumption, which is its goal, two goals from the point of view before the test or the basic realization of a’s.

Piledriver improve

The primary change is the CPU core, the first generation of the APU Llano, using a modified K10 core, the Trinity using a bulldozer architecture inherited a modular design, the four core has two modules, each module shared 2MB L2 cache, Trinity is still no L3 cache.

Instruction set AVX, / AVX, 1.1, AES support, of course, ultimately, AMD FMA3 instruction.

Pliedriver with first-generation Bulldozer compared to enhance the performance of the IPC (a week of instruction), and lower leakage current, the frequency also increased, desktop-class Llano maximum frequency of 2.9GHz, while the Trinity 2.0-3.8GHz, the former is mainly used in the mobile model, the highest desktop-class CPU frequency up to 3.8GHz. The GPU frequency is similar to the minimum for 424MHz mobile models, the typical frequency of 685MHz, up to 800MHz.

In summary, the system performance of desktop-class Trinity upgrading the 26% production efficiency by 29% in the mobile model. (Good and vague statement)

Piledriver architecture improvements

Turbo Core 3.0

Turbo Core dynamic acceleration technology can also upgrade to the third generation of AMD’s description is not much more advanced power management, CPU and GPU can accelerate.

A10-4600M, for example, the CPU frequency of 2.3GHz, the GPU is the default frequency of 685MHz by default, run 3DMark 11 test, the CPU is at full load, the CPU frequency will not improve to run the PCMark test GPU does not need too high load, the frequency will be reduced to 496MHz, the CPU can be upgraded to 2.7GHz, if it is running single-threaded CINEBench, then the GPU is still maintained to 496MHz, but the CPU can accelerate up to 3.2GHz.

GPU core — VLIW4

The second improvement of the Trinity the APU is the GPU part of the Llano, the graphics core of the HD 6000 series VLIW5 system, the Trinity upgrading to more efficient VLIW4 homology with the HD 6900 series, six groups SIMD unit stream processor cores reduce the number of 384, the surface is lower than 400 Llano, but greater efficiency in making Trinity 3D performance enhanced, the value declared before the AMD is increased by 50%.

Trinity APU process technology is still the GF 32nm SOI process, However, process maturity should be a good many, unlike last year, Llano so cups. The core area of 246mm2 228mm2 than the Llano APU has increased the number of transistors to 1.303 billion from 1.45 billion in Llano, in other words, while transistor to reduce core area increased.

This appears to be negative, but also see the power of the Trinity APU does lower standby 1.08W mobile Model 65W, 17W, 25W, and 35W, desktop-level APU TDP is still maintained and 100W unchanged This is just to design the reference value, specific performance of the power will have to see the actual test.

Several achievement tests leaked before the Trinity the APU is to give people confidence, due to the improvement of the CPU architecture, the frequency enhancement and upgrading the GPU, as well as lower power consumption, the Trinity in the low-end market can replace the CPU + alone significantly mix Although this is AMD’s low-end graphics card is not a good thing, but it also than Intel Ivy Bridge eroded better.