![]() |
![]() |
#1 |
"Composite as Heck"
Oct 2017
73510 Posts |
![]()
Zen 3 is the 7nm+ generation of AMD chiplets that follows the current 7nm Zen 2 generation. It's early to start discussing Zen 3 as not even all Zen 2 SKU's have been released yet but I'm going to be compiling data about Zen 3 anyway so it might as well be public. Anything written as absolute is confirmed (I'll try my best anyway, hopefully with references), the rest is speculation based on logic and a big dollop of wishful thinking.
Last fiddled with by M344587487 on 2019-10-05 at 20:16 Reason: Conference video removed from youtube |
![]() |
![]() |
![]() |
#2 |
"Sam Laur"
Dec 2018
Turku, Finland
2·3·5·11 Posts |
![]()
AMD confirmed in a recent presentation that the CCX will indeed be 8 cores as well, bringing the whole chiplet together.
https://wccftech.com/amd-zen-3-epyc-...-cpu-detailed/ |
![]() |
![]() |
![]() |
#3 |
"Composite as Heck"
Oct 2017
3·5·72 Posts |
![]()
That's the presentation that I and all the news sites got their information, there's no confirmation that a CCX will be 8 cores. I watched it before it was removed from youtube and it was presented in a confusing manner (with slides detailing different generations intermingled), if anyone reports 8 core CCX confirmed they are mistaken and most (all?) are reporting it as speculation.
Just the cache has been confirmed as unified. This has benefits on its own:
|
![]() |
![]() |
![]() |
#4 |
"Sam Laur"
Dec 2018
Turku, Finland
2·3·5·11 Posts |
![]()
This slide has it...
edit: now that I think about it, perhaps not, it just says that the L3 cache is unified, but would that make sense without making the CCX 8 cores as well? Last fiddled with by nomead on 2019-10-06 at 10:50 |
![]() |
![]() |
![]() |
#5 |
"Composite as Heck"
Oct 2017
3·5·72 Posts |
![]()
For the reasons I outlined above I think so. 8 core CCX's will likely come at some point but I'm skeptical it's going to be so soon. Changing topology seems like a big change and everything points to Zen 3 being a more incremental step. Place your bets now.
|
![]() |
![]() |
![]() |
#6 |
Jan 2015
25310 Posts |
![]()
For the Epyc 7003 release, they should (had better) get the memory controller off of 14nm. The uniform access to all the memory on the socket is nice, especially for big VMs, but the 150ns access latency is a weakness.
Other than that... I can't think of what the next bottlneck is going to be, besides the obvious cores per ccx. Edit: Clock speed is a weakness. I'm eying the 7543 32c SKU for the clock speed, and the 64c whatever it is for density/cost. They announced a 7H12 special SKU that has 64c @ 2.7GHz, but, like the intel AP series, requires 300w+ and water cooling. Last fiddled with by aurashift on 2019-10-08 at 18:08 |
![]() |
![]() |
![]() |
#7 |
Feb 2016
UK
39910 Posts |
![]()
How did I miss this thread until now... my biggest wish item improvement from Zen 2 would be unified L3 cache, and if there is nothing new to get in the way, it would make these CPUs even better.
As always, I don't know how the P95-like code works, but I assume core to core communication isn't required, and it is more a shared data set that is needed. So if that thinking is correct, then a split CCX with unified L3 would still be of great benefit. Right now for maximum throughput, I have to keep tasks on one CCX as there is quite a drop in throughput once you cross CCX. AVX-512 isn't so exciting if they implement something comparable to one unit Intel CPUs. No throughput benefit to AVX2/FMA. If they do VNNI, there might still be something in it for low precision uses but I don't believe that is useful to us. |
![]() |
![]() |
![]() |
#8 |
Just call me Henry
"David"
Sep 2007
Cambridge (GMT/BST)
579510 Posts |
![]()
We want them to include AVX-512 as soon as possible as it shows progress towards what we actually want. AMD virtually always implement SIMD extensions that are slow initially but catch up after a generation or two. AVX512 has so many extra bits that there will quite possibly be some useful bits even at the same throughput as FMA3. I believe that the doubling in the number of registers should help if nothing else.
|
![]() |
![]() |
![]() |
#9 |
Feb 2016
UK
39910 Posts |
![]()
Thinking more, I'm wondering how much benefit there could be from AVX-512 in Zen. Reason for saying this is that I feel they're power and thermal limited already even at lower core counts. We got the expected FP performance increase over previous generations in Zen 2, and some improvement from 7nm, but they still use a lot more energy running this type of code than otherwise.
I think we're at the point where power efficiency is more important than more raw performance. |
![]() |
![]() |
![]() |
#10 | ||||
"Composite as Heck"
Oct 2017
3×5×72 Posts |
![]() Quote:
An interesting question is what intel will do. They likely have to go for an MCM approach in the future (at least for servers) to scale as well as AMD have, they'll have a similar choice to make when it comes to memory access. Their 56 core vapourware is in essence two 28C 8180's smushed together with the same NUMA as a two processor server just in one socket (2x6 channel not 1x12 channel, separate cache), it's not representative of their intentions. iGPU's are another element to consider for both companies, we've yet to see how an MCM iGPU performs (unless you count the NUCs, I don't know if you can) and if intel intends to switch to MCM for consumers they'll have that influencing the design too. Quote:
Quote:
Quote:
|
||||
![]() |
![]() |
![]() |
#11 |
Sep 2016
331 Posts |
![]()
There's enough stuff in AVX512 not related to 512-bit that AMD can't sit on it forever. So they'll either need to adopt AVX512 as is, or build a competing version (highly unlikely).
When I spoke to David Suggs (lead chip architect for AMD) at Hot Chips back in August, I asked him if there's anything in AVX512 that's fundamentally difficult (under the pretense that AMD isn't already working on it). I specifically asked about things that (I believe) would complicate a design that doesn't natively support it: - Mask registers. - 4-input dependencies - Decoding He said none. No difficulties. It's just "work to be done" - without implying that they're already working on it or not. So my guess is that AMD won't add a competing ISA that's 256-bit. But will wait on AVX512 for as long as economically possible - which could be quite a few generations from now given that AMD now has Intel on its heels. Last fiddled with by Mysticial on 2019-10-09 at 21:48 |
![]() |
![]() |