![]() |
![]() |
#1 |
Sep 2014
3 Posts |
![]()
I am looking for ways to get maximum DRAM bandwidth/traffic. I tried to customize the torture test to run a larger FFT (8192-16384) and bumped up the 'memory to use' to about 80% of my RAM size but not had much success.
Any suggestions how I could increase the bandwidth ? |
![]() |
![]() |
![]() |
#2 |
"Kieren"
Jul 2011
In My Own Galaxy!
100111101011102 Posts |
![]()
Use quad channel DDR4.
I'm not meaning to be flippant, because it is true. I also know you were asking about tweaking what you have, not doing a total system change (unless you already have a beast that has such capability.) Last fiddled with by kladner on 2015-06-04 at 20:20 |
![]() |
![]() |
![]() |
#3 |
"/X\(‘-‘)/X\"
Jan 2013
309510 Posts |
![]()
And it appears there's no gain in bandwith with DDR3 over 2400 MHz in Haswell. Bandwidth actually drops: http://www.corsair.com/en/blog/2014/...ddr4_synthetic
I wasn't expecting that. |
![]() |
![]() |
![]() |
#4 | |
"Curtis"
Feb 2005
Riverside, CA
160216 Posts |
![]() Quote:
Please rephrase your question without using the word "bandwidth". |
|
![]() |
![]() |
![]() |
#5 | |
Sep 2014
3 Posts |
![]() Quote:
Using smaller FFT's would mean the data fits in the cache and a low DRAM access. I am looking for the opposite scenario - high DRAM access. I tried to increase the FFT size to miss in the cache and hit DRAM, which did help. I would like to find out if there is a "sweet spot" which would give me the highest DRAM read/write access. Last fiddled with by smartypants on 2015-06-04 at 21:45 |
|
![]() |
![]() |
![]() |
#6 | |
If I May
"Chris Halsall"
Sep 2002
Barbados
2B4216 Posts |
![]() Quote:
Oh... Imagine that. This has MemTest already been done. Last fiddled with by chalsall on 2015-06-04 at 22:19 |
|
![]() |
![]() |
![]() |
#7 |
P90 years forever!
Aug 2002
Yeehaw, FL
41×199 Posts |
![]()
The large FFT torture test (choose custom and then in-place) will max out your memory bandwidth if you run it on all cores.
There are benchmark programs that measure memory bandwidth (i think one is called AIDA64 is one). These max out memory bandwidth but do not check results along the way. Memtest will also max out mem bandwidth if you run multiple threads. No single-threaded program can max out memory bandwidth. Last I looked memtest on multiple threads was broken. |
![]() |
![]() |
![]() |
#8 | |
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#9 | |
May 2013
East. Always East.
11·157 Posts |
![]() Quote:
The higher the FFT, the more data needs to travel in and out of RAM. Lower FFT's don't reduce memory traffic; they require less. In this sense, small FFT's are "better" because they use up less of your bandwidth. Higher FFT's need more and if your system doesn't have enough, then the CPU has to constantly wait for your memory to spit out the data so you're losing efficiency there. We say that the system is "bottlenecked" by memory. Imagine if the CPU is a big factory with a bunch of trucks coming in delivering materials and leaving carrying toys. By increasing the power of your factory, you make more toys which means you have to have more trucks coming in. Eventually though, the road isn't big enough for all the trucks and your factory is constantly waiting for trucks to bring more stuff. What you're asking us is how to maximize the amount of trucks on your road. This isn't necessarily "better". If you want to stress test your memory and memory controller then run the stress test George suggested. |
|
![]() |
![]() |
![]() |
#10 |
"/X\(‘-‘)/X\"
Jan 2013
5·619 Posts |
![]() |
![]() |
![]() |
![]() |
#11 |
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
![]()
Oops. I skipped over the link. There are some interesting things there. It looks like that up to a point Ivy-E almost holds its own, but the higher clocks of DDR4 leave it in the dust (if you can afford 3 G or 3.2 GHz parts)
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Memory Bandwidth | Fred | Hardware | 12 | 2016-02-01 18:29 |
Optimal LL configuration | aurashift | Hardware | 11 | 2015-09-22 14:09 |
High Bandwidth Memory | tha | GPU Computing | 4 | 2015-07-31 00:21 |
P-1 memory bandwidth | TheMawn | Hardware | 1 | 2013-06-15 23:15 |
Parallel memory bandwidth | fivemack | Factoring | 14 | 2008-06-11 20:43 |