![]() |
![]() |
#1 |
P90 years forever!
Aug 2002
Yeehaw, FL
5·23·71 Posts |
![]()
I'm working on the code that selects which FFT implementation to run for a given FFT size. Kinda stuck.
V25 uses a hardwired selection process based on the L2+L3 cache size. Preliminary benchmarks on my Core 2 and Core i7 boxes indicate that this isn't optimal. I could add hardwired choices for Core i7 too, but as different cache configurations come out and next generation chips are released, even more hardwired choices might need to be added. So maybe a more dynamic system is needed, but that runs into other difficulties such as other programs might stop and start during a mini-benchmark to determine the best FFT implementation. Worse yet (and this applies to a hardwired choice), the best single-worker FFT implementation might not be the same as the best multi-worker FFT implementation. At this point, I'm leaning to an expanded hardwired approach. Although I'm willing to entertain interesting ideas. |
![]() |
![]() |
![]() |
#2 |
P90 years forever!
Aug 2002
Yeehaw, FL
11111111001012 Posts |
![]()
I could use a few more v26 benchmarks. My Core 2 machine is a Mac which does not give the most accurate timings. So a few Core 2 and i7 benchmarks would be useful. I'd also be curious as to AMD K10 timings (though I haven't worked on any AMD optimizations yet).
A benchmark will take a few hours. Please have little to nothing else running. Download (but don't overwrite your existing prime95!): ftp://mersenne.org/gimps/test_v26_32.zip ftp://mersenne.org/gimps/test_v26_64.zip Install v26 prime95 in a new directory. Tell prime95 you are a stress tester so that it does not contact the server. Add this to prime.txt: StressTester=1 MinBenchFFT=4 MaxBenchFFT=32768 OnlyBench5678=0 BenchAllComplex=1 AllBench=1 NumCPUs=1 Then run Options/Benchmark. Post your results.txt file in this thread. P.S. I really wouldn't recommend using this version for production work, though it will probably work. Last fiddled with by Prime95 on 2010-06-24 at 00:48 |
![]() |
![]() |
![]() |
#3 |
P90 years forever!
Aug 2002
Yeehaw, FL
5×23×71 Posts |
![]()
If you want to do some QA, then do this:
Add this to prime.txt: [qa] MAX_B=5 MIN_N=100000 MAX_N=10000000 MAX_K_BITS=20 MAX_C_BITS_FOR_SMALL_K=2 MAX_C_BITS_FOR_LARGE_K=2 Then do Advanced/Time. Enter 9920 as the exponent. Report any occurrences of the word "mismatch" in results.txt. For extra credit, report any instances where the round off error exceeds 0.3. |
![]() |
![]() |
![]() |
#4 |
Feb 2009
3·13 Posts |
![]()
I can't seem to get the 64-bit version to download. Does not give an error, just sits there saying "starting" but never starts. 32-bit version downloads right away.
[EDIT] 32-bit doesn't want to download either now. Last fiddled with by enderak on 2010-06-22 at 18:57 |
![]() |
![]() |
![]() |
#5 |
Just call me Henry
"David"
Sep 2007
Liverpool (GMT/BST)
603210 Posts |
![]()
How about before it starts testing each size for the first time it runs a benchmark on the available FFT algorithms that look likely to be useful.
|
![]() |
![]() |
![]() |
#6 |
Oct 2008
California
22·59 Posts |
![]()
Do you need any i5 benchmarks / would they help?
|
![]() |
![]() |
![]() |
#7 |
P90 years forever!
Aug 2002
Yeehaw, FL
1FE516 Posts |
![]() |
![]() |
![]() |
![]() |
#8 |
P90 years forever!
Aug 2002
Yeehaw, FL
11111111001012 Posts |
![]() |
![]() |
![]() |
![]() |
#9 |
"Oliver"
Mar 2005
Germany
100010110102 Posts |
![]()
Hello George,
why not having a basic hardwired table for those users which just want to run Prime95 and an option for advanced users to run some benchmarks to select "optimal" parameters for their machine? Oliver P.S. I can't test Windows binaries ![]() |
![]() |
![]() |
![]() |
#10 |
"Nathan"
Jul 2008
Maryland, USA
5·223 Posts |
![]()
Here is the results.txt file from 4.5 hours of QA on Betsy, a 3.06 GHz PIV. There are NO mismatches that I can see, nor did MaxErr ever get above 0.3 (in fact I think the highest value was around 0.17-0.18).
I am going to run further QA on this system, as well as on the Core2 systems I have borged. But it looks good on the P4, initially! |
![]() |
![]() |
![]() |
#11 |
May 2010
32·7 Posts |
![]()
6 hours of running QA on my overclocked i7-920 came up with no mismatches and a MaxErr of 0.28125 (close to your 0.3 threshold). It would have been a longer test, but I got MS updated with a reboot at 3 am.
One thing I noticed while running QA is that the total cpu loading was only in the 25-50% (mostly at 27%) which is far lower than I'm used to seeing with Prime95. System Spec: Core i7 920 @ 3.73 (overclocked). 6 GB DDR3 1424 MHz RAM in triple channel. No Hyperthreading qa results.zip had to zip it - txt file is 475KB Last fiddled with by Rhyled on 2010-06-23 at 15:14 |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Some ideas regarding NFS... | paul0 | Factoring | 3 | 2015-03-14 19:55 |
two ideas for NPLB | TimSorbet | No Prime Left Behind | 16 | 2008-03-01 23:32 |
GROUP IDEAS | TTn | 15k Search | 15 | 2003-09-23 16:28 |
Domain name ideas... | Xyzzy | Lounge | 17 | 2003-03-24 16:20 |
Couple of ideas/things to do | Stormblade | Lounge | 12 | 2002-08-20 02:21 |