mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2007-10-17, 10:53   #298
Dresdenboy
 
Dresdenboy's Avatar
 
Apr 2003
Berlin, Germany

36110 Posts
Default

Here is a first Barcelona result:

Code:
Quad-Core AMD Opteron(tm) Processor 2347
CPU speed: 1909.87 MHz, 8 cores
CPU features: RDTSC, CMOV, Prefetch, 3DNow!, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 512 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 48
L2 TLBS: 512
Prime95 32-bit version 25.5, RdtscTiming=1
Best time for 768K FFT length: 27.935 ms.
Best time for 896K FFT length: 33.206 ms.
Best time for 1024K FFT length: 37.916 ms.
Best time for 1280K FFT length: 46.703 ms.
Best time for 1536K FFT length: 58.223 ms.
Best time for 1792K FFT length: 69.253 ms.
Best time for 2048K FFT length: 78.543 ms.
Best time for 2560K FFT length: 103.744 ms.
Best time for 3072K FFT length: 126.878 ms.
Best time for 3584K FFT length: 148.956 ms.
Best time for 4096K FFT length: 170.245 ms.
Best time for 5120K FFT length: 219.297 ms.
Best time for 6144K FFT length: 267.964 ms.
Best time for 7168K FFT length: 324.853 ms.
Best time for 8192K FFT length: 370.556 ms.
Timing FFTs using 2 threads.
Best time for 768K FFT length: 16.163 ms.
Best time for 896K FFT length: 23.021 ms.
Best time for 1024K FFT length: 26.109 ms.
Best time for 1280K FFT length: 31.032 ms.
Best time for 1536K FFT length: 37.103 ms.
Best time for 1792K FFT length: 43.269 ms.
Best time for 2048K FFT length: 49.260 ms.
Best time for 2560K FFT length: 66.193 ms.
Best time for 3072K FFT length: 79.358 ms.
Best time for 3584K FFT length: 92.148 ms.
Best time for 4096K FFT length: 105.078 ms.
Best time for 5120K FFT length: 130.060 ms.
Best time for 6144K FFT length: 161.614 ms.
Best time for 7168K FFT length: 195.442 ms.
Best time for 8192K FFT length: 221.939 ms.
Timing FFTs using 3 threads.
Best time for 768K FFT length: 13.809 ms.
Best time for 896K FFT length: 21.811 ms.
Best time for 1024K FFT length: 24.318 ms.
Best time for 1280K FFT length: 28.503 ms.
Best time for 1536K FFT length: 33.089 ms.
Best time for 1792K FFT length: 37.643 ms.
Best time for 2048K FFT length: 42.105 ms.
Best time for 2560K FFT length: 59.491 ms.
Best time for 3072K FFT length: 68.819 ms.
Best time for 3584K FFT length: 78.486 ms.
Best time for 4096K FFT length: 87.776 ms.
Best time for 5120K FFT length: 96.075 ms.
Best time for 6144K FFT length: 115.049 ms.
Best time for 7168K FFT length: 134.383 ms.
Best time for 8192K FFT length: 152.169 ms.
Timing FFTs using 4 threads.
Best time for 768K FFT length: 12.659 ms.
Best time for 896K FFT length: 20.397 ms.
Best time for 1024K FFT length: 22.734 ms.
Best time for 1280K FFT length: 26.702 ms.
Best time for 1536K FFT length: 30.707 ms.
Best time for 1792K FFT length: 34.792 ms.
Best time for 2048K FFT length: 38.922 ms.
Best time for 2560K FFT length: 55.094 ms.
Best time for 3072K FFT length: 63.479 ms.
Best time for 3584K FFT length: 72.257 ms.
Best time for 4096K FFT length: 80.775 ms.
Best time for 5120K FFT length: 87.718 ms.
Best time for 6144K FFT length: 104.854 ms.
Best time for 7168K FFT length: 121.390 ms.
Best time for 8192K FFT length: 136.810 ms.
Timing FFTs using 5 threads.
[Wed Oct 17 02:55:20 2007]
Best time for 768K FFT length: 12.210 ms.
Best time for 896K FFT length: 20.615 ms.
Best time for 1024K FFT length: 22.624 ms.
Best time for 1280K FFT length: 26.663 ms.
Best time for 1536K FFT length: 31.017 ms.
Best time for 1792K FFT length: 34.802 ms.
Best time for 2048K FFT length: 38.889 ms.
Best time for 2560K FFT length: 55.187 ms.
Best time for 3072K FFT length: 64.065 ms.
Best time for 3584K FFT length: 72.070 ms.
Best time for 4096K FFT length: 79.979 ms.
Best time for 5120K FFT length: 87.083 ms.
Best time for 6144K FFT length: 103.475 ms.
Best time for 7168K FFT length: 120.259 ms.
Best time for 8192K FFT length: 135.915 ms.
Timing FFTs using 6 threads.
Best time for 768K FFT length: 11.827 ms.
Best time for 896K FFT length: 20.350 ms.
Best time for 1024K FFT length: 22.470 ms.
Best time for 1280K FFT length: 26.510 ms.
Best time for 1536K FFT length: 30.388 ms.
Best time for 1792K FFT length: 34.276 ms.
Best time for 2048K FFT length: 38.393 ms.
Best time for 2560K FFT length: 55.001 ms.
Best time for 3072K FFT length: 62.357 ms.
Best time for 3584K FFT length: 70.534 ms.
Best time for 4096K FFT length: 78.571 ms.
Best time for 5120K FFT length: 85.249 ms.
Best time for 6144K FFT length: 101.560 ms.
Best time for 7168K FFT length: 117.729 ms.
Best time for 8192K FFT length: 132.915 ms.
Timing FFTs using 7 threads.
Best time for 768K FFT length: 11.377 ms.
Best time for 896K FFT length: 20.030 ms.
Best time for 1024K FFT length: 22.347 ms.
Best time for 1280K FFT length: 26.278 ms.
Best time for 1536K FFT length: 30.107 ms.
Best time for 1792K FFT length: 33.871 ms.
Best time for 2048K FFT length: 37.850 ms.
Best time for 2560K FFT length: 53.885 ms.
Best time for 3072K FFT length: 61.956 ms.
Best time for 3584K FFT length: 70.095 ms.
Best time for 4096K FFT length: 77.744 ms.
Best time for 5120K FFT length: 84.498 ms.
Best time for 6144K FFT length: 100.272 ms.
Best time for 7168K FFT length: 116.577 ms.
Best time for 8192K FFT length: 131.545 ms.
Timing FFTs using 8 threads.
Best time for 768K FFT length: 11.269 ms.
Best time for 896K FFT length: 19.927 ms.
Best time for 1024K FFT length: 22.092 ms.
Best time for 1280K FFT length: 25.985 ms.
Best time for 1536K FFT length: 29.836 ms.
Best time for 1792K FFT length: 33.735 ms.
Best time for 2048K FFT length: 37.738 ms.
Best time for 2560K FFT length: 53.270 ms.
Best time for 3072K FFT length: 61.192 ms.
Best time for 3584K FFT length: 68.896 ms.
Best time for 4096K FFT length: 77.015 ms.
Best time for 5120K FFT length: 83.931 ms.
Best time for 6144K FFT length: 99.895 ms.
Best time for 7168K FFT length: 115.865 ms.
Best time for 8192K FFT length: 130.881 ms.
Best time for 58 bit trial factors: 6.102 ms.
Best time for 59 bit trial factors: 6.116 ms.
Best time for 60 bit trial factors: 6.104 ms.
Best time for 61 bit trial factors: 6.127 ms.
Best time for 62 bit trial factors: 11.402 ms.
Best time for 63 bit trial factors: 11.376 ms.
Best time for 64 bit trial factors: 11.220 ms.
Best time for 65 bit trial factors: 11.166 ms.
Best time for 66 bit trial factors: 11.154 ms.
Best time for 67 bit trial factors: 11.135 ms.
Many thanks to "linhvndiy" @ XtremeSystems. Original posting is here:
http://www.xtremesystems.org/forums/...postcount=1102

The 2 CPUs run at 1.9 GHz on a Supermicro board with memory controller at 1.6 GHz. Since there were 2 CPUs, memory latency is higher than in an 1S system. The single threaded times seem to be a bit high, although clearly lower than my X2@2 GHz. OTOH the multithreaded tests shows a different behaviour than on C2Q, very likely because of the different memory subsystem and the shared L3 cache.
Dresdenboy is offline   Reply With Quote
Old 2007-10-17, 19:59   #299
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Quote:
Originally Posted by Dresdenboy View Post
The 2 CPUs run at 1.9 GHz on a Supermicro board with memory controller at 1.6 GHz. Since there were 2 CPUs, memory latency is higher than in an 1S system. The single threaded times seem to be a bit high, although clearly lower than my X2@2 GHz. OTOH the multithreaded tests shows a different behaviour than on C2Q, very likely because of the different memory subsystem and the shared L3 cache.
Yepp, single-thread performance is too low...
I get ~32ms/iteration (1024k FFT) with one instance (24.14) and ~34ms/iteration when running 8 instances (24.14) on a Barcy 2347.
TheJudger is offline   Reply With Quote
Old 2007-10-18, 07:47   #300
Dresdenboy
 
Dresdenboy's Avatar
 
Apr 2003
Berlin, Germany

192 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Yepp, single-thread performance is too low...
I get ~32ms/iteration (1024k FFT) with one instance (24.14) and ~34ms/iteration when running 8 instances (24.14) on a Barcy 2347.
It seems, either the other two 2347s are somewhat inefficient by bad configuration (BIOS programs the MSRs with less efficient values) or the higher mem latency thanks to the 2S system really cost that much.

Your ~32 ms/iteration (although with an older version) brings the per clock throughput much closer to C2Q and lets the 1.9 GHz Barcelona iterate as fast as a 3.15 GHz K8 using 1 thread.

Can you please post a full benchmark run of 25.5a?

BTW, someone on XS posted an even faster K8 result:
Code:
Here are my Results with a Athlon 64 X2 6400 @ 3500 MHz and CellShock DDR2 1000 @ 4-4-4-12

[Wed Oct 17 18:32:13 2007]
Compare your results to other computers at http://www.mersenne.org/bench.htm
AMD Athlon(tm) 64 X2 Dual Core Processor 6400+
CPU speed: 3499.87 MHz, 2 cores
CPU features: RDTSC, CMOV, Prefetch, 3DNow!, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 32-bit version 25.5, RdtscTiming=1
Best time for 768K FFT length: 21.030 ms.
Best time for 896K FFT length: 25.262 ms.
Best time for 1024K FFT length: 27.967 ms.
Best time for 1280K FFT length: 35.763 ms.
Best time for 1536K FFT length: 43.718 ms.
Best time for 1792K FFT length: 52.707 ms.
Best time for 2048K FFT length: 58.854 ms.
Best time for 2560K FFT length: 77.593 ms.
Best time for 3072K FFT length: 94.625 ms.
Best time for 3584K FFT length: 114.018 ms.
Best time for 4096K FFT length: 127.410 ms.
Best time for 5120K FFT length: 165.333 ms.
Best time for 6144K FFT length: 201.588 ms.
Best time for 7168K FFT length: 244.321 ms.
Best time for 8192K FFT length: 278.076 ms.
Timing FFTs using 2 threads.
Best time for 768K FFT length: 12.993 ms.
Best time for 896K FFT length: 15.601 ms.
Best time for 1024K FFT length: 17.517 ms.
Best time for 1280K FFT length: 23.488 ms.
Best time for 1536K FFT length: 28.120 ms.
Best time for 1792K FFT length: 33.641 ms.
Best time for 2048K FFT length: 37.514 ms.
Best time for 2560K FFT length: 50.270 ms.
Best time for 3072K FFT length: 60.128 ms.
Best time for 3584K FFT length: 71.829 ms.
Best time for 4096K FFT length: 79.863 ms.
Best time for 5120K FFT length: 89.756 ms.
Best time for 6144K FFT length: 111.599 ms.
Best time for 7168K FFT length: 139.406 ms.
Best time for 8192K FFT length: 167.504 ms.
Best time for 58 bit trial factors: 3.403 ms.
Best time for 59 bit trial factors: 3.436 ms.
Best time for 60 bit trial factors: 3.420 ms.
Best time for 61 bit trial factors: 3.436 ms.
Best time for 62 bit trial factors: 6.236 ms.
Best time for 63 bit trial factors: 6.233 ms.
Best time for 64 bit trial factors: 7.895 ms.
Best time for 65 bit trial factors: 7.840 ms.
Best time for 66 bit trial factors: 7.858 ms.
Best time for 67 bit trial factors: 7.845 ms.
Dresdenboy is offline   Reply With Quote
Old 2007-10-18, 19:07   #301
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Quote:
Originally Posted by Dresdenboy View Post
Can you please post a full benchmark run of 25.5a?
if I remember this at the right time ;)
The scaling over the number of threads in 25.x wasn't good.
TheJudger is offline   Reply With Quote
Old 2007-10-25, 05:56   #302
Kevin
 
Kevin's Avatar
 
Aug 2002
Ann Arbor, MI

6618 Posts
Default

Q6600 overclocked from 2.4 to a little under 3.0 . Memory is 2 gigs of corsair xms PC8500 running at 1000mhz.

First without other instances running.

Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz
CPU speed: 2996.86 MHz
CPU features: RDTSC, CMOV, Prefetch, MMX, SSE, SSE2
L1 cache size: 32 KB
L2 cache size: unknown
L1 cache line size: 64 bytes
L2 cache line size: unknown
Prime95 32-bit version 24.14, RdtscTiming=1
Best time for 512K FFT length: 8.485 ms.
Best time for 640K FFT length: 11.557 ms.
Best time for 768K FFT length: 14.257 ms.
Best time for 896K FFT length: 17.042 ms.
Best time for 1024K FFT length: 18.841 ms.
Best time for 1280K FFT length: 24.027 ms.
Best time for 1536K FFT length: 29.308 ms.
Best time for 1792K FFT length: 34.820 ms.
Best time for 2048K FFT length: 38.678 ms.
Best time for 2560K FFT length: 51.084 ms.
Best time for 3072K FFT length: 62.210 ms.
Best time for 3584K FFT length: 75.166 ms.
Best time for 4096K FFT length: 84.369 ms.
Best time for 58 bit trial factors: 3.720 ms.
Best time for 59 bit trial factors: 3.756 ms.
Best time for 60 bit trial factors: 3.741 ms.
Best time for 61 bit trial factors: 3.731 ms.
Best time for 62 bit trial factors: 5.950 ms.
Best time for 63 bit trial factors: 5.972 ms.
Best time for 64 bit trial factors: 5.474 ms.
Best time for 65 bit trial factors: 5.436 ms.
Best time for 66 bit trial factors: 5.438 ms.
Best time for 67 bit trial factors: 5.416 ms.

And then with the three other cores running.

Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz
CPU speed: 2996.52 MHz
CPU features: RDTSC, CMOV, Prefetch, MMX, SSE, SSE2
L1 cache size: 32 KB
L2 cache size: unknown
L1 cache line size: 64 bytes
L2 cache line size: unknown
Prime95 32-bit version 24.14, RdtscTiming=1
Best time for 512K FFT length: 11.077 ms.
Best time for 640K FFT length: 15.283 ms.
Best time for 768K FFT length: 18.713 ms.
Best time for 896K FFT length: 22.324 ms.
Best time for 1024K FFT length: 25.093 ms.
Best time for 1280K FFT length: 31.123 ms.
Best time for 1536K FFT length: 37.983 ms.
Best time for 1792K FFT length: 45.044 ms.
Best time for 2048K FFT length: 50.218 ms.
Best time for 2560K FFT length: 67.627 ms.
Best time for 3072K FFT length: 82.085 ms.
Best time for 3584K FFT length: 98.875 ms.
Best time for 4096K FFT length: 110.707 ms.
Best time for 58 bit trial factors: 3.836 ms.
Best time for 59 bit trial factors: 3.842 ms.
Best time for 60 bit trial factors: 3.842 ms.
Best time for 61 bit trial factors: 3.805 ms.
Best time for 62 bit trial factors: 6.016 ms.
Best time for 63 bit trial factors: 6.016 ms.
Best time for 64 bit trial factors: 5.522 ms.
Best time for 65 bit trial factors: 5.503 ms.
Best time for 66 bit trial factors: 5.482 ms.
Best time for 67 bit trial factors: 5.443 ms.
Kevin is offline   Reply With Quote
Old 2007-12-05, 23:52   #303
db597
 
db597's Avatar
 
Jan 2003

7·29 Posts
Default

Anyone have benchmarks for the AMD Phenom? Very curious to see how running 4 copies of Prime95 affects the native quad core. Intel's "fake" quad core only hits 2.5-3x, depending on the memory used (bottleneck). On paper the Phenom should scale better.
db597 is offline   Reply With Quote
Old 2008-01-06, 16:13   #304
marc81
 

13×97 Posts
Default

Code:
AMD Athlon(tm) 64 Processor 3500+
CPU speed: 2202.56 MHz
CPU features: RDTSC, CMOV, Prefetch, 3DNow!, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 512 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 32-bit version 24.14, RdtscTiming=1
Best time for 512K FFT length: 23.178 ms.
Best time for 640K FFT length: 30.124 ms.
Best time for 768K FFT length: 36.717 ms.
Best time for 896K FFT length: 43.781 ms.
Best time for 1024K FFT length: 49.206 ms.
Best time for 1280K FFT length: 61.889 ms.
Best time for 1536K FFT length: 75.457 ms.
Best time for 1792K FFT length: 91.583 ms.
Best time for 2048K FFT length: 102.957 ms.
Best time for 2560K FFT length: 141.389 ms.
Best time for 3072K FFT length: 173.051 ms.
Best time for 3584K FFT length: 208.017 ms.
  Reply With Quote
Old 2008-04-09, 19:31   #305
Nelson
 
Nelson's Avatar
 
Apr 2008
Regensburg..^~^..Plzeƈ

5·17 Posts
Default My secondary heater

Have here Pentium 4 3.0 GHz (Prescott) Hyper-threading
Mainboard ASUS P4C800 deluxe
Memory Corsair XMS3200C2 2X512 using Dual Channel
1 Hitachi 160 GB hard drive and
1 Seagate 1.5GB hard drive used for swapfile
CPU Cooler Cooler Master Hyper6+
@3345.62MHz

Following Most recent benchmark copied from result test of Prime95 v. 25.6

Intel(R) Pentium(R) 4 CPU 3.00GHz
CPU speed: 2998.64 MHz, with hyperthreading {3345.62}
CPU features: RDTSC, CMOV, Prefetch, MMX, SSE, SSE2
L1 cache size: 16 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 128 bytes
TLBS: 64
Prime95 32-bit version 25.6, RdtscTiming=1
Best time for 768K FFT length: 25.895 ms.
Best time for 896K FFT length: 31.571 ms.
Best time for 1024K FFT length: 35.646 ms.
Best time for 1280K FFT length: 44.122 ms.
Best time for 1536K FFT length: 53.281 ms.
Best time for 1792K FFT length: 64.420 ms.
Best time for 2048K FFT length: 71.589 ms.
Best time for 2560K FFT length: 94.074 ms.
Best time for 3072K FFT length: 114.453 ms.
Best time for 3584K FFT length: 137.688 ms.
Best time for 4096K FFT length: 153.834 ms.
Best time for 5120K FFT length: 195.064 ms.
Best time for 6144K FFT length: 245.903 ms.
Best time for 7168K FFT length: 296.296 ms.
Best time for 8192K FFT length: 325.289 ms.
Timing FFTs using 2 threads on 1 physical CPUs.
Best time for 768K FFT length: 24.290 ms.
Best time for 896K FFT length: 29.437 ms.
Best time for 1024K FFT length: 32.209 ms.
Best time for 1280K FFT length: 41.926 ms.
Best time for 1536K FFT length: 50.414 ms.
Best time for 1792K FFT length: 61.121 ms.
Best time for 2048K FFT length: 66.994 ms.
Best time for 2560K FFT length: 88.222 ms.
Best time for 3072K FFT length: 107.420 ms.
Best time for 3584K FFT length: 131.018 ms.
Best time for 4096K FFT length: 144.655 ms.
Best time for 5120K FFT length: 189.701 ms.
Best time for 6144K FFT length: 230.644 ms.
Best time for 7168K FFT length: 279.896 ms.
Best time for 8192K FFT length: 308.665 ms.
Best time for 58 bit trial factors: 9.297 ms.
Best time for 59 bit trial factors: 9.279 ms.
Best time for 60 bit trial factors: 9.431 ms.
Best time for 61 bit trial factors: 9.313 ms.
Best time for 62 bit trial factors: 13.129 ms.
Best time for 63 bit trial factors: 13.079 ms.
Best time for 64 bit trial factors: 15.500 ms.
Best time for 65 bit trial factors: 15.125 ms.
Best time for 66 bit trial factors: 15.723 ms.
Best time for 67 bit trial factors: 15.332 ms.

I think it is interesting to note that Hyper threading seems to have a greater effect when the FFT sizes go outside the L2 Cache Range. Over the range of a 40G LLT the time gained amounts to about 1 hour.

So, that about does it!

nelson
Nelson is offline   Reply With Quote
Old 2008-04-11, 10:21   #306
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

22×839 Posts
Default

Quote:
Originally Posted by db597 View Post
Anyone have benchmarks for the AMD Phenom? Very curious to see how running 4 copies of Prime95 affects the native quad core. Intel's "fake" quad core only hits 2.5-3x, depending on the memory used (bottleneck). On paper the Phenom should scale better.
I guess I could've posted this a while back, sorry:
Code:
AMD Phenom(tm) 9500 Quad-Core Processor
CPU speed: 2199.69 MHz, 4 cores
CPU features: RDTSC, CMOV, Prefetch, 3DNow!, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 512 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 48
L2 TLBS: 512
Prime95 32-bit version 25.5, RdtscTiming=1
Best time for 768K FFT length: 41.804 ms.
Best time for 896K FFT length: 49.864 ms.
Best time for 1024K FFT length: 55.613 ms.
Best time for 1280K FFT length: 69.719 ms.
Best time for 1536K FFT length: 85.844 ms.
Best time for 1792K FFT length: 102.716 ms.
Best time for 2048K FFT length: 116.113 ms.
Best time for 2560K FFT length: 156.770 ms.
Best time for 3072K FFT length: 194.179 ms.
Best time for 3584K FFT length: 231.726 ms.
Best time for 4096K FFT length: 261.339 ms.
Best time for 5120K FFT length: 335.159 ms.
Best time for 6144K FFT length: 409.656 ms.
Best time for 7168K FFT length: 498.341 ms.
Best time for 8192K FFT length: 569.639 ms.
Timing FFTs using 2 threads.
Best time for 768K FFT length: 27.307 ms.
Best time for 896K FFT length: 35.729 ms.
Best time for 1024K FFT length: 39.905 ms.
Best time for 1280K FFT length: 48.903 ms.
Best time for 1536K FFT length: 59.772 ms.
Best time for 1792K FFT length: 69.921 ms.
Best time for 2048K FFT length: 79.809 ms.
Best time for 2560K FFT length: 105.899 ms.
Best time for 3072K FFT length: 128.341 ms.
Best time for 3584K FFT length: 150.262 ms.
Best time for 4096K FFT length: 171.352 ms.
Best time for 5120K FFT length: 214.281 ms.
Best time for 6144K FFT length: 265.299 ms.
Best time for 7168K FFT length: 327.061 ms.
Best time for 8192K FFT length: 361.771 ms.
Timing FFTs using 3 threads.
Best time for 768K FFT length: 22.397 ms.
Best time for 896K FFT length: 33.972 ms.
Best time for 1024K FFT length: 37.596 ms.
Best time for 1280K FFT length: 44.736 ms.
Best time for 1536K FFT length: 56.757 ms.
Best time for 1792K FFT length: 65.936 ms.
Best time for 2048K FFT length: 66.658 ms.
Best time for 2560K FFT length: 94.593 ms.
Best time for 3072K FFT length: 109.093 ms.
Best time for 3584K FFT length: 125.415 ms.
Best time for 4096K FFT length: 140.467 ms.
Best time for 5120K FFT length: 158.095 ms.
Best time for 6144K FFT length: 194.164 ms.
Best time for 7168K FFT length: 235.368 ms.
Best time for 8192K FFT length: 281.969 ms.
Timing FFTs using 4 threads.
Best time for 768K FFT length: 20.817 ms.
Best time for 896K FFT length: 31.743 ms.
Best time for 1024K FFT length: 35.078 ms.
Best time for 1280K FFT length: 41.724 ms.
Best time for 1536K FFT length: 48.570 ms.
Best time for 1792K FFT length: 55.979 ms.
Best time for 2048K FFT length: 62.253 ms.
Best time for 2560K FFT length: 86.482 ms.
Best time for 3072K FFT length: 100.766 ms.
Best time for 3584K FFT length: 116.563 ms.
Best time for 4096K FFT length: 135.655 ms.
Best time for 5120K FFT length: 150.350 ms.
Best time for 6144K FFT length: 183.207 ms.
Best time for 7168K FFT length: 219.970 ms.
Best time for 8192K FFT length: 238.696 ms.
Best time for 58 bit trial factors: 10.626 ms.
Best time for 59 bit trial factors: 10.611 ms.
Best time for 60 bit trial factors: 10.601 ms.
Best time for 61 bit trial factors: 10.617 ms.
Best time for 62 bit trial factors: 19.791 ms.
Best time for 63 bit trial factors: 19.774 ms.
Best time for 64 bit trial factors: 19.485 ms.
Best time for 65 bit trial factors: 19.407 ms.
Best time for 66 bit trial factors: 19.349 ms.
Best time for 67 bit trial factors: 19.287 ms.
The numbers in the attachment are for a Phenom 9500 @ 2200MHz and a Core2 Q6600 @ 3492MHz, but the benchmark numbers for both have been normalized to 1000MHz equivalent for comparison. According to these numbers, the Phenom scales approximately equally well at all FFT sizes, whereas the Core2 doesn't scale well at all at small FFT sizes (in fact, 3 and 4 threads perform worse than 2 threads). You can see the Core2 walks all over the Phenom (88% faster at 768K, around 150% faster at 3072K and 8192K). The Core2 scales very well to 2 cores (about 90% efficient), but Core2 with 3-4 cores and Phenom with 2-4 cores are around 50-65% efficient.
On the TF side, the Core2 again dominates, showing numbers 117% to 159% faster than the Phenom.

Admittedly these are numbers from a TLB-bug-affected Phenom9500, so numbers may look different with the new B3 versions just appearing now. When I see a system with a 9x50 Phenom I'll see if I can grab some benchmark numbers.

Phenom9500 was running Prime95 25.5 on Windows XP Pro 32-bit.
Core2 Q6600 was running Prime95 25.6 on Windows Vista Premium 32-bit.
Attached Thumbnails
Click image for larger version

Name:	P9500vQ6600.gif
Views:	149
Size:	22.5 KB
ID:	2391  
James Heinrich is offline   Reply With Quote
Old 2008-05-02, 23:33   #307
WOB1010
 

2×34×5×11 Posts
Smile xeon E3110

Click image for larger version

Name:	bench.JPG
Views:	177
Size:	121.1 KB
ID:	2442

My new build.
  Reply With Quote
Old 2008-05-06, 17:51   #308
lycorn
 
lycorn's Avatar
 
Sep 2002
Oeiras, Portugal

2·36 Posts
Default

Couple of questions:

1. What frequency were you running the CPU at?
2. DDR2 or DDR3?
3. Mobo used?

The figures look nice...
lycorn is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Perpetual "interesting video" thread... Xyzzy Lounge 39 2021-03-12 14:19
LLR benchmark thread Oddball Riesel Prime Search 5 2010-08-02 00:11
Perpetual I'm pi**ed off thread rogue Soap Box 19 2009-10-28 19:17
Perpetual autostereogram thread... Xyzzy Lounge 10 2006-09-28 00:36
Perpetual ECM factoring challenge thread... Xyzzy Factoring 65 2005-09-05 08:16

All times are UTC. The time now is 02:43.

Sun May 9 02:43:31 UTC 2021 up 30 days, 21:24, 0 users, load averages: 1.54, 1.58, 1.55

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.