mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Software (https://www.mersenneforum.org/forumdisplay.php?f=10)
-   -   Early Beta of version 24.11 (https://www.mersenneforum.org/showthread.php?t=3934)

TheJudger 2005-04-01 21:27

[QUOTE=Prime95]I'm sure the binutils guys will make objcopy work eventually if they haven't done so already. The source is available if someone wants to try a 64-bit linux port.
[/QUOTE]
So if binutils provite support for MASM -> ELF64 than you would give it a try? :)

[QUOTE=Prime95]My next task is more optimizations, especially making use of the extra SSE2 registers in 64-bit mode. Don't expect much - a few percent on AMD64, perhaps a little more on the P4.[/QUOTE]

Optimizations for ll-tests and/or factoring?

Peter Nelson 2005-04-01 23:24

From what I have read, I believe the latest version of objcopy in binutils DOES have the ability to convert to ELF64 format.

Someone could try to do this from the source provided above.

Obviously this would not include the security module for primenet comms, but if successful, it would not be too much work for George to make an official build using the same process.

Prime95 2005-04-02 01:31

[QUOTE=Peter Nelson]
d) please would it be possible to include a short test of trial factoring speed in the benchmark (of the release version) because this would be very useful to know and compare the benefit of your optimisations.[/QUOTE]

Done. Look for it next time I upload an executable

rainchill 2005-04-02 01:34

Hi, I have an Athlon64 3000+ Hp laptop, with 1Gb of ram. Here are my benchmark results with 23.8
Prime95 version 23.8, RdtscTiming=1
Best time for 384K FFT length: 26.055 ms.
Best time for 448K FFT length: 31.393 ms.
Best time for 512K FFT length: 35.744 ms.
Best time for 640K FFT length: 45.166 ms.
Best time for 768K FFT length: 54.970 ms.
Best time for 896K FFT length: 66.214 ms.
Best time for 1024K FFT length: 74.506 ms.
Best time for 1280K FFT length: 100.525 ms.
Best time for 1536K FFT length: 122.588 ms.
Best time for 1792K FFT length: 147.037 ms.
Best time for 2048K FFT length: 165.960 ms.

And here are my results with 24.11
Prime95 version 24.11, RdtscTiming=1
Best time for 512K FFT length: 32.064 ms.
Best time for 640K FFT length: 38.795 ms.
Best time for 768K FFT length: 46.873 ms.
Best time for 896K FFT length: 56.537 ms.
Best time for 1024K FFT length: 63.192 ms.
Best time for 1280K FFT length: 82.394 ms.
Best time for 1536K FFT length: 102.145 ms.
Best time for 1792K FFT length: 122.718 ms.
Best time for 2048K FFT length: 136.854 ms.

So it seems like a good little speed bump.

Prime95 2005-04-02 19:23

New 24.11 executables are available. From whatsnew.txt:

5) Added timeouts to PrimeNet communications in hopes of avoiding rare hangs
when contacting the PrimeNet server.
6) Fixed rare bug where P-1's GCD could miss a factor.
7) Added trial factoring to the benchmark.
8) Fixed bug in ECM when using zero-padded FFTs.

TheJudger 2005-04-02 22:59

9) crashes immediately on CPUs supporting sse2 when "advance/time" exponent is 78000000 :(

Happens with windows-client aswell as the linux-client...
Timing exponents works fine in older versions up to 79.xxxM

With "CPUSupportsSSE2=0" in local.ini timing exponents up to 79.xxxM works fine

[code]
Your choice: 8

Exponent to time (10000000): 78000000
Number of Iterations (10):

Accept the answers above? (Y):
Floating point exception
[/code]

Xyzzy 2005-04-04 00:40

[QUOTE=TheJudger]9) crashes immediately on CPUs supporting sse2 when "advance/time" exponent is 78000000 :([/QUOTE]
[url]http://www.mersenneforum.org/showpost.php?p=52142&postcount=4[/url]

TheJudger 2005-04-04 08:51

xyzzy: I know... but older versions (23.5, 23.9) automaticly switches to x87-code when leaving sse2-range... 24.11 just crashes...

Dresdenboy 2005-04-04 20:15

I posted many prime95 results (incl. 64bit TF numbers vs. 32bit) in the [URL=http://mersenneforum.org/showthread.php?p=52693#post52693]benchmark thread[/URL].

Dresdenboy 2005-04-05 12:01

Trial factoring speedup in 64bit mode:
[code]24.11 24.11
32 bit 64 bit
(ms) (ms) Speedup
6,010 3,786 58,7%
6,032 3,782 59,5%
6,003 4,032 48,9%
6,029 4,140 45,6%
10,961 4,843 126,3%
10,963 5,643 94,3%
13,932 6,816 104,4%
13,836 8,007 72,8%
13,849 7,951 74,2%
13,831 7,934 74,3%[/code]

ET_ 2005-04-05 18:17

[QUOTE=Dresdenboy]Trial factoring speedup in 64bit mode:
[code]24.11 24.11
32 bit 64 bit
(ms) (ms) Speedup
6,010 3,786 58,7%
6,032 3,782 59,5%
6,003 4,032 48,9%
6,029 4,140 45,6%
10,961 4,843 126,3%
10,963 5,643 94,3%
13,932 6,816 104,4%
13,836 8,007 72,8%
13,849 7,951 74,2%
13,831 7,934 74,3%[/code][/QUOTE]

cOOOOl! :showoff:

Luigi


All times are UTC. The time now is 07:48.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.