mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2020-08-18, 13:15   #23
bsquared
 
bsquared's Avatar
 
"Ben"
Feb 2007

2×17×97 Posts
Default

Quote:
Originally Posted by frmky View Post
I know this is late, but if you still have this data set up, try
mpirun -np 2 msieve -nc2 1,2 -v -t 20
After 1% elasped, the ETA is:

Code:
-np 2 1x2 -t 20: 3 hrs 9 min
-np 4 1x4 -t 10: 2 hrs 48 min
-np 5 1x5 -t 8: 3 hrs 49 min
-np 8 1x8 -t 5: 2 hrs 50 min
The 1x5 time is not surprising as one of the processes is split across sockets. Of the others that split evenly, more processes with fewer threads each appear to be a bit better.
bsquared is offline   Reply With Quote
Old 2020-08-30, 09:03   #24
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

22×3×643 Posts
Default

Here are binaries for 64-bit Linux with various "VBITS" flags set.

Attached Files
File Type: gz msieve-1.54-VBITS64.x86_64.gz (324.4 KB, 15 views)
File Type: gz msieve-1.54-VBITS128.x86_64.gz (326.1 KB, 23 views)
File Type: gz msieve-1.54-VBITS256.x86_64.gz (334.2 KB, 26 views)
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 10:53   #25
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

11110001001002 Posts
Default

CPU = i5-10600K
RAM = 2×8GB DDR4-3200
CMD = ./msieve -v -nc -t 6
LA = 21988s


Attached Files
File Type: gz msieve.log.gz (2.8 KB, 0 views)
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 11:08   #26
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

1E2416 Posts
Default

Given that the 1920X and 3950X are pretty serious CPUs, does the result for the i5 seem abnormally fast?

CPU = 1920X
RAM = 4×16GB DDR4-2666
CMD = ./msieve -v -nc -t 24
LA = 7h 58m 53s

CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s

CPU = i5-10600K
RAM = 2×8GB DDR4-3200
CMD = ./msieve -v -nc -t 6
LA = 6h 06m 28s
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 12:38   #27
Gimarel
 
Apr 2010

9016 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s
My timings for a AMD Ryzen 9 3950X, 2x32GB DDR4-3600:

-nc1: ~0h 43m 18s
-nc2: ~0h 5m 15s until the multithreaded LA starts


Timings for the multithreaded part:

-nc2: estimated 3h 24m msieve compiled with gcc-9.3
-nc2: estimated 3h 25m msieve compiled with gcc-10.0
-nc2: estimated 3h 22m msieve compiled with clang-9
-nc2: estimated 3h 24m msieve compiled with clang-10

Fastest total without -nc3: ~4h 21m

All runs with VBITS=256 and 32 threads. All other versions were slower.
I tried the objects for each compiler twice, to ensure that the clang-9 one is indeed the fastest.

Last fiddled with by Gimarel on 2020-10-30 at 12:39
Gimarel is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
PFGW benchmarking carpetpool Hardware 4 2019-09-30 20:06
Looking for benchmarking help with a Phenom or PhenomII X6 mrolle Software 25 2012-03-14 14:15
GMP 5.0.1 vs GMP 4.1.4 benchmarking unconnected GMP-ECM 5 2011-04-03 16:16
Benchmarking dual-CPU machines garo Software 2 2010-09-27 20:33
Benchmarking challenge! Xyzzy Software 17 2003-08-26 15:43

All times are UTC. The time now is 17:33.

Fri Oct 30 17:33:46 UTC 2020 up 50 days, 14:44, 2 users, load averages: 1.63, 1.84, 2.03

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.