20120624, 21:45  #1 
Jun 2012
Boulder, CO
111111000_{2} Posts 
Running msieve polynomial selection in parallel?
Hi,
I'd like to maximize what I get out of factmsieve.py (i.e. Gilchrist's guide at http://gilchrist.ca/jeff/factoring/n...ers_guide.html) by running msieve's polynomial selection on several machines (510) in paralell. Currently it runs "msieve np", e.g. Code:
msieve s <savefile> l <logfile> i <input> np Code:
np [X,Y] perform only NFS polynomial selection; if specified, only cover leading coefficients in the range from X to Y inclusive Thanks! 
20120624, 22:05  #2 
(loop (#_fork))
Feb 2006
Cambridge, England
2×7×461 Posts 
I've mostly done parallel polynomial selection on a single large SMP machine, so I can use 'make'; the difficulty is that the runs for small K take longer than for large K, with the runtime being roughly proportional to K^(0.4) and really quite variable, so blocking it all in advance is a bit tricky and unlikely to give great results.
One option I've used is simply to start at K=10^6, where ranges of 1000 take consistently about an hour on a C163. So msieve v np 1000000,1024000 msieve v np 1024000,1048000 msieve v np 1048000,1072000 on three separate machines (or three separate directories on a machine with three cores). You can apparently run several msieve jobs in the same directory but I wouldn't  one directory per job makes life easier. 
20120625, 00:35  #3 
Jun 2012
Boulder, CO
111111000_{2} Posts 
Hi fivemack,
Thanks for your reply. As I'm fairly new to this, out of curiosity  how did you arrive at your base value (10^6) and increment (24000)? Would the same values still be useful for 5, 10, or 20 machines, say? Also, perhaps I'm asking the wrong question. Essentially I'm just trying to parallelize the work of factmsieve (msieve + ggnfs) across several machines. Would I get more benefit out of running msieve on a single machine, perhaps with a smallish time limit, and then the sieving step in parallel? 
20120625, 06:15  #4 
(loop (#_fork))
Feb 2006
Cambridge, England
2·7·461 Posts 
You're definitely asking the right question.
I'm picking 1000000 as the base because that's well into the flat portion of the curve, and 24000 as the increment because that's about 24 hours of work on a C163 on the machine I have access to. You might well want to get the scripts to run 10000001001000, time it, and pick the increment appropriately  but remember that the intrinsic timing variance is very large. I tend to run polynomial selection for somewhere between 10% and 20% of the time I expect the sieving to take (e.g. 24 hours on 24 threads for a C16x which will take about 250 hours on 24 threads to sieve; 24 hours on 4 threads for a C150 which will take about 18 hours on 48 threads to sieve). But the precise amount of selection you do makes very little difference. 
20120625, 16:42  #5 
Sep 2009
2481_{10} Posts 
You could use factMsieve.pl from http://mersenneforum.org/showthread.php?t=15662 post 13. It's designed to do polynomial selection on several systems, then automatically select the best poly and start sieving. Read the whole thread for instructions etc.
Chris 
20120701, 06:38  #6  
Jun 2012
Boulder, CO
111111000_{2} Posts 
Quote:
So, for example, suppose I run polynomial selection on a few systems and end up with .fb output files like: Code:
N 5140013334704851669242405887740004085957839657722764142543276828381162999697124563886598292766328463730755687076646610242046892464460226559 SKEW 589153.91 R0 348443457246036719266568520 R1 1280647128257263 A0 12834409323851043625932493277191207 A1 180101857702596183420795022293 A2 385348143299525459124615 A3 563481396530542305 A4 1408784852902 A5 1000692 Thanks again! 

20120701, 08:05  #7 
Sep 2009
3^{2}×109 Posts 
You'll usually want to choose the polynomial whose Murphy "E" score is the highest
Didn't factMsieve.pl / msieve produce *.poly files that contain a comment indicating this e value ? Anyhow, one of the ways to obtain that score from a .fb file is Code:
~/msieve/msieve i <name>.ini nf <name>.fb nc v Code:
$ ~/msieve/msieve i 6_447_minus1.ini nf 6_447_minus1.fb nc v Msieve v. 1.51 (SVN 719M) Sun Jul 1 10:00:37 2012 random seeds: b32131b6 5d85b626 factoring 616206951833849099509404360836151448327434546464783674219667801836184526666386100726932863976883607096993792653424911139417437242190871728765383549637572368393220053069548102628443735964624521073 (195 digits) searching for 15digit factors commencing number field sieve (195digit input) R0: 808281277464764060643139600456536293376 R1: 1 A0: 36 A1: 0 A2: 0 A3: 6 A4: 0 A5: 0 A6: 1 skew 1.82, size 1.891e11, alpha 0.773, combined = 1.017e12 rroots = 0 commencing relation filtering with target density 70.00 estimated available RAM is 3870.3 MB error: cannot open 'msieve.dat' 
20120701, 16:23  #8  
Sep 2009
3×827 Posts 
Quote:
For really large jobs it's worth doing some test sieving for the poly's with the top few scores to make sure you have the best poly, the score isn't always spot on. But just using the one with the best score works for most projects. Chris 

20120713, 16:04  #9 
Nov 2010
2×5^{2} Posts 
I've made a simple patch for the msieve to be able to run parallel polynomial selection over MPI. Both GPU and CPU modes are supported (not simulteniously but it shouldn't be too hard to add), up to 2 GPUs per node.
There are some troubles with early termination (like by CtrlC) though: MPI daemons I tried wouldn't give the program enough time to properly respond to the termination signal. This basically leave you with the raw .p file as a result and you should pick the best candidate from it some other way. The patch is just a diff in the gnfs/poly directory. I hope someone finds it useful. 
20191116, 19:45  #10 
Nov 2019
1 Posts 
Hi, all!
Please, can help me with stage completing. i running np1 stage in parallel for C173 i have several files msieve.dat.m About 5Gb each file. How can i good poly from this file? Can msieve do this job, or some external tool. thanks! 
Thread Tools  
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
Msieve parallel poly selection with several GPU cards  jacky  Msieve  8  20170929 13:05 
msieve parallel poly selection with MPI  drone84  Msieve  4  20170628 09:18 
reduce number of coefficient for polynomial selection with msieve on GPU  aein  Factoring  3  20170225 16:42 
msieve 1.52 with GPU polynomial selection  cgy606  Msieve  16  20161006 14:16 
Polynomial selection  CRGreathouse  Factoring  2  20090525 07:55 