20051008, 10:43  #1 
Aug 2005
69469, Germany
3×5 Posts 
Suggestion: Alternate exponent distribution
Hello folks,
today i've written a really bad test at the university and because my knowledge in some cases showed some "holes", I used the time to think about the current expodistribution of GIMPS. Currently a exponent is given out to LLing and some time after (months? years?) the exponent get's a doublechecking... Now the way I thought of is a bit different... What about giving one exponent to two computers at the same time? The one who reports at first has done the first time and the second reporter did the doublechekcing in nearly the same time. With this way of distribution the checking would be slower but more acurate. 
20051008, 14:05  #2 
Aug 2002
43×199 Posts 
I'd like to see an option for one LL and then one DC, or maybe a ratio... (I'm talking about on the same box!)
I like to have a box verified periodically just to be safe... I can do it manually, but it is a bit of work... http://www.teamprimerib.com/rr1/bin/...er.php?u=Xyzzy See how some of my boxes returned bad work? Had I not checked them, they'd probably still be returning bad work... Or maybe the default should be a few DC, until the box returns a correct result? I suppose that would require some serverside programming, though... Above all, I fear wasted cycles... 
20051009, 00:38  #3 
Aug 2002
2^{6}·5 Posts 
Sending out an exponent twice at the same time makes it easier to cheat the system.
By "timeshifting" the double check, it makes it highly unlikely two people can collude to fake results. 
20051009, 02:29  #4  
"Richard B. Woods"
Aug 2002
Wisconsin USA
2^{2}×3×641 Posts 
Quote:
(For those of you thinking, "What about improvements in program speed over time?"  That doesn't really matter if the improvement is in LL speed, because that applies to both first and DC runs. At any given time, GIMPS has a potential contribution of xxx CPUyears per day and the project's overall progress doesn't depend on how that's divided between firsttime and DC. And don't quibble about differences in rate of adoption of new software versions between firsttime and DC systems! That's negligible.) As for accuracy: no, again the assignment algorithm makes no significant difference. Remember when considering whether a change in assignment method helps or hurts overall GIMPS throughput: If we shift the balance of computing power from one type of work to another, it slows down progress on the former by the same amount it speeds up the latter.    Besides, having a lag between firsttime and DC makes it easier for "slow" systems to contribute. (Not because of computer considerations, but because of psychological factors) Last fiddled with by cheesehead on 20051009 at 02:36 

20051025, 08:53  #5 
Aug 2005
69469, Germany
F_{16} Posts 
The point I think about is that with the current system, the fast machines do the first LL and the slow machines do the doublechecking. Because of that the first time LLs are faster and ckeck more numbers than the doublecheckers. so there ist already a huge gap between both checks. And this gap will grow further...

20051025, 10:15  #6 
Aug 2002
Termonfeckin, IE
2768_{10} Posts 
The gap had been growing for some time but it has finally started to narrow. The vast majority of Prime95/mprime users leave their exponent selection to default. Hence the preset limits chosen by George overwhelmingly decide what machine gets what. These limits have been revised for v24. So as more people download v24 more middling machines (900MHz1200MHz) get doublechecks instead of LL tests. Moreover, Team_Prime_Rib which is ranked second in the project has recently moved a lot of their machines to Factoring instead of LL testing so this has also slowed the progress of LL testing somewhat.
Looking back at old summary files I see that at one time over 72000 firsttime exponents were assigned and the number of doublechecks assigned at that time was a shade over 12000. These numbers have steadily improved and now the number of firsttime LL tests assigned is around 57000 while DCs are at about 19000. Hence the gap is now narrowing. 
20051025, 11:01  #7 
Jul 2004
Potsdam, Germany
831_{10} Posts 
For GIMPS, I'd say that a quick doublecheck is not really needed. Sure, it's good to find a prime, but there are no real differences between now and later.
For SoB, it would be a different story, of course, because a found prime eliminates all further tests for the corresponding k. While we're at it, the creators of SoB (or were it some active forum members? I don't remember...) approximated that with their current error rate, the probabilitiesoptimal situation would be when doublechecks are at half the n value as firsttime tests. The higher the error rate, the more time should be spend on doublechecking, of course... 
20051025, 12:43  #8  
"Richard B. Woods"
Aug 2002
Wisconsin USA
1E0C_{16} Posts 
Quote:
Can you describe any actual harm to the project that's caused by the size of that gap? If not, why do you think it would cause any harm in the future? Quote:
Again, one needs to analyze the situation carefully. Just knowing that fast machines do one type of work and slow machines do another does not give us enough information to decide anything about the changes, size, or even the very existence of any gap between the ranges of exponents the two classes of machines are working on. More important, for instance, are a) the ratio of total CPUyears per day of work accomplished by the two classes of computers, and b) the ratio of CPUyears required per exponent in the two ranges. Other factors include the rates at which machines enter and leave the fast and slow categories, the boundary between "fast" and "slow", the extent to which the ranges of assignments in progress overlap between the fast and slow classes, and the distribution curves of assignments in progress [though that's really a subheading under b) above]. Quote:
GIMPS can take care of itself. There are several adjustments that the folks in charge of the project can make. In fact, each of the four factors I listed in my "Other factors include ..." sentence above is partially or wholly adjustable by project administrators and participants. Last fiddled with by cheesehead on 20051025 at 12:47 

20051025, 15:34  #9  
Aug 2002
Termonfeckin, IE
2^{4}×173 Posts 
Quote:
That said as a 4 year GIMPS participant I would totally agree that things tend to level out in the longrun. Firsttimers in GIMPS have been around twice as large as doublechecks for all this time and George tweaks the limits as necessary once every year or two. PS: As a private aside I am up to $265M. I am catching up to you in HSX!! 

20051026, 14:12  #10 
"GIMFS"
Sep 2002
Oeiras, Portugal
3043_{8} Posts 
I agree that the really important point is to get work done, be it LL or DC. In the end, both are needed to move forward.
But I can´t help insisting again on the benefits of having DCs assigned to new machines until they check in a good result:  It would uncover defective machines sooner.  It would increase the motivation of many folks, as the first results would take less time to appear. My 2 cents... Last fiddled with by lycorn on 20051026 at 14:14 
20051026, 17:52  #11  
"Richard B. Woods"
Aug 2002
Wisconsin USA
7692_{10} Posts 
Quote:
For example, I almost wrote that using DCs to screen new machines can help progress by minimizing the fraction of assignments' work that will turn out bad. But then I realized that the "good" machines will just keep on churning out good results at the same pace either way, even in triplechecking the "bad" results (because eventually we need two matching "good" results no matter how many "bad" results there are for any given exponent!). Now, if DC screening allows some "bad" machines to be fixed (e.g., upgrading cooling or memory) sooner than they would have been if they'd been assigned firsttime LLs, then it's possible that that might allow them to join the ranks of "good" machines sooner and then start "good" contributions sooner. But that's an indirect speedup, not a direct one. 

Thread Tools  
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
How to read the "Exponent Status Distribution"  3mg4  Information & Answers  23  20200724 13:59 
Milestones vs. Exponent Status Distribution  heliosh  Information & Answers  6  20200720 19:27 
Mersenne Prime Exponent Distribution  PawnProver44  Miscellaneous Math  26  20160318 08:48 
Primenet exponent status distribution archived data  James Heinrich  Data  2  20120201 21:14 
suggestion: "check exponent status" page  ixfd64  Lounge  3  20040527 00:51 