mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > PrimeNet > GPU to 72

Reply
 
Thread Tools
Old 2013-10-24, 21:27   #1
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

3×977 Posts
Default Stuck assignment

So in the assignments page, I have:

Manual 50000387 LL TF 69 72 2013-10-23 15:06 1 16.73

But if you look at mersenne.org for 50000387 it shows complete to 72.
Mark Rose is offline   Reply With Quote
Old 2013-10-24, 21:46   #2
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

7×1,361 Posts
Default

Yeah... Not quite sure how that happened.

I'm guessing that since you tend to ask for the lowest TF level, you've managed to discover a "temporal window" bug on GPU72, in-between the time a low LL is completed and it's release back to the system.

Please feel to throw that candidate back. I'll drill down when I have some time (sorry -- currently dealing with several screaming children...).
chalsall is offline   Reply With Quote
Old 2013-10-24, 21:47   #3
blahpy
 
blahpy's Avatar
 
Jun 2013

107 Posts
Default

I'm thinking that what happened is you reserved 69 to 72, but then you or MISFIT or whatever submitted it as 69 to 70, 70 to 71, 71 to 72. Since these are technically different assignments to just 69 to 72, it wouldn't have taken it off the list.

edit: That, or what chalsall said.

Last fiddled with by blahpy on 2013-10-24 at 21:49
blahpy is offline   Reply With Quote
Old 2013-10-25, 01:11   #4
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

3·977 Posts
Default

Quote:
Originally Posted by chalsall View Post
Yeah... Not quite sure how that happened.

I'm guessing that since you tend to ask for the lowest TF level, you've managed to discover a "temporal window" bug on GPU72, in-between the time a low LL is completed and it's release back to the system.

Please feel to throw that candidate back. I'll drill down when I have some time (sorry -- currently dealing with several screaming children...).
I'll hold onto it until it expires so no one else wastes time factoring it. Hopefully you can look into it by then.

I ask for up to 72 via mfloop.py. But if I see anything lower, like I did earlier today, I'll manually grab it all and stick it on the GTX 760.

Quote:
Originally Posted by blahpy View Post
I'm thinking that what happened is you reserved 69 to 72, but then you or MISFIT or whatever submitted it as 69 to 70, 70 to 71, 71 to 72. Since these are technically different assignments to just 69 to 72, it wouldn't have taken it off the list.

edit: That, or what chalsall said.
It's possible mfloop.py did that. It submits 70->72 jobs all the time without issue, but those are in the 64-69M range. I'll manually merge the result lines and see what happens... and nope, didn't clear it.
Mark Rose is offline   Reply With Quote
Old 2013-11-04, 16:19   #5
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

55638 Posts
Default

I've got the same thing now with 50000243.

Seems to be a problem with the 50M range.
Mark Rose is offline   Reply With Quote
Old 2013-11-04, 16:40   #6
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

253716 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
Seems to be a problem with the 50M range.
Arg!!! Sorry.

I don't understand how you're being assigned LLTF in the 50M range. It must be a race condition, or some other Stupid Programmer Error on my part.

I've "told" Spidy to watch that range more closely; perhaps that will help.

Could you please tell me how you're asking for these assignments? Are you using the GPU72 manual assignment page, MISFIT, or some other automatic assignment methodology? This will help me drill down on this issue.

What is a bit strange is you're the only one who is experiencing this. But then, "strange" is where problems are found, and discoveries are made.
chalsall is offline   Reply With Quote
Old 2013-11-04, 17:01   #7
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

100101001101112 Posts
Default

Quote:
Originally Posted by chalsall View Post
Could you please tell me how you're asking for these assignments?
OK, I went through some of the system's logs, and see that you're using (either directly or through a spider) the manual assignment page.

I've added a quick hack to ensure that nothing below 60M is assigned for LLTFing, which should prevent this issue. Although I would really like to understand how it occurred in the first place -- I'm suspecting a race condition in between when a LL is completed by a GPU72 worker and the candidate is released back to Primenet.

I will drill down further into the logs when I have some more time; currently I don't.
chalsall is offline   Reply With Quote
Old 2013-11-04, 17:33   #8
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

55638 Posts
Default

In both cases, the assignment fetching was using with mfloop.py from https://github.com/teknohog/primetools, which was called from cron like this:

5 * * * * /home/lol/primetools/mfloop.py -e 72 -u shifted -p lolno -U shifted -P nuh-uh -n 4 -t 0 -w /home/lol/mfaktc
Mark Rose is offline   Reply With Quote
Old 2013-11-04, 17:35   #9
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

55638 Posts
Default

Is there anything special that happens at 5 minutes past the hour? I offset my cron calls to be nicer on the server, assuming most people would pick 0.
Mark Rose is offline   Reply With Quote
Old 2013-11-04, 17:51   #10
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

7·1,361 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
In both cases, the assignment fetching was using with mfloop.py...
Thank you for that information. Useful. The script (which I haven't studied) appears to fetch only one assignment at a time; thus more regularly than MISFIT. This supports my theory that what we're seeing here is a race condition.

To be clear, this was my error. teknohog has brought a tool to Linux users which I had promised, but wasn't able to deliver because of other pressing matters. I thank him for his work and contribution.

Quote:
Originally Posted by Mark Rose View Post
Is there anything special that happens at 5 minutes past the hour?
Only that GPU72 doesn't "talk" to Primenet between 55 minutes after the hour and 10 minutes after the hour. Primenet is busy during that time.

Quote:
Originally Posted by Mark Rose View Post
I offset my cron calls to be nicer on the server, assuming most people would pick 0.
It would actually be better if you set your cron job to be at some time between 30 to 45 minutes after the hour.
chalsall is offline   Reply With Quote
Old 2013-11-04, 18:59   #11
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

3·977 Posts
Default

Quote:
Originally Posted by chalsall View Post
Thank you for that information. Useful. The script (which I haven't studied) appears to fetch only one assignment at a time; thus more regularly than MISFIT. This supports my theory that what we're seeing here is a race condition.

To be clear, this was my error. teknohog has brought a tool to Linux users which I had promised, but wasn't able to deliver because of other pressing matters. I thank him for his work and contribution.
It does fetch multiple assignments in a single execution. I don't know if it makes multiple API calls; I haven't checked either as it "just works". If you're seeing multiple requests from me, it may be because I'm running multiple copies, one for each card I have. I also run mfloop.py hourly because I like to see my stats updated more often.

Quote:
Only that GPU72 doesn't "talk" to Primenet between 55 minutes after the hour and 10 minutes after the hour. Primenet is busy during that time.

It would actually be better if you set your cron job to be at some time between 30 to 45 minutes after the hour.
Done :)
Mark Rose is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
I am new Running Factmsieve i am Stuck please help zukhruf Msieve 2 2015-11-25 12:58
Stuck Completed Assignments NickOfTime GPU to 72 8 2014-11-25 19:14
Stuck curtisc exponents? NBtarheel_33 PrimeNet 70 2012-08-07 11:06
Processor stuck to heatsink patrik Hardware 10 2005-04-27 22:40
Torture test stuck stephenX Hardware 10 2005-01-20 15:58

All times are UTC. The time now is 05:40.

Sun Apr 18 05:40:52 UTC 2021 up 10 days, 21 mins, 0 users, load averages: 0.95, 1.22, 1.44

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.