mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   PrimeNet (https://www.mersenneforum.org/forumdisplay.php?f=11)
-   -   OFFICIAL "SERVER PROBLEMS" THREAD (https://www.mersenneforum.org/showthread.php?t=5758)

Prime95 2021-02-15 10:19

[QUOTE=slandrum;571637]What about the off-by-5 in the 54M to 55M range? There are 5 more needing DC than are assigned DC currently, and none are available for DC.[/QUOTE]

Those 5 were all unaccounted for -- neither assigned or available. They are now available.

Viliam Furik 2021-02-15 11:34

[QUOTE=retina;571634]Yes, this.

And expire them as usual. No special treatment.[/QUOTE]

Expire, yes. But keep the RES64 for DC.

chalsall 2021-02-15 15:02

[QUOTE=Prime95;571641]Those 5 were all unaccounted for -- neither assigned or available. They are now available.[/QUOTE]

And, another oddity...[CODE]101000000 54316 | 35141 14659 4471 [COLOR="Red"]45[/COLOR] | 135 6 [COLOR="Red"]39 6[/COLOR] | [COLOR="red"]1[/COLOR] 4323 |[/CODE]

kriesel 2021-02-15 15:43

unable to get p-1 manual assignments (unwanted double checks substituted)
 
Are others seeing this issue?

~0730 UTC 2021-02-15, 32 of 32 attempts at manual P-1 assignments (an attempt at a batch of 31 followed by a retest requesting 1), respond like this:
[CODE]
Not enough available memory for P-1 factoring assignments.

DoubleCheck=(AID),58660957,74,1[/CODE]Problem persists now ~1530 UTC 2021-02-15 (a single 11-assignment request)
Can not get any new P-1 assignments for gpuowl v6.11 on gpus.
It takes around 30 per Radeon VII per day at the P-1 for first-test wavefront.

PM to James Heinrich ~0730 UTC but it is outside his realm.
PM to Prime95 ~1531 UTC
I can temporarily busy the gpus going empty, with PRP, or P-1/PRP for high exponents, but that does not help clear the P-1 for others doing wavefront PRP who perhaps have too little ram for an efficient P-1.

(edit:) FWIW, PrimeNet API/prime95 does not appear to be affected, as of 2021-02-15 1130 UTC.
(edit 2:) Assigning LL DC 1:1 in place of requested P-1 is overkill, comprising several times the GhzD per assignment at gpuowl default P-1 bounds. (LL DC 56M ~ 120. GhzD, 102M P-1 at 1M, 30M ~15. GhzD, each; ~8:1 ratio over requested work.)

Zenzoma 2021-02-15 19:09

Happened to me too. I got 13 DC LL's instead of PM1's a few hours ago.

Each manual assignment had this text:

"Not enough available memory for P-1 factoring assignments."

Prime95 2021-02-15 21:03

[QUOTE=MattL;571669]Happened to me too. I got 13 DC LL's instead of PM1's a few hours ago.[/QUOTE]

Please try again.

I increased the CPU power required to get first-time tests. While I was at it I increased the amount of RAM a client must have to get P-1 assignments. In the process, I messed up manual assignments.

Uncwilly 2021-02-15 22:03

[QUOTE=Prime95;571686]While I was at it I increased the amount of RAM a client must have to get P-1 assignments.[/QUOTE]Can you start enforcing expiration on P-1?
There are a bunch here with no progress and are a month old:
[url]https://www.mersenne.org/assignments/?exp_lo=101078797&exp_hi=103000000&execm=1&exdchk=1&exfirst=1&extf=1&excert=1[/url]

slandrum 2021-02-16 05:23

Off by one in the 101M to 102M range
 
2 Attachment(s)
Snapshot at the hour, from the milestones page it shows 33 needing to be cleared, with 2 available. From the work distribution map it shows 32 w/o LL (not 33), but also shows 28 assigned to PRP/LL, 3 assigned to P-1, and 2 available for LL/PRP assignment (which does add to 33).

ATH 2021-03-14 13:29

1 Attachment(s)
Server seems to be down for the last hour or so. Manual connection and connection from Prime95 fails.

EDIT: Server is up again now at: 2:40pm UTC

James Heinrich 2021-03-18 23:15

I seem to be getting a lot of "Error getting CERT starting value" errors:[quote][Mar 18 18:14] Restarting worker to do priority work.
[Mar 18 18:14] Setting affinity to run helper thread 1 on CPU core #2
[Mar 18 18:14] Setting affinity to run helper thread 2 on CPU core #3
[Mar 18 18:14] Setting affinity to run helper thread 3 on CPU core #4
[Mar 18 18:14] Setting affinity to run helper thread 4 on CPU core #5
[Mar 18 18:14] Setting affinity to run helper thread 5 on CPU core #6
[Mar 18 18:14] Starting certification of M108722261 using AVX FFT length 6M, Pass1=384, Pass2=16K, clm=4, 6 threads
[Mar 18 18:14] Error getting CERT starting value.
[Mar 18 18:14] Will retry certification later.
[Mar 18 18:14] Aborting processing of this work unit -- will try again later.[/quote][code][Thu Mar 18 02:58:14 2021]
Error getting CERT starting value.
Error getting CERT starting value.
[Thu Mar 18 03:37:01 2021]
Error getting CERT starting value.
Error getting CERT starting value.
[Thu Mar 18 04:37:02 2021]
Error getting CERT starting value.
Error getting CERT starting value.
[Thu Mar 18 05:37:01 2021]
Error getting CERT starting value.
Error getting CERT starting value.
Abandoning certification of M102864757.
Error getting CERT starting value.
[Thu Mar 18 06:37:05 2021]
Error getting CERT starting value.
[Thu Mar 18 07:37:05 2021]
Error getting CERT starting value.
[Thu Mar 18 08:37:01 2021]
Error getting CERT starting value.
[Thu Mar 18 09:19:28 2021]
Error getting CERT starting value.
[Thu Mar 18 13:56:32 2021]
Error getting CERT starting value.
[Thu Mar 18 15:38:34 2021]
Error getting CERT starting value.
[Thu Mar 18 16:14:52 2021]
Error getting CERT starting value.
[Thu Mar 18 17:14:51 2021]
Error getting CERT starting value.
[Thu Mar 18 18:14:51 2021]
Error getting CERT starting value.
[Thu Mar 18 18:53:37 2021]
UID: JamesHeinrich/3930K, M103151507 completed P-1, B1=836000, B2=43086000, Wi4: 6E32A7D2, AID: 873B35F5D7F1ACC173BECBEB1825AAF5
[Thu Mar 18 19:14:49 2021]
Error getting CERT starting value.[/code]

Prime95 2021-03-19 01:42

Odd, curtisc got reassigned the CERT on 102864757 and completed it OK.


All times are UTC. The time now is 23:20.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.