mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2021-05-29, 22:22   #1
MarkVanCoutren
 
May 2021

2 Posts
Question Help: Hardware errors / 1 Gerbicz/double check error

I'm new to prime95 and GIMPS and I've been getting this message with my Intel Core i5-9600K 3.7 GHz 6-Core Processor running Windows 10

Iteration: 8280000 / 108671053 [7.61%], ms/iter: 10.106, ETA: 11d 17:48
Hardware errors have occurred during the test!
1 Gerbicz/double-check error.
Confidence in final result is excellent

I've been getting this same sequence every iteration and I think I got it on my last number as well. I've tried stopping it for a few hours to let it cool but it keeps giving me the error.
CPU temp is 51/52 C and the cores are around 61 C each (from Open Hardware Monitor)
I haven't tried to overclock this at all. I just left it running nonstop for a few days. Have I broken my computer?

Last fiddled with by MarkVanCoutren on 2021-05-29 at 22:37 Reason: adding more information
MarkVanCoutren is offline   Reply With Quote
Old 2021-05-29, 23:05   #2
moebius
 
moebius's Avatar
 
Jul 2009
Germany

2×313 Posts
Default

Probably your (PRP) result will be right, so let it run till end.

Some memory modules can't be get stable, or the processor core temperatures are to high.
Example for worst case error ratio:
https://mersenneforum.org/showpost.p...&postcount=159
moebius is offline   Reply With Quote
Old 2021-05-29, 23:06   #3
tuckerkao
 
"Tucker Kao"
Jan 2020
Head Base M168202123

7608 Posts
Default

The situation should be okay, it just indicates that there has been 1 Grebicz error check happened. I met this situation before and my final result was still accurate after the PRP certification from another user.

Once an error has occurred, it'll show on every message. Just let the machine finish the PRP testing. Unless you get 2 error checks for the same block, it should be fine in the end.
tuckerkao is online now   Reply With Quote
Old 2021-05-29, 23:07   #4
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

23·54 Posts
Default

You haven't broken it. You may have uncovered that it doesn't handle full-power heat generation anymore (say, from dust accumulation), or you may have uncovered a bad memory stick.

If it's a desktop, open it up and blow out the dust. Check to make sure all fans still turn when the machine is powered on. You might also choose to run Prime95 on fewer cores so that it generates less heat- if single-threaded operation still produces these errors, then it is more likely you have a failing memory stick and less likely dust / heat management is the culprit.

You might look into memtest86, or another memory-testing program, to try to narrow down what might be causing the hardware errors.

EDIT: In your post in another thread, you mentioned overclocking. Getting an error like this means you went too far, and need to back off the overclock for stability.

Last fiddled with by VBCurtis on 2021-05-29 at 23:09
VBCurtis is offline   Reply With Quote
Old 2021-05-29, 23:07   #5
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

1A816 Posts
Default

As long as it says confidence is excellent you'll be ok. I get some numbers where I have that every time, and other numbers where I don't see it at all. Must be edge cases.
Aramis Wyler is offline   Reply With Quote
Old 2021-05-30, 10:08   #6
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

9,787 Posts
Default

If some error happens, like bad GC check or a too-high rounding, etc, then P95 tries to re-do the iteration using a different (slower) method. Sometimes, like for some rounding errors, borderline FFT sizes, as mentioned above, there is no problem, and the slower method will get the same result. Then it will say that the "result is reproducible", so there was not a hardware error. Sometimes the slower calculation gets a different result, and in that case it will redo the GC, or resume from an earlier checkpoint, depending on the situation. If that's the case, your result is still OK, there is no error, and the confidence in the final result being correct is very high. However, the app will let you know that some error happened, so you can take measures in the future (like, dusting, reducing clocks, re-seat the CPU - not reset, re-seat means taking the CPU out, clean, apply new paste, etc, whatever, up to you or your IT guys).

If you see 1 error, 1 error, 1 error, 1 error, 1 error, at every iteration/checkpoint/printing on the screen, there is no problem. This are NOT new errors, it is the same error that happened in the past, the system lets you know, so you can decide. Errors happen to all of us, now and then. I see one or two monthly, or every two mounts, when I overclock. They are harmless.

If you see 1 error, 2 errors, 3 errors, 5 errors, 77 errors, at every iteration/checkpoint/printing on the screen, or if you see 1 error at every test, or often (the counter is reset with the new assignment), then you are in deep shh.. you need to take action. I mean, bad system will continue to produce errors. Then, dusting, reduce clocks, re-seat, whatever the other guys said.

Last fiddled with by LaurV on 2021-05-30 at 10:10
LaurV is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Are the recent advancements for GIMPS (VDF, Gerbicz check, Jacobi check) worth publishing? henryzz Math 37 2020-08-24 23:56
Gerbicz/double-check errors DJN PrimeNet 4 2020-02-20 20:01
Error in stats (Top LL Double-Check Producer) moebius PrimeNet 5 2010-11-09 23:19
First check and double check llrnet servers. opyrt Prime Sierpinski Project 3 2009-01-02 01:50
Events that cause errors - how to check. rx7350 Data 2 2006-03-26 17:51

All times are UTC. The time now is 09:02.


Sat Oct 23 09:02:37 UTC 2021 up 92 days, 3:31, 0 users, load averages: 1.42, 1.31, 1.18

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.