mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2020-03-08, 01:08   #12
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

7·1,021 Posts
Default

Quote:
Originally Posted by preda View Post
What may have happened in your case is that you added one GPU in the "forbidden slot" that triggers ROCm. In that situation, all the instances on all GPUs become 100%-cpu threads.
Sandy Bridge lives again. I revived this antique computer and moved the GPU to it for the time being. I can't see spending $100-150 on a Haswell motherboard when Haswell gives 1/10th the performance of a Radeon VII. I can move the memory to another Haswell box which might give mprime a small boost.

I'll eventually start disassembling one of the dream machines and move the GPU and the RMA'd GPU to these mini-itx motherboards.
Prime95 is offline   Reply With Quote
Old 2020-03-08, 13:03   #13
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22458 Posts
Default

I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot when it has been working fine before. It seemed one GPU was causing the problem, so I started (in turn) switching GPUs around, switching the PCIe slots they're connected to, etc to see to what element the problem stays attached to.

And very surprisingly, it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors..; anyway I'm happy it seems fixed now.
preda is offline   Reply With Quote
Old 2020-03-08, 19:12   #14
dcheuk
 
dcheuk's Avatar
 
Jan 2019
Pittsburgh, PA

3×7×11 Posts
Default

Quote:
Originally Posted by preda View Post
I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot when it has been working fine before. It seemed one GPU was causing the problem, so I started (in turn) switching GPUs around, switching the PCIe slots they're connected to, etc to see to what element the problem stays attached to.

And very surprisingly, it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors..; anyway I'm happy it seems fixed now.
Lol

Had this problem on a Mac Pro couple years ago, almost pulled my hair out figuring out what was wrong.
dcheuk is offline   Reply With Quote
Old 2020-03-09, 15:04   #15
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

104678 Posts
Default

Quote:
Originally Posted by preda View Post
I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot ..., it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors
...
anyway I'm happy it seems fixed now.
I've seen it take 45 minutes or more for issues to develop.

I just redid the cabling a bit on my mini-miner recently, which currently has 5 NIVIDIA gpus of 3 different models on it. So far it seems that arranging for no more than 2 connections to a gpu or pcie-extender card per power cable regardless of connector count on the cable is working better. V=IR
kriesel is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Which Hardware should I buy ? MLoerke Hardware 45 2020-07-23 21:37
The Right to Repair ewmayer Soap Box 17 2019-08-12 20:58
Hardware robert44444uk Prime Gap Searches 45 2018-03-12 23:59
Hardware Error after 1s StechusKaktus Information & Answers 13 2018-02-20 07:46
NAS hardware VictordeHolland Hardware 5 2015-03-05 23:37

All times are UTC. The time now is 12:36.

Sat Sep 19 12:36:35 UTC 2020 up 9 days, 9:47, 1 user, load averages: 1.24, 1.30, 1.33

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.