mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Hardware (https://www.mersenneforum.org/forumdisplay.php?f=9)
-   -   Hardware repair odyssey (https://www.mersenneforum.org/showthread.php?t=25342)

Prime95 2020-03-08 01:08

[QUOTE=preda;539069]What may have happened in your case is that you added one GPU in the "forbidden slot" that triggers ROCm. In that situation, all the instances on all GPUs become 100%-cpu threads.[/QUOTE]

Sandy Bridge lives again. I revived this antique computer and moved the GPU to it for the time being. I can't see spending $100-150 on a Haswell motherboard when Haswell gives 1/10th the performance of a Radeon VII. I can move the memory to another Haswell box which might give mprime a small boost.

I'll eventually start disassembling one of the dream machines and move the GPU and the RMA'd GPU to these mini-itx motherboards.

preda 2020-03-08 13:03

I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot when it has been working fine before. It seemed one GPU was causing the problem, so I started (in turn) switching GPUs around, switching the PCIe slots they're connected to, etc to see to what element the problem stays attached to.

And very surprisingly, it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors..; anyway I'm happy it seems fixed now.

dcheuk 2020-03-08 19:12

[QUOTE=preda;539159]I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot when it has been working fine before. It seemed one GPU was causing the problem, so I started (in turn) switching GPUs around, switching the PCIe slots they're connected to, etc to see to what element the problem stays attached to.

And very surprisingly, it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors..; anyway I'm happy it seems fixed now.[/QUOTE]

Lol :smile:

Had this problem on a Mac Pro couple years ago, almost pulled my hair out figuring out what was wrong.

kriesel 2020-03-09 15:04

[QUOTE=preda;539159]I just spent a huge amout of time diagnosing why my system with 3x GPUs started freezing after a reboot ..., it seems the problem was attached to the power cable..! (the cable connecting the PSU to the GPU). Yet the GPU was starting up fine, just dying a few seconds after starting gpuowl on it with very exotic errors
...
anyway I'm happy it seems fixed now.[/QUOTE]I've seen it take 45 minutes or more for issues to develop.

I just redid the cabling a bit on my mini-miner recently, which currently has 5 NIVIDIA gpus of 3 different models on it. So far it seems that arranging for no more than 2 connections to a gpu or pcie-extender card per power cable regardless of connector count on the cable is working better. V=IR


All times are UTC. The time now is 15:09.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.