mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2020-05-14, 19:37   #265
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

3·53·73 Posts
Default

Quote:
Originally Posted by xx005fs View Post
Actually not bad when considering brand new, but the pro cards can't be overclocked so it will for sure be memory bound. However, RVII looks to be a much better deal when buying used.
And if the Pro finds a market, we can hope that the older R7s will start coming onto the used-gear market in decent volume, thus pushing prices for those down a bit. Or am I being overoptimistic?
ewmayer is offline   Reply With Quote
Old 2020-05-14, 19:55   #266
xx005fs
 
"Eric"
Jan 2018
USA

110101002 Posts
Default

Quote:
Originally Posted by ewmayer View Post
And if the Pro finds a market, we can hope that the older R7s will start coming onto the used-gear market in decent volume, thus pushing prices for those down a bit. Or am I being overoptimistic?

That's certainly possible, but I don't think many workstation users will use Radeon Pros over Quadros. However, I am sure that the release of new gaming cards will make the Radeon VII gamers dump them for the new rx 6000 or rtx 3000 series



On a side note, with Nvidia's announcement of Ampere GA100 today, if they by chance release a card similar to the Titan V called the Titan A or something that has all of the capability of the server GA100 (maybe with 1 less stack of HBM2e), then it will be an absolute beast in PRP, but I certainly don't expect the price to drop.
xx005fs is offline   Reply With Quote
Old 2020-06-18, 22:47   #267
Viliam Furik
 
"Viliam Furík"
Jul 2018
Martin, Slovakia

2·193 Posts
Default Radeon VII throwing errors more than I would like

My settings are:
Voltage: 975 mV
Power limit: -15%
Core clock: 1450 MHz
Memory: 1100 MHz

I am applying them with the MSI Afterburner software.

Temperatures are:
GPU temp: about 75 °C
Hot spot temp: about 95 °C

As I mentioned in the DC and TC thread, I have done two LLDC tests, neither of them matched. TC is in progress. I have done third LLDC, on a known residue (50000059), this time it matched.

Can somebody give me advice on how to change the underclock to not give errors, and maybe also how to keep it cooler at the same time?

BTW both not-matched tests had one Jacobi error in them. The matching one also had one Jacobi error, but that was obviously fixed. I can now only hope that this was the same scenario as in the first two tests.

I will give a status update when the TCs are finished.
Viliam Furik is offline   Reply With Quote
Old 2020-06-18, 23:13   #268
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

2×11×61 Posts
Default

Quote:
Originally Posted by Viliam Furik View Post
My settings are:
Voltage: 975 mV
Power limit: -15%
Core clock: 1450 MHz
Memory: 1100 MHz

I am applying them with the MSI Afterburner software.

Temperatures are:
GPU temp: about 75 °C
Hot spot temp: about 95 °C

As I mentioned in the DC and TC thread, I have done two LLDC tests, neither of them matched. TC is in progress. I have done third LLDC, on a known residue (50000059), this time it matched.

Can somebody give me advice on how to change the underclock to not give errors, and maybe also how to keep it cooler at the same time?

BTW both not-matched tests had one Jacobi error in them. The matching one also had one Jacobi error, but that was obviously fixed. I can now only hope that this was the same scenario as in the first two tests.

I will give a status update when the TCs are finished.
Please do PRP testing while you tune the GPU setup. When it's rock-solid (less than one PRP error per week) you can switch to LL.

It's possible that the MEM overclock is too high. Or that the voltage too low. To keep it cool, lower the sclk (core clock).
preda is offline   Reply With Quote
Old 2020-06-19, 00:02   #269
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

3×53×73 Posts
Default

Viliam, your core-clock of 1450MHz is roughly midway between what an R7 runs at under Linux/ROCm at sclk =3 (1373MHz) and 4 (1547MHz). I've had no problems running any of my 4 R7s at those clocks and mclk = 1150MHz, but I could see one with a slightly-lesser-grade chunk of Silicon causing problems at your settings. Echoing Mihai, I suggest:

1. Reduce your core-clock to, say, 1350MHz;
2. What is your system-default mclk frequency? I'm guessing it's < 1100MHz, suggest you revert to it and...
3. Do several PRPs - preferably PRP-DCs so you can be 100% confident in an "error free" run really having been so - and use the much-better error-checking of those to tune your hardware.
ewmayer is offline   Reply With Quote
Old 2020-06-19, 00:13   #270
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

162718 Posts
Default

Quote:
Originally Posted by Viliam Furik View Post
Can somebody give me advice on how to change the underclock to not give errors, and maybe also how to keep it cooler at the same time?
Not specific advice to your question:

Consider a Linux setup. AMD's linux driver and opencl compiler are *way* better (faster, more reliable) in the Linux environment.
Prime95 is offline   Reply With Quote
Old 2020-06-19, 04:51   #271
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

32×19×43 Posts
Default

For sale (not from me): A used Radeon VII, $550

https://hardforum.com/threads/fs-rad...-16gb.1998015/
Prime95 is offline   Reply With Quote
Old 2020-06-19, 11:33   #272
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

3×1,637 Posts
Default

Thanks for taking reliability seriously. (Some of the following may be quite obvious to you. If so, consider it might not be to someone after you, and help them.)

PRP with the highly reliable GEC to detect errors is the way to go to find settings for which your gpu is stable. Establish stability at stock memory clock rate first. If you don't have that, you probably won't at anything faster. If not too much time has passed, and it's not stable at 1000Mhz, you should be able to exchange it for one that is, or get a refund and buy one from somewhere else.
Reducing gpu clock rate reduces power and indirectly reduces temperature. Cooler electronics live longer and can be more reliable.
Lifting the high end of the fan curve may help temperatures with less impact on throughput.
On Windows the driver install package often has a utility for doing all that. Save the tuned profile of settings for reuse. On linux Ernst's thread on setting up gpuowl for linux in the gpuowl subforum may help.

After you find and save and set at startup, settings that provide reliability, then you may be able to run LL or P-1 reliably, and verify that by doing some LL DC that matches and P-1 on test exponents with known factors.
Electronics change with age. I suggest retesting for reliability and reevaluating annually.

Good luck.

Last fiddled with by kriesel on 2020-06-19 at 11:36
kriesel is offline   Reply With Quote
Old 2020-07-25, 16:31   #273
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

3·1,637 Posts
Default B&H Photo listing Radeon Pro VII for preorder

Expected availability end of August. https://www.bhphotovideo.com/c/produ...vii_16gb.html/
Note "noncancelable and nonreturnable".
kriesel is offline   Reply With Quote
Old 2020-07-30, 21:24   #274
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

265278 Posts
Default

Quote:
Originally Posted by kriesel View Post
Expected availability end of August. https://www.bhphotovideo.com/c/produ...vii_16gb.html/
Note "noncancelable and nonreturnable".
End of august, $1899, with those terms? No thanks. Amazon just undercut that by $400, free-ship, "Arrives: Aug 10 - 12; Fastest delivery: Aug 7 - 11. Only 9 left in stock - order soon."

They also have this, which I'd never seen before:

AMD Radeon Vega Frontier Edition Liquid Retail

Is this basically an R7 w/liquid-cooling? Because it's priced $890 new, ~$800 from the cheapest reseller, both lower than used R7s have been going for of late. Above is w/free-shipping, and free returns.

Last fiddled with by ewmayer on 2020-07-30 at 21:25
ewmayer is offline   Reply With Quote
Old 2020-07-30, 22:05   #275
Runtime Error
 
Sep 2017
USA

3·7·11 Posts
Default

Quote:
Originally Posted by ewmayer View Post
End of august, $1899, with those terms? No thanks. Amazon just undercut that by $400, free-ship, "Arrives: Aug 10 - 12; Fastest delivery: Aug 7 - 11. Only 9 left in stock - order soon."
That's for the Radeon Pro WX 9100, not the Radeon Pro VII. Maybe you copy/pasted a different link?

Pro WX 9100: https://www.amd.com/en/products/prof...-liquid-cooled
Pro VII: https://www.amd.com/en/products/prof...radeon-pro-vii


Quote:
Originally Posted by ewmayer View Post
They also have this, which I'd never seen before:

AMD Radeon Vega Frontier Edition Liquid Retail

Is this basically an R7 w/liquid-cooling? Because it's priced $890 new, ~$800 from the cheapest reseller, both lower than used R7s have been going for of late. Above is w/free-shipping, and free returns.
Not an R7. The specs say it has 819 GFLOPs double precision, while R7s have 3.46 TFLOPs double precision:

VF: https://www.amd.com/en/products/prof...ion-air-cooled
R7: https://www.amd.com/en/products/graphics/amd-radeon-vii

Edit: Scroll to the bottom of these pages for specs.

Last fiddled with by Runtime Error on 2020-07-30 at 22:11 Reason: url formatting
Runtime Error is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Vega 20 announced with 7.64 TFlops of FP64 M344587487 GPU Computing 4 2018-11-08 16:56
GTX 1180 Mars Volta consumer card specs leaked tServo GPU Computing 20 2018-06-24 08:04
RX Vega performance xx005fs GPU Computing 5 2018-01-17 00:22
Radeon Pro Duo 0PolarBearsHere GPU Computing 0 2016-03-15 01:32
AMD Radeon R9 295X2 firejuggler GPU Computing 33 2014-09-03 21:42

All times are UTC. The time now is 05:22.

Tue Mar 2 05:22:58 UTC 2021 up 89 days, 1:34, 0 users, load averages: 2.40, 2.84, 2.63

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.