![]() |
![]() |
#2553 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
132·29 Posts |
![]() Quote:
It probably also ought indicate which version of gpuowl was used for that timing. Finally, please sort by model. |
|
![]() |
![]() |
![]() |
#2554 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
132×29 Posts |
![]()
Windows AMD Adrenalin driver difference, Window 10 Pro x64, XFX Radeon VII and XFX 5700XT
Code:
Radeon VII (power limited to ~1670Mhz gpu clock for temperature control): Exponent fft length Gpuowl Version us/it PRP delta, 20.4.2 to 20.10.1 Mersenne M words 20.4.2 20.10.1 us/it % 642589933 36M 4K:9:512 v6.11-364-g36f4e2a 6864 6944 +80 +1.17 843112609 48M 4K:12:512 v7.0-35-gf06bc5b 10063 10433 +370 +3.68 5700XT (free-running, not power limited): 852348659 48M 4K:12:512 v6.11-364-g36f4e2a 21829 21319 -510 -2.34 Which I was contemplating anyway since with the April driver, running the 5700XT caused driver and system instability sufficient to deter running the 5700XT. I've seen as high as 5% speed penalty for newer driver major version on older AMD gpus previously. % delta are given with excess digits to avoid adding rounding error and are maybe significant to a full decimal digit. Early indications after ~12 hours are stability is better with 20.10.1; no issues yet. Last fiddled with by kriesel on 2020-10-30 at 15:57 |
![]() |
![]() |
![]() |
#2555 |
Jul 2009
Germany
547 Posts |
![]()
I still use Adrenaline 19.11.3 ,Win64 10 Pro 1909 and v6.11-364-g36f4e2a with RX Vega 64.
107868373 FFT: 6M 1K:12:256 1775 us/it PRP Why updating if everything is stable. |
![]() |
![]() |
![]() |
#2556 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
132×29 Posts |
![]()
Back at gpuowl V1.9 to get V2.0 to work at all, a driver update was necessary, and cost 5.1% on performance on RX480 & RX550 in V1.9 for driver v19.x vs. v18.y, as I recall.
|
![]() |
![]() |
![]() |
#2557 |
Jul 2009
Germany
547 Posts |
![]()
Does anyone have a Radeon RX 590 or below to compare? This card should perform reasonably well for a consumer card, as it can do 0.445 TLops FP64 and 7.119 TLOPS FP32.
Last fiddled with by moebius on 2020-10-30 at 21:56 |
![]() |
![]() |
![]() |
#2558 | |
"Ethan O'Connor"
Oct 2002
GIMPS since Jan 1996
22×23 Posts |
![]() Quote:
1) GPUOpen OpenCL SDK : https://github.com/GPUOpen-Libraries...L-SDK/releases (3.0 tested) 2) Intel OpenCL SDK https://software.intel.com/content/w...pencl-sdk.html (2020 Update 3 tested) 3) nvidia OpenCL from the Cuda Toolkit: https://developer.nvidia.com/cuda-do...et_arch=x86_64 (11.1 Update 1 tested) I ran prp 1000003 with each build on a 1080ti as a quick check and the residue looked fine. I'm attaching binaries in case anyone wants to test these more thoroughly or on different hardware. |
|
![]() |
![]() |
![]() |
#2559 |
Jul 2009
Germany
547 Posts |
![]()
Please read this thread regarding v7.1
https://mersenneforum.org/showthread.php?t=26152 |
![]() |
![]() |
![]() |
#2560 |
Jul 2009
Germany
547 Posts |
![]()
gpuowl-win.exe -iters 200000 -prp 77936867
2020-11-01 01:30:36 gpuowl v6.11-364-g36f4e2a 2020-11-01 01:30:36 Note: not found 'config.txt' 2020-11-01 01:30:36 config: -iters 200000 -prp 77936867 2020-11-01 01:30:36 device 0, unique id '' 2020-11-01 01:30:36 GeForce RTX 3080-0 77936867 FFT: 4M 1K:8:256 (18.58 bpw) 2020-11-01 01:30:36 GeForce RTX 3080-0 Expected maximum carry32: 583B0000 2020-11-01 01:30:36 GeForce RTX 3080-0 OpenCL args "-DEXP=77936867u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=8u -DPM1=0 -DMM_CHAIN=1u -DMM2_CHAIN=2u -DMAX_ACCURACY=1 -DWEIGHT_STEP_MINUS_1=0xa.c42d0d7cec038p-5 -DIWEIGHT_STEP_MINUS_1=-0x8.0e50c8817ddf8p-5 -cl-unsafe-math-optimizations -cl-std=CL2.0 -cl-finite-math-only " 2020-11-01 01:30:36 GeForce RTX 3080-0 2020-11-01 01:30:36 GeForce RTX 3080-0 OpenCL compilation in 0.01 s 2020-11-01 01:30:37 GeForce RTX 3080-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-11-01 01:30:37 GeForce RTX 3080-0 validating proof residues for power 8 2020-11-01 01:30:37 GeForce RTX 3080-0 Proof using power 8 2020-11-01 01:30:40 GeForce RTX 3080-0 77936867 OK 800 0.00%; 1948 us/it; ETA 1d 18:11; 1579c241dc63eca6 (check 0.84s) 2020-11-01 01:37:16 GeForce RTX 3080-0 Stopping, please wait.. 2020-11-01 01:37:17 GeForce RTX 3080-0 77936867 OK 200000 0.26%; 1991 us/it; ETA 1d 18:59; f0b04b45b0855bd2 (check 0.86s) 2020-11-01 01:37:17 GeForce RTX 3080-0 Exiting because "stop requested" 2020-11-01 01:37:17 GeForce RTX 3080-0 Bye |
![]() |
![]() |
![]() |
#2561 |
"Oliver"
Mar 2005
Germany
111110 Posts |
![]()
some Quick&Dirty benchmarks:
|
![]() |
![]() |
![]() |
#2562 |
"Eric"
Jan 2018
USA
22×53 Posts |
![]()
Thank you for the benchmark numbers. Very impressive performance from the A100, almost scales 1:1 with volta when comparing their memory bandwidth. I can't imagine the performance if the memory is overclocked.
OTOH 3090 is honestly a big disappointment, it's slower than a tuned Vega 64 (which draw a lot less power) and not much faster than Turing RTX 8000. Looking forward to the performance of 6900xt for sure but I highly doubt it'll best the Radeon VII. Last fiddled with by xx005fs on 2020-11-01 at 17:43 |
![]() |
![]() |
![]() |
#2563 | |
Jun 2003
23·607 Posts |
![]() Quote:
I'm hoping that it will achieve 90%+ performance of R VII. Of course, at $999, it is still too expensive but 6800 & 6800XT might be good value. All pure speculation currently, obviously. Last fiddled with by axn on 2020-11-01 at 18:22 |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1668 | 2020-12-22 15:38 |
GPUOWL AMD Windows OpenCL issues | xx005fs | GpuOwl | 0 | 2019-07-26 21:37 |
Testing an expression for primality | 1260 | Software | 17 | 2015-08-28 01:35 |
Testing Mersenne cofactors for primality? | CRGreathouse | Computer Science & Computational Number Theory | 18 | 2013-06-08 19:12 |
Primality-testing program with multiple types of moduli (PFGW-related) | Unregistered | Information & Answers | 4 | 2006-10-04 22:38 |