mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2021-03-08, 21:58   #3466
lolapus
 
Jun 2020

23 Posts
Default

Quote:
Originally Posted by Dylan14 View Post
The pkgbuild for mfaktc has been updated. A new CUDA version, 11.2 came out which requires a new driver which has not been fully released (it is in beta). Hence the dependencies have been updated.
For anyone running the beta driver (presently 460.27.04), here is a mfaktc executable with CUDA 11.2 support on Linux.
I know I asked this a while ago but would you be able to compile mfaktc for cuda 11.2 on windows so I can use my 3090 to do tf instead of prp? I'm using my own desktop pc so installing another os wouldn't be an option for me.
lolapus is offline   Reply With Quote
Old 2021-03-10, 13:36   #3467
lolapus
 
Jun 2020

23 Posts
Default

Ok, after a few hours of tinkering with cuda 11.2 and visual studio 2012 I managed to get a 64-bit cuda 11.2 build of mfaktc. I'm not sure if I'm supposed to include a dll for cuda 11.2 with the build so let me know if I do. Attached is the build with the readme, changelog, copying, ini, and woktodo files. Hopefully this helps out a few people with the RTX 3000s on windows.
Attached Files
File Type: zip mfaktc-0.2.1.win.cuda11.2.zip (233.7 KB, 30 views)
lolapus is offline   Reply With Quote
Old 2021-03-10, 17:33   #3468
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

117368 Posts
Default

Quote:
Originally Posted by lolapus View Post
Ok, after a few hours of tinkering with cuda 11.2 and visual studio 2012 I managed to get a 64-bit cuda 11.2 build of mfaktc.
Is that v0.21 with the gpusievesize max extended to 2047M?
kriesel is offline   Reply With Quote
Old 2021-03-10, 23:04   #3469
lolapus
 
Jun 2020

23 Posts
Default

Quote:
Originally Posted by kriesel View Post
Is that v0.21 with the gpusievesize max extended to 2047M?
When I run mfaktc it says the gpu sievesize is 64Mi bits but I think you can change the sievesize in the ini file. I didn't change anything from the source except for the cuda and vs build tools locations for compiling so I guess the answer is no.
lolapus is offline   Reply With Quote
Old 2021-03-11, 01:12   #3470
lolapus
 
Jun 2020

810 Posts
Default

I recompiled mfaktc for cuda 11.2 on windows that has the GPUSievesizemax increased to 2047 and now the value in the ini file is set to 2047. I did see significant performance improvements and might be looking to increase the value even further in the future.
Attached Files
File Type: zip mfaktc-0.2.1.win.cuda11.2.zip (233.9 KB, 35 views)
lolapus is offline   Reply With Quote
Old 2021-03-11, 02:40   #3471
joejoefla
 
May 2019

2×3 Posts
Default

I tried to run the program on my machine. I have nVidia driver 461.72 and installed the cudart64_110.dll file from the CUDA toolkit. Getting this error:


Quote:
mfaktc v0.21 (64bit built)

Compiletime options
THREADS_PER_BLOCK 256
SIEVE_SIZE_LIMIT 32kiB
SIEVE_SIZE 193154bits
SIEVE_SPLIT 250
MORE_CLASSES enabled

Runtime options
SievePrimes 25000
SievePrimesAdjust 1
SievePrimesMin 5000
SievePrimesMax 100000
NumStreams 3
CPUStreams 3
GridSize 3
GPU Sieving enabled
GPUSievePrimes 82486
GPUSieveSize 2047Mi bits
GPUSieveProcessSize 16Ki bits
Checkpoints enabled
CheckpointDelay 30s
WorkFileAddDelay 600s
Stages enabled
StopAfterFactor bitlevel
PrintMode full
V5UserID (none)
ComputerID (none)
AllowSleep no
TimeStampInResults no

CUDA version info
binary compiled for CUDA 11.20
CUDA runtime version 11.20
CUDA driver version 11.20

CUDA device info
name GeForce GTX 1080
compute capability 6.1
max threads per block 1024
max shared memory per MP 98304 byte
number of multiprocessors 20
clock rate (CUDA cores) 1733MHz
memory clock rate: 5005MHz
memory bus width: 256 bit

Automatic parameters
threads per grid 655360
GPUSievePrimes (adjusted) 82486
GPUsieve minimum exponent 1055144

running a simple selftest...
ERROR: cudaGetLastError() returned 209: no kernel image is available for execution on the device

Last fiddled with by joejoefla on 2021-03-11 at 02:41
joejoefla is offline   Reply With Quote
Old 2021-03-11, 03:02   #3472
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

22·839 Posts
Default

Quote:
Originally Posted by joejoefla View Post
CUDA device info
name GeForce GTX 1080
compute capability 6.1
I think the above build is targeted for RTX 30x0 hardware only, I don't think you'd see any improvement over the previous 2047-enabled builds.
James Heinrich is offline   Reply With Quote
Old 2021-03-11, 03:54   #3473
joejoefla
 
May 2019

2·3 Posts
Talking

Quote:
Originally Posted by James Heinrich View Post
I think the above build is targeted for RTX 30x0 hardware only, I don't think you'd see any improvement over the previous 2047-enabled builds.
Sorry my bad. I'll switch back.
joejoefla is offline   Reply With Quote
Old 2021-03-11, 16:51   #3474
ZacHFX
 
ZacHFX's Avatar
 
Mar 2017
Halifax, NS

108 Posts
Default

Quote:
Originally Posted by lolapus View Post
I recompiled mfaktc for cuda 11.2 on windows that has the GPUSievesizemax increased to 2047 and now the value in the ini file is set to 2047. I did see significant performance improvements and might be looking to increase the value even further in the future.
Just got this up and running on my 3080. Sent benchmarks to James. Let me know if there's something specific I can do to assist.
ZacHFX is offline   Reply With Quote
Old 2021-03-11, 17:01   #3475
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

22×839 Posts
Default

Quote:
Originally Posted by ZacHFX View Post
Just got this up and running on my 3080. Sent benchmarks to James. Let me know if there's something specific I can do to assist.
Thanks! 6 months after launch you're the first person to submit a 3080 benchmark...
James Heinrich is offline   Reply With Quote
Old 2021-03-11, 17:07   #3476
ZacHFX
 
ZacHFX's Avatar
 
Mar 2017
Halifax, NS

23 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Thanks! 6 months after launch you're the first person to submit a 3080 benchmark...
Mining ether is more profitable than factoring, sadly.
ZacHFX is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1668 2020-12-22 15:38
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 09:30.

Thu May 6 09:30:02 UTC 2021 up 28 days, 4:10, 0 users, load averages: 1.06, 1.18, 1.31

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.