mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2020-10-20, 12:34   #3378
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

2·37·43 Posts
Default

Tangential to this subject, I just mention for completeness: the credit given by PrimeNet and the credit displayed on mersenne.ca will likely differ when a factor is found (and bit level not completed), since primenet assumes the factor was found with prime95 (or something using an equivalent number of classes), mersenne.ca assumes the factor was found with mfaktc. On average it all balances, but specific factor "credit" may differ slightly.
James Heinrich is offline   Reply With Quote
Old 2020-10-20, 12:39   #3379
axn
 
axn's Avatar
 
Jun 2003

112528 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
since primenet assumes the factor was found with prime95
Can't it be made to assume otherwise?
axn is online now   Reply With Quote
Old 2020-10-20, 12:41   #3380
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

2·37·43 Posts
Default

Quote:
Originally Posted by axn View Post
Can't it be made to assume otherwise?
It likely can, but I'll need to tread carefully with that code. I'll see how possible it is (especially since my previous fix for full-bitrange factors doesn't appear to have worked as intended).

Last fiddled with by James Heinrich on 2020-10-20 at 12:41
James Heinrich is offline   Reply With Quote
Old 2020-10-21, 06:52   #3381
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

893210 Posts
Default

Quote:
Originally Posted by kriesel View Post
edit: But interestingly,<link> shows full credit
[offtopic]
link is pointless, that's your private page nobody can see, therefore please provide us your password so we can have a look..
[/offtopic]
[edit: that was a joke, don't PM me your password ]

Last fiddled with by LaurV on 2020-10-21 at 06:53 Reason: spacing
LaurV is offline   Reply With Quote
Old 2020-10-21, 07:59   #3382
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

10010011001112 Posts
Default

Compare other recent 74-75 bit factor-found credits given, to the most recent one, and 75-76 to the proper 67.1 GhzD credit for completing the bit level. Note that had they been performed with StopAfterFactor=2 (finish the class), reduced credit would have been appropriate, and the listings here would include asterisks. These TF were all done with StopAfterFactor=1 (finish the bit level), so no asterisks.
The difference can be over 90% credit loss.
The difference can be rather significant on 80-86 bit final level on large exponents.
Attached Thumbnails
Click image for larger version

Name:	tfcredit.png
Views:	26
Size:	209.1 KB
ID:	23579   Click image for larger version

Name:	tfcredit84.png
Views:	23
Size:	220.4 KB
ID:	23582  

Last fiddled with by kriesel on 2020-10-21 at 08:21
kriesel is online now   Reply With Quote
Old 2020-10-21, 08:26   #3383
Neutron3529
 
Neutron3529's Avatar
 
Dec 2018
China

1010002 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
A NF result is probably what I'm looking for. Ideally I'd want to know the clockspeed the GPU was running at during the run as well, but a completed NF run is a good start.
https://www.mersenne.ca/mfaktc.php#benchmark
I bought a RTX 3090.
Here's my results
results.txt
(only the last result is uploaded)

I got ~5500 GHz-d/day rather than ~5200
The exactly GPU I bought is `GeForce RTX 3090 VENTUS 3X 24G OC`, which could easily reach fan speed 67% and temperature 75C (with P2 348W / 350W)
a normal gpu should be ~70C. Thus I do not recomment buying that GPU even it is faster.
will test gpuowl after current progress finished.
Neutron3529 is offline   Reply With Quote
Old 2020-10-21, 10:01   #3384
Neutron3529
 
Neutron3529's Avatar
 
Dec 2018
China

508 Posts
Default

Quote:
Originally Posted by moebius View Post
Please make a short gpuowl benchmark with the exponent 77936867, so that we can directly compare the values ​​of the graphics cards, thank you.
https://mersenneforum.org/showthread.php?p=558317#post558317
you're welcome.

I post the first 300k iters.
A strange thing is that my GPU does not reach 350W power limit(but reach 1965Mhz which is ~200 Mhz higher than it is in mfaktc.)
Neutron3529 is offline   Reply With Quote
Old 2020-10-21, 12:07   #3385
Viliam Furik
 
Jul 2018
Martin, Slovakia

FF16 Posts
Default

Quote:
Originally Posted by Neutron3529 View Post
I bought a RTX 3090.
Here's my results
Attachment 23580
(only the last result is uploaded)

I got ~5500 GHz-d/day rather than ~5200
The exactly GPU I bought is `GeForce RTX 3090 VENTUS 3X 24G OC`, which could easily reach fan speed 67% and temperature 75C (with P2 348W / 350W)
a normal gpu should be ~70C. Thus I do not recomment buying that GPU even it is faster.
will test gpuowl after current progress finished.
Why so poor result? I get about 4900 GHzD/D with the same work. It should be at least 10000 GHzD/D for the 3090, no? It has more than double the FP32 throughput.

It is most probably one of these two reasons:
1. The shared INT32 and FP32 cores don't play nicely with mfaktc - either incompatible code or the cores not fulfilling their promise
2. Memory bottleneck

Either way, I am not satisfied with the result.
Viliam Furik is offline   Reply With Quote
Old 2020-10-21, 13:27   #3386
Neutron3529
 
Neutron3529's Avatar
 
Dec 2018
China

23×5 Posts
Default

Quote:
Originally Posted by Viliam Furik View Post
Why so poor result?
I tried some machine learning program(mxnet), find it is no different to switch the cuda architecture from sm_80 to sm_86, which should have a 2x boost.


maybe the current cuda implementation does not really works for sm_86, maybe cuda 11.2 would help.
Neutron3529 is offline   Reply With Quote
Old 2020-10-21, 14:32   #3387
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

2×37×43 Posts
Default

Quote:
Originally Posted by Neutron3529 View Post
I bought a RTX 3090. I got ~5500 GHz-d/day
I'd be curious to know what kind of combined throughput you get when running two instances of mfaktc simultaneously?
James Heinrich is offline   Reply With Quote
Old 2020-10-21, 15:04   #3388
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

1100011011102 Posts
Default

Quote:
Originally Posted by kriesel View Post
These TF were all done with StopAfterFactor=1 (finish the bit level)
I've looked at the code again and clearly I'm missing something because I think it should be working as I intended (but clearly it isn't). It's also difficult to test because that section of code will only get processed when a new factor is submitted (my logic works fine in my test environment, but something different is happening on the server). I have added a couple of debug lines that might help me track down the problem, if you see them next time you (collective "you", anyone reading this) submit a factor please email me either a copy-paste or screenshot of the output.
James Heinrich is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1657 2020-10-27 01:23
The P-1 factoring CUDA program firejuggler GPU Computing 752 2020-09-08 16:15
"CUDA runtime version 0.0" when running mfaktc.exe froderik GPU Computing 4 2016-10-30 15:29
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26

All times are UTC. The time now is 03:52.

Fri Nov 27 03:52:59 UTC 2020 up 78 days, 1:03, 4 users, load averages: 1.54, 1.31, 1.42

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.