mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing > GpuOwl

Reply
 
Thread Tools
Old 2020-10-02, 01:00   #45
firejuggler
 
firejuggler's Avatar
 
Apr 2010
Over the rainbow

46538 Posts
Default

I have another question.
How will it work with cofactor prp?
would the line

'PRP=N/A,1,2,1841759,-1,"14734073,1694418281,5788466077109264959217""
work with a low B1 of 10000?
firejuggler is offline   Reply With Quote
Old 2020-10-02, 03:55   #46
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

10010011111102 Posts
Default

Quote:
Originally Posted by firejuggler View Post
I have another question.
How will it work with cofactor prp?
would the line

'PRP=N/A,1,2,1841759,-1,"14734073,1694418281,5788466077109264959217""
work with a low B1 of 10000?
PRP cofactor tests are supported in prime95/mprime but not gpuowl.
kriesel is online now   Reply With Quote
Old 2020-10-02, 14:32   #47
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
U.S.A.

3×563 Posts
Default

Quote:
Originally Posted by kriesel View Post
PRP cofactor tests are supported in prime95/mprime but not gpuowl.
Perhaps they should be.
storm5510 is offline   Reply With Quote
Old 2020-10-02, 16:22   #48
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

2·5·449 Posts
Default

Quote:
Originally Posted by storm5510 View Post
Perhaps they should be.
Why? Why spend the development effort for such a tiny and unimportant sub-task within this project?
VBCurtis is offline   Reply With Quote
Old 2020-10-02, 17:34   #49
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
U.S.A.

3·563 Posts
Default

Quote:
Originally Posted by VBCurtis View Post
Why? Why spend the development effort for such a tiny and unimportant sub-task within this project?
I run CF's because few others want to. On my fastest hardware, a single CF takes around 4 hours to complete with Prime95. Some of us do not have the option of having a 12 core, or higher, CPU. I considered building such a machine, but I opted for an RTX 2080 instead since the vast majority of what I do uses a GPU.

If CF's are "such a tiny and unimportant" task, then why have them at all?
storm5510 is offline   Reply With Quote
Old 2020-10-02, 18:14   #50
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

111768 Posts
Default

Mihai gets to set Mihai's priorities. (Similar applies for George or anyone else.)

We can ask, beg, help, test, etc, but none of us have any claim on any of his time.
PRP Proof capability was correctly top priority because of the huge payoff.
Now it's on to reducing P-1 net cost.
LL proof per R Gerbicz' proposed method might be good someday. such as for PROVING LL tests were correct for the next Mp found, or the previous ones.

Among the other things gpuowl does not currently support that may interest some potential users are:
TF
ECM
shifts
PRP-CF
prp certs accepted by PrimeNet
Longer fft lengths
Proth numbers
Fermat numbers (Ernst may tackle this someday)
Wagstaff numbers
General repdigit numbers
APIs other than OpenCLOpenCL 1.1/1.2 on old gpus
OS other than Windows or Linux
P-1 bounds extension
Substantial P-1 error detection/correction
UI or control flow changes (continue on to next worktodo item instead of quit on error)

v7.0 is dropping LL support. Some will want it back, preferably with Jacobi and shift.

Opinions will vary on which of the above to put toward the lower and least priority.
Those who write the code get to decide.

Last fiddled with by kriesel on 2020-10-02 at 18:32
kriesel is online now   Reply With Quote
Old 2020-10-02, 18:39   #51
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

2×32×263 Posts
Default

Quote:
Originally Posted by storm5510 View Post
I run CF's because few others want to. On my fastest hardware, a single CF takes around 4 hours to complete with Prime95. Some of us do not have the option of having a 12 core, or higher, CPU. I considered building such a machine, but I opted for an RTX 2080 instead since the vast majority of what I do uses a GPU.

If CF's are "such a tiny and unimportant" task, then why have them at all?
To help the project of finding primes, run certs, run P-1, run TF that will help. PRP-CF does not help that. Four hours is not a long run. I have runs that take days, weeks, or months. Run some LLDC to help reduce the years of backlog.

Mainly though, run whatever makes you smile.
kriesel is online now   Reply With Quote
Old 2020-10-02, 21:56   #52
Uncwilly
6809 > 6502
 
Uncwilly's Avatar
 
"""""""""""""""""""
Aug 2003
101×103 Posts

2×5×7×127 Posts
Default

Quote:
Originally Posted by storm5510 View Post
I run CF's because few others want to. On my fastest hardware, a single CF takes around 4 hours to complete with Prime95. Some of us do not have the option of having a 12 core, or higher, CPU.
I have a 4 core laptop that takes more time than that for a current range CF (it has a small L2 and runs at 1.6GHz). I do it, because it brings me joy. The machine also runs certs. I have Borged a couple of 4 core machines and have them running 2 DCs on 2 cores. I have one 6 core that is running DCs on 3 cores and P-1 on the other 3. I like seeing results roll in. Do what makes you happy.

CF does not help find the next Prime, but does help that that are trying to study the factorization of Mersenne numbers.
Uncwilly is online now   Reply With Quote
Old 2020-10-03, 00:27   #53
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
U.S.A.

3·563 Posts
Default

Quote:
Originally Posted by kriesel View Post
To help the project of finding primes, run certs, run P-1, run TF that will help. PRP-CF does not help that. Four hours is not a long run. I have runs that take days, weeks, or months. Run some LLDC to help reduce the years of backlog.

Mainly though, run whatever makes you smile.
This would be wavefront TF's to 76 bits making them ready for P-1. GPUto72. What makes me smile is an even mix of what I like and something productive. LL-DC is a little bug that likes to crawl around in my ear as a reminder that it is there. Perhaps, actually doing something as opposed to thinking about how to do it, might satisfy the little bug regardless of how slow and laborious it may be.
storm5510 is offline   Reply With Quote
Old 2020-10-06, 00:45   #54
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

101001100002 Posts
Default

I'm back to internet access. I'll try to get 7.0 useful/ready for testing.
preda is offline   Reply With Quote
Old 2020-10-07, 08:11   #55
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

24608 Posts
Default Memlock

We know that for PRP, it's beneficial to run two GpuOwl processes per GPU, and this fits nicely with the very low memory requirement of the PRP.

But now in the merged PRP + P-1, there are two steps (P1, P2) that require a lot of memory. What to do? The simple solutions are: run only one process and give it the full memory of the GPU (-maxAlloc), or run two processes and give each 50% of RAM. Both solutions are somehow suboptimal.

I attempted a different solution, let's call it "Memlock". Each process knows on which device it runs (that small number in -device 0). We can use -maxAlloc to allow each process to use (almost) the full RAM of one GPU. (E.g. for a 16GB GPU that is *not* runninng the monitor, I would use -maxAlloc 15G . If running the monitor, -maxAlloc 14G). In conjunction with -pool <dir> which indicates a directory shared by all GpuOwl process. Each process, when starting the "big memory" regions (P1, P2) will attempt to aquire a memory lock on the device by creating a file in the pool directory, e.g. /pool/memlock-1 ) and wait if another process is already in a big memory region.

On normal exit the process will properly release the lock, but on crash the lock may need to be removed manually -- just delete that memlock-N directory (it's an empty directory BTW).

Last fiddled with by preda on 2020-10-07 at 08:14
preda is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
GpuOwl PRP-Proof changes preda GpuOwl 20 2020-10-17 06:51
gpuowl: runtime error SELROC GpuOwl 59 2020-10-02 03:56
gpuOWL for Wagstaff GP2 GpuOwl 22 2020-06-13 16:57
gpuowl tuning M344587487 GpuOwl 14 2018-12-29 08:11
How to interface gpuOwl with PrimeNet preda PrimeNet 2 2017-10-07 21:32

All times are UTC. The time now is 05:54.

Mon Nov 30 05:54:14 UTC 2020 up 81 days, 3:05, 3 users, load averages: 1.05, 1.12, 1.19

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.