mersenneforum.org  

Go Back   mersenneforum.org > Search Forums

Showing results 1 to 25 of 1000
Search took 0.14 seconds.
Search: Posts Made By: preda
Forum: GPU Computing 2021-04-15, 17:48
Replies: 1
Views: 163
Posted By preda
From what I understand, OpenCL 3.0 is closer to...

From what I understand, OpenCL 3.0 is closer to OpenCL 1.x than to OpenCL 2.0. I.e. 3.0 is not "more" than 2.0, but instead it reduces the mandatory feature-set to the level of 1.x and offers...
Forum: GpuOwl 2021-04-03, 17:12
Replies: 16
Views: 810
Posted By preda
Nice, I like it! The owl has a foxy look :)

Nice, I like it! The owl has a foxy look :)
Forum: Information & Answers 2021-03-31, 11:55
Replies: 7
Views: 338
Posted By preda
I guess it's because of the "chmod 777...

I guess it's because of the "chmod 777 expand.py". Do you need that? (expand.py already has rights 775)
Forum: Hardware 2021-03-29, 07:43
Replies: 16
Views: 738
Posted By preda
The need for the general-MUL vs. MUL-3 only...

The need for the general-MUL vs. MUL-3 only appears when changing the "L" step dinamically during a test. This is something GpuOwl does not support (and thus gets away with using MUL-3), but prime95...
Forum: GpuOwl 2021-03-28, 18:58
Replies: 82
Views: 10,355
Posted By preda
The multiplication time was excessive before the...

The multiplication time was excessive before the restart. One possible cause would be the GPU RAM becoming over-allocated for some reason, which would slow everything down a lot.

If you catch it...
Forum: Hardware 2021-03-21, 20:02
Replies: 16
Views: 738
Posted By preda
A small advantage of "b" being fixed is that, in...

A small advantage of "b" being fixed is that, in the GEC verification, we have a multiplication by "3" (the PRP base). When "b" is variable, this multiplication must be changed to a general...
Forum: GpuOwl 2021-03-12, 20:31
Replies: 16
Views: 810
Posted By preda
From the project settings on github: ...

From the project settings on github:


Although I'm not exactly sure what "Social preview" entails, it certainly requires a huge image.
Forum: GpuOwl 2021-03-12, 08:56
Replies: 82
Views: 10,355
Posted By preda
I recently got a Radeon VII with Samsung memory...

I recently got a Radeon VII with Samsung memory (as as RMA replacement). Even without any RAM overclock, and without any undervolt, that memory consistently generates errors. This is in contrast with...
Forum: GpuOwl 2021-03-12, 08:52
Replies: 16
Views: 810
Posted By preda
Nice, I like it. I looked about including it on...

Nice, I like it. I looked about including it on the project page on github, but they recommend a much larger image e.g. https://github.com/preda/gpuowl/settings/og-template
Forum: GpuOwl 2021-03-10, 19:43
Replies: 82
Views: 10,355
Posted By preda
As a practical approach, I would suggest reducing...

As a practical approach, I would suggest reducing the P1 bounds on the GPUs with these errors (e.g. at the wavefront, B1=2M and B2=40M). Would be interesting to know if these GPUs, booted under ROCm,...
Forum: GpuOwl 2021-03-10, 17:28
Replies: 82
Views: 10,355
Posted By preda
I don't know what is causing the GPU->Host read...

I don't know what is causing the GPU->Host read errors. What I'm most concerned with is whether GpuOwl is functioning correctly or not. I.e. whether you have some indication or suspicion that the...
Forum: mersenne.ca 2021-03-02, 19:04
Replies: 597
Sticky: mersenne.ca
Views: 67,686
Posted By preda
The most recent GpuOwl's P-1 calculator is here: ...

The most recent GpuOwl's P-1 calculator is here:

https://github.com/preda/gpuowl/blob/master/pm1/pm1.cpp

I dropped the equivalent python version because it was becoming laborious to maintain...
Forum: Software 2021-02-20, 08:48
Replies: 39
Views: 6,396
Posted By preda
3xSP sum()

Unfortunately the sum() I have up to now is a beast: 54 ADDs.


This seems a rather very expensive sum()..

To see some corner-cases that sum() must handle, here is one example: given "x", we'd...
Forum: GPU Computing 2021-02-20, 08:42
Replies: 10
Views: 1,148
Posted By preda
How user-unfriendly is that! having the guts to...

How user-unfriendly is that! having the guts to say it out loud: we (Nvidia) get to decide what you use your GPU for. I must ask, what about watching porn on Nvidia GPUs, is that allowed by the...
Forum: Software 2021-02-12, 21:02
Replies: 39
Views: 6,396
Posted By preda
Figure 10 seems to indicate: c0,e0 =...

Figure 10 seems to indicate:

c0,e0 = twoSum(a0, b0)

d1,e11 = twoSum(a1, b1)
c1,e12 = twoSum(d1, e0)

c2 = a2 + b2 + e11 + e12

which looks pretty good (i.e. simpler than I was expecting)
Forum: Software 2021-02-12, 19:45
Replies: 39
Views: 6,396
Posted By preda
Funnily, I'm struggling even with implementing a...

Funnily, I'm struggling even with implementing a 3xSP ADD :).

Here is a good paper with the solution for quad-double:
https://web.mit.edu/tabbott/Public/quaddouble-debian/qd-2.3.4-old/docs/qd.pdf...
Forum: Software 2021-02-12, 08:42
Replies: 39
Views: 6,396
Posted By preda
SP plan

I've been thinking some more about a practical SP FFT implementation on GPUs, and here are some problems/ideas:

1. FFT twiddles, i.e. the trigonometric constants (sin+cos) used in the FFT.
...
Forum: GpuOwl 2021-02-06, 20:04
Replies: 48
Views: 6,974
Posted By preda
For what it's worth, on R7 I'm personally running...

For what it's worth, on R7 I'm personally running with B1=9M, B2=180M for 102M-103M exponents (factored to 76bits).
Forum: GpuOwl 2021-02-04, 06:16
Replies: 48
Views: 6,974
Posted By preda
The program also allows to specify a fixed B1 or...

The program also allows to specify a fixed B1 or B2, and in that situation displays options for the other bound. Examples below with fixed B1=1M, or fixed B2=50M (note, the values below are good with...
Forum: GpuOwl 2021-02-03, 21:40
Replies: 48
Views: 6,974
Posted By preda
GpuOwl updated P-1 calculator

Hi, recently I revisited the P-1 calculator that's included with GpuOwl's source code https://github.com/preda/gpuowl/blob/master/pm1/pm1.cpp

The calculator is a small stanalone C++ program; to...
Forum: GPU Computing 2021-01-17, 16:42
Replies: 21
Views: 1,699
Posted By preda
I would recommend to get a 850W or at least 750W...

I would recommend to get a 850W or at least 750W PSU, Gold 80+, and modular or semi-modular. Maybe read some reviews of the model before buying. The reason is: you have some power headroom (to 850W),...
Forum: Hardware 2021-01-05, 10:12
Replies: 128
Views: 12,134
Posted By preda
The cache (L1/L2/L3) is used transparently for...

The cache (L1/L2/L3) is used transparently for the *global* memory operations. It is managed automatically by the cache control (probably a variant of LRU), not explicitly by the software. So yes,...
Forum: GpuOwl 2020-12-13, 04:31
Replies: 199
Views: 17,959
Posted By preda
I'm personally not using 6.x myself, thus I don't...

I'm personally not using 6.x myself, thus I don't have a lot of motivation to improve it. From my POV, 7.x is now better in a couple of ways than 6.x, and I prefer to focus my (limited) resources on...
Forum: GpuOwl 2020-12-06, 22:58
Replies: 2,695
Views: 243,160
Posted By preda
GpuOwl uses very very little PCIe bandwidth. I...

GpuOwl uses very very little PCIe bandwidth. I regularly run it over PCIe x1 Gen1 without significant slowdown.
Forum: Hardware 2020-12-06, 22:54
Replies: 12
Views: 1,504
Posted By preda
Hardware donation and possible meet-ups

I may have some extra hardware located in Sydney (Australia), I'm considering donating it based on GIMPS participation. Is somebody [else] from the forum living in Sydney?

Now that I think of it,...
Showing results 1 to 25 of 1000

 
All times are UTC. The time now is 20:01.

Sat Apr 17 20:01:53 UTC 2021 up 9 days, 14:42, 0 users, load averages: 1.59, 1.96, 1.87

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.