mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > GMP-ECM

Reply
 
Thread Tools
Old 2022-03-17, 13:15   #518
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

10101100100112 Posts
Default

Quote:
Originally Posted by SethTro View Post
I'm happy to look at this as a bug. I vaguely remember that I wasn't sure if I should always stop or only if the cofactor is composite.
Thanks, but I've moved to using a separate machine with ecm.py and it is working as needed.

I'd leave as is unless others are interested. For me it is no longer an issue.

Again, thank you for considering it.
EdH is offline   Reply With Quote
Old 2022-05-24, 17:34   #519
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
Not U. + S.A.

24×173 Posts
Default

There seems to be an oddity about using a GPU in GMP-ECM.

The --help switch shows the maximum size of a B1 value is 2^1018. Using Windows Calculator to calculate log(2) * 1018 says this is a 308 digit value.

If I run this:

Code:
echo "2^14447-1" | gpuecm -gpu -maxmem 2048 5e6 10e6
I get this message:

Quote:
GMP-ECM 7.0.4-dev [configured with MPIR 2.7.2, --enable-gpu, --enable-openmp] [ECM]
Input number is 2^14447-1 (4349 digits)
GPU: Error, input number should be strictly lower than 2^1018
please report internal errors at <ecm-discuss@lists.gforge.inria.fr>.
B1 in my example is 5e6 or 5,000,000. This is obviously smaller than a 308 digit number.

I am missing something here and I cannot determine what it is.
storm5510 is offline   Reply With Quote
Old 2022-05-24, 17:43   #520
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

2·5·467 Posts
Default

Quote:
Originally Posted by storm5510 View Post
There seems to be an oddity about using a GPU in GMP-ECM.

The --help switch shows the maximum size of a B1 value is 2^1018. Using Windows Calculator to calculate log(2) * 1018 says this is a 308 digit value.

If I run this:

Code:
echo "2^14447-1" | gpuecm -gpu -maxmem 2048 5e6 10e6
I get this message:

B1 in my example is 5e6 or 5,000,000. This is obviously smaller than a 308 digit number.

I am missing something here and I cannot determine what it is.
You are missing something. The inputted number is too big. You specified 2^14447-1. The max is 2^1018.

Last fiddled with by paulunderwood on 2022-05-24 at 17:43
paulunderwood is offline   Reply With Quote
Old 2022-05-24, 17:52   #521
SethTro
 
SethTro's Avatar
 
"Seth"
Apr 2019

17·29 Posts
Default

If you are willing to compile gmp-ecm from source you can test larger input numbers (it's also faster).

See https://www.mersenneforum.org/showthread.php?t=27103
SethTro is offline   Reply With Quote
Old 2022-05-25, 13:52   #522
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
Not U. + S.A.

24·173 Posts
Default

Quote:
Originally Posted by paulunderwood View Post
You are missing something. The inputted number is too big. You specified 2^14447-1. The max is 2^1018.
Understood. IMHO, it is not good for much of anything at this size.
storm5510 is offline   Reply With Quote
Old 2023-02-20, 18:54   #523
kruoli
 
kruoli's Avatar
 
"Oliver"
Sep 2017
Porta Westfalica, DE

61816 Posts
Default

Is there any known progress in implementing stage 1 in (at least somewhat optimised) OpenCL up to 512 or maybe 1024 bits?
kruoli is offline   Reply With Quote
Old 2023-02-21, 06:49   #524
SethTro
 
SethTro's Avatar
 
"Seth"
Apr 2019

1111011012 Posts
Default

Quote:
Originally Posted by kruoli View Post
Is there any known progress in implementing stage 1 in (at least somewhat optimised) OpenCL up to 512 or maybe 1024 bits?
I don't know of anyone working on this. I did some quick google searches and didn't find any OpenCl arbitrary precision libraries. The old ECM gpu code was fairly straightforward grade school multiplication algorithm (IIRC) that could be adapted quickly for OpenCl, but if you/someone wanted competitive speeds from OpenCl more code would be needed for montgomery multiplication, various size kernels, etc.
SethTro is offline   Reply With Quote
Old 2023-02-21, 20:07   #525
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
Not U. + S.A.

24×173 Posts
Default

Quote:
Originally Posted by SethTro View Post
I don't know of anyone working on this. I did some quick google searches and didn't find any OpenCl arbitrary precision libraries. The old ECM gpu code was fairly straightforward grade school multiplication algorithm (IIRC) that could be adapted quickly for OpenCl, but if you/someone wanted competitive speeds from OpenCl more code would be needed for montgomery multiplication, various size kernels, etc.
AFAIK, nothing meaningful has been done with GMP-ECM for a long time. I stopped using it several years ago. It was not worth the time required to run anything.

OpenCL appears not to work a GPU very hard. I have used gpuOwl a lot, so I know. It does a good job, but there is a limit to its GPU utilization. Around 60% seems to be its limit.
storm5510 is offline   Reply With Quote
Old 2023-02-21, 20:26   #526
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

59×131 Posts
Default

Quote:
Originally Posted by storm5510 View Post
OpenCL appears not to work a GPU very hard. I have used gpuOwl a lot, so I know. It does a good job, but there is a limit to its GPU utilization. Around 60% seems to be its limit.
That statement is not consistent with my experience. I routinely see 99-100% GPU load indicated in GPU-Z for Gpuowl running on Radeon VII, RX480, or RX550.

Last fiddled with by kriesel on 2023-02-21 at 20:32
kriesel is offline   Reply With Quote
Old 2023-02-22, 01:12   #527
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
Not U. + S.A.

24·173 Posts
Default

Quote:
Originally Posted by kriesel View Post
That statement is not consistent with my experience. I routinely see 99-100% GPU load indicated in GPU-Z for Gpuowl running on Radeon VII, RX480, or RX550.
I have no experience with AMD GPU's. All of mine, past and present, are Nvidia based. Something tells me that AMD variations are much better suited to run OpenCL than Nvidia models are. CUDA is much better for Nvidia, it seems. mfaktc cranks out over 3,000 Ghz-d/day on my ever-older RTX-2080. I am satisfied with that.
storm5510 is offline   Reply With Quote
Old 2023-02-22, 01:27   #528
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

1E3116 Posts
Default

NVIDIA quadro k620, gpuowl v6.11-382, gpu-Z v2.52.0, indicated gpu load 100%.
Most recent NVIDIA consumer GPUs have poor DP performance (PRP, LL, P-1) relative to SP performance (TF), while Teslas and AMD typically have decent DP performance ratios. I almost never use RTX2080 or GTX 1650 for anything other than TF because of the 32:1 SP/DP ratio.

Last fiddled with by kriesel on 2023-02-22 at 01:29
kriesel is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Running CUDA on non-Nvidia GPUs Rodrigo GPU Computing 3 2016-05-17 05:43
Error in GMP-ECM 6.4.3 and latest svn ATH GMP-ECM 10 2012-07-29 17:15
latest SVN 1677 ATH GMP-ECM 7 2012-01-07 18:34
Has anyone seen my latest treatise? davieddy Lounge 0 2011-01-21 19:29
Latest version? [CZ]Pegas Software 3 2002-08-23 17:05

All times are UTC. The time now is 12:05.


Thu Jun 8 12:05:51 UTC 2023 up 294 days, 9:34, 0 users, load averages: 0.65, 0.95, 0.95

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔