mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2022-08-17, 18:06   #111
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

2·3·5·163 Posts
Default

Quote:
Originally Posted by Plutie View Post
the easiest way would probably be running "msieve -nc2 -g 0" - if it outputs a line showing the VBITS value you compiled msieve with, then it's compiled properly.
Thanks, but to do that, I think -np1 would work better since I wouldn't need to create as many other files first. But I'd still need to look for a value (such as "using GPU" in the log. I was looking for a simple value check or existence check for a file, perhaps a .ptx.
EdH is offline   Reply With Quote
Old 2022-08-17, 18:30   #112
Plutie
 
"Evan"
Dec 2020
Montreal

22×3×7 Posts
Default

ah, in that case - you can look for the lanczos_kernel.ptx file (or stage1_core.ptx)

Last fiddled with by Plutie on 2022-08-17 at 18:31 Reason: oops
Plutie is offline   Reply With Quote
Old 2022-08-17, 18:35   #113
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

2·3·5·163 Posts
Default

Quote:
Originally Posted by Plutie View Post
ah, in that case - you can look for the lanczos_kernel.ptx file (or stage1_core.ptx)
Thanks! I'll work with that.
EdH is offline   Reply With Quote
Old 2022-08-18, 13:14   #114
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

10011000110102 Posts
Default

I'm too excited to keep this to myself. I finally have sufficient cooling for my M40 GPU and am running a c173 that is in LA on both, the 40-thread machine and the GPU machine.

This is the 40-thread (40GB) machine at start of LA:
Code:
Wed Aug 17 22:53:22 2022  linear algebra at 0.0%, ETA 66h33m
and, current state (08:13):
Code:
linear algebra completed 2537146 of 16995095 dimensions (14.9%, ETA 53h31m)
Here is the GPU machine at start of LA:
Code:
Wed Aug 17 23:11:13 2022  linear algebra at 0.0%, ETA 24h39m
and. current state (08:13):
Code:
linear algebra completed 6241861 of 16995095 dimensions (36.7%, ETA 15h35m)
Here's a litle extra from the GPU machine log:
Code:
Wed Aug 17 22:59:01 2022  using VBITS=256
Wed Aug 17 22:59:01 2022  skipping matrix build
Wed Aug 17 22:59:04 2022  matrix starts at (0, 0)
Wed Aug 17 22:59:07 2022  matrix is 16994916 x 16995095 (5214.6 MB) with weight 1611774956 (94.84/col)
Wed Aug 17 22:59:07 2022  sparse part has weight 1163046519 (68.43/col)
Wed Aug 17 22:59:07 2022  saving the first 240 matrix rows for later
Wed Aug 17 22:59:11 2022  matrix includes 256 packed rows
Wed Aug 17 22:59:16 2022  matrix is 16994676 x 16995095 (4829.9 MB) with weight 1060776224 (62.42/col)
Wed Aug 17 22:59:16 2022  sparse part has weight 994218947 (58.50/col)
Wed Aug 17 22:59:16 2022  using GPU 0 (Tesla M40 24GB)
Wed Aug 17 22:59:16 2022  selected card has CUDA arch 5.2
Wed Aug 17 23:10:30 2022  commencing Lanczos iteration
Wed Aug 17 23:10:31 2022  memory use: 11864.2 MB
The GPU is showing "12701MiB / 22945MiB" for its memory use, so I should be able to do some even larger numbers.
EdH is offline   Reply With Quote
Old 2022-08-19, 03:12   #115
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

2×3×5×337 Posts
Default

Sorry I didn't follow this thread very close.

Are you saying that you do NFS completely on GPU? I mean, I knew poly can be done, and I am reading now about LA? How about sieving?
If so, where can I grab the exe and the "for dummy" tutorial?
Windows/Linux available? I may give it a try on local (where I run few quite powerful AMD and Nvidia cards) or on Colab (where I have occasional access to P100, V100 and - if lucky- A100).
LaurV is online now   Reply With Quote
Old 2022-08-19, 03:56   #116
Plutie
 
"Evan"
Dec 2020
Montreal

22·3·7 Posts
Plus

Quote:
Originally Posted by LaurV View Post
Sorry I didn't follow this thread very close.

Are you saying that you do NFS completely on GPU? I mean, I knew poly can be done, and I am reading now about LA? How about sieving?
If so, where can I grab the exe and the "for dummy" tutorial?
Windows/Linux available? I may give it a try on local (where I run few quite powerful AMD and Nvidia cards) or on Colab (where I have occasional access to P100, V100 and - if lucky- A100).
currently, polyselect and LA can be done on GPU - sieving and filtering are still on CPU.

here's a quick guide for linux specifically, but I don't think the process will be too different on windows.

Quote:
find the compute capability of your GPU - can be found here.

compilation example here is for a GTX 1060 (CC 6.1)
Code:
git clone https://github.com/gchilders/msieve_nfsathome -b msieve-lacuda-nfsathome
cd msieve_nfsathome
make all CUDA=61 VBITS=256
once compiled, you can run both polyselect and LA just as you would with normal msieve, just add "-g (gpu_num)" to the command. you can lower the VBITS value to fit larger matrices onto GPU during LA, but at a performance penalty.
Plutie is offline   Reply With Quote
Old 2022-08-29, 01:57   #117
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

3×52×72 Posts
Default

I forgot to add -g 0 to the command line and it seemed to default to device 0. I did specify use_managed=1 so maybe that was enough to invoke the GPU. Then again, I may be using an earlier release.
RichD is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Resume linear algebra Timic Msieve 35 2020-10-05 23:08
use msieve linear algebra after CADO-NFS filtering aein Msieve 2 2017-10-05 01:52
Has anyone tried linear algebra on a Threadripper yet? fivemack Hardware 3 2017-10-03 03:11
Linear algebra at 600% CRGreathouse Msieve 8 2009-08-05 07:25
Linear algebra proof Damian Math 8 2007-02-12 22:25

All times are UTC. The time now is 05:36.


Fri Sep 30 05:36:16 UTC 2022 up 43 days, 3:04, 0 users, load averages: 0.73, 0.83, 0.79

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔