mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2022-05-09, 22:08   #100
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23×5×61 Posts
Default

If nvidia-smi doesn't see it, then msieve won't. Perhaps you need to reinstall the CUDA driver with the M40 installed?
frmky is offline   Reply With Quote
Old 2022-05-10, 01:22   #101
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

111558 Posts
Default

Quote:
Originally Posted by frmky View Post
If nvidia-smi doesn't see it, then msieve won't. Perhaps you need to reinstall the CUDA driver with the M40 installed?
Reinstalled driver and CUDA in different variations and no joy. The computer says it's there, but CUDA says it isn't. I put the K20Xm back in and it sees it every time. Both are PCIEx16 v3.0.

Giving up for now. . .

ETA: Msieve compiled with 5.2, but couldn't find the cqard, as expected.

Thanks for the help.

Last fiddled with by EdH on 2022-05-10 at 01:23
EdH is offline   Reply With Quote
Old 2022-05-10, 17:52   #102
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

53·89 Posts
Default

I guess I have found my answer for the M40:
Code:
[ 1562.849818] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:
               NVRM: BAR1 is 0M @ 0x0 (PCI:0000:01:00.0)
[ 1562.849819] NVRM: The system BIOS may have misconfigured your GPU.
[ 1562.849824] nvidia: probe of 0000:01:00.0 failed with error -1
[ 1562.849839] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 1562.849840] NVRM: None of the NVIDIA devices were initialized.
And, no newer BIOS updates addressing any PCI issues.
EdH is offline   Reply With Quote
Old 2022-05-10, 18:13   #103
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23·5·61 Posts
Default

That's a BIOS issue. Google says to look for options deep in the BIOS menus like PCI Express 64-bit BAR Support, large BARs, or above 4G decoding.
frmky is offline   Reply With Quote
Old 2022-05-10, 20:45   #104
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

53×89 Posts
Default

Quote:
Originally Posted by frmky View Post
That's a BIOS issue. Google says to look for options deep in the BIOS menus like PCI Express 64-bit BAR Support, large BARs, or above 4G decoding.
Thank you for all the help with everythig. I do appreciate it, but I'm going to leave it sit for now. I did search the BIOS and all I found were two things: a Robust Graphics Booster with Auto/Fast/Turbo setting, for which there is a red message (for all three settings), "Warning: VGA Graphics card is not guaranteed to operate normally," and a PCIE frequency adjustment with a warning about setting it above 100MHz. The messages are displayed for the K20Xm as well. I guess I should consider myself lucky that one works.

Thank you, again, for all your help.
EdH is offline   Reply With Quote
Old 2022-07-15, 14:38   #105
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

111558 Posts
Default

A small follow-up:

I now have the Tesla M40 24GB running and am quite pleased. But, there is room for improvement. It is throttling due to insufficient cooling. It gets to 87C and cuts its processing. I have a push fan and a pull fan, but the throughput is just not there. I will have to pursue an alternate method. Would hate to wait until winter to get the full capability.
EdH is offline   Reply With Quote
Old 2022-07-16, 22:51   #106
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

3×1,213 Posts
Default

Quote:
Originally Posted by frmky View Post
Yep. With the managed memory option, the program stores portions of the sparse matrix blocks in main memory if necessary and moves them to the GPU when they are needed in each iteration. This significantly increases traffic on the PCIe bus. The GPU spends much more time waiting for data, but it can still be faster than running on the CPU.
I am thinking of tackling a much larger job where the matrix might be 5-6 times the GPU memory I have on a GTX 1660 (6GB) card. I know it helps on smaller jobs where the memory requirements are less than 2X. Would it better to utilize the GPU or just go for it and report my results here? (Using use_managed=1)
RichD is offline   Reply With Quote
Old 2022-07-16, 22:58   #107
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23×5×61 Posts
Default

There's a good chance that won't work. The vectors are always kept on the card and may take most of the GPU memory, leaving little for the matrix blocks and spmv scratch space. Nothing beats experiment, though, so give it a try and see what happens.

Last fiddled with by frmky on 2022-07-16 at 22:59
frmky is offline   Reply With Quote
Old 2022-07-31, 01:54   #108
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

1110001101112 Posts
Default

Quote:
Originally Posted by frmky View Post
There's a good chance that won't work. The vectors are always kept on the card and may take most of the GPU memory, leaving little for the matrix blocks and spmv scratch space. Nothing beats experiment, though, so give it a try and see what happens.
Attempting a ridiculous LA with the matrix needing more than five times the GPU memory, even trying with use-managed=1, was a no-go as expected.
Code:
matrix is 33782739 x 33783144 (13141.4 MB) with weight 3041417453 (90.03/col)
sparse part has weight 2904400096 (85.97/col)
using GPU 0 (NVIDIA GeForce GTX 1660)
selected card has CUDA arch 7.5
Nonzeros per block: 1750000000
Storing matrix in managed memory
converting matrix to CSR and copying it onto the GPU
 Killed
Maybe a 2-3 times the size needed won't so obnoxious.

Last fiddled with by RichD on 2022-07-31 at 01:58 Reason: forgot "=1"
RichD is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Resume linear algebra Timic Msieve 35 2020-10-05 23:08
use msieve linear algebra after CADO-NFS filtering aein Msieve 2 2017-10-05 01:52
Has anyone tried linear algebra on a Threadripper yet? fivemack Hardware 3 2017-10-03 03:11
Linear algebra at 600% CRGreathouse Msieve 8 2009-08-05 07:25
Linear algebra proof Damian Math 8 2007-02-12 22:25

All times are UTC. The time now is 16:51.


Mon Aug 15 16:51:31 UTC 2022 up 39 days, 11:38, 1 user, load averages: 1.23, 1.24, 1.24

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔