mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-12-28, 04:06   #12
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Quote:
Originally Posted by chalsall View Post
Actually, that's not really bizzare.

The nVidia drivers are proprietary code. So you are suppose to download them yourself each and every time. Then run the script to recompile the driver against your current kernel.

Welcome to freedom... Even though you payed for all the hardware, you still have to jump through hoops to run said hardware using free software....
Hey, speak for yourself. Using the repository mentioned above, I've not had one ounce of problems with the driver (Ubuntu 11.04). apt downloads the updates whenever they're available (which admittedly is never now that 11.04 is unsupported) and they work just fine, reboot or not.

However, before I found the repository, their installation scripts never worked on Ubuntu, and I was much more frustrated then.
Dubslow is offline   Reply With Quote
Old 2012-12-28, 04:15   #13
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

996010 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Hey, speak for yourself. Using the repository mentioned above, I've not had one ounce of problems with the driver (Ubuntu 11.04). apt downloads the updates whenever they're available (which admittedly is never now that 11.04 is unsupported) and they work just fine, reboot or not.
But... Technically you're breaking the contract.

Every time the kernel gets updated you are supposed to recompile the nVidia drivers yourself.

It's stupid, I agree. It's also the law (which is also stupid).
chalsall is online now   Reply With Quote
Old 2012-12-28, 05:00   #14
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Quote:
Originally Posted by chalsall View Post
But... Technically you're breaking the contract.

Every time the kernel gets updated you are supposed to recompile the nVidia drivers yourself.

It's stupid, I agree. It's also the law (which is also stupid).
Really? What a silly stipulation. (I assume by "recompile" you mean reinstall with the shell script they provide?)

I guess that anyone using that repository is breaking the contract then.

Also note that Canonical also supplies the drivers in a "partner" repository, and since those are from nVidia directly, that must have a different license.

*shrug*
Dubslow is offline   Reply With Quote
Old 2012-12-28, 23:52   #15
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

23×3×5×83 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Really? What a silly stipulation. (I assume by "recompile" you mean reinstall with the shell script they provide?)
Yes. You "sh NVIDIA-Linux-x86_64-310.19.run", for example, as root, and trust (hope? (pray?)) they don't do anything nasty. Or they've been hacked. Interestingly, their binary script files they provide are never signed with a key...

Quote:
Originally Posted by Dubslow View Post
I guess that anyone using that repository is breaking the contract then.
Probably. Please see as to the current restrictions.

It *is* possible that the rpm/dep file downloads the current nVidia driver and runs it. But I doubt it. And for some reason the repo maintainers are always behind the ball on kernel upgrades and nVidia driver releases...

Quote:
Originally Posted by Dubslow View Post
Also note that Canonical also supplies the drivers in a "partner" repository, and since those are from nVidia directly, that must have a different license.

*shrug*
But as Richard Stallman pointed out recently, Ubuntu isn't Free. Even though it's free.

It's complicated....
chalsall is online now   Reply With Quote
Old 2012-12-29, 01:19   #16
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Hi!

Quote:
Originally Posted by Graff View Post
Just tried another reboot. This time ran the nvidia-smi -a command with sudo.
Normal output resulted! Was able to get mfaktc running.
Will now try this on the other machine.

Same thing, no joy until I ran sudo nvidia-smi -a.

Gareth
Quote:
Originally Posted by Dubslow View Post
Huh, yes that is really bizzare behavior. Another thing to try is `sudo apt-get upgrade`, though I'm not sure that would help.

I have no idea why the drivers seem to disappear, or why `sudo nvidia-smi -a` would fix it (but not without the sudo).
I've an idea why 'sudo nvidia-smi -a' fixes the problem.
I guess you have either no X running or X is configured not to use the nvidia driver. X is priviliged to load kernel modules, a normal user not. When you run 'nvidia-smi -a' some software notices that it should load the nvidia kernel module. When running 'nvidia-smi -a' as normal user you're not allowed to... when you do the same as priviliged user (e.g. root) it is possible to load the module. Same for mfaktc: running as user: no automagic module loading... but when you start as root it would...

Solutions (pick one of them):
  • configure udev
  • load X (with nvidia modules)
  • write a small startup script which loads the modules

I would choose the first option, this is tested on Ubuntu 12.04. Create those two files as root:
/etc/udev/rules.d/86-nvidia.rules:
Code:
SUBSYSTEM=="module", KERNEL=="nvidia", RUN+="/lib/udev/nvidia.sh"
/lib/udev/nvidia.sh:
Code:
#!/bin/bash

mknod -m 666 /dev/nvidiactl c 195 255
#chown root:root /dev/nvidiactl

for DEV in {0..7} # one for each GPU
do
  mknod -m 666 /dev/nvidia${DEV} c 195 ${DEV}
#  chown root:root /dev/nvidia${DEV}
done
chmod +x /lib/udev/nvidia.sh


Oliver
TheJudger is offline   Reply With Quote
Old 2013-03-14, 11:57   #17
Graff
 
Graff's Avatar
 
Jul 2006
USA (UT-5) via UK (UT)

22×59 Posts
Default

Quote:
Originally Posted by henryzz View Post
Just the last one should do.
The first command was adding a repository. The second was downloading the lists from that repository.
After replacing the power supply in one of my machines, along with
a replacement GPU, mfaktc ran quite happily on both GPUs. One
process ran out of work, so I repopulated worktodo.txt and tried to
rerun mfaktc.exe. Got the following error:

NVIDIA: API mismatch: the NVIDIA kernel module has version 304.64,
but this NVIDIA driver component has version 304.84. Please make
sure that the kernel module and all NVIDIA driver components
have the same version.
Failed to initialize NVML: Unknown Error

(The other GPU is continuing to run and is not going to run out of work
any time soon.)

I tried running "sudo nvidia-smi -a" and got:

NVIDIA: API mismatch: the NVIDIA kernel module has version 304.64,
but this NVIDIA driver component has version 304.84. Please make
sure that the kernel module and all NVIDIA driver components
have the same version.
Failed to initialize NVML: Unknown Error

Presumably some update applied in the <12 hours that the machine
has been running broke something in the NVIDIA set up. (The machine
had been off-line for a couple of weeks, so there were a lot of
updates to apply.)

Any thoughts?

Gareth
Graff is offline   Reply With Quote
Old 2013-03-14, 23:44   #18
Graff
 
Graff's Avatar
 
Jul 2006
USA (UT-5) via UK (UT)

22×59 Posts
Default

Quote:
Originally Posted by Graff View Post
Any thoughts?

Gareth
Fixed it. A simple reboot fixed the problem. The last patch install
was apparently the problem and it did not say that a reboot was
necessary.

Gareth
Graff is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
What happened to moregimps.it? It seems to have disappeared... Stargate38 Operation Billion Digits 2 2017-05-08 18:53
NVIDIA TITAN 320.59 driver with cuda 5.5 Manpowre GPU Computing 43 2013-08-22 12:28
Ubuntu 10.10 Maverick and CudaLucas 1.2 - Fatal kernel update moebius Linux 1 2012-02-20 03:17
A new driver? (or type of driver?) 10metreh Aliquot Sequences 3 2010-02-15 15:57
smilies disappeared from post composition page! mdettweiler Forum Feedback 3 2008-02-21 09:20

All times are UTC. The time now is 23:16.


Sat Oct 23 23:16:15 UTC 2021 up 92 days, 17:45, 0 users, load averages: 1.29, 1.25, 1.17

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.