![]() |
![]() |
#45 |
"Yury Vorobyov"
Jul 2013
Chelyabinsk
19 Posts |
![]()
It does not see any of my devices...
I should get at least CPU resources, and my Nvidia card has libOpenCL but none of that is working. Code:
start M6478992 fft length = 393216 Error: Context::Context() failed. Error code : CL_DEVICE_NOT_FOUND Location : MatrixMulDouble.hpp:354 OPENCL_V< CLFFT_INVALID_CONTEXT > (3191): Getting device array size ( ::clGetContextInfo() ) OPENCL_V< CLFFT_INVALID_CONTEXT > (115): SetEnvelope failed Failed to create c2c plan. status=-34; failed step=1 Segmentation fault (core dumped) |
![]() |
![]() |
![]() |
#46 |
Jul 2009
Tokyo
61010 Posts |
![]() |
![]() |
![]() |
![]() |
#47 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
3×23×149 Posts |
![]()
@sanaris:
grrrr... if you have a cuda-enabled nvidia card, then forget about doing LL with openCL on that card, and use cudaLucas. It is a mature product, works very nice, and objectively, cuda is 3 times faster then openCL for nvidia cards (they "emulate" openCL somehow, it is not as natively supported cuda stuff). Last fiddled with by LaurV on 2013-08-02 at 06:26 Reason: @sanaris (crosspost) |
![]() |
![]() |
![]() |
#48 | |
"Mr. Meeseeks"
Jan 2012
California, USA
88016 Posts |
![]() Quote:
Also, on another note I think found my problem... |
|
![]() |
![]() |
![]() |
#49 |
Jul 2009
Tokyo
11428 Posts |
![]()
Hi.
New Version. |
![]() |
![]() |
![]() |
#50 |
"Mr. Meeseeks"
Jan 2012
California, USA
27×17 Posts |
![]()
Hmm. Compiling works now, although I get error on launch "Error with clAmdFftSetPlanPrecision". I tried removing that just for curiosity and it works... actually it doesn't but the gpu is doing something.. Or maybe not.
Code:
~/0.44 $ ./a.exe 1398269 start M1398269 fft length = 73728 Platform :Advanced Micro Devices, Inc. Device 0 : Capeverde Device 1 : BeaverCreek Build Options are : -D KHR_DP_EXTENSION Iteration 10000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:07 real, 0.7430 ms/iter, ETA 17:05) Iteration 20000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:08 real, 0.7640 ms/iter, ETA 17:26) Iteration 30000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:08 real, 0.7800 ms/iter, ETA 17:40) Iteration 40000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:08 real, 0.8620 ms/iter, ETA 19:23) Iteration 50000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:09 real, 0.8570 ms/iter, ETA 19:08) Iteration 60000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:08 real, 0.8390 ms/iter, ETA 18:35) Iteration 70000 M( 1398269 )C, 0x0000000000000000, n = 73728, CUDALucas v1.66 err = 0.0003744 (0:08 real, 0.7270 ms/iter, ETA 15:59) |
![]() |
![]() |
![]() |
#51 |
Jul 2009
Tokyo
2×5×61 Posts |
![]() |
![]() |
![]() |
![]() |
#52 |
"Mr. Meeseeks"
Jan 2012
California, USA
88016 Posts |
![]() |
![]() |
![]() |
![]() |
#53 |
Jul 2009
Tokyo
2×5×61 Posts |
![]() |
![]() |
![]() |
![]() |
#54 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
3·23·149 Posts |
![]()
He says that the FFT length selected for that exponent is WAY too big. You need either a lower FFT, or a higher exponent.
BTW, is someone going to build a windows version of this? Last fiddled with by LaurV on 2013-08-07 at 10:53 Reason: /s/to/too |
![]() |
![]() |
![]() |
#55 |
"Mr. Meeseeks"
Jan 2012
California, USA
27×17 Posts |
![]()
I have a few "experimental" windows exe's here. Unfortunately, they don't quite work yet...
@msft: strange, setting clAmdFftSetPlanPrecision to CLFFT_SINGLE works but on DOUBLE it crashes with "Error with clAmdFftSetPlanPrecision"... Hmm |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1719 | 2023-01-16 15:51 |
Can't get OpenCL to work on HD7950 Ubuntu 14.04.5 LTS | VictordeHolland | Linux | 4 | 2018-04-11 13:44 |
OpenCL accellerated lattice siever | pstach | Factoring | 1 | 2014-05-23 01:03 |
OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
AMD's Graphics Core Next- a reason to accelerate towards OpenCL? | Belteshazzar | GPU Computing | 19 | 2012-03-07 18:58 |