mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Operazione Doppi Mersennes

Reply
 
Thread Tools
Old 2021-03-31, 15:29   #375
Dylan14
 
Dylan14's Avatar
 
"Dylan"
Mar 2017

24316 Posts
Default

Attached is a CUDA 11.2 binary of mmff compiled on an Arch Linux system, using the cleaned up source posted by Fan Ming here. These should work on any Linux system with the latest Nvidia drivers.

This one does seem to work with MM107 and MM127, using the examples posted by Fan Ming in posts 363 and 364 with the flag -v 3.

MM127:

Code:
mmff v0.28 (64bit built)

Compiletime options
  THREADS_PER_BLOCK         256
  MORE_CLASSES              enabled

Runtime options
  GPU Sieving               enabled
  GPUSievePrimes            500000
  GPUSieveSize              32M bits
  GPUSieveProcessSize       16K bits
  WorkFile                  worktodo.txt
  Checkpoints               enabled
  CheckpointDelay           120s
  StopAfterFactor           disabled
  PrintMode                 full
  V5UserID                  (none)
  ComputerID                (none)
WARNING, no GPUProgressHeader specified in mmff.ini, using default
  GPUProgressHeader         "    class |  raw cand. |    time |    ETA |  raw rate | SievePrimes"
WARNING, no GPUProgressFormat specified in mmff.ini, using default
  GPUProgressFormat            "%C/4620 |    %n | %ts | %e | %rM/s |     %s"
  TimeStampInResults        no

CUDA version info
  binary compiled for CUDA  11.20
  CUDA runtime version      11.20
  CUDA driver version       11.20

CUDA device info
  name                      GeForce GTX 1660 Ti
  compute capability        7.5
  maximum threads per block 1024
  number of mutliprocessors 24 (unknown number of shader cores)
  clock rate                1590MHz

got assignment: MM127, k range 562949953421312 to 1125899906842623 (178-bit factors)
Starting trial factoring of MM127 in k range: 562949953421312 to 1125899906842623 (178-bit factors)
 k_min = 562949953421312
 k_max = 1125899906842623
Using GPU kernel "mfaktc_barrett183_M127gs"
Verifying (2^(2^127)) % 191561944857917697129840166812120120096271125295021529 = 158757927754760480688654173499199469295287057656270356
Verifying (2^(2^127)) % 191614694258348779445950559282708892489982390750176689 = 33662559093375555778002927546058307399215184129861713
Verifying (2^(2^127)) % 191667446858012590842648103696906711574948849832403649 = 58322051460264670631592692291826830098619657940589851
Verifying (2^(2^127)) % 191720197063361195168223175182300315372061913389308569 = 161044083194471348086645110435896183576905312890737395
Verifying (2^(2^127)) % 191772947100494614230101526639209315731354679296043129 = 62574822488929725322867766087082605619226720248185889
Verifying (2^(2^127)) % 191825698154779667550033876773029890149243076788387249 = 178143699218529778276359107673564439346000111582072990
Verifying (2^(2^127)) % 191878449278237320417654597759685254765861316305100489 = 167970576150375862734607277121394055064519674969245756
Verifying (2^(2^127)) % 191931200154874561262841813657816481627920801325769369 = 68989798066545733249246754983396020869954214592518662
Verifying (2^(2^127)) % 191983951926039282622453643539197609014463925252484369 = 97958328059656999804568015951825574551513310180672117
Verifying (2^(2^127)) % 192036702536990857023110525934395209885947426134099129 = 57502989442863596951607305398169962920918382075438075
received signal "SIGINT"
mmff will exit once the current class is finished.
press ^C again to exit immediately
mmff will exit NOW!
MM107:

Code:
mmff v0.28 (64bit built)

Compiletime options
  THREADS_PER_BLOCK         256
  MORE_CLASSES              enabled

Runtime options
  GPU Sieving               enabled
  GPUSievePrimes            500000
  GPUSieveSize              32M bits
  GPUSieveProcessSize       16K bits
  WorkFile                  worktodo.txt
  Checkpoints               enabled
  CheckpointDelay           120s
  StopAfterFactor           disabled
  PrintMode                 full
  V5UserID                  (none)
  ComputerID                (none)
WARNING, no GPUProgressHeader specified in mmff.ini, using default
  GPUProgressHeader         "    class |  raw cand. |    time |    ETA |  raw rate | SievePrimes"
WARNING, no GPUProgressFormat specified in mmff.ini, using default
  GPUProgressFormat            "%C/4620 |    %n | %ts | %e | %rM/s |     %s"
  TimeStampInResults        no

CUDA version info
  binary compiled for CUDA  11.20
  CUDA runtime version      11.20
  CUDA driver version       11.20

CUDA device info
  name                      GeForce GTX 1660 Ti
  compute capability        7.5
  maximum threads per block 1024
  number of mutliprocessors 24 (unknown number of shader cores)
  clock rate                1590MHz

WARNING: ignoring line 1 in "worktodo.txt"! Reason: doesn't begin with Factor=
got assignment: MM107, k range 41400000000000 to 41500000000000 (154-bit factors)
Starting trial factoring of MM107 in k range: 41400G to 41500G (154-bit factors)
 k_min = 41400000000000
 k_max = 41500000000000
Using GPU kernel "mfaktc_barrett160_M107gs"
Verifying (2^(2^107)) % 13435068670193779240929580104031093912799413681 = 11943755078920637255837466212346786801214623286
    class |  raw cand. |    time |    ETA |  raw rate | SievePrimes
   0/4620 |     21.66M |  0.031s |   n.a. | 698.70M/s |      500277
Verifying (2^(2^107)) % 13435068674693228987403666670879552138089175391 = 10351997845221972775324276802600874943890505684
   5/4620 |     21.66M |  0.031s |   n.a. | 698.70M/s |      500277
...
I have also attached the full logs from both runs. Why the CUDA 10.1 made executables fail, I'm not sure.
Attached Files
File Type: zip mmff_cuda_11-2.zip (3.73 MB, 62 views)
File Type: txt test.txt (182.9 KB, 54 views)
File Type: txt test2.txt (2.9 KB, 90 views)
Dylan14 is offline   Reply With Quote
Old 2021-03-31, 16:10   #376
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

147316 Posts
Default

Quote:
Originally Posted by Dylan14 View Post
...got assignment: MM127, k range 562949953421312 to 1125899906842623 (178-bit factors)
Starting trial factoring of MM127 in k range: 562949953421312 to 1125899906842623 (178-bit factors)
k_min = 562949953421312
k_max = 1125899906842623
Using GPU kernel "mfaktc_barrett183_M127gs"

...[/code]
Cool. Did the posted Arch build include the expanded 2047M GpuSieveSize? Might want to aim higher, in your test run, for ranges of k and selection of kernel likely to be run in the future, since MM127 TF to 185 bits was completed months ago.
Quote:
Originally Posted by kriesel View Post
[Fri Sep 04 18:24:46 2020]
UID: kriesel/emu/gtx1650, no factor for MM127 in k range: 140000000000000000 to 144115188075855871 (185-bit factors) [mmff 0.28 mfaktc_barrett185_M127gs]

145P ETA <7 days
Quote:
Originally Posted by kriesel View Post
[Thu Sep 10 22:15:10 2020]
UID: kriesel/emu/gtx1650, no factor for MM127 in k range: 144115188075855872 to 145000000000000000 (186-bit factors) [mmff 0.28 mfaktc_barrett188_M127gs]
Info header was
Code:
mmff v0.28 (64bit built)

Compiletime options
  THREADS_PER_BLOCK         256
  MORE_CLASSES              enabled

Runtime options
  GPU Sieving               enabled
WARNING: Cannot read GPUSievePrimes from mmff.ini, using default value (82486)
  GPUSievePrimes            depends on worktodo entry
  GPUSieveSize              2047M bits
  GPUSieveProcessSize       16K bits
  WorkFile                  worktodo.txt
  Checkpoints               enabled
  CheckpointDelay           300s
  StopAfterFactor           disabled
  PrintMode                 full
  V5UserID                  kriesel
  ComputerID                emu/gtx1650
  TimeStampInResults        yes

CUDA version info
  binary compiled for CUDA  10.10
  CUDA runtime version      10.10
  CUDA driver version       10.20

CUDA device info
  name                      GeForce GTX 1650
  compute capability        7.5
  maximum threads per block 1024
  number of mutliprocessors 14 (unknown number of shader cores)
  clock rate                1710MHz
Edut: maibe ficks tha mipselling tu.

Last fiddled with by kriesel on 2021-03-31 at 17:05
kriesel is online now   Reply With Quote
Old 2021-03-31, 16:40   #377
Dylan14
 
Dylan14's Avatar
 
"Dylan"
Mar 2017

24316 Posts
Default

Quote:
Originally Posted by kriesel View Post
Cool. Did the posted Arch build include the expanded 2047M GpuSieveSize?
No, it is limited to 128M bits. I could easily fix that and put a updated build.
Dylan14 is offline   Reply With Quote
Old 2021-03-31, 20:51   #378
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·29·83 Posts
Default

Quote:
Originally Posted by Dylan14 View Post
No, it is limited to 128M bits. I could easily fix that and put a updated build.
Would you mind sharing the updated source code to the FermatSearch community (or at least to me?) I have my code happily running with Ubuntu and the 11.1 drivers, but no PrimeGaps speedup ...
ET_ is offline   Reply With Quote
Old 2021-05-09, 18:51   #379
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

3×5×349 Posts
Default

Please update http://www.doublemersennes.org/download.php for the newer binaries posted recently in this thread.
If I read it correctly, this thread has CUDA10.1 and 11.2, while doublemersennes has only up to CUDA8 and no enlarged GPUSieveSize.

Last fiddled with by kriesel on 2021-05-09 at 18:56
kriesel is online now   Reply With Quote
Old 2021-05-10, 16:58   #380
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·29·83 Posts
Default

Quote:
Originally Posted by kriesel View Post
Please update http://www.doublemersennes.org/download.php for the newer binaries posted recently in this thread.
If I read it correctly, this thread has CUDA10.1 and 11.2, while doublemersennes has only up to CUDA8 and no enlarged GPUSieveSize.
I will.
There are very few participants to this subproject, and no one complained (hard) until now...
ET_ is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Mersenne trial division implementation mathPuzzles Math 8 2017-04-21 07:21
trial division over a factor base Peter Hackman Factoring 7 2009-10-26 18:27
P95 trial division strategy SPWorley Math 8 2009-08-24 23:26
Trial division software for Mersenne SPWorley Factoring 7 2009-08-16 00:23
Need GMP trial-division timings ewmayer Factoring 7 2008-12-11 22:12

All times are UTC. The time now is 15:18.

Mon Jun 14 15:18:22 UTC 2021 up 17 days, 13:05, 0 users, load averages: 1.75, 1.63, 1.52

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.