mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Sierpinski/Riesel Base 5

Reply
 
Thread Tools
Old 2006-12-31, 00:01   #78
Joe O
 
Joe O's Avatar
 
Aug 2002

3×52×7 Posts
Default

Quote:
Originally Posted by jasong View Post
Sorry, I should have clarified. It was a bad link. File not found. And I'm looking for the Linux 64-bit AMD one. I THINK SSE2, but I'm not sure.
AFAIK there was never a 64 bit version published.

By the way, the link works for me.http://81.233.34.242/proth_sieve.php

proth_sieve v0.42 windows regular

proth_sieve v0.42 windows cmov

proth_sieve v0.42 windows SSE2

proth_sieve v0.42 linux regular

proth_sieve v0.42 linux cmov

proth_sieve v0.42 linux SSE2

proth_sieve v0.42 linux regular statically linked

proth_sieve v0.42 linux cmov statically linked

proth_sieve v0.42 linux SSE2 statically linked

proth_sieve v0.42 bsd regular

proth_sieve v0.42 bsd cmov

proth_sieve v0.42 bsd regular statically linked

proth_sieve v0.42 bsd cmov statically linked


Older versions...
Joe O is offline   Reply With Quote
Old 2006-12-31, 00:06   #79
tnerual
 
tnerual's Avatar
 
Oct 2006

7×37 Posts
Default

Quote:
Originally Posted by Joe O View Post
AFAIK there was never a 64 bit version published.

By the way, the link works for me.http://81.233.34.242/proth_sieve.php
but the link for linux app don't work.

Jasong: download the "prothsieve 0.42 version, all versions" it's the only way to get the linux apps.

Last fiddled with by tnerual on 2006-12-31 at 00:07
tnerual is offline   Reply With Quote
Old 2006-12-31, 01:57   #80
jasong
 
jasong's Avatar
 
"Jason Goatcher"
Mar 2005

3×7×167 Posts
Default

Quote:
Originally Posted by tnerual View Post
but the link for linux app don't work.

Jasong: download the "prothsieve 0.42 version, all versions" it's the only way to get the linux apps.
The answer is late coming(my answer, this one), but thanks. :)
jasong is offline   Reply With Quote
Old 2006-12-31, 02:23   #81
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

22058 Posts
Default

Quote:
Originally Posted by jasong View Post
p=1000000032774601, 441084 p/sec, 0 factors, 0.66% done, ETA 30 Dec 00:11
Thanks for that, could you also also test the 32-bit i686 binary on the same machine and same range for comparison?
geoff is offline   Reply With Quote
Old 2006-12-31, 21:36   #82
jasong
 
jasong's Avatar
 
"Jason Goatcher"
Mar 2005

3×7×167 Posts
Default

For base-2, sieving at 1,000,000 billion on Linux, sr2sieve 32-bit 398,800 p/sec.

Question: Would you be willing to work on it for the Riesel Sieve project as well. It may or may not benefit from the same optimizations.

Last fiddled with by jasong on 2006-12-31 at 21:39
jasong is offline   Reply With Quote
Old 2007-01-02, 23:57   #83
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13×89 Posts
Default

Quote:
Originally Posted by jasong View Post
For base-2, sieving at 1,000,000 billion on Linux, sr2sieve 32-bit 398,800 p/sec.
That is interesting, the difference between 32 and 64 bit is not as great as it was for the base 5 sieve.

Quote:
Question: Would you be willing to work on it for the Riesel Sieve project as well. It may or may not benefit from the same optimizations.
It already does, just run `sr2sieve -r' to sieve from riesel.dat, or `sr2sieve -rs' to sieve riesel.dat and SoB.dat together :-) [probably not a good idea unless both have the same n range].
geoff is offline   Reply With Quote
Old 2007-01-10, 00:37   #84
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13×89 Posts
Default

In sr2sieve 1.4.18 I have incorporated rogue's improved ppc64 mulmod, and I have made corresponding changes to the inlined functions. I don't know how this affects the relative speed of inline vs non-inline, or the different inline variants.

In Makefile, there is a line that adds -DUSE_INLINE_MULMOD=<X> to the CPPFLAGS. The options are:
X=0: Use the externally linked code.
X=1,2,3: Use inline variant 1, 2 or 3. 1 is default. I think variant 2 should be a little faster in principle, provided the compiler does the right thing. Variant 3 might help the compiler do the right thing.
geoff is offline   Reply With Quote
Old 2007-01-10, 18:51   #85
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Thumbs up Looking good...

geoff & rogue, you guys rock. I haven't completed testing with all variants yet, but glancing at the results I've got about a 15% improvement with SoB.dat. I'm also testing with riesel.dat. FYI, here is the matrix of the tests I've got:

Platforms:
  • 2.0 GHz PowerMac, Gentoo
    • gcc-3.4.6
    • gcc-4.0.0
    • gcc-4.0.3
    • gcc-4.1.1
  • 2.5 GHz 970MP, RedHat AS4
    • gcc-3.4.3
    • gcc-4.0.0
    • gcc-4.1.1

On each platform/compiler combination, I'm testing the following:
  • HASH_MAX_DENSITY
    • 0.32
    • 0.65
  • USE_INLINE_MULMOD
    • 0
    • 1
    • 2
    • 3
I'll post full results tonight.
BlisteringSheep is offline   Reply With Quote
Old 2007-01-11, 05:01   #86
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

100100001012 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
On each platform/compiler combination, I'm testing the following:
  • HASH_MAX_DENSITY
I forgot to mention that in version 1.4.18 the hashtable size is automatically doubled if the L1 cache is large enough (as determined by L1_CACHE_SHIFT), so it should be OK to leave HASH_MAX_DENSITY at 0.65.
geoff is offline   Reply With Quote
Old 2007-01-12, 10:34   #87
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default 1.4.18 results

Finally, just a summary over all of the combinations listed above:
  • 2.0 GHz G5 (Apple PowerMac)
    • gcc 4.1.1, DENSITY 0.32, INLINE 2, SoB.dat, 303294 p/sec
  • 2.2 GHz 970FX (IBM JS20)
    • gcc 4.1.1, DENSITY 0.32, INLINE 2, SoB.dat, 205017 p/sec
    • gcc 4.1.1, DENSITY 0.65, INLINE 2, riesel.dat, 204986 p/sec
  • 2.5 GHz 970MP (IBM JS21)
    • gcc 4.1.1, DENSITY 0.32, INLINE 2, SoB.dat, 380233 p/sec
    • gcc 4.1.1, DENSITY 0.32, INLINE 2, riesel.dat, 234616 p/sec
The JS20 numbers with the SoB.dat don't look right. I will re-run that set. I haven't run with riesel.dat on my PowerMac yet, but will soon.
BlisteringSheep is offline   Reply With Quote
Old 2007-01-13, 05:25   #88
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

24610 Posts
Default

Updated with riesel.dat for the G5 and corrected SoB.dat for the JS20.
  • 2.0 GHz G5 (Apple PowerMac)
    • gcc 4.1.1, DENSITY 0.32, INLINE 2, riesel.dat, 187283 p/sec
  • 2.2 GHz 970FX (IBM JS20)
    • gcc 4.1.1, DENSITY 0.65, INLINE 2, SoB.dat, 331802 p/sec

I've attached a zip file of the sr2work.txt and checkpoint.txt along with the data collected.
Attached Files
File Type: zip results-1.4.18.zip (41.9 KB, 112 views)
BlisteringSheep is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
srsieve/sr2sieve enhancements rogue Software 304 2021-11-06 13:51
32-bit of sr1sieve and sr2sieve for Win pepi37 Software 5 2013-08-09 22:31
sr2sieve question SaneMur Information & Answers 2 2011-08-21 22:04
sr2sieve client mgpower0 Prime Sierpinski Project 54 2008-07-15 16:50
How to use sr2sieve nuggetprime Riesel Prime Search 40 2007-12-03 06:01

All times are UTC. The time now is 16:16.


Fri Jul 1 16:16:23 UTC 2022 up 78 days, 14:17, 0 users, load averages: 0.90, 1.46, 1.49

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔