![]() |
![]() |
#78 | |
Aug 2002
3×52×7 Posts |
![]() Quote:
By the way, the link works for me.http://81.233.34.242/proth_sieve.php proth_sieve v0.42 windows regular proth_sieve v0.42 windows cmov proth_sieve v0.42 windows SSE2 proth_sieve v0.42 linux regular proth_sieve v0.42 linux cmov proth_sieve v0.42 linux SSE2 proth_sieve v0.42 linux regular statically linked proth_sieve v0.42 linux cmov statically linked proth_sieve v0.42 linux SSE2 statically linked proth_sieve v0.42 bsd regular proth_sieve v0.42 bsd cmov proth_sieve v0.42 bsd regular statically linked proth_sieve v0.42 bsd cmov statically linked Older versions... |
|
![]() |
![]() |
![]() |
#79 | |
Oct 2006
7×37 Posts |
![]() Quote:
Jasong: download the "prothsieve 0.42 version, all versions" it's the only way to get the linux apps. Last fiddled with by tnerual on 2006-12-31 at 00:07 |
|
![]() |
![]() |
![]() |
#80 |
"Jason Goatcher"
Mar 2005
3×7×167 Posts |
![]() |
![]() |
![]() |
![]() |
#81 |
Mar 2003
New Zealand
22058 Posts |
![]() |
![]() |
![]() |
![]() |
#82 |
"Jason Goatcher"
Mar 2005
3×7×167 Posts |
![]()
For base-2, sieving at 1,000,000 billion on Linux, sr2sieve 32-bit 398,800 p/sec.
Question: Would you be willing to work on it for the Riesel Sieve project as well. It may or may not benefit from the same optimizations. Last fiddled with by jasong on 2006-12-31 at 21:39 |
![]() |
![]() |
![]() |
#83 | ||
Mar 2003
New Zealand
13×89 Posts |
![]() Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#84 |
Mar 2003
New Zealand
13×89 Posts |
![]()
In sr2sieve 1.4.18 I have incorporated rogue's improved ppc64 mulmod, and I have made corresponding changes to the inlined functions. I don't know how this affects the relative speed of inline vs non-inline, or the different inline variants.
In Makefile, there is a line that adds -DUSE_INLINE_MULMOD=<X> to the CPPFLAGS. The options are: X=0: Use the externally linked code. X=1,2,3: Use inline variant 1, 2 or 3. 1 is default. I think variant 2 should be a little faster in principle, provided the compiler does the right thing. Variant 3 might help the compiler do the right thing. |
![]() |
![]() |
![]() |
#85 |
Oct 2006
On a Suzuki Boulevard C90
2×3×41 Posts |
![]()
geoff & rogue, you guys rock. I haven't completed testing with all variants yet, but glancing at the results I've got about a 15% improvement with SoB.dat. I'm also testing with riesel.dat. FYI, here is the matrix of the tests I've got:
Platforms:
On each platform/compiler combination, I'm testing the following:
![]() |
![]() |
![]() |
![]() |
#86 |
Mar 2003
New Zealand
100100001012 Posts |
![]()
I forgot to mention that in version 1.4.18 the hashtable size is automatically doubled if the L1 cache is large enough (as determined by L1_CACHE_SHIFT), so it should be OK to leave HASH_MAX_DENSITY at 0.65.
|
![]() |
![]() |
![]() |
#87 |
Oct 2006
On a Suzuki Boulevard C90
2×3×41 Posts |
![]()
Finally, just a summary over all of the combinations listed above:
![]() |
![]() |
![]() |
![]() |
#88 |
Oct 2006
On a Suzuki Boulevard C90
24610 Posts |
![]()
Updated with riesel.dat for the G5 and corrected SoB.dat for the JS20.
I've attached a zip file of the sr2work.txt and checkpoint.txt along with the data collected. |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
srsieve/sr2sieve enhancements | rogue | Software | 304 | 2021-11-06 13:51 |
32-bit of sr1sieve and sr2sieve for Win | pepi37 | Software | 5 | 2013-08-09 22:31 |
sr2sieve question | SaneMur | Information & Answers | 2 | 2011-08-21 22:04 |
sr2sieve client | mgpower0 | Prime Sierpinski Project | 54 | 2008-07-15 16:50 |
How to use sr2sieve | nuggetprime | Riesel Prime Search | 40 | 2007-12-03 06:01 |