Geoff,
The 1.0.6 SSE2 code increased by 2x the output of my new work machine, a dual core P4 3.0GHz. Went from 107 kp/s to 222 kp/s...thank you.
I noticed that on my AMD the highest performance was achieved by 1.0.4 SSE2 code with caches -l16 -L256. Playing with the flag cache on 1.0.6 code I never achieved the performance of the 1.0.4 SSE2 code.
Later today I will check the ability to hide the client.
Cheers,
Carlos
|