View Single Post
Old 2022-03-28, 01:01   #65
Magellan3s
 
Mar 2022
Earth

5×23 Posts
Default

Quote:
Originally Posted by kruoli View Post
Wonderful, thank you! Would you mind running some single threaded (1 worker, 1 thread, no HT benchmarking) tests, starting from 1K?
Part 1
Code:
 Machine#0 (total=65555544KB, DMIProductName="System Product Name", DMIProductVersion="System Version", DMIBoardVendor="ASUSTeK COMPUTER INC.", DMIBoardName="ProArt Z690-CREATOR WIFI", DMIBoardVersion="Rev 1.xx", DMIBoardAssetTag="Default string", DMIChassisVendor="Default string", DMIChassisType=3, DMIChassisVersion="Default string", DMIChassisAssetTag="Default string", DMIBIOSVendor="American Megatrends Inc.", DMIBIOSVersion=0811, DMIBIOSDate=12/15/2021, DMISysVendor=ASUS, Backend=Linux, LinuxCgroup=/, OSName=Linux, OSRelease=5.13.0-37-generic, OSVersion="#42-Ubuntu SMP Tue Mar 15 14:34:06 UTC 2022", HostName=Magellan, Architecture=x86_64, hwlocVersion=2.4.1, ProcessName=mprime)
  Package#0 (total=65555544KB, CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=151, CPUModel="12th Gen Intel(R) Core(TM) i9-12900K", CPUStepping=2)
    L3 (size=30720KB, linesize=64, ways=12, Inclusive=0)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#0 (cpuset: 0x00000003)
            PU#0 (cpuset: 0x00000001)
            PU#1 (cpuset: 0x00000002)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#4 (cpuset: 0x0000000c)
            PU#2 (cpuset: 0x00000004)
            PU#3 (cpuset: 0x00000008)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#8 (cpuset: 0x00000030)
            PU#4 (cpuset: 0x00000010)
            PU#5 (cpuset: 0x00000020)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#12 (cpuset: 0x000000c0)
            PU#6 (cpuset: 0x00000040)
            PU#7 (cpuset: 0x00000080)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#16 (cpuset: 0x00000300)
            PU#8 (cpuset: 0x00000100)
            PU#9 (cpuset: 0x00000200)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#20 (cpuset: 0x00000c00)
            PU#10 (cpuset: 0x00000400)
            PU#11 (cpuset: 0x00000800)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#24 (cpuset: 0x00003000)
            PU#12 (cpuset: 0x00001000)
            PU#13 (cpuset: 0x00002000)
      L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
        L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
          Core#28 (cpuset: 0x0000c000)
            PU#14 (cpuset: 0x00004000)
            PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 30.7, RdtscTiming=1
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=4 (1 core, 1 worker):  0.32 ms.  Throughput: 3151.93 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=2 (1 core, 1 worker):  0.32 ms.  Throughput: 3087.37 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=1 (1 core, 1 worker):  0.37 ms.  Throughput: 2671.78 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1024, clm=2 (1 core, 1 worker):  0.39 ms.  Throughput: 2534.63 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1024, clm=1 (1 core, 1 worker):  0.38 ms.  Throughput: 2609.84 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=512, clm=1 (1 core, 1 worker):  0.45 ms.  Throughput: 2222.52 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=4 (1 core, 1 worker):  0.35 ms.  Throughput: 2818.22 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=2 (1 core, 1 worker):  0.37 ms.  Throughput: 2700.36 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=1 (1 core, 1 worker):  0.40 ms.  Throughput: 2474.46 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=4 (1 core, 1 worker):  0.37 ms.  Throughput: 2696.36 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=2 (1 core, 1 worker):  0.38 ms.  Throughput: 2662.90 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=1 (1 core, 1 worker):  0.38 ms.  Throughput: 2649.28 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1024, clm=2 (1 core, 1 worker):  0.44 ms.  Throughput: 2281.93 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1024, clm=1 (1 core, 1 worker):  0.44 ms.  Throughput: 2272.86 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=768, clm=2 (1 core, 1 worker):  0.43 ms.  Throughput: 2323.97 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=768, clm=1 (1 core, 1 worker):  0.42 ms.  Throughput: 2381.80 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=512, clm=1 (1 core, 1 worker):  0.51 ms.  Throughput: 1947.46 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=384, clm=1 (1 core, 1 worker):  0.58 ms.  Throughput: 1738.65 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=4 (1 core, 1 worker):  0.38 ms.  Throughput: 2598.30 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=2 (1 core, 1 worker):  0.39 ms.  Throughput: 2549.45 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=1 (1 core, 1 worker):  0.41 ms.  Throughput: 2448.53 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=4 (1 core, 1 worker):  0.46 ms.  Throughput: 2196.62 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=2 (1 core, 1 worker):  0.45 ms.  Throughput: 2232.31 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=1 (1 core, 1 worker):  0.44 ms.  Throughput: 2265.45 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=4 (1 core, 1 worker):  0.47 ms.  Throughput: 2141.49 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=2 (1 core, 1 worker):  0.47 ms.  Throughput: 2125.35 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=1 (1 core, 1 worker):  0.46 ms.  Throughput: 2179.10 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=640, clm=2 (1 core, 1 worker):  0.49 ms.  Throughput: 2061.25 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=640, clm=1 (1 core, 1 worker):  0.48 ms.  Throughput: 2102.26 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=4 (1 core, 1 worker):  0.39 ms.  Throughput: 2537.50 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=2 (1 core, 1 worker):  0.41 ms.  Throughput: 2457.71 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=1 (1 core, 1 worker):  0.45 ms.  Throughput: 2218.23 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1024, clm=2 (1 core, 1 worker):  0.48 ms.  Throughput: 2075.28 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1024, clm=1 (1 core, 1 worker):  0.49 ms.  Throughput: 2044.03 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=640, clm=1 (1 core, 1 worker):  0.49 ms.  Throughput: 2024.64 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=4 (1 core, 1 worker):  0.43 ms.  Throughput: 2329.84 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=2 (1 core, 1 worker):  0.43 ms.  Throughput: 2322.22 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=1 (1 core, 1 worker):  0.46 ms.  Throughput: 2189.27 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1024, clm=2 (1 core, 1 worker):  0.53 ms.  Throughput: 1899.91 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1024, clm=1 (1 core, 1 worker):  0.52 ms.  Throughput: 1925.47 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=448, clm=1 (1 core, 1 worker):  0.63 ms.  Throughput: 1599.61 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=4 (1 core, 1 worker):  0.52 ms.  Throughput: 1908.48 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=2 (1 core, 1 worker):  0.52 ms.  Throughput: 1921.31 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=1 (1 core, 1 worker):  0.52 ms.  Throughput: 1925.29 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=4 (1 core, 1 worker):  0.57 ms.  Throughput: 1756.84 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=2 (1 core, 1 worker):  0.54 ms.  Throughput: 1842.64 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=1 (1 core, 1 worker):  0.54 ms.  Throughput: 1841.53 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=4 (1 core, 1 worker):  0.45 ms.  Throughput: 2215.36 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=2 (1 core, 1 worker):  0.46 ms.  Throughput: 2173.71 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=1 (1 core, 1 worker):  0.48 ms.  Throughput: 2071.58 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=4 (1 core, 1 worker):  0.52 ms.  Throughput: 1919.41 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=2 (1 core, 1 worker):  0.52 ms.  Throughput: 1912.21 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=1 (1 core, 1 worker):  0.52 ms.  Throughput: 1929.25 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=4 (1 core, 1 worker):  0.55 ms.  Throughput: 1818.95 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=2 (1 core, 1 worker):  0.54 ms.  Throughput: 1840.91 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=1 (1 core, 1 worker):  0.54 ms.  Throughput: 1868.83 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=768, clm=2 (1 core, 1 worker):  0.52 ms.  Throughput: 1909.66 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=768, clm=1 (1 core, 1 worker):  0.51 ms.  Throughput: 1945.57 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=640, clm=1 (1 core, 1 worker):  0.57 ms.  Throughput: 1766.68 iter/sec.
[Sun Mar 27 19:05:54 2022]
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=4 (1 core, 1 worker):  0.62 ms.  Throughput: 1603.04 iter/sec.
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=2 (1 core, 1 worker):  0.58 ms.  Throughput: 1718.72 iter/sec.
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=1 (1 core, 1 worker):  0.58 ms.  Throughput: 1722.72 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=4 (1 core, 1 worker):  0.47 ms.  Throughput: 2127.05 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=2 (1 core, 1 worker):  0.48 ms.  Throughput: 2091.64 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=1 (1 core, 1 worker):  0.52 ms.  Throughput: 1922.43 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=4 (1 core, 1 worker):  0.48 ms.  Throughput: 2087.15 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=2 (1 core, 1 worker):  0.50 ms.  Throughput: 2000.05 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=1 (1 core, 1 worker):  0.52 ms.  Throughput: 1906.82 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1024, clm=2 (1 core, 1 worker):  0.61 ms.  Throughput: 1644.07 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1024, clm=1 (1 core, 1 worker):  0.58 ms.  Throughput: 1724.51 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=768, clm=1 (1 core, 1 worker):  0.55 ms.  Throughput: 1809.02 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=512, clm=1 (1 core, 1 worker):  0.70 ms.  Throughput: 1419.95 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=4 (1 core, 1 worker):  0.54 ms.  Throughput: 1836.55 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=2 (1 core, 1 worker):  0.56 ms.  Throughput: 1795.41 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=1 (1 core, 1 worker):  0.60 ms.  Throughput: 1658.66 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=4 (1 core, 1 worker):  0.59 ms.  Throughput: 1704.47 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=2 (1 core, 1 worker):  0.57 ms.  Throughput: 1765.59 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=1 (1 core, 1 worker):  0.57 ms.  Throughput: 1763.10 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1600, clm=2 (1 core, 1 worker):  0.63 ms.  Throughput: 1592.88 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1600, clm=1 (1 core, 1 worker):  0.62 ms.  Throughput: 1612.98 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=4 (1 core, 1 worker):  0.63 ms.  Throughput: 1587.84 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=2 (1 core, 1 worker):  0.60 ms.  Throughput: 1665.82 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=1 (1 core, 1 worker):  0.60 ms.  Throughput: 1661.03 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=4 (1 core, 1 worker):  0.64 ms.  Throughput: 1553.56 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=2 (1 core, 1 worker):  0.63 ms.  Throughput: 1599.48 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=1 (1 core, 1 worker):  0.61 ms.  Throughput: 1636.97 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=4 (1 core, 1 worker):  0.65 ms.  Throughput: 1549.72 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=2 (1 core, 1 worker):  0.63 ms.  Throughput: 1587.33 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=1 (1 core, 1 worker):  0.63 ms.  Throughput: 1596.50 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=4 (1 core, 1 worker):  0.53 ms.  Throughput: 1878.73 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=2 (1 core, 1 worker):  0.54 ms.  Throughput: 1866.64 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=1 (1 core, 1 worker):  0.57 ms.  Throughput: 1764.81 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=4 (1 core, 1 worker):  0.65 ms.  Throughput: 1546.72 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=2 (1 core, 1 worker):  0.64 ms.  Throughput: 1563.21 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=1 (1 core, 1 worker):  0.61 ms.  Throughput: 1647.41 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=768, clm=1 (1 core, 1 worker):  0.61 ms.  Throughput: 1636.96 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=4 (1 core, 1 worker):  0.68 ms.  Throughput: 1462.01 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=2 (1 core, 1 worker):  0.68 ms.  Throughput: 1472.04 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=1 (1 core, 1 worker):  0.66 ms.  Throughput: 1510.16 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1600, clm=2 (1 core, 1 worker):  0.68 ms.  Throughput: 1475.99 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1600, clm=1 (1 core, 1 worker):  0.69 ms.  Throughput: 1444.21 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=4 (1 core, 1 worker):  0.65 ms.  Throughput: 1531.00 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=2 (1 core, 1 worker):  0.66 ms.  Throughput: 1503.96 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=1 (1 core, 1 worker):  0.72 ms.  Throughput: 1380.27 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=4 (1 core, 1 worker):  0.61 ms.  Throughput: 1647.53 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=2 (1 core, 1 worker):  0.59 ms.  Throughput: 1685.91 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=1 (1 core, 1 worker):  0.63 ms.  Throughput: 1577.13 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=4 (1 core, 1 worker):  0.67 ms.  Throughput: 1490.67 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=2 (1 core, 1 worker):  0.64 ms.  Throughput: 1553.91 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=1 (1 core, 1 worker):  0.63 ms.  Throughput: 1575.76 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=4 (1 core, 1 worker):  0.70 ms.  Throughput: 1419.14 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=2 (1 core, 1 worker):  0.70 ms.  Throughput: 1426.63 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=1 (1 core, 1 worker):  0.68 ms.  Throughput: 1469.57 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1920, clm=2 (1 core, 1 worker):  0.71 ms.  Throughput: 1406.58 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1920, clm=1 (1 core, 1 worker):  0.70 ms.  Throughput: 1432.09 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1024, clm=2 (1 core, 1 worker):  0.77 ms.  Throughput: 1304.55 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1024, clm=1 (1 core, 1 worker):  0.73 ms.  Throughput: 1371.78 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=640, clm=1 (1 core, 1 worker):  0.77 ms.  Throughput: 1295.06 iter/sec.
[Sun Mar 27 19:10:55 2022]
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=4 (1 core, 1 worker):  0.72 ms.  Throughput: 1384.18 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=2 (1 core, 1 worker):  0.71 ms.  Throughput: 1416.93 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=1 (1 core, 1 worker):  0.72 ms.  Throughput: 1390.54 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=4 (1 core, 1 worker):  0.77 ms.  Throughput: 1304.58 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=2 (1 core, 1 worker):  0.73 ms.  Throughput: 1366.18 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=1 (1 core, 1 worker):  0.74 ms.  Throughput: 1343.55 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=4 (1 core, 1 worker):  0.72 ms.  Throughput: 1391.21 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=2 (1 core, 1 worker):  0.72 ms.  Throughput: 1389.51 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=1 (1 core, 1 worker):  0.72 ms.  Throughput: 1391.36 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1600, clm=2 (1 core, 1 worker):  0.76 ms.  Throughput: 1321.36 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1600, clm=1 (1 core, 1 worker):  0.77 ms.  Throughput: 1293.76 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=4 (1 core, 1 worker):  0.75 ms.  Throughput: 1341.15 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=2 (1 core, 1 worker):  0.73 ms.  Throughput: 1369.98 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=1 (1 core, 1 worker):  0.72 ms.  Throughput: 1380.61 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=4 (1 core, 1 worker):  0.76 ms.  Throughput: 1318.20 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=2 (1 core, 1 worker):  0.75 ms.  Throughput: 1341.57 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=1 (1 core, 1 worker):  0.74 ms.  Throughput: 1356.18 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=4 (1 core, 1 worker):  0.62 ms.  Throughput: 1624.29 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=2 (1 core, 1 worker):  0.64 ms.  Throughput: 1569.61 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=1 (1 core, 1 worker):  0.69 ms.  Throughput: 1442.22 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1024, clm=1 (1 core, 1 worker):  0.75 ms.  Throughput: 1335.28 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=4 (1 core, 1 worker):  0.83 ms.  Throughput: 1203.76 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=2 (1 core, 1 worker):  0.78 ms.  Throughput: 1282.97 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=1 (1 core, 1 worker):  0.80 ms.  Throughput: 1253.12 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1600, clm=2 (1 core, 1 worker):  0.82 ms.  Throughput: 1216.13 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1600, clm=1 (1 core, 1 worker):  0.83 ms.  Throughput: 1210.09 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=4 (1 core, 1 worker):  0.84 ms.  Throughput: 1196.10 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=2 (1 core, 1 worker):  0.80 ms.  Throughput: 1251.21 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=1 (1 core, 1 worker):  0.80 ms.  Throughput: 1244.86 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1920, clm=2 (1 core, 1 worker):  0.80 ms.  Throughput: 1243.83 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1920, clm=1 (1 core, 1 worker):  0.79 ms.  Throughput: 1271.00 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=4 (1 core, 1 worker):  0.78 ms.  Throughput: 1289.62 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=2 (1 core, 1 worker):  0.78 ms.  Throughput: 1274.68 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=1 (1 core, 1 worker):  0.85 ms.  Throughput: 1174.81 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=4 (1 core, 1 worker):  0.78 ms.  Throughput: 1279.14 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=2 (1 core, 1 worker):  0.75 ms.  Throughput: 1329.32 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=1 (1 core, 1 worker):  0.76 ms.  Throughput: 1312.34 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=4 (1 core, 1 worker):  0.81 ms.  Throughput: 1241.90 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=2 (1 core, 1 worker):  0.80 ms.  Throughput: 1248.73 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=1 (1 core, 1 worker):  0.80 ms.  Throughput: 1243.22 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2240, clm=2 (1 core, 1 worker):  0.82 ms.  Throughput: 1213.41 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2240, clm=1 (1 core, 1 worker):  0.81 ms.  Throughput: 1229.12 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=4 (1 core, 1 worker):  0.78 ms.  Throughput: 1277.86 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=2 (1 core, 1 worker):  0.80 ms.  Throughput: 1249.71 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=1 (1 core, 1 worker):  0.86 ms.  Throughput: 1165.50 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=4 (1 core, 1 worker):  0.70 ms.  Throughput: 1434.76 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=2 (1 core, 1 worker):  0.73 ms.  Throughput: 1372.91 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=1 (1 core, 1 worker):  0.73 ms.  Throughput: 1374.91 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=4 (1 core, 1 worker):  0.80 ms.  Throughput: 1248.85 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=2 (1 core, 1 worker):  0.77 ms.  Throughput: 1292.23 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=1 (1 core, 1 worker):  0.75 ms.  Throughput: 1327.86 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2304, clm=2 (1 core, 1 worker):  0.82 ms.  Throughput: 1214.71 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2304, clm=1 (1 core, 1 worker):  0.82 ms.  Throughput: 1216.94 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1024, clm=1 (1 core, 1 worker):  0.86 ms.  Throughput: 1161.03 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=768, clm=1 (1 core, 1 worker):  0.82 ms.  Throughput: 1221.39 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=4 (1 core, 1 worker):  0.86 ms.  Throughput: 1159.94 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=2 (1 core, 1 worker):  0.87 ms.  Throughput: 1150.88 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=1 (1 core, 1 worker):  0.83 ms.  Throughput: 1201.72 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=4 (1 core, 1 worker):  0.86 ms.  Throughput: 1168.02 iter/sec.
[Sun Mar 27 19:15:57 2022]
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=2 (1 core, 1 worker):  0.87 ms.  Throughput: 1151.76 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=1 (1 core, 1 worker):  0.88 ms.  Throughput: 1135.27 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=4 (1 core, 1 worker):  0.82 ms.  Throughput: 1217.39 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=2 (1 core, 1 worker):  0.83 ms.  Throughput: 1206.57 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=1 (1 core, 1 worker):  0.83 ms.  Throughput: 1202.11 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=4 (1 core, 1 worker):  0.79 ms.  Throughput: 1257.90 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=2 (1 core, 1 worker):  0.79 ms.  Throughput: 1272.42 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=1 (1 core, 1 worker):  0.80 ms.  Throughput: 1244.00 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=4 (1 core, 1 worker):  0.90 ms.  Throughput: 1114.88 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=2 (1 core, 1 worker):  0.87 ms.  Throughput: 1153.50 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=1 (1 core, 1 worker):  0.85 ms.  Throughput: 1183.31 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=4 (1 core, 1 worker):  0.92 ms.  Throughput: 1084.00 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=2 (1 core, 1 worker):  0.87 ms.  Throughput: 1146.49 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=1 (1 core, 1 worker):  0.87 ms.  Throughput: 1148.81 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1920, clm=2 (1 core, 1 worker):  0.89 ms.  Throughput: 1122.56 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1920, clm=1 (1 core, 1 worker):  0.91 ms.  Throughput: 1095.05 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1600, clm=2 (1 core, 1 worker):  0.94 ms.  Throughput: 1065.88 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1600, clm=1 (1 core, 1 worker):  0.92 ms.  Throughput: 1090.95 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=4 (1 core, 1 worker):  0.96 ms.  Throughput: 1039.11 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=2 (1 core, 1 worker):  0.94 ms.  Throughput: 1069.18 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=1 (1 core, 1 worker):  0.90 ms.  Throughput: 1107.23 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2240, clm=2 (1 core, 1 worker):  0.91 ms.  Throughput: 1095.19 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2240, clm=1 (1 core, 1 worker):  0.91 ms.  Throughput: 1101.32 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1920, clm=2 (1 core, 1 worker):  0.96 ms.  Throughput: 1037.17 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1920, clm=1 (1 core, 1 worker):  0.94 ms.  Throughput: 1060.17 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=4 (1 core, 1 worker):  0.86 ms.  Throughput: 1160.90 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=2 (1 core, 1 worker):  0.90 ms.  Throughput: 1111.83 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=1 (1 core, 1 worker):  0.95 ms.  Throughput: 1048.01 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=4 (1 core, 1 worker):  0.87 ms.  Throughput: 1150.13 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=2 (1 core, 1 worker):  0.83 ms.  Throughput: 1204.47 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=1 (1 core, 1 worker):  0.86 ms.  Throughput: 1166.05 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2560, clm=2 (1 core, 1 worker):  0.90 ms.  Throughput: 1108.28 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2560, clm=1 (1 core, 1 worker):  0.89 ms.  Throughput: 1122.17 iter/sec.
FFTlen=2592K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2304, clm=2 (1 core, 1 worker):  0.94 ms.  Throughput: 1063.08 iter/sec.
FFTlen=2592K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2304, clm=1 (1 core, 1 worker):  0.92 ms.  Throughput: 1088.26 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=4 (1 core, 1 worker):  0.91 ms.  Throughput: 1101.60 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=2 (1 core, 1 worker):  0.94 ms.  Throughput: 1066.55 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=1 (1 core, 1 worker):  1.02 ms.  Throughput: 979.08 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=4 (1 core, 1 worker):  0.94 ms.  Throughput: 1066.03 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=2 (1 core, 1 worker):  0.90 ms.  Throughput: 1106.35 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=1 (1 core, 1 worker):  0.92 ms.  Throughput: 1082.23 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=4 (1 core, 1 worker):  0.92 ms.  Throughput: 1088.25 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=2 (1 core, 1 worker):  0.91 ms.  Throughput: 1098.26 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=1 (1 core, 1 worker):  0.93 ms.  Throughput: 1076.99 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2688, clm=2 (1 core, 1 worker):  0.98 ms.  Throughput: 1023.93 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2688, clm=1 (1 core, 1 worker):  0.95 ms.  Throughput: 1048.73 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=4 (1 core, 1 worker):  1.02 ms.  Throughput: 976.94 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=2 (1 core, 1 worker):  1.01 ms.  Throughput: 986.58 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=1 (1 core, 1 worker):  1.02 ms.  Throughput: 976.67 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=4 (1 core, 1 worker):  1.00 ms.  Throughput: 1000.86 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=2 (1 core, 1 worker):  0.99 ms.  Throughput: 1014.66 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=1 (1 core, 1 worker):  0.99 ms.  Throughput: 1007.22 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=4 (1 core, 1 worker):  1.03 ms.  Throughput: 966.31 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=2 (1 core, 1 worker):  1.01 ms.  Throughput: 992.26 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=1 (1 core, 1 worker):  1.03 ms.  Throughput: 970.36 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2240, clm=2 (1 core, 1 worker):  1.02 ms.  Throughput: 982.69 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2240, clm=1 (1 core, 1 worker):  1.06 ms.  Throughput: 943.26 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=4 (1 core, 1 worker):  0.99 ms.  Throughput: 1010.13 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=2 (1 core, 1 worker):  1.01 ms.  Throughput: 994.99 iter/sec.
[Sun Mar 27 19:21:00 2022]
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=1 (1 core, 1 worker):  1.01 ms.  Throughput: 994.10 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=4 (1 core, 1 worker):  0.99 ms.  Throughput: 1014.18 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=2 (1 core, 1 worker):  0.98 ms.  Throughput: 1020.38 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=1 (1 core, 1 worker):  0.98 ms.  Throughput: 1023.48 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=4 (1 core, 1 worker):  0.97 ms.  Throughput: 1031.53 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=2 (1 core, 1 worker):  0.98 ms.  Throughput: 1018.25 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=1 (1 core, 1 worker):  0.97 ms.  Throughput: 1032.05 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=4 (1 core, 1 worker):  1.03 ms.  Throughput: 973.05 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=2 (1 core, 1 worker):  1.00 ms.  Throughput: 1001.04 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=1 (1 core, 1 worker):  1.00 ms.  Throughput: 1004.08 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2560, clm=2 (1 core, 1 worker):  1.03 ms.  Throughput: 972.71 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2560, clm=1 (1 core, 1 worker):  1.01 ms.  Throughput: 988.57 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2304, clm=2 (1 core, 1 worker):  1.06 ms.  Throughput: 944.73 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2304, clm=1 (1 core, 1 worker):  1.04 ms.  Throughput: 959.89 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1920, clm=2 (1 core, 1 worker):  1.11 ms.  Throughput: 903.03 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1920, clm=1 (1 core, 1 worker):  1.07 ms.  Throughput: 930.50 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=4 (1 core, 1 worker):  1.10 ms.  Throughput: 911.80 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=2 (1 core, 1 worker):  1.06 ms.  Throughput: 939.18 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=1 (1 core, 1 worker):  1.08 ms.  Throughput: 925.43 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2240, clm=2 (1 core, 1 worker):  1.11 ms.  Throughput: 902.62 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2240, clm=1 (1 core, 1 worker):  1.11 ms.  Throughput: 897.28 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=4 (1 core, 1 worker):  1.09 ms.  Throughput: 917.64 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=2 (1 core, 1 worker):  1.09 ms.  Throughput: 918.39 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=1 (1 core, 1 worker):  1.10 ms.  Throughput: 913.14 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1600, clm=2 (1 core, 1 worker):  1.22 ms.  Throughput: 823.02 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1600, clm=1 (1 core, 1 worker):  1.13 ms.  Throughput: 883.72 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2688, clm=2 (1 core, 1 worker):  1.09 ms.  Throughput: 918.52 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2688, clm=1 (1 core, 1 worker):  1.12 ms.  Throughput: 895.72 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2304, clm=2 (1 core, 1 worker):  1.12 ms.  Throughput: 893.46 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2304, clm=1 (1 core, 1 worker):  1.13 ms.  Throughput: 883.67 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=4 (1 core, 1 worker):  1.03 ms.  Throughput: 968.16 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=2 (1 core, 1 worker):  1.09 ms.  Throughput: 919.66 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=1 (1 core, 1 worker):  1.16 ms.  Throughput: 858.44 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=4 (1 core, 1 worker):  0.98 ms.  Throughput: 1016.52 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=2 (1 core, 1 worker):  0.97 ms.  Throughput: 1036.16 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=1 (1 core, 1 worker):  1.00 ms.  Throughput: 999.56 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=4 (1 core, 1 worker):  1.06 ms.  Throughput: 940.77 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=2 (1 core, 1 worker):  1.03 ms.  Throughput: 971.78 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=1 (1 core, 1 worker):  1.02 ms.  Throughput: 980.02 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3072, clm=2 (1 core, 1 worker):  1.04 ms.  Throughput: 964.86 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3072, clm=1 (1 core, 1 worker):  1.04 ms.  Throughput: 962.62 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=1024, clm=1 (1 core, 1 worker):  1.19 ms.  Throughput: 843.82 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=4 (1 core, 1 worker):  1.09 ms.  Throughput: 914.89 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=2 (1 core, 1 worker):  1.12 ms.  Throughput: 894.21 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=1 (1 core, 1 worker):  1.19 ms.  Throughput: 840.65 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=4 (1 core, 1 worker):  1.13 ms.  Throughput: 886.27 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=2 (1 core, 1 worker):  1.09 ms.  Throughput: 914.10 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=1 (1 core, 1 worker):  1.10 ms.  Throughput: 912.60 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3136, clm=2 (1 core, 1 worker):  1.15 ms.  Throughput: 868.07 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3136, clm=1 (1 core, 1 worker):  1.12 ms.  Throughput: 892.16 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=4 (1 core, 1 worker):  1.07 ms.  Throughput: 934.44 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=2 (1 core, 1 worker):  1.09 ms.  Throughput: 915.35 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=1 (1 core, 1 worker):  1.09 ms.  Throughput: 916.03 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3200, clm=2 (1 core, 1 worker):  1.16 ms.  Throughput: 864.52 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3200, clm=1 (1 core, 1 worker):  1.15 ms.  Throughput: 872.21 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2560, clm=2 (1 core, 1 worker):  1.16 ms.  Throughput: 859.35 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2560, clm=1 (1 core, 1 worker):  1.15 ms.  Throughput: 872.60 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1600, clm=1 (1 core, 1 worker):  1.19 ms.  Throughput: 838.23 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=4 (1 core, 1 worker):  1.18 ms.  Throughput: 848.14 iter/sec.
[Sun Mar 27 19:26:05 2022]
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=2 (1 core, 1 worker):  1.19 ms.  Throughput: 843.30 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=1 (1 core, 1 worker):  1.21 ms.  Throughput: 825.37 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=4 (1 core, 1 worker):  1.15 ms.  Throughput: 871.80 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=2 (1 core, 1 worker):  1.13 ms.  Throughput: 881.51 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=1 (1 core, 1 worker):  1.16 ms.  Throughput: 860.93 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=4 (1 core, 1 worker):  1.20 ms.  Throughput: 835.16 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=2 (1 core, 1 worker):  1.19 ms.  Throughput: 837.77 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=1 (1 core, 1 worker):  1.18 ms.  Throughput: 850.72 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=4 (1 core, 1 worker):  1.16 ms.  Throughput: 865.73 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=2 (1 core, 1 worker):  1.15 ms.  Throughput: 868.80 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=1 (1 core, 1 worker):  1.18 ms.  Throughput: 846.59 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=4 (1 core, 1 worker):  1.22 ms.  Throughput: 819.21 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=2 (1 core, 1 worker):  1.17 ms.  Throughput: 854.76 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=1 (1 core, 1 worker):  1.18 ms.  Throughput: 848.26 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2688, clm=2 (1 core, 1 worker):  1.19 ms.  Throughput: 837.40 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2688, clm=1 (1 core, 1 worker):  1.23 ms.  Throughput: 811.18 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2560, clm=2 (1 core, 1 worker):  1.24 ms.  Throughput: 805.15 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2560, clm=1 (1 core, 1 worker):  1.22 ms.  Throughput: 820.19 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2240, clm=2 (1 core, 1 worker):  1.25 ms.  Throughput: 799.50 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2240, clm=1 (1 core, 1 worker):  1.22 ms.  Throughput: 819.88 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=4 (1 core, 1 worker):  1.18 ms.  Throughput: 844.28 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=2 (1 core, 1 worker):  1.19 ms.  Throughput: 838.91 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=1 (1 core, 1 worker):  1.23 ms.  Throughput: 816.09 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=4 (1 core, 1 worker):  1.18 ms.  Throughput: 847.55 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=2 (1 core, 1 worker):  1.17 ms.  Throughput: 855.66 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=1 (1 core, 1 worker):  1.18 ms.  Throughput: 847.56 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3072, clm=2 (1 core, 1 worker):  1.19 ms.  Throughput: 841.62 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3072, clm=1 (1 core, 1 worker):  1.18 ms.  Throughput: 848.84 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2304, clm=2 (1 core, 1 worker):  1.28 ms.  Throughput: 780.70 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2304, clm=1 (1 core, 1 worker):  1.25 ms.  Throughput: 803.03 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3136, clm=2 (1 core, 1 worker):  1.29 ms.  Throughput: 776.93 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3136, clm=1 (1 core, 1 worker):  1.29 ms.  Throughput: 778.19 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2688, clm=2 (1 core, 1 worker):  1.32 ms.  Throughput: 759.09 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2688, clm=1 (1 core, 1 worker):  1.29 ms.  Throughput: 773.00 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=4 (1 core, 1 worker):  1.25 ms.  Throughput: 802.15 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=2 (1 core, 1 worker):  1.29 ms.  Throughput: 772.34 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=1 (1 core, 1 worker):  1.34 ms.  Throughput: 745.92 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=4 (1 core, 1 worker):  1.24 ms.  Throughput: 807.41 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=2 (1 core, 1 worker):  1.24 ms.  Throughput: 808.15 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=1 (1 core, 1 worker):  1.25 ms.  Throughput: 797.79 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3584, clm=2 (1 core, 1 worker):  1.25 ms.  Throughput: 800.30 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3584, clm=1 (1 core, 1 worker):  1.22 ms.  Throughput: 822.68 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=4 (1 core, 1 worker):  1.29 ms.  Throughput: 776.08 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=2 (1 core, 1 worker):  1.25 ms.  Throughput: 799.45 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=1 (1 core, 1 worker):  1.27 ms.  Throughput: 789.32 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3200, clm=2 (1 core, 1 worker):  1.31 ms.  Throughput: 765.44 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3200, clm=1 (1 core, 1 worker):  1.28 ms.  Throughput: 780.14 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1920, clm=2 (1 core, 1 worker):  1.40 ms.  Throughput: 713.39 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1920, clm=1 (1 core, 1 worker):  1.35 ms.  Throughput: 741.33 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1600, clm=1 (1 core, 1 worker):  1.39 ms.  Throughput: 718.64 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=4 (1 core, 1 worker):  1.35 ms.  Throughput: 739.52 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=2 (1 core, 1 worker):  1.36 ms.  Throughput: 737.32 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=1 (1 core, 1 worker):  1.47 ms.  Throughput: 682.10 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=4 (1 core, 1 worker):  1.35 ms.  Throughput: 741.79 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=2 (1 core, 1 worker):  1.36 ms.  Throughput: 735.06 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=1 (1 core, 1 worker):  1.40 ms.  Throughput: 715.40 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=4 (1 core, 1 worker):  1.33 ms.  Throughput: 749.36 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=2 (1 core, 1 worker):  1.27 ms.  Throughput: 784.78 iter/sec.
[Sun Mar 27 19:31:06 2022]
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=1 (1 core, 1 worker):  1.32 ms.  Throughput: 757.66 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=4 (1 core, 1 worker):  1.35 ms.  Throughput: 742.37 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=2 (1 core, 1 worker):  1.33 ms.  Throughput: 753.72 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=1 (1 core, 1 worker):  1.30 ms.  Throughput: 767.07 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=4 (1 core, 1 worker):  1.38 ms.  Throughput: 722.89 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=2 (1 core, 1 worker):  1.34 ms.  Throughput: 746.06 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=1 (1 core, 1 worker):  1.35 ms.  Throughput: 738.36 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3840, clm=2 (1 core, 1 worker):  1.33 ms.  Throughput: 751.59 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3840, clm=1 (1 core, 1 worker):  1.30 ms.  Throughput: 766.46 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3072, clm=2 (1 core, 1 worker):  1.32 ms.  Throughput: 755.91 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3072, clm=1 (1 core, 1 worker):  1.36 ms.  Throughput: 735.65 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2560, clm=2 (1 core, 1 worker):  1.42 ms.  Throughput: 704.79 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2560, clm=1 (1 core, 1 worker):  1.39 ms.  Throughput: 721.32 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1920, clm=1 (1 core, 1 worker):  1.39 ms.  Throughput: 718.03 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=4 (1 core, 1 worker):  1.44 ms.  Throughput: 692.31 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=2 (1 core, 1 worker):  1.45 ms.  Throughput: 689.97 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=1 (1 core, 1 worker):  1.46 ms.  Throughput: 683.77 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3136, clm=2 (1 core, 1 worker):  1.44 ms.  Throughput: 692.57 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3136, clm=1 (1 core, 1 worker):  1.46 ms.  Throughput: 686.07 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=4 (1 core, 1 worker):  1.40 ms.  Throughput: 716.06 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=2 (1 core, 1 worker):  1.39 ms.  Throughput: 720.40 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=1 (1 core, 1 worker):  1.40 ms.  Throughput: 716.77 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3200, clm=2 (1 core, 1 worker):  1.45 ms.  Throughput: 691.63 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3200, clm=1 (1 core, 1 worker):  1.49 ms.  Throughput: 670.92 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=4 (1 core, 1 worker):  1.44 ms.  Throughput: 696.09 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=2 (1 core, 1 worker):  1.45 ms.  Throughput: 688.63 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=1 (1 core, 1 worker):  1.48 ms.  Throughput: 677.13 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=4 (1 core, 1 worker):  1.42 ms.  Throughput: 705.45 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=2 (1 core, 1 worker):  1.42 ms.  Throughput: 705.94 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=1 (1 core, 1 worker):  1.41 ms.  Throughput: 707.64 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=4 (1 core, 1 worker):  1.43 ms.  Throughput: 701.30 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=2 (1 core, 1 worker):  1.43 ms.  Throughput: 699.97 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=1 (1 core, 1 worker):  1.42 ms.  Throughput: 704.28 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3584, clm=2 (1 core, 1 worker):  1.42 ms.  Throughput: 702.09 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3584, clm=1 (1 core, 1 worker):  1.42 ms.  Throughput: 705.31 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3072, clm=2 (1 core, 1 worker):  1.43 ms.  Throughput: 700.91 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3072, clm=1 (1 core, 1 worker):  1.45 ms.  Throughput: 689.72 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2688, clm=2 (1 core, 1 worker):  1.53 ms.  Throughput: 652.45 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2688, clm=1 (1 core, 1 worker):  1.50 ms.  Throughput: 667.34 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=4 (1 core, 1 worker):  1.46 ms.  Throughput: 686.80 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=2 (1 core, 1 worker):  1.49 ms.  Throughput: 670.92 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=1 (1 core, 1 worker):  1.56 ms.  Throughput: 640.98 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4096, clm=2 (1 core, 1 worker):  1.39 ms.  Throughput: 718.48 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4096, clm=1 (1 core, 1 worker):  1.38 ms.  Throughput: 722.90 iter/sec.
FFTlen=4116K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3136, clm=2 (1 core, 1 worker):  1.60 ms.  Throughput: 623.26 iter/sec.
FFTlen=4116K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3136, clm=1 (1 core, 1 worker):  1.59 ms.  Throughput: 628.46 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=4 (1 core, 1 worker):  1.61 ms.  Throughput: 622.67 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=2 (1 core, 1 worker):  1.55 ms.  Throughput: 646.72 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=1 (1 core, 1 worker):  1.55 ms.  Throughput: 644.29 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3200, clm=2 (1 core, 1 worker):  1.57 ms.  Throughput: 636.15 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3200, clm=1 (1 core, 1 worker):  1.59 ms.  Throughput: 630.80 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2240, clm=2 (1 core, 1 worker):  1.68 ms.  Throughput: 595.58 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2240, clm=1 (1 core, 1 worker):  1.61 ms.  Throughput: 619.52 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=4 (1 core, 1 worker):  1.59 ms.  Throughput: 627.00 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=2 (1 core, 1 worker):  1.53 ms.  Throughput: 653.04 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=1 (1 core, 1 worker):  1.53 ms.  Throughput: 651.97 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3840, clm=2 (1 core, 1 worker):  1.53 ms.  Throughput: 652.06 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3840, clm=1 (1 core, 1 worker):  1.51 ms.  Throughput: 662.05 iter/sec.
[Sun Mar 27 19:36:07 2022]
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2304, clm=2 (1 core, 1 worker):  1.69 ms.  Throughput: 591.65 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2304, clm=1 (1 core, 1 worker):  1.61 ms.  Throughput: 623.04 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1920, clm=1 (1 core, 1 worker):  1.64 ms.  Throughput: 608.34 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=4 (1 core, 1 worker):  1.56 ms.  Throughput: 641.42 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=2 (1 core, 1 worker):  1.54 ms.  Throughput: 650.74 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=1 (1 core, 1 worker):  1.53 ms.  Throughput: 651.92 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=4 (1 core, 1 worker):  1.64 ms.  Throughput: 611.59 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=2 (1 core, 1 worker):  1.57 ms.  Throughput: 635.23 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=1 (1 core, 1 worker):  1.65 ms.  Throughput: 607.70 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4480, clm=2 (1 core, 1 worker):  1.60 ms.  Throughput: 623.07 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4480, clm=1 (1 core, 1 worker):  1.66 ms.  Throughput: 604.17 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3584, clm=2 (1 core, 1 worker):  1.63 ms.  Throughput: 613.34 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3584, clm=1 (1 core, 1 worker):  1.66 ms.  Throughput: 602.35 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2240, clm=1 (1 core, 1 worker):  1.64 ms.  Throughput: 610.04 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=4 (1 core, 1 worker):  1.68 ms.  Throughput: 593.77 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=2 (1 core, 1 worker):  1.68 ms.  Throughput: 596.64 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=1 (1 core, 1 worker):  1.75 ms.  Throughput: 570.28 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=4 (1 core, 1 worker):  1.64 ms.  Throughput: 608.17 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=2 (1 core, 1 worker):  1.59 ms.  Throughput: 629.71 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=1 (1 core, 1 worker):  1.61 ms.  Throughput: 621.90 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4608, clm=2 (1 core, 1 worker):  1.59 ms.  Throughput: 627.67 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4608, clm=1 (1 core, 1 worker):  1.63 ms.  Throughput: 614.96 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4096, clm=2 (1 core, 1 worker):  1.62 ms.  Throughput: 615.75 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4096, clm=1 (1 core, 1 worker):  1.63 ms.  Throughput: 614.93 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3072, clm=2 (1 core, 1 worker):  1.66 ms.  Throughput: 603.38 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3072, clm=1 (1 core, 1 worker):  1.63 ms.  Throughput: 612.19 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2304, clm=1 (1 core, 1 worker):  1.69 ms.  Throughput: 590.89 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=4 (1 core, 1 worker):  1.76 ms.  Throughput: 567.53 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=2 (1 core, 1 worker):  1.78 ms.  Throughput: 561.34 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=1 (1 core, 1 worker):  1.76 ms.  Throughput: 566.65 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=4 (1 core, 1 worker):  1.74 ms.  Throughput: 574.28 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=2 (1 core, 1 worker):  1.74 ms.  Throughput: 573.58 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=1 (1 core, 1 worker):  1.74 ms.  Throughput: 573.94 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3584, clm=2 (1 core, 1 worker):  1.78 ms.  Throughput: 560.59 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3584, clm=1 (1 core, 1 worker):  1.73 ms.  Throughput: 579.57 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3136, clm=2 (1 core, 1 worker):  1.81 ms.  Throughput: 551.31 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3136, clm=1 (1 core, 1 worker):  1.83 ms.  Throughput: 546.85 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=4 (1 core, 1 worker):  1.69 ms.  Throughput: 590.67 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=2 (1 core, 1 worker):  1.66 ms.  Throughput: 602.16 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=1 (1 core, 1 worker):  1.71 ms.  Throughput: 586.20 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=4 (1 core, 1 worker):  1.77 ms.  Throughput: 566.36 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=2 (1 core, 1 worker):  1.72 ms.  Throughput: 581.13 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=1 (1 core, 1 worker):  1.69 ms.  Throughput: 590.64 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=4 (1 core, 1 worker):  1.80 ms.  Throughput: 554.67 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=2 (1 core, 1 worker):  1.75 ms.  Throughput: 572.72 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=1 (1 core, 1 worker):  1.73 ms.  Throughput: 579.23 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3840, clm=2 (1 core, 1 worker):  1.74 ms.  Throughput: 574.98 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3840, clm=1 (1 core, 1 worker):  1.77 ms.  Throughput: 565.66 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3200, clm=2 (1 core, 1 worker):  1.84 ms.  Throughput: 544.18 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3200, clm=1 (1 core, 1 worker):  1.79 ms.  Throughput: 559.18 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2560, clm=2 (1 core, 1 worker):  1.86 ms.  Throughput: 537.00 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2560, clm=1 (1 core, 1 worker):  1.80 ms.  Throughput: 556.08 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=1600, clm=1 (1 core, 1 worker):  1.97 ms.  Throughput: 508.33 iter/sec.

Last fiddled with by Uncwilly on 2022-03-29 at 02:36 Reason: changed quote to code
Magellan3s is offline   Reply With Quote