View Single Post
Old 2020-08-15, 07:23   #21
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

7FB16 Posts
Default

Here's a bench using compute nodes with one Xeon E5-2650 v4 Broadwell cpu with 12-cores, 24 threads.

1 node 7h 40m
2 nodes 2h 45m
4 nodes 1h 35m
8 nodes 1h 10m

Not sure why the time for one node is so high compared to the others? Perhaps something fitting into the cache with the smaller matrices on each node?
frmky is offline   Reply With Quote