Node: 4 socket Intel ® Xeon® Gold 6148 Scalable Processors（20Core,150W,2.4GHz）with DRAM 256G
Cluster: 14 nodes
Linpack info: Ns 168000 NBs 384 Ps 1 Qs 1
run with : parallel_studio/mkl/benchmarks/mp_linpack/xhpl_intel64_static
The Max peak for each node is around 4Tflops, compared with the theoretical peak is 6.144Tflops.
The efficiency is 65%, is it low?