Link Copied
Doesn't the formula also hold if hyperthreading is enabled? My understanding is that even with hyperthreading enabled, each core can still execute at most 4 flops per tick, so the formula for peak DP performance in FLOPS is still the same. Hyperthreading only improves performance when there are less than 4 instructions ready for a given thread in a clock cycle, right? Or am I misunderstanding hyperthreading (and if this is the case, is there an additional x2 for hyperthreading, or is the factor something else?)
Thanks.
That was easy to find:
http://software.intel.com/en-us/articles/intel-linpack-benchmark-download-license-agreement/
For more complete information about compiler optimizations, see our Optimization Notice.