I started to use Vtune recently. For my application, Intel Tuning Assistant gives me the following data:
Streaming SIMD Extensions (SSE) Input Assists Performance Impact: 119.67
1st Level Cache Load Miss Performance Impact: 9.12
Clockticks per Instructions Retired (CPI): 3.39
Trace Cache (TC) Miss Performance Impact: 2.03
(1) How can the SSE input assists performance impact have such a big value? I was expecting a number below 5 by reading the definition of this parameter. The values for1st Level Cache Load Miss Performance Impact is also too high. Should I trust the numbers?
(2) I'm using Intel C++ compiler in MS visual studio 2000. I alreadyset "none" for "Floating Point Precision Improvment". Is this sufficient to enable the FTZ and DAZ modes? How to further reduce the SSE Input Assists Performance Impact?