If this is the wrong forum, I apologize - it's the closest match I could find for my question.
I'm trying to find out how many clock cycles are required for various double-precision operations, both in their simple forms, and in their SSE and (if applicable) AVX forms. For example, I'm trying to understand the relative costs of doube-precision comparisons, multiplications, and divisions for Intel's recent processors (Core 2 Duo up through i7's.)
Can anyone point me in the right direction?
Thanks very much,