I am writing a code which involves calculating the dot product of two vectors. Therefore, I tried to use the _mm_dp_ps inttrinsic available in sse4, but got an illegal instruction at runtime. There were no compilation errors.
The compiler I am using is icc and the processor is a xeon 5150, which according to processor specifications does support sse4, but a /proc/cpuinfo (in linux) on the same shows support only until sse2.
Can some one tell me why am I facing an illegal instruction and if at sse4 needs to be enabled explicitly, how do I do it?