- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I am writing a code which involves calculating the dot product of two vectors. Therefore, I tried to use the _mm_dp_ps inttrinsic available in sse4, but got an illegal instruction at runtime. There were no compilation errors.
The compiler I am using is icc and the processor is a xeon 5150, which according to processor specifications does support sse4, but a /proc/cpuinfo (in linux) on the same shows support only until sse2.
Can some one tell me why am I facing an illegal instruction and if at sse4 needs to be enabled explicitly, how do I do it?
Link Copied
3 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
There are several version of SSE4 - SSE4.1, SSE4.2.
My processors also "supports SSE4", but up to SSE4.1. I get similar errors when trying to use SSE4.2 instructions.
My processors also "supports SSE4", but up to SSE4.1. I get similar errors when trying to use SSE4.2 instructions.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
_mm_dp_ps requires SSE4.1
Xeon 5150 supports only SSE3
Xeon 5150 supports only SSE3
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thanks dmitriy. I saw a processor comparision page which said that xeon 5150 actually supported sse4. but the intel specification page for xeon 5100 series states support only upto sse3. thanks bro.

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page