- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi all,
although the compiler (10.1.012) docs mention it multiple times, the functions
F64vec2 rcp(F64vec2)
F64vec2 rcp_nr(F64vec2)
do not seem to exist - a search in dvec.h confirms this. They are present, however, for F32vec4. Is this a bug or intended?
Thanks for any insight,
Georg.
although the compiler (10.1.012) docs mention it multiple times, the functions
F64vec2 rcp(F64vec2)
F64vec2 rcp_nr(F64vec2)
do not seem to exist - a search in dvec.h confirms this. They are present, however, for F32vec4. Is this a bug or intended?
Thanks for any insight,
Georg.
Link Copied
1 Reply
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
As the rcpps instruction is 4-operand parallel, your finding doesn't surprise me. So, it may be a documentation bug, which you could submit on premier.intel.com. However, the documentation of such functions is intentionally incomplete; too many changes occur between compiler versions for them to be recommended for application code. In view of the improved performance of IEEE divide on Penryn and later CPUs, I have been trying to avoid rccps.

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page