- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hey guys,
simple question. Does the PHI has FMA3?
if yes:
do i need to be aware of the FMA3 destructive operand form using FMA with Intel Intrinsics like:
Thanks,
Patrick
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yes, FMA3 is supported. From what I see in the Intel® Xeon Phi™ Coprocessor Instruction set reference (available on the Intel® Xeon Phi™ Coprocessor site) the intrinsic uses the underlying VFMADD132PS instruction with an MVEX opcode prefix containing NDS indicating a non-destructive source register specifier, which is the first source operand in the 3-operand instruction syntax as per the discussion of the MVEX Prefix discussion.
I will double check with Developers.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Development clarified this is not limited to the VFMADD132PS instruction. They noted:
"The compiler will choose one of the forms of FMA instructions (VFMADD132PS, VFMADD213PS, VFMADD231PS), depending on the context, to generate optimal code.
So, from the user's perspective, the statement ‘x=_mm512_fmadd_ps(a, b, c)’ means just ‘x=a*b+c’, no matter which operand is destructive in the underlying instruction."
Hope that helps.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks for your excellent answer :)

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page