Software Tuning, Performance Optimization & Platform Monitoring
Discussion regarding monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform updating.
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.

Preformance between Intel Assembly vs Intel Intrinsics

Uday_Krishna__G_
942 Views

Deal All,

Am trying to optimize my codec application on Intel Platform. before kick start Is the any numbers for the performance comparison numbers between Intel Assembly vs Intel Intrinsics? Am in delima to choose between the two approaches.

0 Kudos
3 Replies
Bernard
Valued Contributor I
942 Views

If the specific intrinsic is directly compiled into single machine code instruction you will probably not see any difference in the performance when comparing  inline assembly vs. compiler intrinsic.

0 Kudos
TimP
Honored Contributor III
942 Views

The compiler has more latitude to optimize intrinsics.  For example, Intel C++ will choose more effective equivalent instructions, or switch from SSE to AVX-128, when permitted according to the architecture flag.

0 Kudos
Bernard
Valued Contributor I
942 Views

Bear in mind that inline assembly code will not be optimized by the compiler at least this is the case with VS C++ compiler.

0 Kudos
Reply