Software Tuning, Performance Optimization & Platform Monitoring
Discussion around monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform monitoring
Announcements
This community is designed for sharing of public information. Please do not share Intel or third-party confidential information here.

Preformance between Intel Assembly vs Intel Intrinsics

Uday_Krishna__G_
177 Views

Deal All,

Am trying to optimize my codec application on Intel Platform. before kick start Is the any numbers for the performance comparison numbers between Intel Assembly vs Intel Intrinsics? Am in delima to choose between the two approaches.

0 Kudos
3 Replies
Bernard
Black Belt
177 Views

If the specific intrinsic is directly compiled into single machine code instruction you will probably not see any difference in the performance when comparing  inline assembly vs. compiler intrinsic.

TimP
Black Belt
177 Views

The compiler has more latitude to optimize intrinsics.  For example, Intel C++ will choose more effective equivalent instructions, or switch from SSE to AVX-128, when permitted according to the architecture flag.

Bernard
Black Belt
177 Views

Bear in mind that inline assembly code will not be optimized by the compiler at least this is the case with VS C++ compiler.

Reply