Software Tuning, Performance Optimization & Platform Monitoring
Discussion regarding monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform updating.
Avisos
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.

Preformance between Intel Assembly vs Intel Intrinsics

Uday_Krishna__G_
957 Vistas

Deal All,

Am trying to optimize my codec application on Intel Platform. before kick start Is the any numbers for the performance comparison numbers between Intel Assembly vs Intel Intrinsics? Am in delima to choose between the two approaches.

0 kudos
3 Respuestas
Bernard
Colaborador Valioso I
957 Vistas

If the specific intrinsic is directly compiled into single machine code instruction you will probably not see any difference in the performance when comparing  inline assembly vs. compiler intrinsic.

TimP
Colaborador Distinguido III
957 Vistas

The compiler has more latitude to optimize intrinsics.  For example, Intel C++ will choose more effective equivalent instructions, or switch from SSE to AVX-128, when permitted according to the architecture flag.

Bernard
Colaborador Valioso I
957 Vistas

Bear in mind that inline assembly code will not be optimized by the compiler at least this is the case with VS C++ compiler.

Responder