- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Deal All,
Am trying to optimize my codec application on Intel Platform. before kick start Is the any numbers for the performance comparison numbers between Intel Assembly vs Intel Intrinsics? Am in delima to choose between the two approaches.
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If the specific intrinsic is directly compiled into single machine code instruction you will probably not see any difference in the performance when comparing inline assembly vs. compiler intrinsic.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The compiler has more latitude to optimize intrinsics. For example, Intel C++ will choose more effective equivalent instructions, or switch from SSE to AVX-128, when permitted according to the architecture flag.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Bear in mind that inline assembly code will not be optimized by the compiler at least this is the case with VS C++ compiler.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page