- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
AVX2 appears to only offer _mm256_cmpeq_epi32 and _mm256_cmpgt_epi32. What's the most efficient way to implement _mm256_cmplt_epi32 given the available AVX2 functions?
- Tags:
- Intel® Advanced Vector Extensions (Intel® AVX)
- Intel® Streaming SIMD Extensions
- Parallel Computing
Link Copied
1 Reply
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Jeff D. wrote:
AVX2 appears to only offer _mm256_cmpeq_epi32 and _mm256_cmpgt_epi32. What's the most efficient way to implement _mm256_cmplt_epi32 given the available AVX2 functions?
simply use cmpgt with swapped operands, you can also introduce your own cmplt with code such as :
__m256i _mm256_cmplt_epi32 (__m256i a, __m256i b) {return _mm256_cmpgt_epi32(b,a);}
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page