Intel® C++ Compiler
Support and discussions for creating C++ code that runs on platforms based on Intel® processors.
7692 Discussions

[BUG][AVX] Invalid code generation using _mm256_unpacklo_pd and _mm256_permute2f128_pd

joel_f_
Beginner
143 Views

As hinted here: https://github.com/MetaScale/nt2/commit/f398ddb886cd4c9526276431dcadbeb066c9fd00

Code using _mm256_unpacklo_pd/_mm256_unpackhi_pd in cunjunction with _mm256_permute2f128_pd produce wrong code. Moreover, the exact same code pattern being used elsewhere with a different constant mask for permute is working correctly. The workaroudn usign volatile is of course underperforming.

Same code works and codegen correctly on gcc and clang. 

0 Kudos
0 Replies
Reply