Intel® ISA Extensions
Use hardware-based isolation and memory encryption to provide more code protection in your solutions.

why is ‘_mm512d load/store’ intrinsic changed to vmovups not vmovupd?

Yeongha_L_
Beginner
650 Views

 

in my application, speed is very important. so I use intel advisor on my application, then I find that there are some type conversions.

I think it is weird, because there are some float type but I always use double type. therefore I have a test, then find that _mm512d load/store intrinsics are changed vmovup’s’z. I think, it have to changed vmovup’d’z.

why it is happened? and is type conversion important to speed? I use very many load/store instruction.

0 Kudos
1 Solution
McCalpinJohn
Honored Contributor III
650 Views

The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.

I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction.  Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.

View solution in original post

0 Kudos
1 Reply
McCalpinJohn
Honored Contributor III
651 Views

The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.

I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction.  Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.

0 Kudos
Reply