Intel® ISA Extensions
Use hardware-based isolation and memory encryption to provide more code protection in your solutions.
1135 ディスカッション

why is ‘_mm512d load/store’ intrinsic changed to vmovups not vmovupd?

Yeongha_L_
ビギナー
1,330件の閲覧回数

 

in my application, speed is very important. so I use intel advisor on my application, then I find that there are some type conversions.

I think it is weird, because there are some float type but I always use double type. therefore I have a test, then find that _mm512d load/store intrinsics are changed vmovup’s’z. I think, it have to changed vmovup’d’z.

why it is happened? and is type conversion important to speed? I use very many load/store instruction.

0 件の賞賛
1 解決策
McCalpinJohn
名誉コントリビューター III
1,330件の閲覧回数

The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.

I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction.  Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.

元の投稿で解決策を見る

1 返信
McCalpinJohn
名誉コントリビューター III
1,331件の閲覧回数

The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.

I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction.  Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.

返信