- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
in my application, speed is very important. so I use intel advisor on my application, then I find that there are some type conversions.
I think it is weird, because there are some float type but I always use double type. therefore I have a test, then find that _mm512d load/store intrinsics are changed vmovup’s’z. I think, it have to changed vmovup’d’z.
why it is happened? and is type conversion important to speed? I use very many load/store instruction.
- Tags:
- Intel® Advanced Vector Extensions (Intel® AVX)
- Intel® Streaming SIMD Extensions
- Parallel Computing
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.
I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction. Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The VMOVUP* don't perform any type conversions -- they just move the bits between memory and registers.
I think that the compiler prefers to use VMOVUPS because the "float" data type is the default for the instruction. Changing the type requires additional prefix bytes, which can slow down instruction decode and decrease the effectiveness of the L1 Instruction Cache, without providing any performance benefits.

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page