- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I could find these values for 32bits operations in the last Optimization Reference Manual but the counter part for 64bits operations is not there.
Does anyone know where I can find this ?
Reagrds,
Matthieu
1 Solution
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Matthieu - POPCNT has the same 3-cycle latency / 1-cycle throughputfor both 32-bit and 64-bit forms.
Regards,
-Max
Regards,
-Max
Link Copied
4 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Mattheiu,
POPCNT is an SSE4 instruction. You might try looking in the Intel SSE4 Programming Reference here:
http://software.intel.com/en-us/articles/45nm-next-generation-intel-coret-2-processor-family-penryn-and-intel-streaming-simd-extensions-4-intel-sse4/
AVX reference can be found at http://software.intel.com/en-us/avx/
I hope that helps.
==
Aubrey W.
Intel Software Network Support
POPCNT is an SSE4 instruction. You might try looking in the Intel SSE4 Programming Reference here:
http://software.intel.com/en-us/articles/45nm-next-generation-intel-coret-2-processor-family-penryn-and-intel-streaming-simd-extensions-4-intel-sse4/
AVX reference can be found at http://software.intel.com/en-us/avx/
I hope that helps.
==
Aubrey W.
Intel Software Network Support
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
This was not the kind of information I was looking for. I've used this instruction for a while now and thus, I know that it is SSE4.2 or SSE4a on AMD. In fact, just have to look for the POPCNT flag in CPUID...
What I am looking for is the performance of this instruction. As I said in the first post, the "Intel 64 and IA-32ArchitecturesOptimization Reference Manual" (Order Number: 248966-020) lists in Appendix C almost all throughput and latency values for SIMD extensions.
However, I can't find these values for POPCNT in 64 bits flavour.
Regards,
Matthieu
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Matthieu - POPCNT has the same 3-cycle latency / 1-cycle throughputfor both 32-bit and 64-bit forms.
Regards,
-Max
Regards,
-Max
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Max,
Thanks for this answer.
Is there anyway to post comments/ request additional information like this to be put in the reference manual ?
Regards,
Matthieu
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page