Software Tuning, Performance Optimization & Platform Monitoring
Discussion around monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform monitoring
Announcements
The Intel sign-in experience is changing in February to support enhanced security controls. If you sign in, click here for more information.

Request for feedback on VP2INTERSECT emulation

guillermozo
Beginner
178 Views

I wanted to ask for feedback on an emulation of the VP2INTERSECT instructions:

https://arxiv.org/abs/2112.06342

The emulation is faster than the native instructions when only one of the output masks is returned. I consider the following three applications of VP2INTERSECT instructions: 

  1. computing the intersection (common elements) of two arrays of integers (whether sorted or unsorted), 
  2. computing the size of the intersection of two arrays of integers, 
  3. removing common elements from two arrays of integers. 

Only 3. requires both output masks, while 1. and 2. only need one.

Since the name of the instructions is VP2INTERSECT, I presume that the main application is 1. (possibly 2.), in which case a fast emulation could be useful. 

 

But I may be wrong, so would like to ask if the two cases above (computing the intersection, or the size of the intersection of two arrays of integers) are the intended (or expected most frequent) use cases for these instructions?. 

Thank you. 

 

0 Kudos
0 Replies
Reply