Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Highlighted
New Contributor I
18 Views

Composite ippi functions

When I need a sequence of ippi functions for a composite operation I could, of course, call those ippi functions in sequence. However, for large images that would be a rather bad usage of the CPU Ln caches.

Another solution is to create a composite "ippi" function that calls for each row the equivalent series of ipps functions. An added advantage is that this can be threaded easily, by giving each thread its own set of rows.

If the RowBytes (StepSize) of the image(s) equal(s) the PixelSize multiplied by the ImageWidth, we can work per chunk rather than per row. Let's assume that sysctlbyname with "hw.l1dcachesize" returns the L1 cache per core. The chunk size can be chosen to be that size for in-place operations or half that size for operations from a source to a target.

Is that optimal ? Or what does Intel recommend ?

Regards,

Adriaan van Os

 

0 Kudos
2 Replies
Highlighted
Employee
18 Views

Hello Adiaan van Os,

Thanks for your proposal.

For the cache usage, does the link below meet your expectation?

https://software.intel.com/en-us/ipp-dev-guide-cache-optimizations

And, if possible, could you help create ticket for the feature request or enhancement in our online service center through https://supporttickets.intel.com/?lang=en-US

Best Regards,

Ruqiu

0 Kudos
Highlighted
New Contributor I
18 Views

I am not aware that my question implies a feature request or proposal. And I did read the page you refer to, of course.

Regards,

Adriaan van Os

 

0 Kudos