vector array in mkl

saar_w_ · ‎06-28-2016

Hi does anyone know what is the best mapping in MKL for ippmMul_vaca_64f.

Thanks,

Saar.

Ying_H_Intel · ‎06-28-2016

Hi Saar,

I guess, you have known that ippm was deprecated in latest IPP version, and the replacement may be in MKL BLAS function or VSL functions.

https://software.intel.com/en-us/articles/the-alternatives-for-intel-ipp-legacy-small-matrices-domain

No sure what is your array layout, next operation, the vector size, machine type etc. You may search in MKL reference machine, there are a few of function be able to do scalar * vector

for example, cblas_dscal()

The ?scal routines perform a vector operation defined as x = a*x
where: a is a scalar, x is an n-element vector

The ?axpy routines perform a vector-vector operation defined as
y := a*x + y

VSL functions: vdLinearFrac( n, a, b, scalea, shifta, scaleb, shiftb, y ) , y=(scalea·a+shifta)/(scaleb·b+shiftb), i=1,2 … n

or you create a matrix * vector according your vector array etc. Which usually have better performance.

Best Regards,
Ying

saar_w_ · ‎06-28-2016

Hi Ying and thank you for your quick replay ,

I am familiar with cblas functions but my problem is very specific for batch operation

the method ippmMul_vaca_64f is multiuplyng array of vectors with array of scalar ,

just like calling mutipale times to ?axpy. the difference is that it multiply all the vectors multi threaded and at once. how can I get the same result in MKL?

Ying H. (Intel) wrote:

Hi Saar,

I guess, you have known that ippm was deprecated in latest IPP version, and the replacement may be in MKL BLAS function or VSL functions.

https://software.intel.com/en-us/articles/the-alternatives-for-intel-ipp...

No sure what is your array layout, next operation, the vector size, machine type etc. You may search in MKL reference machine, there are a few of function be able to do scalar * vector

for example, cblas_dscal()

The ?scal routines perform a vector operation defined as x = a*x
where: a is a scalar, x is an n-element vector

The ?axpy routines perform a vector-vector operation defined as
y := a*x + y

VSL functions: vdLinearFrac( n, a, b, scalea, shifta, scaleb, shiftb, y ) , y=(scalea·a+shifta)/(scaleb·b+shiftb), i=1,2 … n

or you create a matrix * vector according your vector array etc. Which usually have better performance.

Best Regards,
Ying

Ying_H_Intel · ‎07-01-2016

Hi Saar,

Right, current MKL only provide dgemm's batch function, no others, Do you have following operation after you get aX1, bX2, cX3, ... (a,b,c are constant number, XI is vector, how was the length and number of your vector array ?). If no, it seems you have to call one blas function several times. or you may add OpenMP progam to parallel the calls.

Best Regards,
Ying