you can use a workaround which is more flexible:
IPPAPI(IppStatus, ippiScaleC_16u8u_C1R , ( const Ipp16u* pSrc, int srcStep, Ipp64f mVal, Ipp64f aVal, Ipp8u* pDst, int dstStep, IppiSize roiSize, IppHintAlgorithm hint ))
This function scales pixel values of the source image ROI and converts them to the destination data type according to the following formula:
dst = saturate_to_dstType(src * mVal + aVal)
Thanks for the suggestion. ippiScaleC_16u8u_C1R does work. The results are not exactly the same because it uses floats, which probably also makes it slower. I ended up writing the functions with intrinsics.