- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I want to measure the performance difference between SSE2 and AVX implementations for a neural network application. I am using Intel MKL to perform the BLAS calculations so that I have the most optimized implementation.
Is there anyway to instruct the MKL to release only SSE2 code even if machine supports AVX2. I know AVX2 will mostly perform better than SSE2. But I want to quantify that difference.
Regards,
Animesh Jain
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Mkl conditional numerical reproducibility offers options which should take care of this. It has been discussed several times on mkl forum.
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Mkl conditional numerical reproducibility offers options which should take care of this. It has been discussed several times on mkl forum.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you. I am able to dispatch code variants with different SIMD extensions by using the following environment variables - MKL_CBWR, MKL_NUM_THREADS.

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page