Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Employee
93 Views

Intel MKL 10.2 Update 3 is now available

Intel MKL 10.2 Update 3 is now available. It includes the following features:

  • Performance improvements
    • BLAS: Several Level 1 & 2 BLAS functions newly threaded; Improved scaling for DGEMM for skinny matrices
    • LAPACK: Improved scalability for LAPACK functions: ?POTRF, ?GEBRD, ?SYTRD, ?HETRD, and ?STEDC
    • FFTs: Extended threading to small-size multi-dimensional transforms and other cases
    • VML: Further optimizations: v(s,d)Asin, v(s,d)Acos, v(s,d)Ln, v(s,d)Log10, vsLog1p, v(s/d)Hypot
    • VSL: Improved performance for viRngPoisson and viRngPoissonV random number generators
  • Usability/Interface improvements
    • Improved example programs for uBLAS, Java, FFTW3, LAPACK95, and BLAS95
    • New 64-bit integer (ILP64) fftw_mpi interfaces for cluster FFTs
  • Bug fixes

A stand-alone package containing the Intel Optimized LINPACK Benchmark can be found online.

CODE TIPS

Users with current licenses may login at the Intel Registration Center to download.

0 Kudos
3 Replies
Moderator
93 Views


as an additional info, please see the KB - Intel MKL 10.2 fixes List ...
You can find there the list of all more significant issues fixed in Update3
--Gennady
0 Kudos
Beginner
93 Views

Can someone please provide more details about the following Update 3 bug fix:

DPD200084747 DFT not free internal memory in MKL 10.2.1

Thanks,
Ozzer


Quoting - Todd Rosenquist (Intel)
Intel MKL 10.2 Update 3 is now available. It includes the following features:

  • Performance improvements
    • BLAS: Several Level 1 & 2 BLAS functions newly threaded; Improved scaling for DGEMM for skinny matrices
    • LAPACK: Improved scalability for LAPACK functions: ?POTRF, ?GEBRD, ?SYTRD, ?HETRD, and ?STEDC
    • FFTs: Extended threading to small-size multi-dimensional transforms and other cases
    • VML: Further optimizations: v(s,d)Asin, v(s,d)Acos, v(s,d)Ln, v(s,d)Log10, vsLog1p, v(s/d)Hypot
    • VSL: Improved performance for viRngPoisson and viRngPoissonV random number generators
  • Usability/Interface improvements
    • Improved example programs for uBLAS, Java, FFTW3, LAPACK95, and BLAS95
    • New 64-bit integer (ILP64) fftw_mpi interfaces for cluster FFTs
  • Bug fixes

A stand-alone package containing the Intel Optimized LINPACK Benchmark can be found online.

CODE TIPS

Users with current licenses may login at the Intel Registration Center to download.


0 Kudos
Moderator
93 Views

Ozzer, please see the short description of DPD200084747:
The problem was in visible when
DFT routines ( DFTIComputeForward and DFTIComputeBackward)
were called repetitively during iterations.

The memory usage was growing even MKL_DISABLE_FAST_MM was set as 1
and also when after each iteration mkl_free_buffers() has been called.
This didnt have any effect.
--Gennady
0 Kudos