<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Announcing Intel MKL 2017 release in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Announcing-Intel-MKL-2017-release/m-p/1066946#M21962</link>
    <description>&lt;P&gt;&lt;B&gt;Check out the new and the latest Intel® Math Kernel Library (Intel&lt;/B&gt;&lt;B&gt;® MKL)&amp;nbsp;2017 release!&amp;nbsp;&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;What's new in Intel MKL 2017:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Introduced optimizations for the Intel® Xeon Phi™ processor x200 (formerly Knights Landing ) self-boot platform for Windows* OS&lt;/LI&gt;
	&lt;LI&gt;Enabled Automatic Offload (AO) and Compiler Assisted Offload (CAO) modes for the second generation of Intel Xeon Phi coprocessor on Linux* OS&lt;/LI&gt;
	&lt;LI&gt;Introduced Deep Neural Networks (DNN) primitives including convolution, normalization, activation, and pooling functions intended to accelerate convolutional neural networks (CNNs) and deep neural networks on Intel® Architecture.
		&lt;UL&gt;
			&lt;LI&gt;Optimized for Intel® Xeon® processor E5-xxxx v3 (formerly Haswell), Intel Xeon processor E5-xxxx v4 (formerlty Broadwell), and Intel Xeon Phi processor x200 self-boot platform.&lt;/LI&gt;
			&lt;LI&gt;Introduced inner product primitive to support fully connected layers.&lt;/LI&gt;
			&lt;LI&gt;Introduced batch normalization, sum, split, and concat primitives to provide full support for GoogLeNet and ResidualNet topologies.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;BLAS:
		&lt;UL&gt;
			&lt;LI&gt;Introduced new packed matrix multiplication interfaces (?gemm_alloc, ?gemm_pack&amp;nbsp;,?gemm_compute, ?gemm_free)&amp;nbsp;for single and double precisions.&lt;/LI&gt;
			&lt;LI&gt;Improved performance over standard S/DGEMM on Intel Xeon processor E5-xxxx v3 and later processors.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Sparse BLAS:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of parallel BSRMV functionality for processor supporting Intel® Advanced Vector Extensions 2 (Intel® AVX2) instruction set.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of sparse matrix functionality on the Intel Xeon Phi processor x200.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Intel MKL PARDISO:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of parallel solving step for matrices with fewer than 300000 elements.&lt;/LI&gt;
			&lt;LI&gt;Added support for mkl_progress in Parallel Direct Sparse Solver for Clusters.&lt;/LI&gt;
			&lt;LI&gt;Added fully distributed reordering step to Parallel Direct Sparse Solver for Clusters.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Fourier Transforms:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of batched 1D FFT with large batch size on processor supporting Intel® Advanced Vector Extensions (Intel® AVX), Intel AVX2, Intel® Advanced Vector Extensions 512 (Intel® AVX512) and IntelAVX512_MIC instruction sets&lt;/LI&gt;
			&lt;LI&gt;Improved performance for small size batched 2D FFT on&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200 self-boot platform,&amp;nbsp;Intel Xeon processor E5-xxxx v3, and Intel Xeon processor E5-xxxx v4.&lt;/LI&gt;
			&lt;LI&gt;Improved performance for 3D FFT on&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200 self-boot platform.&amp;nbsp;&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;LAPACK
		&lt;UL&gt;
			&lt;LI&gt;Included the latest LAPACK v3.6 enhancements. New features introduced are:
				&lt;UL&gt;
					&lt;LI&gt;SVD by Jacobi ([CZ]GESVJ) and preconditioned Jacobi ([CZ]GEJSV)&lt;/LI&gt;
					&lt;LI&gt;SVD via EVD allowing computation of a subset of singular values and vectors (?GESVDX)&lt;/LI&gt;
					&lt;LI&gt;In BLAS level 3, generalized Schur (?GGES3), generalized EVD (?GGEV3), generalized SVD (?GGSVD3), and reduction to generalized upper Hessenberg form (?GGHD3)&lt;/LI&gt;
					&lt;LI&gt;Multiplication of a general matrix by a unitary or orthogonal matrix that possesses a 2x2 block structure ([DS]ORM22/[CZ]UNM22)&lt;/LI&gt;
				&lt;/UL&gt;
			&lt;/LI&gt;
			&lt;LI&gt;Improved performance for large size QR(?GEQRF) on processors supporting theIntel AVX2 instruction set.&lt;/LI&gt;
			&lt;LI&gt;Improved LU factorization, solve, and inverse (?GETR?) performance for very small sizes (&amp;lt;16).&lt;/LI&gt;
			&lt;LI&gt;Improved General Eigensolver (?GEEV and ?GEEVD) performance for the case when eigenvectors are needed.&lt;/LI&gt;
			&lt;LI&gt;Improved?GETRF, ?POTRF and ?GEQRF, linear solver (?GETRS) and SMP LINPACK performance on the Intel Xeon Phi processor x200 self-boot platform.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;ScaLAPACK
		&lt;UL&gt;
			&lt;LI&gt;Improved performance for hybrid (MPI + OpenMP*) mode of ScaLAPACK and PBLAS.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of P?GEMM and P?TRSM resulted in better scalability of Qbox First-Principles Molecular Dynamics code.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Data Fitting:
		&lt;UL&gt;
			&lt;LI&gt;Introduced two new storage formats for interpolation results (DF_MATRIX_STORAGE_SITES_FUNCS_DERS and DF_MATRIX_STORAGE_SITES_DERS_FUNCS).&lt;/LI&gt;
			&lt;LI&gt;Added Hyman monotonic cubic spline.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of Data Fititng functionality on the Intel Xeon Phi processor x200.&lt;/LI&gt;
			&lt;LI&gt;Modified callback APIs to allow users to pass information about integration limits.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Vector Mathematics:
		&lt;UL&gt;
			&lt;LI&gt;Introduced optimizations for&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200.&lt;/LI&gt;
			&lt;LI&gt;Improved performance for Intel Xeon processor E5-xxxx v3 and Intel Xeon processor E5-xxxx v4.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Vector Statistics:
		&lt;UL&gt;
			&lt;LI&gt;Introduced additional optimization of SkipAhead method for MT19937 and SFMT19937.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of Vector Statistic functionality including Random Number Generators and Summary Statistic on the Intel Xeon Phi processor x200.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;&lt;B&gt;Checkout Online&amp;nbsp;&lt;A href="https://software.intel.com/en-us/articles/intel-math-kernel-library-intel-mkl-2017-release-notes"&gt;Release notes&lt;/A&gt;&amp;nbsp;for more information&lt;/B&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 07 Sep 2016 08:22:49 GMT</pubDate>
    <dc:creator>Gennady_F_Intel</dc:creator>
    <dc:date>2016-09-07T08:22:49Z</dc:date>
    <item>
      <title>Announcing Intel MKL 2017 release</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Announcing-Intel-MKL-2017-release/m-p/1066946#M21962</link>
      <description>&lt;P&gt;&lt;B&gt;Check out the new and the latest Intel® Math Kernel Library (Intel&lt;/B&gt;&lt;B&gt;® MKL)&amp;nbsp;2017 release!&amp;nbsp;&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;What's new in Intel MKL 2017:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Introduced optimizations for the Intel® Xeon Phi™ processor x200 (formerly Knights Landing ) self-boot platform for Windows* OS&lt;/LI&gt;
	&lt;LI&gt;Enabled Automatic Offload (AO) and Compiler Assisted Offload (CAO) modes for the second generation of Intel Xeon Phi coprocessor on Linux* OS&lt;/LI&gt;
	&lt;LI&gt;Introduced Deep Neural Networks (DNN) primitives including convolution, normalization, activation, and pooling functions intended to accelerate convolutional neural networks (CNNs) and deep neural networks on Intel® Architecture.
		&lt;UL&gt;
			&lt;LI&gt;Optimized for Intel® Xeon® processor E5-xxxx v3 (formerly Haswell), Intel Xeon processor E5-xxxx v4 (formerlty Broadwell), and Intel Xeon Phi processor x200 self-boot platform.&lt;/LI&gt;
			&lt;LI&gt;Introduced inner product primitive to support fully connected layers.&lt;/LI&gt;
			&lt;LI&gt;Introduced batch normalization, sum, split, and concat primitives to provide full support for GoogLeNet and ResidualNet topologies.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;BLAS:
		&lt;UL&gt;
			&lt;LI&gt;Introduced new packed matrix multiplication interfaces (?gemm_alloc, ?gemm_pack&amp;nbsp;,?gemm_compute, ?gemm_free)&amp;nbsp;for single and double precisions.&lt;/LI&gt;
			&lt;LI&gt;Improved performance over standard S/DGEMM on Intel Xeon processor E5-xxxx v3 and later processors.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Sparse BLAS:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of parallel BSRMV functionality for processor supporting Intel® Advanced Vector Extensions 2 (Intel® AVX2) instruction set.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of sparse matrix functionality on the Intel Xeon Phi processor x200.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Intel MKL PARDISO:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of parallel solving step for matrices with fewer than 300000 elements.&lt;/LI&gt;
			&lt;LI&gt;Added support for mkl_progress in Parallel Direct Sparse Solver for Clusters.&lt;/LI&gt;
			&lt;LI&gt;Added fully distributed reordering step to Parallel Direct Sparse Solver for Clusters.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Fourier Transforms:
		&lt;UL&gt;
			&lt;LI&gt;Improved performance of batched 1D FFT with large batch size on processor supporting Intel® Advanced Vector Extensions (Intel® AVX), Intel AVX2, Intel® Advanced Vector Extensions 512 (Intel® AVX512) and IntelAVX512_MIC instruction sets&lt;/LI&gt;
			&lt;LI&gt;Improved performance for small size batched 2D FFT on&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200 self-boot platform,&amp;nbsp;Intel Xeon processor E5-xxxx v3, and Intel Xeon processor E5-xxxx v4.&lt;/LI&gt;
			&lt;LI&gt;Improved performance for 3D FFT on&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200 self-boot platform.&amp;nbsp;&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;LAPACK
		&lt;UL&gt;
			&lt;LI&gt;Included the latest LAPACK v3.6 enhancements. New features introduced are:
				&lt;UL&gt;
					&lt;LI&gt;SVD by Jacobi ([CZ]GESVJ) and preconditioned Jacobi ([CZ]GEJSV)&lt;/LI&gt;
					&lt;LI&gt;SVD via EVD allowing computation of a subset of singular values and vectors (?GESVDX)&lt;/LI&gt;
					&lt;LI&gt;In BLAS level 3, generalized Schur (?GGES3), generalized EVD (?GGEV3), generalized SVD (?GGSVD3), and reduction to generalized upper Hessenberg form (?GGHD3)&lt;/LI&gt;
					&lt;LI&gt;Multiplication of a general matrix by a unitary or orthogonal matrix that possesses a 2x2 block structure ([DS]ORM22/[CZ]UNM22)&lt;/LI&gt;
				&lt;/UL&gt;
			&lt;/LI&gt;
			&lt;LI&gt;Improved performance for large size QR(?GEQRF) on processors supporting theIntel AVX2 instruction set.&lt;/LI&gt;
			&lt;LI&gt;Improved LU factorization, solve, and inverse (?GETR?) performance for very small sizes (&amp;lt;16).&lt;/LI&gt;
			&lt;LI&gt;Improved General Eigensolver (?GEEV and ?GEEVD) performance for the case when eigenvectors are needed.&lt;/LI&gt;
			&lt;LI&gt;Improved?GETRF, ?POTRF and ?GEQRF, linear solver (?GETRS) and SMP LINPACK performance on the Intel Xeon Phi processor x200 self-boot platform.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;ScaLAPACK
		&lt;UL&gt;
			&lt;LI&gt;Improved performance for hybrid (MPI + OpenMP*) mode of ScaLAPACK and PBLAS.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of P?GEMM and P?TRSM resulted in better scalability of Qbox First-Principles Molecular Dynamics code.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Data Fitting:
		&lt;UL&gt;
			&lt;LI&gt;Introduced two new storage formats for interpolation results (DF_MATRIX_STORAGE_SITES_FUNCS_DERS and DF_MATRIX_STORAGE_SITES_DERS_FUNCS).&lt;/LI&gt;
			&lt;LI&gt;Added Hyman monotonic cubic spline.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of Data Fititng functionality on the Intel Xeon Phi processor x200.&lt;/LI&gt;
			&lt;LI&gt;Modified callback APIs to allow users to pass information about integration limits.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Vector Mathematics:
		&lt;UL&gt;
			&lt;LI&gt;Introduced optimizations for&amp;nbsp;the&amp;nbsp;Intel Xeon Phi processor x200.&lt;/LI&gt;
			&lt;LI&gt;Improved performance for Intel Xeon processor E5-xxxx v3 and Intel Xeon processor E5-xxxx v4.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Vector Statistics:
		&lt;UL&gt;
			&lt;LI&gt;Introduced additional optimization of SkipAhead method for MT19937 and SFMT19937.&lt;/LI&gt;
			&lt;LI&gt;Improved performance of Vector Statistic functionality including Random Number Generators and Summary Statistic on the Intel Xeon Phi processor x200.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;&lt;B&gt;Checkout Online&amp;nbsp;&lt;A href="https://software.intel.com/en-us/articles/intel-math-kernel-library-intel-mkl-2017-release-notes"&gt;Release notes&lt;/A&gt;&amp;nbsp;for more information&lt;/B&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 07 Sep 2016 08:22:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Announcing-Intel-MKL-2017-release/m-p/1066946#M21962</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-09-07T08:22:49Z</dc:date>
    </item>
  </channel>
</rss>

