Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- Intel Community
- Software Development SDKs and Libraries
- Intel® oneAPI Math Kernel Library & Intel® Math Kernel Library
- Information on the results of parallelizing algorithms for fast matrix multiplication using dgemm Intel MKL

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

Highlighted
##
Information on the results of parallelizing algorithms for fast matrix multiplication using dgemm Intel MKL

Parallel algorithms for fast matrix multiplication is non-trivial task because of the large number of quadratic operations: necessary to minimize the amount of the allocation of additional memory and it does not sacrifice speed multiplication. My recent advances in this field for 3-square matrices 16000 * 16000, located in memory and processed according to the formula C = C + A * B: my 129 seconds to 186 seconds dgemm Intel MKL (OS XP x64, the processor i7 860, 8 gigabytes of memory 1333 Mhz). The positive effect of parallelization beginning to affect the matrix is not less than 1500 * 1500. As a basic function of multiplication on the leaves of the recursion tree used dgemm Intel MKL. Created and fast algorithm for multiplication without allocating additional memory: this prize is more modest - about 8/7 on the speed dgemm Intel MKL on large matrices. There has been a positive effect for the case when one of the non-square matrices:I use it to speed up many problems in linear algebra, starting from the solution of systems of linear equations and ending with a singular analysis.

yuriisig

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

07-01-2010
12:26 AM

39 Views

0 Replies

For more complete information about compiler optimizations, see our Optimization Notice.