- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
hello,
i would like to perform the operation A*D*B, where D is diagonal, and A,B are appropriate dimensions using BLAS. there are no routines specific to diagonal matrices, so i was able to make a one-line modification to the DGEMM code available on netlib to implement this. it runs much slower than the BLAS in MKL (of course). is the source code available to make such a modification?
i would like to perform the operation A*D*B, where D is diagonal, and A,B are appropriate dimensions using BLAS. there are no routines specific to diagonal matrices, so i was able to make a one-line modification to the DGEMM code available on netlib to implement this. it runs much slower than the BLAS in MKL (of course). is the source code available to make such a modification?
Link Copied
1 Reply
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If you compiled netlib DGEMM with a vectorizing compiler, it ought to perform well on problems which don't need cache blocking. By removing the if tests which skip loops involving 0. operations, you would give ifort -O3 more freedom to optimize.
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page