- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Someonetold me he ovserved a 30% DGEMMperformance difference with MKL on the SAME core 2 duomachine with 32-bit and 64-bit OSes installed. I am not sure if he used the latest version of MKL.I am really curious to know if it is true since we are buying some 64-bit mahcines and we need tochoose between64-bitand 32-bit Linux. We don't plan to get more than 3G memory.My understanding is the difference hereis mainly those additionalxmm and general pupose registers. But can they really help in a computation-intensive kernelsuch asDGEMM which already gave close to peak performance on IA32 machines?How aboutother kernels such as FFT and level-2 blas?Let's assume large array sizes.
I only have access to P4 and Itanium 2 right now so I can' conduct any experiments. Anyone has such an experience?
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Many variables would impact this. On some of my tests on Core 2 Duo, the different thresholds for invoking threading within MKL brought about large performance differences between MKL 8.1 and 9.0, and between 32- and 64-bit. MKL functions which aren't threaded evidently aren't subject to abrupt changes in performance due to thread number selection.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page