- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Can anyone advise the compiler settings to get the maximum raw speed out of matrix multiplicationcalculations using parallel processing while maintaining precise accuracy for REAL*8 calculations? Alldebugging and other development phase features can be turned off.
I am comparing the Intel XE Windows compiler with Lahey Fortran and speed is my main criterion. Does anyone have any comments concerning comparisons between the two compilers?
I am comparing the Intel XE Windows compiler with Lahey Fortran and speed is my main criterion. Does anyone have any comments concerning comparisons between the two compilers?
Link Copied
1 Reply
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The simplest suggestion is to call the Intel Math Kernel Library for your matrix multiply (DGEMM or similar). You will automatically get the best parallel performance. The Fortran compiler can recognize some forms of matrix-multiply and call MKL on your behalf, if you compile with auto-parallelization and the highest optimization level, but that will also allow some optimizations that may not be as precise as you want.
The /fast option is shorthand for a set of options that usually improve performance, but note that this implies /QxHost which assumes you are running the program on the same CPU type you compile on.
The /fast option is shorthand for a set of options that usually improve performance, but note that this implies /QxHost which assumes you are running the program on the same CPU type you compile on.

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page