Intel® Fortran Compiler
Build applications that can scale for the future with optimized code designed for Intel® Xeon® and compatible processors.
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.
29285 Discussions

Cache blocking techniques for element-wise math in large arrays - Fortran

art-croucher
Beginner
371 Views
I have a CFD code that does a lot of element-wise (A(i,j)+B(i,j)) math with large arrays. roughly 500x500 R*8s, with most sections of the code using a half dozen of these (1.5MB) arrays at a time.
To clean up the code, and in a naive hope that IFC would figure out the best way to manage the work, we vectorized most of the code.
VTune shows a lot of time wasted with various stores.
Is there a simple technique or a library that can block these operations to be efficient on a Xeon? Hopefully without rewriting all the code!
0 Kudos
1 Reply
Steven_L_Intel1
Employee
371 Views
You may want to ask this in the Integrated Performance Primitives and Math Kernel Library boards. Have you tried compiling with /opt:3?
0 Kudos
Reply