- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi all — I’ve been using the Intel Fortran Compiler for a few projects and want to get better performance out of my scientific code on newer Intel CPUs. What optimization flags, compiler settings, or profiling tools are you using to squeeze out extra performance? Any tips on auto-vectorization or cache-friendly coding patterns would be really helpful!
Thanks in advance!
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Attached is a program that compares calculating Pi using Serial, Parallel OpenMP, and Parallel Do Concurrent. The best approach I have found so far is to use these settings:
Optimization: /O3
Intel Processor-Specific Optimization: /QxCORE-AVX2
Add Processor-Optimized Code Path: /QaxCORE-AVX512)
Maximize AVX512 usage if it is available: /Qopt-zmm-usage:high
Enable OpenMP: /fpp /Qopenmp
I find results will vary depending on the compiler and the cpu.
I too am interested in what others are doing to make their code faster.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page