<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic The question looks more in Intel® Moderncode for Parallel Architectures</title>
    <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Optimization-of-fortran-code-using-parallel-studio-xe/m-p/1103277#M7350</link>
    <description>&lt;P&gt;The question looks more suitable for the Intel Linux Fortran forum. &amp;nbsp;But it's difficult to see what you're trying to accomplish with all this confusion. &amp;nbsp;Why not use -msse4.2 throughout? &amp;nbsp;There's no sse4.3 option for intel compilers. &amp;nbsp;If it's not rejected we can't guess what will happen.&lt;/P&gt;</description>
    <pubDate>Tue, 21 Jun 2016 14:08:02 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2016-06-21T14:08:02Z</dc:date>
    <item>
      <title>Optimization of fortran code using parallel studio xe</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Optimization-of-fortran-code-using-parallel-studio-xe/m-p/1103276#M7349</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;I'm now trying to find the best compilation flags for my code (CFD) since a couple of weeks but I think maybe thing wrong because the behaviour of my code is wired.&lt;/P&gt;

&lt;P&gt;My code is written in fortran (77/95) and I use OpenMPI for parallelisation.&amp;nbsp;&amp;nbsp;According to the Intel site of the CPU I use SSE4.2 can be used for optimization! (http://ark.intel.com/products/52576/Intel-Xeon-Processor-X5690-12M-Cache-3_46-GHz-6_40-GTs-Intel-QPI), therefore I compiled OpenMPI with the following command:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;./configure --prefix=/opt/OpenMpi_intel_Opt_static/ CC=icc CXX=icpc FC=ifort CFLAGS="-msse4.2 -axsse4.2" CXXFLAGS="-msse4.2 -axsse4.2" FFLAGS="-msse4.2 -axsse4.2" FCFLAGS="-msse4.2 -axsse4.2" LDFLAGS="-msse4.2 -axsse4.2" --with-platform=optimized --disable-shared --enable-static&lt;/PRE&gt;

&lt;P&gt;based on (https://software.intel.com/en-us/articles/performance-tools-for-software-developers-building-open-mpi-with-the-intel-compilers).&lt;/P&gt;

&lt;P&gt;My code is than compiled with the following:&lt;/P&gt;

&lt;PRE class="brush:;"&gt;mpif90 -c -axsse4.3 -O3    files.f

mpif90 -o prog all.o -axsse4.2 -O3 &lt;/PRE&gt;

&lt;P&gt;It seems, that based on the size of my arrays (in my case the number of entries of the array are equal my domain size) I get a good result or just "NaN". And when I get "NaN" the following remark is shown during compilation:&lt;/P&gt;

&lt;PRE class="brush:;"&gt;MAIN__ has been targeted for automatic cpu dispatch
....&lt;/PRE&gt;

&lt;P&gt;If I delete the "&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;-axsse4.2" flag it worked fine but It takes longer!&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Is there another way to optimize my code or change the compilation flags in order to decrease the runtime?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;THX&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;EDIT: mistyped -axsse4.2&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jun 2016 12:38:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Optimization-of-fortran-code-using-parallel-studio-xe/m-p/1103276#M7349</guid>
      <dc:creator>Christoph_I_</dc:creator>
      <dc:date>2016-06-21T12:38:03Z</dc:date>
    </item>
    <item>
      <title>The question looks more</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Optimization-of-fortran-code-using-parallel-studio-xe/m-p/1103277#M7350</link>
      <description>&lt;P&gt;The question looks more suitable for the Intel Linux Fortran forum. &amp;nbsp;But it's difficult to see what you're trying to accomplish with all this confusion. &amp;nbsp;Why not use -msse4.2 throughout? &amp;nbsp;There's no sse4.3 option for intel compilers. &amp;nbsp;If it's not rejected we can't guess what will happen.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jun 2016 14:08:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Optimization-of-fortran-code-using-parallel-studio-xe/m-p/1103277#M7350</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2016-06-21T14:08:02Z</dc:date>
    </item>
  </channel>
</rss>

