<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Simple matrix vector OpenMP problem in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744398#M1640</link>
    <description>I selected Release Win32 - it solved problem! Many thanks!&lt;BR /&gt;&lt;BR /&gt;Now (for N=1000) it works with 4 times faster with 4 threads. However I &lt;BR /&gt;the code is incorrect from the semantics point of view (indexes should not be shared).&lt;BR /&gt;Why it speeds up so poorly with Debug tagret selected?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Sat, 22 May 2010 11:00:51 GMT</pubDate>
    <dc:creator>Mikhail_Posypkin</dc:creator>
    <dc:date>2010-05-22T11:00:51Z</dc:date>
    <item>
      <title>Simple matrix vector OpenMP problem</title>
      <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744395#M1637</link>
      <description>&lt;BR /&gt;Consider the following very simple OpenMP program&lt;BR /&gt;&lt;BR /&gt;#define n 5000&lt;BR /&gt;double a&lt;N&gt;&lt;N&gt;, b&lt;N&gt;, c&lt;N&gt;;&lt;BR /&gt;&lt;BR /&gt;int _tmain (int 
argc, char *argv[])&lt;BR /&gt;{&lt;BR /&gt; omp_set_num_threads (1);&lt;BR /&gt; double start
 = omp_get_wtime ();&lt;BR /&gt;&lt;BR /&gt;#pragma omp parallel for&lt;BR /&gt; for (int i = 
0; i &amp;lt; n; i++)&lt;BR /&gt; {&lt;BR /&gt; c&lt;I&gt; = 0;&lt;BR /&gt; for (int k = 0; k
 &amp;lt; n; k++)&lt;BR /&gt; {&lt;BR /&gt; c&lt;I&gt; += a&lt;I&gt;&lt;K&gt; * b&lt;K&gt;;&lt;BR /&gt;
 }&lt;BR /&gt;&lt;BR /&gt; }&lt;BR /&gt; printf ("time = %lf (%d)\\n", omp_get_wtime () - 
start, omp_get_num_threads());&lt;BR /&gt;}&lt;BR /&gt;&lt;BR /&gt;on my Intel Core 2 Quad it runs 0.17 s with 1 thread, 0.18 s with 2 threads, 0.18 s with 4 threads&lt;BR /&gt;So, no speedup.&lt;BR /&gt;&lt;BR /&gt;Intel
 Parallel Amplifier shows bad utilization, but doesn't show the reason. How can I discover the reason with Intel tools integrated in MS Visual Studio?&lt;BR /&gt;&lt;/K&gt;&lt;/K&gt;&lt;/I&gt;&lt;/I&gt;&lt;/I&gt;&lt;/N&gt;&lt;/N&gt;&lt;/N&gt;&lt;/N&gt;</description>
      <pubDate>Fri, 21 May 2010 16:33:43 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744395#M1637</guid>
      <dc:creator>Mikhail_Posypkin</dc:creator>
      <dc:date>2010-05-21T16:33:43Z</dc:date>
    </item>
    <item>
      <title>Simple matrix vector OpenMP problem</title>
      <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744396#M1638</link>
      <description>please check your program in debug version or release version ?</description>
      <pubDate>Sat, 22 May 2010 01:34:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744396#M1638</guid>
      <dc:creator>afd_lml</dc:creator>
      <dc:date>2010-05-22T01:34:06Z</dc:date>
    </item>
    <item>
      <title>Simple matrix vector OpenMP problem</title>
      <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744397#M1639</link>
      <description>Debug target is selected.</description>
      <pubDate>Sat, 22 May 2010 08:17:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744397#M1639</guid>
      <dc:creator>Mikhail_Posypkin</dc:creator>
      <dc:date>2010-05-22T08:17:15Z</dc:date>
    </item>
    <item>
      <title>Simple matrix vector OpenMP problem</title>
      <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744398#M1640</link>
      <description>I selected Release Win32 - it solved problem! Many thanks!&lt;BR /&gt;&lt;BR /&gt;Now (for N=1000) it works with 4 times faster with 4 threads. However I &lt;BR /&gt;the code is incorrect from the semantics point of view (indexes should not be shared).&lt;BR /&gt;Why it speeds up so poorly with Debug tagret selected?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 22 May 2010 11:00:51 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744398#M1640</guid>
      <dc:creator>Mikhail_Posypkin</dc:creator>
      <dc:date>2010-05-22T11:00:51Z</dc:date>
    </item>
    <item>
      <title>Simple matrix vector OpenMP problem</title>
      <link>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744399#M1641</link>
      <description>For inner loop, auto-vectorization from /arch:SSE2|SSE3|SSE4 or /QxSSE2... will help performance a lot. &lt;BR /&gt;&lt;BR /&gt;but it's disabled in "Debug". You can use /Qvec-report[1|2|3|4|5] to show the details. &lt;BR /&gt;for openmp, use /Qopenmp-report[1|2]&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;&amp;gt;&amp;gt;icl /O2 /Qopenmp /Ob2 /Qvec-report3 /Qopenmp-report:2 u.cpp&lt;BR /&gt;Intel C++ Compiler for applications running on IA-32, Version 12.0.0.024 Beta Build 20100415&lt;BR /&gt;Copyright (C) 1985-2010 Intel Corporation. All rights reserved.&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN style="font-family: Courier New;"&gt;u.cpp&lt;BR /&gt;C:\temp\u.cpp(12): (col. 1) remark: OpenMP DEFINED LOOP WAS PARALLELIZED.&lt;BR /&gt;C:\temp\u.cpp(13): (col. 3) remark: loop was not vectorized: not inner loop.&lt;BR /&gt;C:\temp\u.cpp(16): (col. 7) remark: LOOP WAS VECTORIZED.&lt;BR /&gt;Microsoft  Incremental Linker Version 9.00.30729.01&lt;BR /&gt;Copyright (C) Microsoft Corporation. All rights reserved.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;BR /&gt;You can add /Qvec-report3 /Qopenmp-report2 to the project property C/C++-&amp;gt;Advanced-&amp;gt;Additional option. &lt;BR /&gt;&lt;BR /&gt;Jennifer &lt;/P&gt;</description>
      <pubDate>Tue, 25 May 2010 17:47:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Simple-matrix-vector-OpenMP-problem/m-p/744399#M1641</guid>
      <dc:creator>JenniferJ</dc:creator>
      <dc:date>2010-05-25T17:47:12Z</dc:date>
    </item>
  </channel>
</rss>

