<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic oom may have killed your job: in Intel® Fortran Compiler</title>
    <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050717#M115124</link>
    <description>&lt;P&gt;oom may have killed your job:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.kernel.org/doc/gorman/html/understand/understand016.html"&gt;https://www.kernel.org/doc/gorman/html/understand/understand016.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;</description>
    <pubDate>Sat, 14 Mar 2015 16:23:17 GMT</pubDate>
    <dc:creator>jimdempseyatthecove</dc:creator>
    <dc:date>2015-03-14T16:23:17Z</dc:date>
    <item>
      <title>Why my fortran jobs got killed?</title>
      <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050716#M115123</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;I have encountered a very strange problem while running my fortran code on a &lt;/SPAN&gt;&lt;SPAN style="font-size: 13.0080003738403px; line-height: 19.5120010375977px;"&gt;ubuntu&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;system. I was using the &lt;STRONG&gt;PARDISO&lt;/STRONG&gt; to solve a very '&lt;STRONG&gt;huge&lt;/STRONG&gt;' system using the &lt;STRONG&gt;OOC&lt;/STRONG&gt; mode. For a first case, according to iparm(17), it required 680G harddisk storage. And the problem was solved without any issue. For the second case, the matrix size is even bigger, almost twice of the previous one, but the harddisk usage is not increased much, &lt;/SPAN&gt;&lt;SPAN style="font-size: 13.0080003738403px; line-height: 19.5120010375977px;"&gt;around 700G according to iparm(17).&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;And this is the only difference between these two problems. But the job of the second problem was killed after phase 22. I have used phase 11, 22 and 33 in my code. For both cases, I have set the ulimit to unlimited and the KMP_STACKSIZE to 5G. So, what should be the problem? Why the second job was killed by the system. Any suggestion will be much appreciated.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13.0080003738403px; line-height: 19.5120010375977px;"&gt;By the way, the machine I was using has 250G RAM, but for large problems I still needs to use the OOC mode.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Mar 2015 15:52:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050716#M115123</guid>
      <dc:creator>Gators_vs__Sundevil</dc:creator>
      <dc:date>2015-03-13T15:52:30Z</dc:date>
    </item>
    <item>
      <title>oom may have killed your job:</title>
      <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050717#M115124</link>
      <description>&lt;P&gt;oom may have killed your job:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.kernel.org/doc/gorman/html/understand/understand016.html"&gt;https://www.kernel.org/doc/gorman/html/understand/understand016.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;</description>
      <pubDate>Sat, 14 Mar 2015 16:23:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050717#M115124</guid>
      <dc:creator>jimdempseyatthecove</dc:creator>
      <dc:date>2015-03-14T16:23:17Z</dc:date>
    </item>
    <item>
      <title>Quote:jimdempseyatthecove</title>
      <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050718#M115125</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;jimdempseyatthecove wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;oom may have killed your job:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.kernel.org/doc/gorman/html/understand/understand016.html" rel="nofollow"&gt;https://www.kernel.org/doc/gorman/html/understand/understand016.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Hi, Jim,&lt;/P&gt;

&lt;P&gt;Thanks for your information. The total&amp;nbsp;&lt;SPAN style="font-size: 13.0080003738403px; line-height: 19.5120010375977px;"&gt;available&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;harddisk storage is 2T and the RAM is 250G. As mentioned, I was using the OOC mode. Thus, there should not consume much RAM in my cases. All the required space to store the LU results is on the harddisk.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Anyway, if it was oom killed the process, how can i avoid this happen again, i.e., is there any way to set oom not to kill the process? &amp;nbsp;Much appreciated.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 14 Mar 2015 16:53:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050718#M115125</guid>
      <dc:creator>Gators_vs__Sundevil</dc:creator>
      <dc:date>2015-03-14T16:53:50Z</dc:date>
    </item>
    <item>
      <title>For questions like this,</title>
      <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050719#M115126</link>
      <description>&lt;P&gt;For questions like this, Google is your friend. Search for "disable oom".&lt;/P&gt;

&lt;P&gt;Possibly: &lt;A href="http://thetechnick.blogspot.com/2010/12/steps-to-disable-oom-on-linux.html"&gt;http://thetechnick.blogspot.com/2010/12/steps-to-disable-oom-on-linux.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;That is for system wide disable. You might want to do some googling on your own to see how you can specifically do this for a given application. Note, the references I found, indicate you can do this for a specific process ID as opposed to via path to process. So your&amp;nbsp;app would have to get its PID and then write the appropriate flag value.&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;</description>
      <pubDate>Sun, 15 Mar 2015 14:53:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050719#M115126</guid>
      <dc:creator>jimdempseyatthecove</dc:creator>
      <dc:date>2015-03-15T14:53:56Z</dc:date>
    </item>
    <item>
      <title>Quote:jimdempseyatthecove</title>
      <link>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050720#M115127</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;jimdempseyatthecove wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;For questions like this, Google is your friend. Search for "disable oom".&lt;/P&gt;

&lt;P&gt;Possibly: &lt;A href="http://thetechnick.blogspot.com/2010/12/steps-to-disable-oom-on-linux.html" rel="nofollow"&gt;http://thetechnick.blogspot.com/2010/12/steps-to-disable-oom-on-linux.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;That is for system wide disable. You might want to do some googling on your own to see how you can specifically do this for a given application. Note, the references I found, indicate you can do this for a specific process ID as opposed to via path to process. So your&amp;nbsp;app would have to get its PID and then write the appropriate flag value.&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Thanks, Jim. I found out a way how to solve the problem. I reduced the max amount of RAM can be used by OOC in the config file (MKL_PARDISO_OOC_MAX_CORE_SIZE,&amp;nbsp;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;MKL_PARDISO_OOC_MAX_SWAP_SIZE). Previously, these two numbers were almost the same as the available RAM for the system. I think this may be the reason why the OOM decided to kill the job.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Mar 2015 18:06:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Fortran-Compiler/Why-my-fortran-jobs-got-killed/m-p/1050720#M115127</guid>
      <dc:creator>Gators_vs__Sundevil</dc:creator>
      <dc:date>2015-03-16T18:06:11Z</dc:date>
    </item>
  </channel>
</rss>

