<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic intel compiler in tesla machine in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838859#M6277</link>
    <description>As you must be aware, Fortran compilers for Tesla run on a host machine, and support off-loading of cuda library code to run on Tesla under syntax resembling OpenMP. There is no compiler which installs on Tesla, nor has any decision been made about an Intel compiler supporting Tesla. Intel Fortran could be installed on a host machine for Tesla, but would not utilize Tesla unless you made yourself an interface to cuda host tools.</description>
    <pubDate>Thu, 15 Apr 2010 13:34:37 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2010-04-15T13:34:37Z</dc:date>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838857#M6275</link>
      <description>Hello sir.&lt;BR /&gt;I wanr to ask whether the intel fortran compiler for linux&lt;BR /&gt;can be installed on the nvidia tesla machine</description>
      <pubDate>Thu, 15 Apr 2010 07:06:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838857#M6275</guid>
      <dc:creator>ahmediiit</dc:creator>
      <dc:date>2010-04-15T07:06:42Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838858#M6276</link>
      <description>Hello,&lt;DIV&gt;Please look &lt;A href="http://software.intel.com/en-us/articles/intel-fortran-compiler-111-release-notes/"&gt;here&lt;/A&gt; a t the Intel Fortran Compiler Release Notes to find out the appropriate System Requirements.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 15 Apr 2010 13:08:58 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838858#M6276</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2010-04-15T13:08:58Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838859#M6277</link>
      <description>As you must be aware, Fortran compilers for Tesla run on a host machine, and support off-loading of cuda library code to run on Tesla under syntax resembling OpenMP. There is no compiler which installs on Tesla, nor has any decision been made about an Intel compiler supporting Tesla. Intel Fortran could be installed on a host machine for Tesla, but would not utilize Tesla unless you made yourself an interface to cuda host tools.</description>
      <pubDate>Thu, 15 Apr 2010 13:34:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838859#M6277</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2010-04-15T13:34:37Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838860#M6278</link>
      <description>Hello sir,&lt;BR /&gt;&lt;BR /&gt;this means that if i install intel fortran compiler on the host machine it will not utilise &lt;BR /&gt;the multi cores of tesla.Is there any tool to make it compatible with cuda to use &lt;BR /&gt;mkl pardiso on the tesla?&lt;BR /&gt;&lt;BR /&gt;presently i am using the IVF Compiler with mkl for solving linear equation(pardiso).&lt;BR /&gt;&lt;B&gt;My system is intel xeon processor (e5520) with 8 cores.&lt;/B&gt;&lt;BR /&gt;I need to solve large sparsematrice around 50,00000 size matrice for many iteration.&lt;BR /&gt;&lt;BR /&gt;the system is taking lot of time.&lt;BR /&gt;please give some suggestion how to increase the speed. or changing the processor.&lt;BR /&gt;any processor where pardiso isefficient?&lt;BR /&gt;Is there any other solver faster than pardiso?&lt;BR /&gt;or can we attach one more processor to the present system?&lt;BR /&gt;does pardiso works on the cluster?&lt;BR /&gt;</description>
      <pubDate>Mon, 19 Apr 2010 03:16:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838860#M6278</guid>
      <dc:creator>ahmediiit</dc:creator>
      <dc:date>2010-04-19T03:16:14Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838861#M6279</link>
      <description>Hello Ahmed,&lt;DIV&gt;quote:""I need to solve large sparsematrice around &lt;B&gt;50,00000&lt;/B&gt; size matrice for many iteration."&lt;/DIV&gt;&lt;DIV&gt;Do you mean the input matrices size is 5 000 000?&lt;/DIV&gt;&lt;DIV&gt;What mode ( in-core, out-of-core, hybryd) of PARDISO are you using?&lt;/DIV&gt;&lt;DIV&gt;What MKL version?&lt;/DIV&gt;&lt;DIV&gt;--Gennady&lt;/DIV&gt;</description>
      <pubDate>Mon, 19 Apr 2010 05:21:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838861#M6279</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2010-04-19T05:21:27Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838862#M6280</link>
      <description>hello sir&lt;BR /&gt;&lt;BR /&gt;50,00000 is thesizeof matrix not nonzero elements.&lt;BR /&gt;nonzero is around 1500,00000.&lt;BR /&gt;mkl version 10.2.3.029.&lt;BR /&gt;going for incore&lt;BR /&gt;RAM is around 24GB&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 19 Apr 2010 06:56:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838862#M6280</guid>
      <dc:creator>ahmediiit</dc:creator>
      <dc:date>2010-04-19T06:56:08Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838863#M6281</link>
      <description>Yes, this is very good size :).&lt;DIV&gt;Are you sure you don't swapping the calculation? because of the input task size will requires ~2 Gb of RAM at least ( nnz * sizeof(double) + ja * sizeof(int) ~ 2 GB). Then, at the factorization stage may requires more then 10 times memory versus the original and therefore in this case you will have swap.&lt;/DIV&gt;&lt;DIV&gt;Can you check it with&lt;STRONG&gt;iparm&lt;SAMP class="codeph"&gt;(18) -&lt;SPAN style="font-weight: normal;"&gt;the solver will report the numbers of non-zero elements on the factors.&lt;/SPAN&gt;&lt;/SAMP&gt;&lt;/STRONG&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;&lt;SAMP class="codeph"&gt;&lt;SPAN style="font-weight: normal;"&gt;--Gennady&lt;/SPAN&gt;&lt;/SAMP&gt;&lt;/STRONG&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 19 Apr 2010 13:02:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838863#M6281</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2010-04-19T13:02:34Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838864#M6282</link>
      <description>Hello sir,&lt;BR /&gt;i am sending the output for size 12lac size ( 1200000) MATRIx&lt;BR /&gt;for 50 lac there is problem with my code,we will see it later,&lt;BR /&gt;now please tell me how to make this still faster.&lt;BR /&gt;&lt;BR /&gt;in the start of the program the size of the pf usage is around 3.5 GB&lt;BR /&gt;and when it enters the pardiso subroutine it increases to 22.5 GB&lt;BR /&gt;&lt;BR /&gt; nonzero= 15346680&lt;BR /&gt;solution start&lt;BR /&gt;&lt;BR /&gt;== PARDISO is running in In-Core mode, because iparam(60)=0 ===&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;=============== PARDISO: solving a symmetric indef. system ===========&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;ummary PARDISO: ( reorder to reorder )&lt;BR /&gt;===============&lt;BR /&gt;&lt;BR /&gt;imes:&lt;BR /&gt;=====&lt;BR /&gt; Time fulladj: 0.098160 s&lt;BR /&gt; Time reorder: 7.023050 s&lt;BR /&gt; Time symbfct: 8.933176 s&lt;BR /&gt; Time parlist: 0.275664 s&lt;BR /&gt; Time malloc : 0.825073 s&lt;BR /&gt; Time total : 19.331269 s total - sum: 2.176147 s&lt;BR /&gt;&lt;BR /&gt;tatistics:&lt;BR /&gt;==========&lt;BR /&gt;&amp;lt; Parallel Direct Factorization with #processors: &amp;gt; 8&lt;BR /&gt;&amp;lt; Numerical Factorization with BLAS3 and O(n) synchronization &amp;gt;&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt;&lt;BR /&gt; #equations: 1200000&lt;BR /&gt; #non-zeros in A: 15346680&lt;BR /&gt; non-zeros in A (): 0.001066&lt;BR /&gt;&lt;BR /&gt; #right-hand sides: 1&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Factors L and U &amp;gt;&lt;BR /&gt; #columns for each panel: 96&lt;BR /&gt; #independent subgraphs: 0&lt;BR /&gt; #supernodes: 483385&lt;BR /&gt; size of largest supernode: 12302&lt;BR /&gt; number of nonzeros in L 2403511054&lt;BR /&gt; number of nonzeros in U 1&lt;BR /&gt; number of nonzeros in L+U 2403511055&lt;BR /&gt;Percentage of computed non-zeros for LL^T factorization&lt;BR /&gt;0 %&lt;BR /&gt;1 %&lt;BR /&gt;2 %&lt;BR /&gt;3 %&lt;BR /&gt;4 %&lt;BR /&gt;5 %&lt;BR /&gt;6 %&lt;BR /&gt;7 %&lt;BR /&gt;8 %&lt;BR /&gt;9 %&lt;BR /&gt;10 %&lt;BR /&gt;11 %&lt;BR /&gt;12 %&lt;BR /&gt;13 %&lt;BR /&gt;14 %&lt;BR /&gt;15 %&lt;BR /&gt;16 %&lt;BR /&gt;17 %&lt;BR /&gt;18 %&lt;BR /&gt;19 %&lt;BR /&gt;20 %&lt;BR /&gt;21 %&lt;BR /&gt;22 %&lt;BR /&gt;23 %&lt;BR /&gt;24 %&lt;BR /&gt;25 %&lt;BR /&gt;26 %&lt;BR /&gt;27 %&lt;BR /&gt;28 %&lt;BR /&gt;29 %&lt;BR /&gt;30 %&lt;BR /&gt;31 %&lt;BR /&gt;32 %&lt;BR /&gt;33 %&lt;BR /&gt;34 %&lt;BR /&gt;35 %&lt;BR /&gt;36 %&lt;BR /&gt;37 %&lt;BR /&gt;38 %&lt;BR /&gt;39 %&lt;BR /&gt;40 %&lt;BR /&gt;41 %&lt;BR /&gt;42 %&lt;BR /&gt;43 %&lt;BR /&gt;44 %&lt;BR /&gt;45 %&lt;BR /&gt;46 %&lt;BR /&gt;47 %&lt;BR /&gt;48 %&lt;BR /&gt;49 %&lt;BR /&gt;50 %&lt;BR /&gt;51 %&lt;BR /&gt;52 %&lt;BR /&gt;53 %&lt;BR /&gt;54 %&lt;BR /&gt;55 %&lt;BR /&gt;56 %&lt;BR /&gt;57 %&lt;BR /&gt;58 %&lt;BR /&gt;59 %&lt;BR /&gt;60 %&lt;BR /&gt;61 %&lt;BR /&gt;62 %&lt;BR /&gt;63 %&lt;BR /&gt;64 %&lt;BR /&gt;65 %&lt;BR /&gt;66 %&lt;BR /&gt;67 %&lt;BR /&gt;68 %&lt;BR /&gt;69 %&lt;BR /&gt;70 %&lt;BR /&gt;71 %&lt;BR /&gt;72 %&lt;BR /&gt;73 %&lt;BR /&gt;74 %&lt;BR /&gt;75 %&lt;BR /&gt;76 %&lt;BR /&gt;77 %&lt;BR /&gt;78 %&lt;BR /&gt;79 %&lt;BR /&gt;80 %&lt;BR /&gt;81 %&lt;BR /&gt;82 %&lt;BR /&gt;83 %&lt;BR /&gt;84 %&lt;BR /&gt;85 %&lt;BR /&gt;86 %&lt;BR /&gt;87 %&lt;BR /&gt;88 %&lt;BR /&gt;89 %&lt;BR /&gt;90 %&lt;BR /&gt;91 %&lt;BR /&gt;92 %&lt;BR /&gt;93 %&lt;BR /&gt;94 %&lt;BR /&gt;95 %&lt;BR /&gt;96 %&lt;BR /&gt;97 %&lt;BR /&gt;98 %&lt;BR /&gt;99 %&lt;BR /&gt;100 %&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;=============== PARDISO: solving a symmetric indef. system ===========&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;ummary PARDISO: ( factorize to factorize )&lt;BR /&gt;===============&lt;BR /&gt;&lt;BR /&gt;imes:&lt;BR /&gt;=====&lt;BR /&gt; Time A to LU: 0.000000 s&lt;BR /&gt; Time numfct : 776.535926 s&lt;BR /&gt; Time malloc : 0.048327 s&lt;BR /&gt; Time total : 776.586925 s total - sum: 0.002672 s&lt;BR /&gt;&lt;BR /&gt;tatistics:&lt;BR /&gt;==========&lt;BR /&gt;&amp;lt; Parallel Direct Factorization with #processors: &amp;gt; 8&lt;BR /&gt;&amp;lt; Numerical Factorization with BLAS3 and O(n) synchronization &amp;gt;&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt;&lt;BR /&gt; #equations: 1200000&lt;BR /&gt; #non-zeros in A: 15346680&lt;BR /&gt; non-zeros in A (): 0.001066&lt;BR /&gt;&lt;BR /&gt; #right-hand sides: 1&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Factors L and U &amp;gt;&lt;BR /&gt; #columns for each panel: 96&lt;BR /&gt; #independent subgraphs: 0&lt;BR /&gt; #supernodes: 483385&lt;BR /&gt; size of largest supernode: 12302&lt;BR /&gt; number of nonzeros in L 2403511054&lt;BR /&gt; number of nonzeros in U 1&lt;BR /&gt; number of nonzeros in L+U 2403511055&lt;BR /&gt; gflop for the numerical factorization: 22853.815471&lt;BR /&gt;&lt;BR /&gt; gflop/s for the numerical factorization: 29.430468&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;=============== PARDISO: solving a symmetric indef. system ===========&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;ummary PARDISO: ( solve to solve )&lt;BR /&gt;===============&lt;BR /&gt;&lt;BR /&gt;imes:&lt;BR /&gt;=====&lt;BR /&gt; Time solve : 9.663265 s&lt;BR /&gt; Time total : 29.857200 s total - sum: 20.193935 s&lt;BR /&gt;&lt;BR /&gt;tatistics:&lt;BR /&gt;==========&lt;BR /&gt;&amp;lt; Parallel Direct Factorization with #processors: &amp;gt; 8&lt;BR /&gt;&amp;lt; Numerical Factorization with BLAS3 and O(n) synchronization &amp;gt;&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt;&lt;BR /&gt; #equations: 1200000&lt;BR /&gt; #non-zeros in A: 15346680&lt;BR /&gt; non-zeros in A (): 0.001066&lt;BR /&gt;&lt;BR /&gt; #right-hand sides: 1&lt;BR /&gt;&lt;BR /&gt;&amp;lt; Factors L and U &amp;gt;&lt;BR /&gt; #columns for each panel: 96&lt;BR /&gt; #independent subgraphs: 0&lt;BR /&gt; #supernodes: 483385&lt;BR /&gt; size of largest supernode: 12302&lt;BR /&gt; number of nonzeros in L 2403511054&lt;BR /&gt; number of nonzeros in U 1&lt;BR /&gt; number of nonzeros in L+U 2403511055&lt;BR /&gt; gflop for the numerical factorization: 22853.815471&lt;BR /&gt;&lt;BR /&gt; gflop/s for the numerical factorization: 29.430468&lt;BR /&gt;&lt;BR /&gt;solution end&lt;BR /&gt;&lt;BR /&gt;&lt;/TRANSPOSE&gt;&lt;/TRANSPOSE&gt;&lt;/TRANSPOSE&gt;</description>
      <pubDate>Tue, 20 Apr 2010 03:40:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838864#M6282</guid>
      <dc:creator>ahmediiit</dc:creator>
      <dc:date>2010-04-20T03:40:32Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838865#M6283</link>
      <description>Could you please check the scalability of the solution by linking your application with the serial libraries.&lt;DIV&gt;--Gennady&lt;/DIV&gt;</description>
      <pubDate>Tue, 20 Apr 2010 06:06:31 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838865#M6283</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2010-04-20T06:06:31Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838866#M6284</link>
      <description>Hello sir&lt;BR /&gt;how to link with serial libraries&lt;BR /&gt;</description>
      <pubDate>Tue, 20 Apr 2010 06:49:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838866#M6284</guid>
      <dc:creator>ahmediiit</dc:creator>
      <dc:date>2010-04-20T06:49:07Z</dc:date>
    </item>
    <item>
      <title>intel compiler in tesla machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838867#M6285</link>
      <description>please use &lt;A href="http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/"&gt;the Linker Adviser&lt;/A&gt; to have the appropriate linking line.&lt;BR /&gt;</description>
      <pubDate>Tue, 20 Apr 2010 07:24:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/intel-compiler-in-tesla-machine/m-p/838867#M6285</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2010-04-20T07:24:46Z</dc:date>
    </item>
  </channel>
</rss>

