<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Matrix Q of QR decomposition in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794485#M2541</link>
    <description>Hello,&lt;BR /&gt;I need to obtain matrix Q of QR decomposition, so I've been 
using function *geqrf followed by *orgqr and it performs well. The 
problem is when I use this functions with threaded mkl where i've got 
good speed up with *geqrf, but no speed up with *orgqr. I've seen user's
 manual and it seems that *orgqr is not a threaded function. I have also tested function 'dormqr()' being matrix C an identity matrix and
 as expected, I have same results as using 'dorgqr()' function. As we 
can see in the manual, 'dormqr()' is a threaded function, so I've got 
matrix Q faster than with 'dorgqr()', but for my surprise when I run 
both functions in sequential MKL 'dormqr()' is near 3 times faster than 
'dorgqr()'. How is it posible if both functions make the same and 
'dormqr()' also make a matrix-matrix multiplication?&lt;BR /&gt;&lt;BR /&gt;thanks!! :)&lt;BR /&gt;Jorge&lt;BR /&gt;</description>
    <pubDate>Thu, 24 Feb 2011 15:48:58 GMT</pubDate>
    <dc:creator>Jorge_Lorente</dc:creator>
    <dc:date>2011-02-24T15:48:58Z</dc:date>
    <item>
      <title>Matrix Q of QR decomposition</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794485#M2541</link>
      <description>Hello,&lt;BR /&gt;I need to obtain matrix Q of QR decomposition, so I've been 
using function *geqrf followed by *orgqr and it performs well. The 
problem is when I use this functions with threaded mkl where i've got 
good speed up with *geqrf, but no speed up with *orgqr. I've seen user's
 manual and it seems that *orgqr is not a threaded function. I have also tested function 'dormqr()' being matrix C an identity matrix and
 as expected, I have same results as using 'dorgqr()' function. As we 
can see in the manual, 'dormqr()' is a threaded function, so I've got 
matrix Q faster than with 'dorgqr()', but for my surprise when I run 
both functions in sequential MKL 'dormqr()' is near 3 times faster than 
'dorgqr()'. How is it posible if both functions make the same and 
'dormqr()' also make a matrix-matrix multiplication?&lt;BR /&gt;&lt;BR /&gt;thanks!! :)&lt;BR /&gt;Jorge&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Feb 2011 15:48:58 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794485#M2541</guid>
      <dc:creator>Jorge_Lorente</dc:creator>
      <dc:date>2011-02-24T15:48:58Z</dc:date>
    </item>
    <item>
      <title>Matrix Q of QR decomposition</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794486#M2542</link>
      <description>Hi Jorge,&lt;DIV&gt;1. Yes you are right. The allfunctions ?(or/un)gqr are not threaded. We will do that in one of the next updates.&lt;/DIV&gt;&lt;DIV&gt;2. regardingdormqr : what size of the tasks?&lt;/DIV&gt;&lt;DIV&gt;--Gennady&lt;/DIV&gt;</description>
      <pubDate>Fri, 25 Feb 2011 08:34:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794486#M2542</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2011-02-25T08:34:55Z</dc:date>
    </item>
    <item>
      <title>Matrix Q of QR decomposition</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794487#M2543</link>
      <description>I use dorgqr and dormqr with same matrix dimensions (2250x2249) of matrix Q. Is it important for their performance??&lt;BR /&gt;Thanks Gennady</description>
      <pubDate>Wed, 02 Mar 2011 15:50:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794487#M2543</guid>
      <dc:creator>Jorge_Lorente</dc:creator>
      <dc:date>2011-03-02T15:50:12Z</dc:date>
    </item>
    <item>
      <title>Matrix Q of QR decomposition</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794488#M2544</link>
      <description>&lt;DIV id="_mcePaste"&gt;For the dormqr these sizes are enough to see the performance benefitsof using threading.&lt;/DIV&gt;</description>
      <pubDate>Fri, 11 Mar 2011 17:50:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794488#M2544</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2011-03-11T17:50:10Z</dc:date>
    </item>
    <item>
      <title>Matrix Q of QR decomposition</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794489#M2545</link>
      <description>I know, but my question is why dormqr is faster (near 3 times) than dorgqr if I'm executing both in sequential MKL??</description>
      <pubDate>Mon, 14 Mar 2011 08:48:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Matrix-Q-of-QR-decomposition/m-p/794489#M2545</guid>
      <dc:creator>Jorge_Lorente</dc:creator>
      <dc:date>2011-03-14T08:48:34Z</dc:date>
    </item>
  </channel>
</rss>

