<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic H.264 performence issues in IPP 7.0.2 in Intel® Integrated Performance Primitives</title>
    <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785611#M1971</link>
    <description>We found the same problem when running H264 decoder with simple player from IPP7.0 sample code.&lt;BR /&gt;&lt;BR /&gt;In our westmere system with Linux, with threadNum &amp;gt;1, the total CPU usage is much higher compared to single thread H.264 decoding. We also found that with threadNum &amp;gt;1, systime is very high, while for single thread H.264 decoding, system time is almost zero.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;IPP7.0 with 1 thread &lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;55225 root 20 0 147m 60m 3376 S 54 0.5 0:30.57 simple_player7V&lt;BR /&gt;&lt;BR /&gt;IPP7.0 with 3 threads &lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;55362 root 20 0 242m 81m 4736 R 69 0.7 0:52.33 simple_player&lt;BR /&gt;55363 root 20 0 242m 81m 4736 R 69 0.7 0:52.62 simple_player&lt;BR /&gt;55364 root 20 0 242m 81m 4736 S 3 0.7 0:02.25 simple_player&lt;BR /&gt;&lt;BR /&gt;There are 6 processors: Here is the info for process 0.&lt;BR /&gt;cat /proc/cpuinfo&lt;BR /&gt;processor : 0&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 6&lt;BR /&gt;model : 44&lt;BR /&gt;model name : Intel Xeon CPU X5660 @ 2.80GHz&lt;BR /&gt;stepping : 2&lt;BR /&gt;cpu MHz : 2793.182&lt;BR /&gt;cache size : 12288 KB&lt;BR /&gt;physical id : 0&lt;BR /&gt;siblings : 6&lt;BR /&gt;core id : 0&lt;BR /&gt;cpu cores : 6&lt;BR /&gt;apicid : 0&lt;BR /&gt;initial apicid : 0&lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 11&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm dca sse4_1 sse4_2 popcnt aes lahf_lm ida arat epb dts tpr_shadow vnmi flexpriority ept vpid&lt;BR /&gt;bogomips : 5586.36&lt;BR /&gt;clflush size : 64&lt;BR /&gt;cache_alignment : 64&lt;BR /&gt;address sizes : 40 bits physical, 48 bits virtual&lt;BR /&gt;power management:</description>
    <pubDate>Thu, 04 Aug 2011 15:53:42 GMT</pubDate>
    <dc:creator>johnscreek</dc:creator>
    <dc:date>2011-08-04T15:53:42Z</dc:date>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785603#M1963</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;after changing from IPP version 6.1 to 7.0.2 we encountered a performance issue with our video decoder implementation.&lt;/P&gt;&lt;P&gt;We implemented the H.264 decoder functionality as a DLL using static linking. Therefore we changed the project settings to use the new renamed libraries without multithreading.&lt;/P&gt;&lt;P&gt;The decoding performance of this new DLL is about 4 times slower than the performance of the old DLL which uses IPP 6.1. This performance issue however only occurs when decoding h.264 video data. Our implementation of the MPEG-4 decoder shows the same performance with both IPP versions. Ippinit is called on startup.&lt;BR /&gt;&lt;BR /&gt;Do you have any idea what the cause of this problem might be?&lt;BR /&gt;&lt;BR /&gt;Philip&lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2011 14:35:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785603#M1963</guid>
      <dc:creator>PhilipH</dc:creator>
      <dc:date>2011-03-10T14:35:05Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785604#M1964</link>
      <description>How many threads are used in decoding? Are all the CPUs busy during decompression? Can you print out the results of ippiGetLibVersion()?&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 10 Mar 2011 16:39:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785604#M1964</guid>
      <dc:creator>shyaki</dc:creator>
      <dc:date>2011-03-10T16:39:32Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785605#M1965</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;we compiled the ipps in both cases without openmp (using static linking with serial parameter).&lt;/P&gt;&lt;P&gt;We tried also the version 7.0.1. There is the same performance issue.&lt;/P&gt;&lt;P&gt;Here is the result of ippiGetLibVersion:&lt;/P&gt;&lt;P&gt;Intel Integrated Performance Primitives &lt;/P&gt;&lt;P&gt;version: 7.0 build 205.40, [7.0.1037.205] &lt;/P&gt;&lt;P&gt;name: ippiw7_l.lib &lt;/P&gt;&lt;P&gt;date: Jan 5 2011 &lt;/P&gt;&lt;P&gt;we are feeding the decoder directly with raw h.264 data without rendering the frames. The cpu usage of a core i7 PC while running our testapplication is about 3 percent with the old DLL and about 15% with the new one.&lt;/P&gt;&lt;P&gt;Philip&lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2011 17:40:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785605#M1965</guid>
      <dc:creator>PhilipH</dc:creator>
      <dc:date>2011-03-10T17:40:09Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785606#M1966</link>
      <description>Hi again,&lt;BR /&gt;&lt;BR /&gt;we tested the h.264 decoder again by using the simpleplayer application compiled with version 6.1.1 and 7.0.2. we compared the cpu usage of both players using the attached h.264 file. The result was the same performance difference as in our application.&lt;BR /&gt;&lt;BR /&gt;Can you reproduce this?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 10 Mar 2011 17:53:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785606#M1966</guid>
      <dc:creator>PhilipH</dc:creator>
      <dc:date>2011-03-10T17:53:26Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785607#M1967</link>
      <description>&lt;DIV&gt;Not, I cannot to reproduce, but I checked on different CPU type [&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Core2 Duo]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;the Resulst which I got:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;simple_player.exe rec.264&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;Video Render :     NULL&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;-RenderFormat:     YV12&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;Stream Type :     H264PV&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;Video Info  :&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;-Video Type :     H264&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;-Resolution :     1280x960&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;-Frame Rate :     15.00&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;=== ipp 6.1 Update5:&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;DecRate:&lt;B&gt;71.19 fps&lt;/B&gt;(Dec 11.05ms/f + Conv 3.00ms/f = 14.05ms/f) RndrRate 15.01fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;" id="_mcePaste"&gt;DecRate:&lt;B&gt;79.02 fps(&lt;/B&gt;Dec 10.97ms/f + Conv 1.68ms/f = 12.66ms/f) RndrRate 14.98fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;DecRate:&lt;B&gt;79.10 fps&lt;/B&gt;(Dec 11.04ms/f + Conv 1.60ms/f = 12.64ms/f) RndrRate 15.00fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;DecRate:&lt;B&gt;80.50 fps&lt;/B&gt;(Dec 10.88ms/f + Conv 1.55ms/f = 12.42ms/f) RndrRate 15.00fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;and the same with IPP 7.0.2 ( bundled with Comper XE-2011)&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;DecRate:&lt;B&gt;113.93 fps&lt;/B&gt;(Dec 5.18ms/f + Conv 3.60ms/f = 8.78ms/f) RndrRate 15.07fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV&gt;DecRate:&lt;B&gt;358.07 fps&lt;/B&gt;(Dec 1.10ms/f + Conv 1.70ms/f = 2.79ms/f) RndrRate 15.00fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV&gt;DecRate:&lt;B&gt;411.53 fps&lt;/B&gt;(Dec 0.83ms/f + Conv 1.60ms/f = 2.43ms/f) RndrRate 15.00fps Audio Dec 0.&lt;/DIV&gt;&lt;DIV&gt;DecRate:&lt;B&gt;418.73 fps(&lt;/B&gt;Dec 0.77ms/f + Conv 1.62ms/f = 2.39ms/f) RndrRate 15.00fps Audio Dec 0.00chnls&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;see the decoding rate for 7.0.2 much higher vs 6.1.5&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;it make sense to check the behaivior on diffferent CPU type like you use.&lt;/DIV&gt;</description>
      <pubDate>Fri, 11 Mar 2011 16:43:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785607#M1967</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2011-03-11T16:43:47Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785608#M1968</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;these are the results of our test on a core i5 PC. It looks similar to your results, but our CPU usage asdisplayedin the Taskmanager isin test #2 (IPP 7.0.2)much higher than in test #1 (IPP 6.1.1). &lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;simple_player_611_cl8.exe&lt;/STRONG&gt; rec.264&lt;/P&gt;&lt;P&gt;Video Render : NULL&lt;/P&gt;&lt;P&gt;-RenderFormat: YV12&lt;/P&gt;&lt;P&gt;Stream Type : H264PV&lt;/P&gt;&lt;P&gt;Video Info :&lt;/P&gt;&lt;P&gt;-Video Type : H264&lt;/P&gt;&lt;P&gt;-Resolution : 1280x960&lt;/P&gt;&lt;P&gt;-Frame Rate : 15.00&lt;/P&gt;&lt;P&gt;DecRate:135.06 fps(Dec 6.80ms/f + Conv 0.60ms/f = 7.40ms/f) RndrRate 14.95fps Audio Dec 0.00chnls DecRate:182.83 fps(Dec 4.97ms/f + Conv 0.50ms/f = 5.47ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:186.76 fps(Dec 4.86ms/f + Conv 0.49ms/f = 5.35ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:189.24 fps(Dec 4.80ms/f + Conv 0.49ms/f = 5.28ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;simple_player_702_cl8.exe rec.264&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Video Render : NULL&lt;/P&gt;&lt;P&gt;-RenderFormat: YV12&lt;/P&gt;&lt;P&gt;Stream Type : H264PV&lt;/P&gt;&lt;P&gt;Video Info :&lt;/P&gt;&lt;P&gt;-Video Type : H264&lt;/P&gt;&lt;P&gt;-Resolution : 1280x960&lt;/P&gt;&lt;P&gt;-Frame Rate : 15.00&lt;/P&gt;&lt;P&gt;DecRate:406.46 fps(Dec 1.81ms/f + Conv 0.65ms/f = 2.46ms/f) RndrRate 14.98fps Audio Dec 0.00chnls DecRate:1118.73 fps(Dec 0.47ms/f + Conv 0.43ms/f = 0.89ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:1262.16 fps(Dec 0.38ms/f + Conv 0.41ms/f = 0.79ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:1326.14 fps(Dec 0.35ms/f + Conv 0.41ms/f = 0.75ms/f) RndrRate 15.00fps Audio Dec 0.00chnls &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Mar 2011 09:29:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785608#M1968</guid>
      <dc:creator>PhilipH</dc:creator>
      <dc:date>2011-03-14T09:29:10Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785609#M1969</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;after further investigation we found out that the issue of the high cpu usage is caused by the "numThreads" parameter of the H.264 video decoder. Using the simple_player or our custom implementation with the "numThreads" parameter set to 1 instead of 0 the cpu usage is much lower.&lt;/P&gt;&lt;P&gt;(The IPP 6.1 and IPP 7.0.2 solutions are both compiled without openmp and using static linkage as mentioned before.)&lt;/P&gt;&lt;P&gt;Summary:&lt;/P&gt;&lt;P&gt;simple_player with IPP 6.1: &lt;BR /&gt;- Cpu usage with numThreads set to 1 is about the same as with numThreads set to 0.&lt;/P&gt;&lt;P&gt;- The decoding rates with numThreads set to 0 are higher&lt;/P&gt;&lt;P&gt;simple_player with IPP 7.0.2: &lt;BR /&gt;- Cpu usage with numThreads set to 1 ismuch lower than with numThreads set to 0. (with numThreads = 1 it is the same usage as IPP 6.1)&lt;/P&gt;&lt;P&gt;-The decoding rates with numThreads set to 0 are higher&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2011 12:00:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785609#M1969</guid>
      <dc:creator>PhilipH</dc:creator>
      <dc:date>2011-03-21T12:00:41Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785610#M1970</link>
      <description>Hello Philip,&lt;BR /&gt;&lt;BR /&gt;I am not sure I fully understand your conclusions:&lt;BR /&gt;&lt;BR /&gt;Is it merely a question of whether or not threading is in use? Or do you see additional performance differences between 6.1 and 7.0 beyond that? &lt;BR /&gt;&lt;BR /&gt;So let me write my understanding of your conclusions:&lt;BR /&gt;&lt;BR /&gt;&lt;UL&gt;&lt;LI&gt;6.1 using numThreads&amp;gt;1 is equal in perf. to 7.0 using the same number of threads &amp;gt; 1.&lt;/LI&gt;&lt;LI&gt;6.1 using numThreads=0 means automatic choice of number of threads equal to number of HW threads. This is equal in performance to 7.0 using numThreads=0 with the same automatic choice.&lt;/LI&gt;&lt;LI&gt;6.1 using numThreads=1 is equal in perf. to 7.0 using numThreads=0 (and thus equal to 6.1 with numThreads=0). This means that there is a bug in 6.1 that incorrectly translates numThreads=1 to use an automatic choice of number of threads (i.e. equal to specifying numThreads=0) instead of using a single thread. This should be easy to verify in the TaskManager and looking in the code (frankly, I have not).&lt;/LI&gt;&lt;/UL&gt;Is this the correct understanding?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;- Jay</description>
      <pubDate>Thu, 31 Mar 2011 12:05:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785610#M1970</guid>
      <dc:creator>IDZ_A_Intel</dc:creator>
      <dc:date>2011-03-31T12:05:00Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785611#M1971</link>
      <description>We found the same problem when running H264 decoder with simple player from IPP7.0 sample code.&lt;BR /&gt;&lt;BR /&gt;In our westmere system with Linux, with threadNum &amp;gt;1, the total CPU usage is much higher compared to single thread H.264 decoding. We also found that with threadNum &amp;gt;1, systime is very high, while for single thread H.264 decoding, system time is almost zero.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;IPP7.0 with 1 thread &lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;55225 root 20 0 147m 60m 3376 S 54 0.5 0:30.57 simple_player7V&lt;BR /&gt;&lt;BR /&gt;IPP7.0 with 3 threads &lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;55362 root 20 0 242m 81m 4736 R 69 0.7 0:52.33 simple_player&lt;BR /&gt;55363 root 20 0 242m 81m 4736 R 69 0.7 0:52.62 simple_player&lt;BR /&gt;55364 root 20 0 242m 81m 4736 S 3 0.7 0:02.25 simple_player&lt;BR /&gt;&lt;BR /&gt;There are 6 processors: Here is the info for process 0.&lt;BR /&gt;cat /proc/cpuinfo&lt;BR /&gt;processor : 0&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 6&lt;BR /&gt;model : 44&lt;BR /&gt;model name : Intel Xeon CPU X5660 @ 2.80GHz&lt;BR /&gt;stepping : 2&lt;BR /&gt;cpu MHz : 2793.182&lt;BR /&gt;cache size : 12288 KB&lt;BR /&gt;physical id : 0&lt;BR /&gt;siblings : 6&lt;BR /&gt;core id : 0&lt;BR /&gt;cpu cores : 6&lt;BR /&gt;apicid : 0&lt;BR /&gt;initial apicid : 0&lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 11&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm dca sse4_1 sse4_2 popcnt aes lahf_lm ida arat epb dts tpr_shadow vnmi flexpriority ept vpid&lt;BR /&gt;bogomips : 5586.36&lt;BR /&gt;clflush size : 64&lt;BR /&gt;cache_alignment : 64&lt;BR /&gt;address sizes : 40 bits physical, 48 bits virtual&lt;BR /&gt;power management:</description>
      <pubDate>Thu, 04 Aug 2011 15:53:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785611#M1971</guid>
      <dc:creator>johnscreek</dc:creator>
      <dc:date>2011-08-04T15:53:42Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785612#M1972</link>
      <description>&lt;BR /&gt;A good way to demonstrate this problem is by looking at the output of the unix time command (the timing numbers coming from simple_player do not appear to be correct). For IPP6 the results look as expected for either 1 or 3 threads. For IPP7 results only make sense for 1 thread.&lt;BR /&gt;&lt;BR /&gt;This is on a Westmere based platform with 64bit OpenSuse Linux distribution. Both versions of simple_player are build with gcc 4.5.1.&lt;BR /&gt;&lt;BR /&gt;The command used in all cases below is as follows:&lt;BR /&gt;&lt;BR /&gt;&lt;B&gt;time simple_player -s -tN -fyuv_420 -vnul -anul /tmp/file.mp4&lt;/B&gt;&lt;BR /&gt;&lt;BR /&gt;where N is either 1 or 3.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;OL&gt;&lt;LI&gt;Decoding with N=1 &lt;/LI&gt;&lt;/OL&gt;&lt;BLOCKQUOTE&gt;&lt;UL&gt;&lt;LI&gt;IPP6 results are as follows&lt;/LI&gt;&lt;/UL&gt;&lt;/BLOCKQUOTE&gt;DecRate:52.11 fps(Dec 18.38ms/f + Conv 0.81ms/f = 19.19ms/f) RndrRate 29.96fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:59.19 fps(Dec 16.31ms/f + Conv 0.59ms/f = 16.89ms/f) RndrRate 29.98fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:57.95 fps(Dec 16.68ms/f + Conv 0.57ms/f = 17.26ms/f) RndrRate 29.97fps Audio Dec 0.00chnls &lt;BR /&gt;&lt;BR /&gt;real 0m9.958s&lt;BR /&gt;user 0m5.004s&lt;BR /&gt;sys 0m0.071s&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;UL&gt;&lt;LI&gt;IPP7 results are as follows&lt;/LI&gt;&lt;/UL&gt;&lt;/BLOCKQUOTE&gt;DecRate:54.49 fps(Dec 17.45ms/f + Conv 0.90ms/f = 18.35ms/f) RndrRate 30.09fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:67.65 fps(Dec 14.09ms/f + Conv 0.69ms/f = 14.78ms/f) RndrRate 29.98fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:66.61 fps(Dec 14.34ms/f + Conv 0.67ms/f = 15.01ms/f) RndrRate 29.97fps Audio Dec 0.00chnls &lt;BR /&gt;&lt;BR /&gt;real 0m9.958s&lt;BR /&gt;user 0m4.310s&lt;BR /&gt;sys 0m0.058s&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;2. Decoding with N=3&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;UL&gt;&lt;LI&gt;IPP6 results are as follows&lt;/LI&gt;&lt;/UL&gt;&lt;/BLOCKQUOTE&gt;DecRate:178.02 fps(Dec 4.69ms/f + Conv 0.93ms/f = 5.62ms/f) RndrRate 30.01fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:253.50 fps(Dec 3.24ms/f + Conv 0.71ms/f = 3.94ms/f) RndrRate 29.97fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:256.97 fps(Dec 3.20ms/f + Conv 0.69ms/f = 3.89ms/f) RndrRate 29.97fps Audio Dec 0.00chnls &lt;BR /&gt;&lt;BR /&gt;real 0m9.959s&lt;BR /&gt;user 0m5.435s&lt;BR /&gt;sys 0m0.083s&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;UL&gt;&lt;LI&gt;IPP7 results are as follows&lt;/LI&gt;&lt;/UL&gt;&lt;/BLOCKQUOTE&gt;DecRate:224.16 fps(Dec 3.48ms/f + Conv 0.98ms/f = 4.46ms/f) RndrRate 30.07fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:680.72 fps(Dec 0.69ms/f + Conv 0.77ms/f = 1.47ms/f) RndrRate 29.98fps Audio Dec 0.00chnls &lt;BR /&gt;DecRate:821.99 fps(Dec 0.46ms/f + Conv 0.76ms/f = 1.22ms/f) RndrRate 29.97fps Audio Dec 0.00chnls &lt;BR /&gt;&lt;BR /&gt;real 0m9.953s&lt;BR /&gt;user 0m6.239s&lt;BR /&gt;sys 0m6.640s&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;In all cases the real time (i.e. "wall clock time") is the same because the rendering rate of 30 fps gates decoding process.&lt;BR /&gt;&lt;BR /&gt;For IPP6 total CPU usage only increases slightly for 3 threads (5.004+.071 vs 5.435+.083) . . . consistent with small overhread for task distribution to the 3 cores.&lt;BR /&gt;&lt;BR /&gt;However, IPP7 does not scale properly. The total CPU usage for IPP7 with 3 threads is ~3X the CPU usage with one thread (i.e. (6.329+6.640 for 3 thread) vs (4.310+0.058 for 1 thread)).&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 04 Aug 2011 16:46:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785612#M1972</guid>
      <dc:creator>Joe_Monaco</dc:creator>
      <dc:date>2011-08-04T16:46:33Z</dc:date>
    </item>
    <item>
      <title>H.264 performence issues in IPP 7.0.2</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785613#M1973</link>
      <description>Dear All, &lt;BR /&gt;&lt;BR /&gt;I heard from IPP developer team. The problem should be changed in IPP 7.1 beta. Please check the IPP 7.1 beta &lt;A href="http://software.intel.com/en-us/forums/showthread.php?t=106105&amp;amp;o=a&amp;amp;s=lr"&gt;http://software.intel.com/en-us/forums/showthread.php?t=106105&amp;amp;o=a&amp;amp;s=lr&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;and let us know if any problem. &lt;BR /&gt;&lt;BR /&gt;Best Regards,&lt;BR /&gt;Ying</description>
      <pubDate>Tue, 24 Jul 2012 01:47:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/H-264-performence-issues-in-IPP-7-0-2/m-p/785613#M1973</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2012-07-24T01:47:44Z</dc:date>
    </item>
  </channel>
</rss>

