<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic jpeg 2000 performance in Intel® Integrated Performance Primitives</title>
    <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820446#M4678</link>
    <description>Do you have a document with official benchmarks for JPEG and JPEG2000 codecs?&lt;BR /&gt;What maximum throughput could be achieved with IPP-7.0 JPEG/JPEG2000 codecs?&lt;BR /&gt;This is important question and we can't find the answer. Please adivse.&lt;BR /&gt;As we know, latest white paper concerning IPP performance benchmarks dated 2003.</description>
    <pubDate>Thu, 29 Mar 2012 15:39:36 GMT</pubDate>
    <dc:creator>fvipp</dc:creator>
    <dc:date>2012-03-29T15:39:36Z</dc:date>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820444#M4676</link>
      <description>Hello,&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Our company is starting project related to decomressing jpeg2000. Most images have resolution is 4096x4096 in jp2. Most important requirement is decomression speed. I tested sample images with Intel IPP 7.0 &amp;amp; Kakadu on my Q6600 and got huge difference in speed:&lt;/DIV&gt;&lt;DIV&gt;Intel IPP 7.0.6 uic_transcoder_con - 0.770 sec/image&lt;/DIV&gt;&lt;DIV&gt;Kakadu 6.3 - 0.170 sec/image&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Also if I specify -n (number of threads), it increases decompress time: -n 2 gives 1.280 sec/image, -n 4 gives 1.580 sec/image&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;I suspect I did something wrong.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;What is wrong with my tests ? Can I get Intel IPP speed to be on pair with Kakadu ? If yes - it would be preferrable for our company.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Thank you, Sergey.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;ps: Actually I found reason:ippiDecodeCBProgrSetPassCounter_JPEG2K function is not multithreaded and if it would be probably decoding time could be improved down to 0.220 - 0.230 sec/image which is on pair with kakadu. So it would be good to reimplement this function in IPP...&lt;/DIV&gt;</description>
      <pubDate>Mon, 30 Jan 2012 01:48:31 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820444#M4676</guid>
      <dc:creator>ksubox</dc:creator>
      <dc:date>2012-01-30T01:48:31Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820445#M4677</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;First, please let me apologize for the delay in this response.&lt;BR /&gt;&lt;BR /&gt;If you're still interested we may be able to helpimprove your performance with multiple threads. However, bottom line is that we're not going tobe able to push our free sample to this level of performance with different parameters, compile options, or simple changes to the code.&lt;BR /&gt;&lt;BR /&gt;Thank you very much for your feedback on this issue, including your analysis on ippiDecodeCBProgrSetPassCounter_JPEG2K.This looks like a great place to start for improving decode performance.Ihave fed thisrecommendation to product planningso it can be considered for future releases.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Best Regards,&lt;BR /&gt;&lt;BR /&gt;Jeff&lt;BR /&gt;</description>
      <pubDate>Thu, 15 Mar 2012 21:10:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820445#M4677</guid>
      <dc:creator>Jeffrey_M_Intel1</dc:creator>
      <dc:date>2012-03-15T21:10:22Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820446#M4678</link>
      <description>Do you have a document with official benchmarks for JPEG and JPEG2000 codecs?&lt;BR /&gt;What maximum throughput could be achieved with IPP-7.0 JPEG/JPEG2000 codecs?&lt;BR /&gt;This is important question and we can't find the answer. Please adivse.&lt;BR /&gt;As we know, latest white paper concerning IPP performance benchmarks dated 2003.</description>
      <pubDate>Thu, 29 Mar 2012 15:39:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820446#M4678</guid>
      <dc:creator>fvipp</dc:creator>
      <dc:date>2012-03-29T15:39:36Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820447#M4679</link>
      <description>We appreciate the importance of this question. Unfortunatelywe don't currently have the kind of performance comparisons you're looking for. What we can show, if you look at our performance details summary&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://software.intel.com/en-us/articles/intel-ipp/#details"&gt;&lt;/A&gt;&lt;A href="http://software.intel.com/en-us/articles/intel-ipp/#details" target="_blank"&gt;http://software.intel.com/en-us/articles/intel-ipp/#details&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;is that JPEG2000 performance improved betweenthe 6.1 and7.0 version of IPP.&lt;BR /&gt;&lt;BR /&gt;Yourrequirements are always important.Since we don't have this data ready for publication, perhaps thebest way toassist withyour decision could be to let us knowwhere your expectations have been set by alternative solutions. Wemay be able toprovide assistance with setting up some quick tests to determine ifUICis close to those requirements. In any casewe can pass thisdataonto project planning to be considered for future releases.&lt;BR /&gt;&lt;BR /&gt;Would this help? &lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;&lt;BR /&gt;Jeff&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;</description>
      <pubDate>Fri, 30 Mar 2012 07:32:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820447#M4679</guid>
      <dc:creator>Jeffrey_M_Intel1</dc:creator>
      <dc:date>2012-03-30T07:32:38Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820448#M4680</link>
      <description>&lt;DL&gt;&lt;DT&gt;&lt;P lang="en-US"&gt;Thanks for your reply. Unfortunately your link doesn't
	help. We don't need to know relative speed-up. We need to know real
	performance at standard conditions. This is example for jpeg: Baseline jpeg,
	standard images from Kodak set (grayscale or color), quality
	settings 50%, static tables for quantization and Huffman, good
	PC with Core i7, your recommended parameters for multithreading. We
	need results for compression ratio, compression time and
	decompression time (without time for image loading from HDD).&lt;/P&gt;&lt;/DT&gt;&lt;/DL&gt;
&lt;P align="JUSTIFY" lang="en-US"&gt;
It should be like this: uic_transcoder_con.exe -o test.jpg
-i lenna.bmp -j b -q 50 -t 1 -n 8 
&lt;/P&gt;&lt;P align="JUSTIFY" lang="en-US"&gt;with details how to get maximum
performance at coding and decoding. And we need a table with achieved
results. Please have a look at these examples:&lt;/P&gt;&lt;P align="JUSTIFY" lang="en-US"&gt;
&lt;SPAN style="color: #000080;"&gt;&lt;SPAN style="text-decoration: underline;"&gt;&lt;A href="http://www.accusoft.com/picphotofeatures.htm#comparison_table"&gt;http://www.accusoft.com/picphotofeatures.htm#comparison_table&lt;/A&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P align="JUSTIFY" lang="en-US"&gt;&lt;A target="_blank" title="Vision Experts" href="http://visionexperts.co.uk/news/?id=25"&gt;http://visionexperts.co.uk/news/?id=25&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Fri, 30 Mar 2012 16:11:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820448#M4680</guid>
      <dc:creator>fvipp</dc:creator>
      <dc:date>2012-03-30T16:11:07Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820449#M4681</link>
      <description>&lt;P&gt;Hi Sergey,&lt;BR /&gt;&lt;BR /&gt;We don't have this data now. However, your suggestion makes a lot of sense. I've escalated your feedback to our developers and project planners. No guarantees though. I can't promise if or when we will be able to produce this kind of report. A lot of effort and review needs to go into official performance comparisons, and this needs to be prioritized against other tasks.&lt;/P&gt;&lt;P&gt;Until then, my ability to provide this data is limited. In the interest of being helpful, since you mentioned that 4096x4096 decode performance is your most important requirement, here is a snapshot of what I'm seeing on my machine. Please note that these results have not been reviewed, and are not authorititative benchmark results. There are a lot of factors affecting performance, so your results may be different. The intent is simply to give you some indication of the results we're getting here.&lt;/P&gt;&lt;P&gt;Test platform: Intel Core i7-2600K CPU @3.40 GHz, Windows 7, IPP+samples 7.0.6 (Using pre-compiled executables for easy reproduceability. If you have not already, you may want to give these a try.)&lt;/P&gt;&lt;P&gt;library: ippje9-7.0.dll&lt;/P&gt;&lt;P&gt;Input image: 4096x4096 resize of standard lenna, sRGB colorspace, 50% quality, jp2 format.&lt;/P&gt;&lt;P&gt;./uic_transcoder_con.exe -i c:/videos/lenna4096x4096_2.jp2 -o test.bmp -n 1 -t 1 -m 20 : decode time: 548.71 msec&lt;/P&gt;&lt;P&gt;./uic_transcoder_con.exe -i c:/videos/lenna4096x4096_2.jp2 -o test.bmp -n 2 -t 1 -m 20 : decode time: 436.84 msec&lt;/P&gt;&lt;P&gt;./uic_transcoder_con.exe -i c:/videos/lenna4096x4096_2.jp2 -o test.bmp -n 4 -t 1 -m 20 : decode time: 381.77 msec&lt;/P&gt;&lt;P&gt;./uic_transcoder_con.exe -i c:/videos/lenna4096x4096_2.jp2 -o test.bmp -n 8 -t 1 -m 20 : decode time: 381.97 msec&lt;BR /&gt;&lt;BR /&gt;Sorry we can't get everything today, but hopefully this is at least getting closer to the data you need.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;&lt;BR /&gt;Jeff&lt;/P&gt;</description>
      <pubDate>Sun, 01 Apr 2012 23:07:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820449#M4681</guid>
      <dc:creator>Jeffrey_M_Intel1</dc:creator>
      <dc:date>2012-04-01T23:07:39Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820450#M4682</link>
      <description>&lt;P&gt;Thanks a
lot for your reply. That's exactly what we are looking for. Actually we are
more interested in Baseline JPEG compression for greyscale images, so let's try
to get more understanding in the matter.&lt;/P&gt;&lt;P&gt;We can get
greyscale test image "big_building.bmp" from here:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://www.imagecompression.info/test_images/"&gt;http://www.imagecompression.info/test_images/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Then we try
to compress it to JPEG on Core i7 920 with the following command lines&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;

&lt;P&gt;1)
uic_transcoder_con.exe -i big_building.bmp -o test.jpg -n 8 -q 50 -t 1 -m 1&lt;/P&gt;&lt;P&gt;Intel Integrated Performance Primitives&lt;/P&gt;

&lt;P&gt; version: 7.0 build
205.85, [7.0.1058.205]&lt;/P&gt;

&lt;P&gt; name: ippjy8-7.0.dll+&lt;/P&gt;

&lt;P&gt; date: Nov 27 2011&lt;/P&gt;

&lt;P&gt;image: big_building.bmp, 7216x5408x1, 8-bits unsigned,
color: Grayscale, sampling: 444&lt;/P&gt;

&lt;P&gt;decode time: 19.25 msec&lt;/P&gt;

&lt;P&gt;encode time: 112.50 msec&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;

&lt;P&gt;2)
uic_transcoder_con.exe -i big_building.bmp -o test.jpg -n 8 -q 50 -t 1 -m 20&lt;/P&gt;

&lt;P&gt;We do
compression bmp-to-jpeg with parameter -m 20. It's almost the same, we just ask
to repeat compression 20 times.&lt;/P&gt;&lt;P&gt;Intel Integrated Performance Primitives&lt;/P&gt;

&lt;P&gt; version: 7.0 build
205.85, [7.0.1058.205]&lt;/P&gt;

&lt;P&gt; name: ippjy8-7.0.dll+&lt;/P&gt;

&lt;P&gt; date: Nov 27 2011&lt;/P&gt;

&lt;P&gt;image: big_building.bmp, 7216x5408x1, 8-bits unsigned,
color: Grayscale, sampling: 444&lt;/P&gt;

&lt;P&gt;decode time: 8.78 msec&lt;/P&gt;

&lt;P&gt;encode time: 50.33 msec&lt;/P&gt;&lt;P&gt;We have the
same image and same settings, with the only difference in the repeat count. What is the main
reason for such a variation in performance (a factor of 2)? Can your
software decode 37 MB image in 8.78 msec? Does your software do decoding at the
same time as encoding?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Now we can
do the same thing for decoding (we decode the image which we got in the
previous test):&lt;/P&gt;&lt;P&gt;3)
uic_transcoder_con.exe -i test.jpg -o test.bmp -n 8 -t 1 -m 1&lt;/P&gt;

&lt;P&gt;Intel
Integrated Performance Primitives&lt;/P&gt;

&lt;P&gt; version: 7.0 build 205.85, [7.0.1058.205]&lt;/P&gt;

&lt;P&gt; name:
ippjy8-7.0.dll+&lt;/P&gt;

&lt;P&gt; date:
Nov 27 2011&lt;/P&gt;

&lt;P&gt;image:
test.jpg, 7216x5408x1, 8-bits unsigned, color: Grayscale, sampling: 444&lt;/P&gt;

&lt;P&gt;decode
time: 66.13 msec&lt;/P&gt;

&lt;P&gt;encode
time: 14.28 msec&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Then we try
to see what we get with -m 20:&lt;/P&gt;

&lt;P&gt;4)
uic_transcoder_con.exe -i test.jpg -o test.bmp -n 8 -t 1 -m 20&lt;/P&gt;

&lt;P&gt;Intel
Integrated Performance Primitives&lt;/P&gt;

&lt;P&gt; version: 7.0 build 205.85, [7.0.1058.205]&lt;/P&gt;

&lt;P&gt; name:
ippjy8-7.0.dll+&lt;/P&gt;

&lt;P&gt; date:
Nov 27 2011&lt;/P&gt;

&lt;P&gt;image:
test.jpg, 7216x5408x1, 8-bits unsigned, color: Grayscale, sampling: 444&lt;/P&gt;

&lt;P&gt;decode
time: 37.63 msec&lt;/P&gt;

&lt;P&gt;encode
time: 9.00 msec&lt;/P&gt;



&lt;P&gt;&lt;/P&gt;&lt;P&gt;Decoding
time varies by a factor of 2. The meaning of encode time here is not clear
either.&lt;/P&gt;

&lt;P&gt;What is the
accuracy of time measurements? Could you explain the above results?&lt;/P&gt;</description>
      <pubDate>Mon, 02 Apr 2012 10:26:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820450#M4682</guid>
      <dc:creator>fvipp</dc:creator>
      <dc:date>2012-04-02T10:26:01Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820451#M4683</link>
      <description>Hi,&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;-m options defines number of loops (the same operations of encoding and decoding) to do.&lt;/DIV&gt;&lt;DIV&gt;Usually in performance measurements it is helpful to do the same thing several times and to divide overall time by the number of iterations. You'll have better accuracy.&lt;/DIV&gt;&lt;DIV&gt;But, there's a side effect related to the cache "temperature". When you repeat execution of the same algorithm, it can happen that the data you try to read is already in the CPU cache line(s). So, you spend less time waiting for instruction(s) to complete, then if required data must be loaded from virtual memory (RAM) to cache first and only then is consumed by CPU instruction.&lt;/DIV&gt;&lt;DIV&gt;Regarding JPEG decoding (your last example), encode time is just the time to repack JPEG-decoded image into BMP container.&lt;/DIV&gt;&lt;DIV&gt;Then, I see "-n" option, which means multi-threaded execution. It also speeds up the execution.&lt;/DIV&gt;&lt;DIV&gt;Regards,&lt;/DIV&gt;&lt;DIV&gt;Sergey&lt;/DIV&gt;</description>
      <pubDate>Tue, 03 Apr 2012 10:59:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820451#M4683</guid>
      <dc:creator>Sergey_K_Intel</dc:creator>
      <dc:date>2012-04-03T10:59:21Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820452#M4684</link>
      <description>&lt;DIV style="text-align: justify;"&gt;&lt;BLOCKQUOTE&gt;&lt;DIV style="text-align: left;"&gt;-m options defines number of loops (the same operations of encoding
	and decoding) to do. 
	&lt;/DIV&gt;&lt;DIV style="text-align: left;"&gt;Usually in performance measurements it is helpful to do the same
	thing several times and to divide overall time by the number of
	iterations. You'll have better accuracy. 
	&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;/DIV&gt;&lt;P lang="en-US"&gt;No, we haven't got better accuracy. Actually
	we've got significant "speed up" (from 112 ms to 50 ms for jpeg
	encoding) just with -m option. We think that it's not better
	accuracy. This is lack of accuracy.&lt;/P&gt;&lt;DL&gt;&lt;DT&gt;&lt;P lang="en-US"&gt;Let's talk about real accuracy rather than
	better accuracy. It's not clear how you measure execution time
	and this is very important. If you show two digits after point 
	do they mean anything? If you do the same encoding several times
	(without -m option), what error in terms of MSE will you get? How we
	can find out average time for encoding or decoding?&lt;/P&gt;&lt;/DT&gt;&lt;BLOCKQUOTE&gt;&lt;DT&gt;But, there's a side effect related to the cache "temperature".
	When you repeat execution of the same algorithm, it can happen that
	the data you try to read is already in the CPU cache line(s). So,
	you spend less time waiting for instruction(s) to complete, then if
	required data must be loaded from virtual memory (RAM) to cache
	first and only then is consumed by CPU instruction.&lt;/DT&gt;&lt;/BLOCKQUOTE&gt;&lt;DT&gt;Thanks, we know about hot cache. As far as
	concerns your approach with -m option, it looks really strange
	due to hot cache. It's far from being real time measurements and one
	could say that this is not fair. We want to estimate encoding
	performance and we see something strange. Please advise. As we
	understand, option -m is not worth using because of hot cache. We
	think that we'd better use big images to increase accuracy of time
	measurements.&lt;/DT&gt;&lt;DT&gt;
	&lt;BR /&gt;
	&lt;/DT&gt;&lt;BLOCKQUOTE&gt;&lt;DT&gt;Regarding JPEG decoding (your last example), encode time is just
	the time to repack JPEG-decoded image into BMP container.&lt;/DT&gt;&lt;/BLOCKQUOTE&gt;&lt;DT&gt;&lt;P lang="en-US"&gt;This is also very strange. We see the phrase encode time,
	though it means time to repack JPEG-decoded image into BMP container.&lt;/P&gt;&lt;/DT&gt;&lt;BLOCKQUOTE&gt;&lt;DT&gt;Then, I see "-n" option, which means multi-threaded
	execution. It also speeds up the execution.&lt;/DT&gt;&lt;/BLOCKQUOTE&gt;&lt;DT&gt;&lt;P lang="en-US"&gt;
	Thanks, we know that. We are trying to find out how Intel recommend
	to do jpeg encoding and decoding in the fastest way. What would you
	recommend?&lt;/P&gt;&lt;/DT&gt;&lt;/DL&gt;</description>
      <pubDate>Tue, 03 Apr 2012 14:02:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820452#M4684</guid>
      <dc:creator>fvipp</dc:creator>
      <dc:date>2012-04-03T14:02:02Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820453#M4685</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;DIV&gt;No, we haven't got better accuracy. Actually we've got significant "speed up" (from 112 ms to 50 ms for jpeg encoding) just with -m option. We think that it's not better accuracy. This is lack of accuracy.&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;OK. Let's consider this option to get more stable results. If you run "-m 1" several times, you will probably see quite a big fluctuation of results. "-m &lt;N&gt;" makes timing more predictable.&lt;BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;It's not clear how you measure execution time and this is very important. If you show two digits after point  do they mean anything?&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;This is why we distribute samples in source code form. Two digits might be helpful if overall time is several msecs.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;How we can find out average time for encoding or decoding?&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Take several input images, encode them with "-m 1" and find average.&lt;BR /&gt;&lt;/SPAN&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;This is also very strange. We see the phrase encode time, though it means time to repack JPEG-decoded image into BMP container.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;In terms of UIC BMP is also a "codec", so it has Encode method, which in reality is repacking of raw image array to BMP format (no any encoding performed). Nevertheless, since time marks are around EncodeImage function, the results go to "Encode time" section. The same situation will be with other codecs-containers like TIFF and PNM. There is also no encoding.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;As we understand, option -m is not worth using because of hot cache. We think that we'd better use big images to increase accuracy of time measurements&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Good way too&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;We are trying to find out how Intel recommend to do jpeg encoding and decoding in the fastest way. What would you recommend?&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;General recomendations are to use appropriate library (with binary code according to used CPU). Multi-threading could also help if the image is not too short and the number of hardware cores is not too big (is this case the overhead on multi-thread support is significant).&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Regards,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Sergey&lt;/SPAN&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/N&gt;</description>
      <pubDate>Wed, 04 Apr 2012 15:34:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820453#M4685</guid>
      <dc:creator>Sergey_K_Intel</dc:creator>
      <dc:date>2012-04-04T15:34:23Z</dc:date>
    </item>
    <item>
      <title>jpeg 2000 performance</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820454#M4686</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;DIV&gt;No, we haven't got better accuracy. Actually we've got 
significant "speed up" (from 112 ms to 50 ms for jpeg encoding) just 
with -m option. We think that it's not better accuracy. This is lack 
of accuracy.&lt;BR /&gt;&lt;BR /&gt;OK. Let's consider this option to get 
more stable results. If you run "-m 1" several times, you will probably 
see quite a big fluctuation of results. "-m &lt;N&gt;" makes timing more
 predictable.&lt;/N&gt;&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;Unfortunately we can't explain the difference between
112 ms and 50 ms as you do. So we have to consider "-m &lt;N&gt;" option as unfair trick with cache.&lt;BR /&gt;&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;It's
 not clear how you measure execution time and this is very important. If
 you show two digits after point  do they mean anything?&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;This is why we distribute samples in source code form. Two digits might be helpful if overall time is several msecs.&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;P lang="en-US"&gt;&lt;SPAN style="font-family: Verdana,Arial,Helvetica,sans-serif;"&gt;It
could be a good idea to round output and to show only necessary
digits. If two digits after point are right, then your accuracy in time measurements is equal to 0.01 ms. It's difficult to believe.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;This
 is also very strange. We see the phrase encode time, though it means 
time to repack JPEG-decoded image into BMP container.&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;In
 terms of UIC BMP is also a "codec", so it has Encode method, which in 
reality is repacking of raw image array to BMP format (no any encoding 
performed). Nevertheless, since time marks are around EncodeImage 
function, the results go to "Encode time" section. The same situation 
will be with other codecs-containers like TIFF and PNM. There is also no
 encoding.&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;P lang="en-US"&gt;&lt;SPAN style="font-family: Verdana,Arial,Helvetica,sans-serif;"&gt;If
you indicate in command line bmp file with option -i (input) and jpeg
file with option -o (output), it means that there will be conversion
from bmp to jpeg. This is sufficient indication that process should
be called encoding. There is no any decoding here. In this case it could be better not to show timing for decoding at all. &lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;We are trying to find out how Intel recommend to do jpeg encoding and decoding in the fastest way. What would you recommend?&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE&gt;&lt;DIV&gt;General
 recomendations are to use appropriate library (with binary code 
according to used CPU). Multi-threading could also help if the image is 
not too short and the number of hardware cores is not too big (is this 
case the overhead on multi-thread support is significant).&lt;/DIV&gt;&lt;/BLOCKQUOTE&gt;&lt;P style="padding-left: 20pt;" lang="en-US"&gt;What would you recommend for Core i7 920 and Win-7
(32/64)?&lt;BR /&gt;We do the following for jpeg encoding with 50% quality: uic_transcoder_con.exe -i lenna.bmp -o test.jpg -j b -q 50 -t 1 -n 8&lt;/P&gt;&lt;/N&gt;</description>
      <pubDate>Wed, 04 Apr 2012 17:23:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/jpeg-2000-performance/m-p/820454#M4686</guid>
      <dc:creator>fvipp</dc:creator>
      <dc:date>2012-04-04T17:23:14Z</dc:date>
    </item>
  </channel>
</rss>

