<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic   in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968118#M2743</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;I would suggest you to do profiling with the VTune and post the result.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 02 Apr 2014 17:17:40 GMT</pubDate>
    <dc:creator>Bernard</dc:creator>
    <dc:date>2014-04-02T17:17:40Z</dc:date>
    <item>
      <title>Varying CPU usage despite the same test pattern</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968107#M2732</link>
      <description>&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Hello all,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;We've got very strange behavior when testing IP packet forwarding performance &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;on Sandy Bridge platform (Supermicro X9DRH with the latest BIOS) on Linux Kernel. This is two &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;socket E5-2690 CPU system. Using different PC we're generating DDoS-like traffic &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;with rate of about 4.5 million packets per second. Traffic is receiving by two &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Intel 82599 NICs and forwarding using the second port of one of this NICs. All &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;load is evenly distributed among two nodes, so each of 32 CPUs SI usage is &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;virtually equal.&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Now the strangest part. Few moments after pktgen start on traffic generator PC, &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;average CPU usage on SB system goes to 30-35%. No packet drops, &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;no rx_missed_errors, no rx_no_dma_resources. Very nice. But CPU usage starts to &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;decreasing gradually. After about 10 seconds we see ~15% average among all &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;CPUs. Still no packet drops, the same RX rate as in the beginning, RX packet &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;count is equal to TX packet count. After some time we see that average usage &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;start to go up. Peaked at initial 30-35% it goes down to 15% again. This pattern &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;is repeated every 80 seconds. Interval is very stable. It is undoubtedly bind &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;to the test start time, because if we start test, then interrupt it after 10 &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;seconds and start it again we see the same 30% CPU peak in a few moments. Then &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;all timings will be the same.&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;During the high load time we see this in "perf top -e cache-misses":&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 14017.00 24.9% __netdev_alloc_skb &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;5172.00 &amp;nbsp;9.2% _raw_spin_lock &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4722.00 &amp;nbsp;8.4% build_skb &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;3603.00 &amp;nbsp;6.4% fib_table_lookup &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;During the "15% load time" top is different:&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 11090.00 20.9% build_skb &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4879.00 &amp;nbsp;9.2% fib_table_lookup &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4756.00 &amp;nbsp;9.0% ipt_do_table &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; /lib/modules/3.12.15-BUIL&lt;/SPAN&gt;&lt;WBR style="display: inline-block; color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;D-g2e94e30-dirty/kernel/n&lt;/SPAN&gt;&lt;WBR style="display: inline-block; color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;et/ipv4/netfilter/ip_tabl&lt;/SPAN&gt;&lt;WBR style="display: inline-block; color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;es.ko&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;3042.00 &amp;nbsp;5.7% nf_iterate &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;And __netdev_alloc_skb is at the end of list:&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 911.00 &amp;nbsp;0.5% __netdev_alloc_skb &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [kernel.kallsyms]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Some info from "perf stat -a sleep 2":&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;15% CPU case:&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="wmi-callto" style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;28640006291&lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt; cycles &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;# &amp;nbsp; &amp;nbsp;0.447 GHz &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [83.23%]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="wmi-callto" style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;38764605205&lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt; instructions &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;# &amp;nbsp; &amp;nbsp;1.35 &amp;nbsp;insns per cycle&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;30% CPU case:&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="wmi-callto" style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;56225552442&lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt; cycles &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;# &amp;nbsp; &amp;nbsp;0.877 GHz &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [83.23%]&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="wmi-callto" style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;39718182298&lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt; instructions &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;# &amp;nbsp; &amp;nbsp;0.71 &amp;nbsp;insns per cycle&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Cycles go up, but instructions remain the same.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;CPUs never go above C1 state, all cores speed from /proc/cpuinfo is constant at &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;2899.942 MHz. ASPM is disabled.&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;All non-essential userspace apps was explicitly killed for test time, there &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;was no active cron jobs too. So we should assume no interference with &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;userspace.&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Kernel version is 3.12.15 (ixgbe 3.21.2), but we have the same behavior with &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;ancient 2.6.35 (ixgbe 3.10.16). Although on 2.6.35 we sometimes get 160-170 &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;seconds interval and different symbols at the "perf top" output (especially cheap &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;local_bh_enable() which is completely blows my mind).&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;So now I think that the problem has nothing with software, but with some part or hardware. Does anybody have some thoughts about the reasons of this kind of behavior? &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Sandy Bridge CPU has many uncore and offcore events, which I can sample, maybe some of them &lt;/SPAN&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;can shed some light on such behavior?&lt;/SPAN&gt;&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;BR style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;" /&gt;
	&lt;SPAN style="color: rgb(0, 0, 0); font-family: Arial, sans-serif; font-size: 15px; line-height: 21.50400161743164px; white-space: pre-wrap;"&gt;Thank you!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Apr 2014 18:22:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968107#M2732</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-01T18:22:21Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968108#M2733</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;As I was able to understand your measurement has sin wave like pattern despite the same load being generated?One question arises here does every packet has the same TCP payload content?&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 07:21:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968108#M2733</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T07:21:22Z</dc:date>
    </item>
    <item>
      <title>Each packet is Linux pktgen</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968109#M2734</link>
      <description>&lt;P&gt;Each packet is Linux pktgen generated UDP packet with length of 64 byte. All packets send from random source IP address to random destination. This traffic is just go through Sandy Bridge box without any content analysis.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://www.wireshark.org/docs/dfref/p/pktgen.html" target="_blank"&gt;http://www.wireshark.org/docs/dfref/p/pktgen.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Here is the graph for CPU load. Just don't understand what is the cause of this spikes.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 08:39:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968109#M2734</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-02T08:39:00Z</dc:date>
    </item>
    <item>
      <title>Second try.</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968110#M2735</link>
      <description>&lt;P&gt;Second try.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 08:40:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968110#M2735</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-02T08:40:21Z</dc:date>
    </item>
    <item>
      <title>Maybe you are seeing</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968111#M2736</link>
      <description>&lt;P&gt;Maybe you are seeing accumulated&amp;nbsp;interrupt processing time(I mean Interrupt Service Routine) which contribute to those spikes and also some part of the CPU time is spend trying to allocate memory buffers by using *__alloc_skb() function.I can also see that this function tries to allocate memory on some NUMA node.By further theorizing those spikes are related to buffer allocations when incoming packets are buffered and sent to the CPU.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 09:06:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968111#M2736</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T09:06:23Z</dc:date>
    </item>
    <item>
      <title>iliyapolak, thank you for</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968112#M2737</link>
      <description>iliyapolak, thank you for reply! Yes, all CPU time during this test is from kernel softirq processing (bottom half of interrupt handler). What I trying to understand is why I got such spikes in constant traffic flow conditions. Interrupt rate according to vmstat is constant during the test time. Packet jitter, delays and packet rate is also not changing. This spikes is relatively long - about 20 seconds if I count from its beginning to the end and very constant in its timing (you can see this from graph in attachment).</description>
      <pubDate>Wed, 02 Apr 2014 12:02:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968112#M2737</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-02T12:02:17Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968113#M2738</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Can you obtain the source of those interrupts?I can further theorize that you are seeing some kind of interrupt coalescing.Moreover the periodicity of those spikes can indicate aferomentioned coalescing.I suppose that NIC will not interrupt CPU per single packet base.NIC will simply buffer probably in&amp;nbsp;internal on chip memory buffers incoming&amp;nbsp; Ethernet frames do some processing on them(extracting higher level protocols etc...) and when the buffer(s) will be full it will signal the CPU by firing up the interrupt.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 13:56:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968113#M2738</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T13:56:19Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;&gt;This spikes is relatively</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968114#M2739</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt;&amp;gt;This spikes is relatively long - about 20 seconds if I count from its beginning to the end and very constant in its timing (you can see this from graph in attachment).&amp;gt;&amp;gt;&amp;gt;&lt;/P&gt;

&lt;P&gt;Yes I see it.I think that you are seeing superposition? (I do not know if this is a proper word)&amp;nbsp; of many short time interrupt signals.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 13:59:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968114#M2739</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T13:59:21Z</dc:date>
    </item>
    <item>
      <title>iliyapolak, yes, Intel 82599</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968115#M2740</link>
      <description>iliyapolak, yes, Intel 82599-based NICs support interrupt coalescing. This feature is on in my case using this command:

ethtool -C eth0 rx-usecs 488

This means that about 2,000 interrupts will be generated by one interrupt vector (each NIC had 16 vectors). But this value is constant and not changing with time.</description>
      <pubDate>Wed, 02 Apr 2014 15:31:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968115#M2740</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-02T15:31:42Z</dc:date>
    </item>
    <item>
      <title>Number of packets could be</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968116#M2741</link>
      <description>&lt;P&gt;Number of packets could be changing over the time hence the count of interrupts also.Beside the NIC interrupts you could have also I/O interrupts.By looking at the screenshot can you provide the exact breakdown of the CPU load?&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 16:54:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968116#M2741</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T16:54:02Z</dc:date>
    </item>
    <item>
      <title>iliyapolak, I can post</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968117#M2742</link>
      <description>iliyapolak, I can post screenshort tomorrow. But number of packets doesn't changing during the single test. This is the controlled test with fixed pps rate at generator side.</description>
      <pubDate>Wed, 02 Apr 2014 17:00:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968117#M2742</guid>
      <dc:creator>Oleg_A_</dc:creator>
      <dc:date>2014-04-02T17:00:37Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968118#M2743</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;I would suggest you to do profiling with the VTune and post the result.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 02 Apr 2014 17:17:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968118#M2743</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-04-02T17:17:40Z</dc:date>
    </item>
    <item>
      <title>You told that 82599 will not</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968119#M2744</link>
      <description>&lt;P&gt;You told that 82599 will not send interrupt to the CPU for each packet instead it will send when its bucket will be full with the packets. I can see this may degrade the performance of a NIC. please suggest any reference guide for interrupt &lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;coalescing&lt;/SPAN&gt;.&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Himanshu&lt;/P&gt;</description>
      <pubDate>Tue, 24 Mar 2015 07:19:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968119#M2744</guid>
      <dc:creator>Himanshu_T_</dc:creator>
      <dc:date>2015-03-24T07:19:34Z</dc:date>
    </item>
    <item>
      <title>Quote:Himanshu T. wrote:</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968120#M2745</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Himanshu T. wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;You told that 82599 will not send interrupt to the CPU for each packet instead it will send when its bucket will be full with the packets. I can see this may degrade the performance of a NIC. please suggest any reference guide for interrupt coalescing.&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Himanshu&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Do you mean debugging guide?&lt;/P&gt;</description>
      <pubDate>Tue, 24 Mar 2015 13:59:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968120#M2745</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2015-03-24T13:59:06Z</dc:date>
    </item>
    <item>
      <title>@Himanshu,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968121#M2746</link>
      <description>&lt;P&gt;@Himanshu,&lt;/P&gt;

&lt;P&gt;Are you using Windows or Linux?&lt;/P&gt;</description>
      <pubDate>Tue, 24 Mar 2015 14:00:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Varying-CPU-usage-despite-the-same-test-pattern/m-p/968121#M2746</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2015-03-24T14:00:05Z</dc:date>
    </item>
  </channel>
</rss>

