<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Questions regarding measuring MMIO event using PCM in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1654232#M8494</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hello, I measured the data when using RDMA NIC to send data. I used RDMA NIC to send 100 Million 8B data in total. Normally, MMIO should be used, so I think the MMIO-related counter values that ​​I measured are abnormal (i.e. 0).&lt;BR /&gt;Or, is there any way to know whether MMIO is used?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Thank you in advance for your great help!&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Sincerely,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Qiangsheng Su&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 04 Jan 2025 11:55:47 GMT</pubDate>
    <dc:creator>qiangsheng_su</dc:creator>
    <dc:date>2025-01-04T11:55:47Z</dc:date>
    <item>
      <title>Questions regarding measuring MMIO event using PCM</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1653345#M8489</link>
      <description>&lt;P class=""&gt;&lt;SPAN class=""&gt;Hi All,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;I've been trying to measure PCIe performance on&amp;nbsp;Intel(R) Xeon(R) Silver 4314 on Linux. I summarize my questions and post my log below. Any help is welcomed. I always run PCM as root.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN&gt;I found some error messages when measuring, and the counters related to MMIO had no values.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Here is my log:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;===== Processor information =====&lt;BR /&gt;Linux arch_perfmon flag : yes&lt;BR /&gt;Hybrid processor : no&lt;BR /&gt;IBRS and IBPB supported : yes&lt;BR /&gt;STIBP supported : yes&lt;BR /&gt;Spec arch caps supported : yes&lt;BR /&gt;Max CPUID level : 27&lt;BR /&gt;CPU family : 6&lt;BR /&gt;CPU model number : 106&lt;BR /&gt;Number of physical cores: 32&lt;BR /&gt;Number of logical cores: 64&lt;BR /&gt;Number of online logical cores: 64&lt;BR /&gt;Threads (logical cores) per physical core: 2&lt;BR /&gt;Num sockets: 2&lt;BR /&gt;Physical cores per socket: 16&lt;BR /&gt;Last level cache slices per socket: 16&lt;BR /&gt;Core PMU (perfmon) version: 5&lt;BR /&gt;Number of core PMU generic (programmable) counters: 8&lt;BR /&gt;Width of generic (programmable) counters: 48 bits&lt;BR /&gt;Number of core PMU fixed counters: 4&lt;BR /&gt;Width of fixed counters: 48 bits&lt;BR /&gt;Nominal core frequency: 2400000000 Hz&lt;BR /&gt;IBRS enabled in the kernel : yes&lt;BR /&gt;STIBP enabled in the kernel : no&lt;BR /&gt;The processor is not susceptible to Rogue Data Cache Load: yes&lt;BR /&gt;The processor supports enhanced IBRS : yes&lt;BR /&gt;Package thermal spec power: 135 Watt; Package minimum power: 72 Watt; Package maximum power: 557 Watt;&lt;/P&gt;&lt;P&gt;ERROR: UPI LL monitoring device (0:7e:3:1) is missing. The UPI statistics will be incomplete or missing.&lt;BR /&gt;Socket 0: 4 memory controllers detected with total number of 8 channels. 2 UPI ports detected. 4 M2M (mesh to memory)/B2CMI blocks detected. 0 HBM M2M blocks detected. 0 EDC/HBM channels detected. 0 Home Agents detected. 3 M3UPI/B2UPI blocks detected.&lt;BR /&gt;ERROR: UPI LL monitoring device (0:fe:3:1) is missing. The UPI statistics will be incomplete or missing.&lt;BR /&gt;Socket 1: 4 memory controllers detected with total number of 8 channels. 2 UPI ports detected. 4 M2M (mesh to memory)/B2CMI blocks detected. 0 HBM M2M blocks detected. 0 EDC/HBM channels detected. 0 Home Agents detected. 3 M3UPI/B2UPI blocks detected.&lt;BR /&gt;Socket 0: 1 PCU units detected. 6 IIO units detected. 6 IRP units detected. 16 CHA/CBO units detected. 0 MDF units detected. 1 UBOX units detected. 0 CXL units detected. 0 PCIE_GEN5x16 units detected. 0 PCIE_GEN5x8 units detected.&lt;BR /&gt;Socket 1: 1 PCU units detected. 6 IIO units detected. 6 IRP units detected. 16 CHA/CBO units detected. 0 MDF units detected. 1 UBOX units detected. 0 CXL units detected. 0 PCIE_GEN5x16 units detected. 0 PCIE_GEN5x8 units detected.&lt;BR /&gt;Initializing RMIDs&lt;/P&gt;&lt;P&gt;Update every 1 seconds&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;=====print counters=====&lt;/P&gt;&lt;P&gt;Skt,PCIRdCur,ItoM,ItoMCacheNear,UCRdF,WiL,PCIe Rd (B),PCIe Wr (B)&lt;BR /&gt;0,76,30,14,0,16,4864,2816(Total)&lt;BR /&gt;0,16,30,6,0,16,1024,2304(Miss)&lt;BR /&gt;0,60,0,8,0,0,3840,512(Hit)&lt;BR /&gt;1,36440,221510,1463586,&lt;STRONG&gt;0,0&lt;/STRONG&gt;,2332160,107846144(Total)&lt;BR /&gt;1,34032,219764,1200274,0,0,2178048,90882432(Miss)&lt;BR /&gt;1,2408,1746,263312,0,0,154112,16963712(Hit)&lt;BR /&gt;&lt;BR /&gt;Skt,PCIRdCur,ItoM,ItoMCacheNear,UCRdF,WiL,PCIe Rd (B),PCIe Wr (B)&lt;BR /&gt;0,78,42,0,0,2,4992,2688(Total)&lt;BR /&gt;0,20,42,0,0,2,1280,2688(Miss)&lt;BR /&gt;0,58,0,0,0,0,3712,0(Hit)&lt;BR /&gt;1,288594,2532052,1410020,&lt;STRONG&gt;0,20&lt;/STRONG&gt;,18470016,252292608(Total)&lt;BR /&gt;1,288588,2532052,1197416,&lt;STRONG&gt;0,20&lt;/STRONG&gt;,18469632,238685952(Miss)&lt;BR /&gt;1,6,0,212604,0,0,384,13606656(Hit)&lt;BR /&gt;&lt;BR /&gt;Skt,PCIRdCur,ItoM,ItoMCacheNear,UCRdF,WiL,PCIe Rd (B),PCIe Wr (B)&lt;BR /&gt;0,1104,38,200,0,14,70656,15232(Total)&lt;BR /&gt;0,52,38,70,0,14,3328,6912(Miss)&lt;BR /&gt;0,1052,0,130,0,0,67328,8320(Hit)&lt;BR /&gt;1,290306,2546964,1738480,0,80,18579584,274268416(Total)&lt;BR /&gt;1,290306,2546964,1392086,0,80,18579584,252099200(Miss)&lt;BR /&gt;1,0,0,346394,0,0,0,22169216(Hit)&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;As you can see, I use the pcie device on socket 1. There is correct PCIRd counter value and ItoM counter value. However, the counters of MMIO events(i.e. WiL) are extremely low(i.e. 0 and 20.&lt;/P&gt;&lt;P class=""&gt;&lt;STRONG&gt;&lt;SPAN class=""&gt;How can I use the PCM tool to measure counters of MMIO event in this type of machines?&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Thank you in advance for your great help!&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Sincerely,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Qiangsheng Su&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 31 Dec 2024 06:45:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1653345#M8489</guid>
      <dc:creator>qiangsheng_su</dc:creator>
      <dc:date>2024-12-31T06:45:23Z</dc:date>
    </item>
    <item>
      <title>Re: Questions regarding measuring MMIO event using PCM</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1653732#M8491</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;the software (e.g. drivers) should minimize the number of&amp;nbsp;&lt;SPAN&gt;MMIO operations (i.e. WiL) because they are expensive. For example they are used for updating Tx/Rx tail pointers and network software should use larger buffers to make those expensive updates rare.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Roman&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jan 2025 12:08:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1653732#M8491</guid>
      <dc:creator>Roman_D_Intel</dc:creator>
      <dc:date>2025-01-02T12:08:11Z</dc:date>
    </item>
    <item>
      <title>Re: Questions regarding measuring MMIO event using PCM</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1654232#M8494</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hello, I measured the data when using RDMA NIC to send data. I used RDMA NIC to send 100 Million 8B data in total. Normally, MMIO should be used, so I think the MMIO-related counter values that ​​I measured are abnormal (i.e. 0).&lt;BR /&gt;Or, is there any way to know whether MMIO is used?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Thank you in advance for your great help!&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Sincerely,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Qiangsheng Su&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 04 Jan 2025 11:55:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Questions-regarding-measuring-MMIO-event-using-PCM/m-p/1654232#M8494</guid>
      <dc:creator>qiangsheng_su</dc:creator>
      <dc:date>2025-01-04T11:55:47Z</dc:date>
    </item>
  </channel>
</rss>

