I ran the latest version of Platform Profiler and uploaded the result file to the platform profiler webpage (localhost:6543).
In Memory view, I can see the 'DRAM Write Queue Latency' graph in 'Socket 0 - Latency'.
The graph shows the DRAM write queue latency over the elapsed time, but the data is too coarse to check the tail latency. I think the sampling rate of measurement is too low.
So, how can I check the all latency information about memory accesses? Is there any way to measure it using Vtune Profiler or Platform Profiler or other tools?
Especially, I am interested in write tail latency of Optane DCPMM. I found the 'Persistent Memory Write Queue Latency' graph in the result of Platform Profiler, but the Platform Profiler only shows me the sampled data. I want all data measured by hardware performance counters.
This picture is an example graph showed by platform profiler. But the sampling rate of the graph is too low.