Inconsistent branch prediction on Broadwell and Skylake

Software Tuning, Performance Optimization & Platform Monitoring

Discussion regarding monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform updating.

Inconsistent branch prediction on Broadwell and Skylake

391 Views

I have a pair of microbenchmark binaries, A and B, built from the same source tree, using different versions of the same compiler.

When I run these binaries under 'perf stat -e branches,branch-misses', I observe the following:

On Skylake, the branch mis-prediction rate of binary A is twice as high as binary B.

On Broadwell, the same discrepancy exists, but the ratio is flipped: that is, the mis-prediction rate for binary B is twice as high as binary A.

The total number of branches is the same for all cases.

What are the possible causes of the effect I am seeing?

Can you suggest a method for investigating this further?

Link Copied

0 Replies

Community support is provided Monday to Friday. Other contact methods are available here.

Intel does not verify all solutions, including but not limited to any file transfers that may appear in this community. Accordingly, Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.

For more complete information about compiler optimizations, see our Optimization Notice.