- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I am trying to profile a process on linux running on a single CPU on a Broadwell (model name : Intel(R) Xeon(R) CPU D-1540 @ 2.00GHz) and getting a CPI rate of 2.828 with default Vtune config. With more samples (reducing the sample interval), I see it bumps up to 3.148. While I understand one or two delay functions in "Bottom-up" chart that are affecting the CPI rate, what I do not understand is "vmlinux" showing a CPI rate of 1.588. According to system config, CPU that I am running is dedicated to the process and any actions of linux kernel should be performed on a different CPU. Does CPI 1.588 above say this is not happening ? Any help is greatly appreciated. Any other suggestions/comments based on the results and Vtune-config pasted below?
Vtune config on General Exploration:
1. Attach to a process with ssh session.
2. Automatically stop after 60 secs.
3. Analyse child processes.
4. Under 1 minute of duration estimate
5. Collection data: 0
6. Slow frames: 40, Fast frames: 100 (default values)
7. CPU mask: 11
RESULTS:
With defaults:
Elapsed Time:    60.059s
	    Clockticks:    151,600,000
	    Instructions Retired:    53,600,000
	    CPI Rate:    2.828
	    MUX Reliability:    0.948
	    Front-End Bound:    0.191
	        Front-End Latency:    0.106
	            ICache Misses:    0.026
	            ITLB Overhead:    0.009
	            Branch Resteers:    0.047
	            DSB Switches:    0.000
	            Length Changing Prefixes:    0.000
	            MS Switches:    0.106
	        Front-End Bandwidth:    0.086
	            Front-End Bandwidth DSB:    0.026
	            Front-End Bandwidth MITE:    0.237
	            Front-End Bandwidth LSD:    0.000
	    Bad Speculation:    0.046
	    Back-End Bound:    0.584
	        Memory Bound:    0.217
	            L1 Bound:    0.237
	            L2 Bound:    0.000
	            L3 Bound:    0.000
	            DRAM Bound:    0.211
	            Store Bound:    0.000
	        Core Bound:    0.367
	            Divider:    0.000
	            Port Utilization:    0.923
	                Cycles of 0 Ports Utilized:    0.633
	                Cycles of 1 Port Utilized:    0.290
	                Cycles of 2 Ports Utilized:    0.053
	                Cycles of 3+ Ports Utilized:    0.079
	    Retiring:    0.178
	        General Retirement:    0.113
	        Microcode Sequencer:    0.065
	        Assists:    0.000
	    Total Thread Count:    5
	    Paused Time:    0s
With Sampling Interval event-config=CPU_CLK_UNHALTED.
Elapsed Time:    60.001s
	    Clockticks:    1,454,600,000
	    Instructions Retired:    462,000,000
	    CPI Rate:    3.148
	    MUX Reliability:    0.984
	    Front-End Bound:    0.067
	        Front-End Latency:    0.063
	            ICache Misses:    0.019
	            ITLB Overhead:    0.003
	            Branch Resteers:    0.025
	            DSB Switches:    0.000
	            Length Changing Prefixes:    0.000
	            MS Switches:    0.121
	        Front-End Bandwidth:    0.004
	            Front-End Bandwidth DSB:    0.000
	            Front-End Bandwidth MITE:    0.179
	            Front-End Bandwidth LSD:    0.000
	    Bad Speculation:    0.009
	    Back-End Bound:    0.769
	        Memory Bound:    0.365
	            L1 Bound:    0.294
	            L2 Bound:    0.000
	            L3 Bound:    0.234
	            DRAM Bound:    0.000
	            Store Bound:    0.000
	        Core Bound:    0.404
	            Divider:    0.000
	            Port Utilization:    0.660
	                Cycles of 0 Ports Utilized:    0.415
	                Cycles of 1 Port Utilized:    0.242
	                Cycles of 2 Ports Utilized:    0.110
	                Cycles of 3+ Ports Utilized:    0.049
	    Retiring:    0.155
	        General Retirement:    0.056
	        Microcode Sequencer:    0.099
	        Assists:    0.000
	    Total Thread Count:    5
	    Paused Time:    0s
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Prasanth,
When a process does say system call it will be done in the context of this process. Most likely this is what happening in your case.
 
					
				
				
			
		
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page