I may need to know more about what you wish to accomplish. However, within Intel VTune Amplifier XE and VTune you can sort by the function column or change the grouping to Function / Call Stack then sort the first column which would also sort by function allowing you to focus in on a specific function.
The User APIs may be useful. For example the Frame API or Collection Control API.
Collection Control API:
Intel Amplifier provides low-overhead statistical sampling technology. Additional information about User-Mode sampling for analysis like Hotspots, Locks and Waits, and Concurrency can be found here:http://software.intel.com/sites/products/documentation/hpc/amplifierxe/en-us/lin/ug_docs/olh/common/...
Currently Intel VTune Amplifier XE does not have the specific functionality that you described. The hardware based sampling has very low overhead. Filtering the data is very easy. There are various filtering controls at the bottom of the views Bottom-up and Top-down Tree views. Once regions are identified you can drill into the source to locate the code lines where the time is being spent.
The following page provides a good example of a usage model and outlines how one might go about interpreting / exploring hotspot analysis to increase performance: http://software.intel.com/sites/products/documentation/hpc/amplifierxe/en-us/lin/ug_docs/index.htm#o...
As previously mentioned, Amplifier XE has sorting and filtering functionality in the GUI to control what you are viewing. There is also pause / resume controls in the GUI to control when data is gathered during the application run.
In addition, there are various project properties that can be employed to limit the amount of data collected. For example, setting data and time limits, Setting the application duration time estimate (A longer time duration increase sampling intervals), how to handle child processes, etc.
Have you run an Intel VTune Amplifier XE on your application to see if the methodology meets your needs?