This is the general limitation of the sampling technology - you get the samples that hit the concrete function without any connection to children or parent functions, i.e. flat profile.
However, you might want to try Intel Performance Tuning Utility, which provides Statistical Call Graph technology and may help to find what you need.
Smaller intervals provides EBS only, which doesn't provide stacks, though. However, the general solution would be to transform the analyzed application by making those small functions called many times. This would statistically increase the probability of samples to hit the address range within the functions.