Try Intel's vTune. They have a trial version.
They also have other utility programs.
By examining where your program is executing you may shed some light on the problem. vTune will detect a fair amount of your problems (once you get the hang of using it).
I do not use vTune myself as I have an AMD based system, for that I use CodeAnalyst which has similar functionality.