I am currently using in-place FFTs but in VTune it appears that the in-place functions call the out-of-place functions. Consequently, for large sizes the perftool indicates that using in-place FFTs is more efficient. I am using sizes 512K-4M. How can one be more efficient than the other if one calls the other??