I've profiled a PyTorch code, that is executed on the 16-core Xeon CPU. And now, I'm profiling the same code on the environment consisting of 4-core CPU and NVIDIA Titan XP. I'm using some NVIDIA GPU profiling tools.
In this point, I'm curious about the results when I use VTune amplifier to profile in the second environment I've mentioned. Do you think this will cause error? Or just generate report that is different from CPU-only environment?
VTune should not cause any errors in your second environment. It will still be able to profile the activity on the CPUs and it has some features to profile the PCIe traffic. You will not, have much visibility into what's going on inside the NVIDIA GPU using VTune profiler. However, it sounds like you have some other tools for that already.