With Intel Advisor is possible to do so and is called "Estimated data transfer with reuse", as I attach in the following screen:
In Intel VTune the only way I found is via the "Memory Access" analysis but It express the result as number of loads and stores and probably using hardware counters, so if there are multiple readings from main memory caused by huge data structures, they will be taken into account and does not returns the number of bytes.
Is there a way to perform a similar analysis with Intel VTune? Thanks