Intel® ISA Extensions
Use hardware-based isolation and memory encryption to provide more code protection in your solutions.
1099 Discussions

How to extract DWORD from upper half of 256-bit register?

levicki
Valued Contributor I
5,486 Views

Congratulations to Intel CPU instruction set engineers for managing to make YET ANOTHER non-orthogonal instruction set extension -- why PEXTRD/PINSRD (among many others) were not promoted to 256 bits in AVX2?

Any ideas/tricks to work around this engineering "oversight"?

0 Kudos
63 Replies
Bernard
Valued Contributor I
254 Views

Prefetching distance can be directly related to the data needed for the computation.The problem is to find how far ahead  prefetch the data.Prefetching too far can saturate the bus and as you pointed it out can cause issue with another thread is competing for L1 data cache.For particle diffusion program prefetching distance of one particle object at least  could be sufficient(of course the issue of memory data layout of such objects should be taken into account ).

0 Kudos
Reply