- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Congratulations to Intel CPU instruction set engineers for managing to make YET ANOTHER non-orthogonal instruction set extension -- why PEXTRD/PINSRD (among many others) were not promoted to 256 bits in AVX2?
Any ideas/tricks to work around this engineering "oversight"?
Link Copied
- « Previous
- Next »
63 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Prefetching distance can be directly related to the data needed for the computation.The problem is to find how far ahead prefetch the data.Prefetching too far can saturate the bus and as you pointed it out can cause issue with another thread is competing for L1 data cache.For particle diffusion program prefetching distance of one particle object at least could be sufficient(of course the issue of memory data layout of such objects should be taken into account ).
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- « Previous
- Next »