- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I'm a newbie try to optimize my computational program. When I using streaming store Intrinsics replace regular store Intrinsics, I get the runtime error.
My application is a finite-difference program. This is the problematic codes.
_mm256_stream_pd(
&A[idx], _mm256_add_pd(
_mm256_mul_pd(C0, _mm256_load_pd(&A[idx])),
_mm256_sub_pd(_mm256_mul_pd(C1, _mm256_sub_pd(Bz0, Bzy)),
_mm256_mul_pd(C2, _mm256_sub_pd(By0, Byz)))));
Can't I use non-temporal Intrinsics in this case? I'm not very clear about the restrictions.
Thanks
Link Copied
0 Replies

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page