- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Just wanted to add two feature requests:
1. FP64 support has already been mentioned and would be the most important request for me.
2. Memory migration APIs from OpenCL 1.2 would be very nice to have
1. FP64 support has already been mentioned and would be the most important request for me.
2. Memory migration APIs from OpenCL 1.2 would be very nice to have
Link Copied
3 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Could you explain a bit about why you need the memory migration APIs?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Could you explain a bit about why you need the memory migration APIs?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Actually, disregard the memory migration API request. After looking at Intel's OpenCL optimization guide, I am now looking at avoiding copies altogether by allocating properly aligned memory :)
To explain why I initially requested memory migration APIs: Consider a GPU device. Without memory migration APIs, some drivers don't provide hard guarantees about where exactly a memory object "lives", and only that it will be made available when you launch a kernel. The driver is otherwise free to swap stuff in/out of GPU memory. This causes my application to do somewhat unpredicatable memory copies between CPU and GPU on other platforms. However, on Ivy Bridge, it looks like I can avoid memory transfers altogether if the alignment conditions are met.
To explain why I initially requested memory migration APIs: Consider a GPU device. Without memory migration APIs, some drivers don't provide hard guarantees about where exactly a memory object "lives", and only that it will be made available when you launch a kernel. The driver is otherwise free to swap stuff in/out of GPU memory. This causes my application to do somewhat unpredicatable memory copies between CPU and GPU on other platforms. However, on Ivy Bridge, it looks like I can avoid memory transfers altogether if the alignment conditions are met.
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page