- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have a host with three phi card,and a big matrix(it is so large that it cannot be directly copoed to phi card) need be divided three part then offload to phi card,doing some processing,then each part of big matrix need transpose back to host.
how could I implement this using c++?
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
When you say transpose, do you really mean transpose, as in to transpose a matrix, or do you mean transfer, as in let me transfer (copy) the matrix to the coprocessor? If you meant transfer, you might want to check out the use of array sections in offload transfers. The article https://software.intel.com/en-us/articles/xeon-phi-coprocessor-data-transfer-non-contiguous-array-elements-using-language-extensions shows how to do this, even if you have a non-unit stride.

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page