- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
Just wanted to add two feature requests:
1. FP64 support has already been mentioned and would be the most important request for me.
2. Memory migration APIs from OpenCL 1.2 would be very nice to have
1. FP64 support has already been mentioned and would be the most important request for me.
2. Memory migration APIs from OpenCL 1.2 would be very nice to have
Link kopiert
3 Antworten
- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
Could you explain a bit about why you need the memory migration APIs?
- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
Could you explain a bit about why you need the memory migration APIs?
- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
Actually, disregard the memory migration API request. After looking at Intel's OpenCL optimization guide, I am now looking at avoiding copies altogether by allocating properly aligned memory :)
To explain why I initially requested memory migration APIs: Consider a GPU device. Without memory migration APIs, some drivers don't provide hard guarantees about where exactly a memory object "lives", and only that it will be made available when you launch a kernel. The driver is otherwise free to swap stuff in/out of GPU memory. This causes my application to do somewhat unpredicatable memory copies between CPU and GPU on other platforms. However, on Ivy Bridge, it looks like I can avoid memory transfers altogether if the alignment conditions are met.
To explain why I initially requested memory migration APIs: Consider a GPU device. Without memory migration APIs, some drivers don't provide hard guarantees about where exactly a memory object "lives", and only that it will be made available when you launch a kernel. The driver is otherwise free to swap stuff in/out of GPU memory. This causes my application to do somewhat unpredicatable memory copies between CPU and GPU on other platforms. However, on Ivy Bridge, it looks like I can avoid memory transfers altogether if the alignment conditions are met.
Antworten
Themen-Optionen
- RSS-Feed abonnieren
- Thema als neu kennzeichnen
- Thema als gelesen kennzeichnen
- Diesen Thema für aktuellen Benutzer floaten
- Lesezeichen
- Abonnieren
- Drucker-Anzeigeseite