- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello,
May be its not a right question to ask here but anyway i am posting.
I have the same OpenCL kernel and compilation time of the kernel take almost 10 times more time in a GPU compared to intel CPU.
Nvidia GPU0.19405 seconds!
Intel CPU 0.0290 seconds!
Does somebody have any insight.
is it may be because GPU compilation usually takes more time compared to CPUs...
Thanks
Link Copied
2 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Are you sure the Nvidia compile did a full rebuild? I've found that it will cache previous build(s) with the same hash of the source file.
For example, if you undo some changes, or modify the file to the same as a previous build, a full rebuild will not be triggered.
One of my kernels takes more than 10 seconds to compile by Nvidia, but with Intel it is less than two seconds. (clBuildProgram) My results seem to vary though with Nvidia, so I am considering precompiling the .cl source to .ptx using clCreateProgramWithBinary rather than clCreateProgramWithSource.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
NVidia definately caches builds and this has been a discussion on this forum in the past. There seem to vary based on the content of the loops, for example Loop unrolling on the GPU expecially in AMD's compiler can take extreammly long when compiling or the GPU.
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page