- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Would it be possible to expose reproducable kernels that we will use inorder to analyze the performance regressions? What workgroup sizes are you using (Is it 1?)
We will also take a look at thethe compilation failure which you have encountered. Is it enough to place the enum inorder to reproduce the failure?
Thanks,
Boaz
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
__kernelvoidmeanZeroCol(__globalT*baseData,intnumRows,intnumCols) { unsignedintcolIndex=get_global_id(0); //Computecurrentaveragevalueforthiscolumn TcolAdj=0; for(introwNum=0;rowNum
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Boaz,
Some more positive information to report....
1) It turns out the only compiler error we ultimately encountered was the note noted above, which we easily worked around.
2) The number of kernels running slowly turned out to be relative few, and we successfully tweaked the
workgroup settings in most or possibly all cases to recover good performance. I believe our issues are similar to the performance regressions noted earlier in the forum whch you have apparently also recognized.
3) Thus only one serious issue remains for us - the incompatability with Windows Server 2008 which you also note will be fixed in some future release.
Thank you,
David

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page