- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Deep Analysis returns the following error and shows an entirely blank "Execution Duration" tab:
The remaining tabs are populated.
I'm running a workgroup of 2688/224 global/local items which is one SIMD8 per hardware thread on an HD6000.
This is on a NUC5i5RYH HD6000 IGP on Win7/x64.
Both 32-bit and 64-bit Code Builder throw the same error.
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Allan,
Sorry for getting back to you so late: I was teaching OpenCL classes for the last couple of days.
What kind of kernel are you analyzing? Could you share the code and the steps to reproduce it?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Unfortunately I can't share the code. I was hoping your team could just look at the error and zero in on where it's failing.
There are 60+ registers per work item and 224 x 12 (2688) items in the workgroup.
Haswell has 70 h/w threads per subslice which is less than 72.
But the HD 6000 would have 56 per subslice but 3 subslices and 2 slices. Perhaps that's hitting a hardcoded limit.
The Deep Analysis "EU Statistics" panel shows a single invocation per hardware thread across all possible 7 x 8 x 3 x 2 threads (but mentions "Core" -- see below) so the kernel is executing as I expected.
---
Side note: the labelling scheme used in the EU Statistics page is confusing: "Core W, Slice X, Half-Slice Y, EU Z".
If the Gen8 Compute Architecture Guide is relevant, I would expect to see "Slice X, Sub-Slice Y, EU Z".
But I'm sure your team has its reasons for labelling it that way. :)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Allan,
I just communicated this issue to the development team. I am behind the technology curve: my boss is in the process of ordering HD6000, so I should be able to reproduce issues like this in the very near future.
Just to clarify: Deep Analysis runs fine on Haswell, but bombs on Broadwell?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Sigh, I have an entirely different Deep Analysis error on Haswell (the dreaded -34) which I wasn't even going to mention. This is with the latest Code Builder. The regular "Analyze" functionality works fine though. Also, Deep Analysis ran fine on the previous release of Code Builder.
So, to answer your question, I've only seen Deep Analysis partially work on Broadwell with the latest Builder. All panels except the "Execution Duration" panel are populated.
Get an HD 6000 NUC. It rocks. :)

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page