Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
Announcements
Welcome to the Intel Community. If you get an answer you like, please mark it as an Accepted Solution to help others. Thank you!
681 Discussions

vtune result broken on devcloud

leevan
Employee
573 Views

dear dev cloud team:

       Did you notice that GPU vtune result broken on devcloud BETA 07, compute task information has gone.

BRs

0 Kudos
7 Replies
JananiC_Intel
Moderator
562 Views

Hi,

Thanks for reaching out.

Kindly provide the below details so that we can try from our side.

1)Your application details.

2)Commands used.

3)How the results are being viewed(CLI or GUI)



leevan
Employee
538 Views

1)Your application details.

  code fragment such as below.

 

void __attribute__((noinline)) MatrixMulOpenMpGpuOffloading ()
{
int i, j, k;

// a is identity matrix
for (i=0; i<M; i++)
for (j=0; j<N; j++)
a[i][j] = 1.0;

// each column of b is the sequence 1,2,...,N
for (i=0; i<N; i++)
for (j=0; j<P; j++)
b[i][j] = i+1.;

// c is initialized to zero
for (i=0; i<M; i++)
for (j=0; j<P; j++)
c[i][j] = 0;

auto t1 = chrono::steady_clock::now();

for (int i = 0 ; i < 1000; i++)
{

// Parallelize on target device
#pragma omp target teams distribute parallel for simd map(to: a, b) map(tofrom: c) //thread_limit(128)
{
for (i=0; i<M; i++) {
// Each element of the product is just the sum 1+2+...+n
for (j=0; j<P; j++) {
for (k=0; k<N; k++) {
c[i][j] += a[i][k] * b[k][j];
}
}
}
}
}
auto t2 = chrono::steady_clock::now();

cout << "matmul default takes :" << chrono::duration_cast<chrono::microseconds>(t2 - t1).count() / 1000 << "us" << std::endl;


}

 

2)Commands used.

 

vtune -collect gpu-hotspot 

 

3)How the results are being viewed(CLI or GUI)

 

GUI VTUNE 

ArunJ_Intel
Moderator
487 Views

Hi Leevan,

 


We are working on this internally, will get back to you soon. Thanks for your patience.


JananiC_Intel
Moderator
474 Views

Hi,


We are forwarding the case to SME.



Adweidh_Intel
Moderator
436 Views

Hi Leevan,

 

Thanks for your patience. We tried running samples in devcloud where we are able to get the information related to computing task.Looks like the issue is related to your code as the information was not visible while running your code.Could you please try with some other samples ?

 

Adweidh_Intel
Moderator
396 Views

Hi Leevan,


We will be glad to hear an update from you.


Thanks,

Adweidh


Adweidh_Intel
Moderator
375 Views

Hi Leevan,


 

We are assuming that the solution provided helped and would no longer be monitoring this issue.Please raise a new thread if you have any further issues.


 

Thanks,

Adweidh.


Reply