Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
1641 Discussions

vtune result broken on devcloud

leevan
Employee
1,198 Views

dear dev cloud team:

       Did you notice that GPU vtune result broken on devcloud BETA 07, compute task information has gone.

BRs

0 Kudos
7 Replies
JananiC_Intel
Moderator
1,187 Views

Hi,

Thanks for reaching out.

Kindly provide the below details so that we can try from our side.

1)Your application details.

2)Commands used.

3)How the results are being viewed(CLI or GUI)



0 Kudos
leevan
Employee
1,163 Views

1)Your application details.

  code fragment such as below.

 

void __attribute__((noinline)) MatrixMulOpenMpGpuOffloading ()
{
int i, j, k;

// a is identity matrix
for (i=0; i<M; i++)
for (j=0; j<N; j++)
a[i][j] = 1.0;

// each column of b is the sequence 1,2,...,N
for (i=0; i<N; i++)
for (j=0; j<P; j++)
b[i][j] = i+1.;

// c is initialized to zero
for (i=0; i<M; i++)
for (j=0; j<P; j++)
c[i][j] = 0;

auto t1 = chrono::steady_clock::now();

for (int i = 0 ; i < 1000; i++)
{

// Parallelize on target device
#pragma omp target teams distribute parallel for simd map(to: a, b) map(tofrom: c) //thread_limit(128)
{
for (i=0; i<M; i++) {
// Each element of the product is just the sum 1+2+...+n
for (j=0; j<P; j++) {
for (k=0; k<N; k++) {
c[i][j] += a[i][k] * b[k][j];
}
}
}
}
}
auto t2 = chrono::steady_clock::now();

cout << "matmul default takes :" << chrono::duration_cast<chrono::microseconds>(t2 - t1).count() / 1000 << "us" << std::endl;


}

 

2)Commands used.

 

vtune -collect gpu-hotspot 

 

3)How the results are being viewed(CLI or GUI)

 

GUI VTUNE 

0 Kudos
ArunJ_Intel
Moderator
1,112 Views

Hi Leevan,

 


We are working on this internally, will get back to you soon. Thanks for your patience.


0 Kudos
JananiC_Intel
Moderator
1,099 Views

Hi,


We are forwarding the case to SME.



0 Kudos
Adweidh_Intel
Moderator
1,061 Views

Hi Leevan,

 

Thanks for your patience. We tried running samples in devcloud where we are able to get the information related to computing task.Looks like the issue is related to your code as the information was not visible while running your code.Could you please try with some other samples ?

 

0 Kudos
Adweidh_Intel
Moderator
1,021 Views

Hi Leevan,


We will be glad to hear an update from you.


Thanks,

Adweidh


0 Kudos
Adweidh_Intel
Moderator
1,000 Views

Hi Leevan,


 

We are assuming that the solution provided helped and would no longer be monitoring this issue.Please raise a new thread if you have any further issues.


 

Thanks,

Adweidh.


0 Kudos
Reply