Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- Intel Community
- Software
- Software Development SDKs and Libraries
- Intel® oneAPI Math Kernel Library
- Unexpected NaN with clbas_dgemm

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

Matt_S_

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-23-2015
07:23 AM

125 Views

Unexpected NaN with clbas_dgemm

I am calculating a correlation matrix and am getting a weird issue. I have a 15x6 matrix R getting read into a subroutine correlation.c which calculates a 6x6 correlation matrix cor based off of the covariance matrix of R. My code looks like this

#include <stdio.h> #include <stdlib.h> #include <string.h> #include <math.h> #include <mkl.h> #include "params.h" void show_matrix(REAL *A, int n_dim, int m_dim); REAL *correlation(REAL *A, int n_dim, int m_dim) { int diag; REAL *cvm = (REAL*)_mm_malloc(sizeof(REAL) * n_dim * n_dim, 64); REAL *cor = (REAL*)_mm_malloc(sizeof(REAL) * n_dim * n_dim, 64); REAL *colavg = (REAL*)_mm_malloc(sizeof(REAL) * n_dim, 64); if (cvm == NULL || colavg == NULL) { printf("\nERROR: Cannot allocate memory for matrices in CORRELATION. Aborting...\n\n"); exit(EXIT_FAILURE); } else { memset(cvm,0.0,sizeof(cvm)); memset(cor,0.0,sizeof(cor)); memset(colavg,0.0,sizeof(colavg)); } /* Calculate column averages */ for (int i = 0; i < n_dim; i++) { for (int j = 0; j < m_dim; j++) { colavg+= A[j * n_dim + i]; } colavg/= m_dim; } /* Calculate covariance matrix */ for (int i = 0; i < m_dim; i++) { int indx = i * n_dim; for (int j = 0; j < n_dim; j++) { A[indx + j] -= colavg; } } printf("A in\n"); show_matrix(A, m_dim, n_dim); cblas_dgemm(CblasRowMajor, CblasTrans, CblasNoTrans, n_dim, n_dim, m_dim, 1.0, A, n_dim, A, n_dim, 1.0, cvm, n_dim); printf("cvm out\n"); show_matrix(cvm, n_dim, n_dim); for (int i = 0; i < n_dim; i++) { int indx = i * n_dim; for (int j = 0; j < n_dim; j++) { cvm[indx + j] /= (m_dim - 1); } } printf("cvm final\n"); show_matrix(cvm, n_dim, n_dim); /* Calculate correlation matrix */ diag = n_dim + 1; for (int i = 0; i < n_dim; i++) { int indx = i * n_dim; int idiag = i * diag; for (int j = 0; j <= i; j++) { cor[indx + j] = cvm[indx + j] / sqrt(cvm[idiag] * cvm[j * diag]); } } printf("cor\n"); show_matrix(cor, n_dim, n_dim); return cor;

The "A in" debug print gives my expected input of

A in 1.53400 1.53400 -1.53400 -1.53400 0.48900 -0.31900 -0.88700 -0.48900 0.88700 -0.88700 -0.15700 0.67400 -0.48900 0.67400 -0.48900 1.15000 1.53400 -0.48900 0.88700 0.00000 -0.67400 0.31900 -0.00000 -1.53400 1.15000 -0.31900 0.48900 0.67400 0.15700 1.15000 0.15700 -1.53400 -0.88700 -0.67400 -0.31900 0.15700 -1.15000 -0.67400 -0.15700 0.15700 -1.53400 -0.15700 0.00000 -0.88700 0.15700 -0.31900 -0.67400 0.88700 0.31900 -0.15700 0.67400 0.88700 0.67400 1.53400 -0.31900 0.15700 -0.31900 -1.15000 1.15000 -0.88700 -1.53400 0.88700 1.15000 1.53400 -0.48900 -1.15000 -0.15700 -1.15000 1.53400 -0.15700 -1.15000 -0.67400 0.48900 0.48900 -1.15000 0.48900 -0.88700 -0.00000 0.67400 0.31900 0.31900 0.00000 0.88700 0.31900 -0.67400 1.15000 0.00000 -0.48900 0.31900 0.48900

However the "cvm out" debug print gives something a little bit goofy

cvm out 10.56446 1.02348 -4.93052 -2.46705 2.76175 1.84661 1.02348 10.56446 -3.30580 0.74990 -nan -2.39809 -4.93052 -3.30580 10.56446 3.56732 -2.08079 2.00943 -2.46705 0.74990 3.56732 10.56446 -0.43575 -0.31520 2.76175 5.11138 -2.08079 -0.43575 10.56446 0.55179 1.84661 -2.39809 2.00943 -0.31520 0.55179 10.56446

Why is that NaN there?! The symmetric value across the diagonal is correct! Very weird!

I am currently working around it by working only on the lower half, which works due to the matrix being SPD, but I'm very curious about this rogue NaN.

Any ideas?

Link Copied

5 Replies

Anthony_H_Intel1

Employee

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-24-2015
09:34 AM

125 Views

Matt,

Using the same "A in" matrix and cblas_dgemm() call I was not able to reproduce the NaN. Note that I calculated "cvm" correctly only when "A" and "cvm" were of type double*. Can you provide any more information on the REAL type you are using? You might want try the same dgemm call with "A" and "cvm" as double* to see if it’s a type issue and if the NaN persists. Other than that, it would be helpful to know a little more about your environment: what MKL version are you using, are you using threading, what is your system/OS ... etc.

Anthony

Matt_S_

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-24-2015
09:46 AM

125 Views

Anthony H. (Intel) wrote:

Matt,

Using the same "A in" matrix and cblas_dgemm() call I was not able to reproduce the NaN. Note that I calculated "cvm" correctly only when "A" and "cvm" were of type double*. Can you provide any more information on the REAL type you are using? You might want try the same dgemm call with "A" and "cvm" as double* to see if it’s a type issue and if the NaN persists. Other than that, it would be helpful to know a little more about your environment: what MKL version are you using, are you using threading, what is your system/OS ... etc.

Anthony

Anthony,

I have REAL defined in params.h (source below)

#ifndef _params_h #define _params_h #define REAL double /* precision of data */ #define DBG 0 /* 1 prints out results */ #define READIN 0 /* 1 reads in white paper data, 0 randomizes data */ #define NVAR 128 /* ncols */ #define NITER 25000 /* nrows */ #define RANDMIN -2 #define RANDMAX 2 #endif

And the matrix A is originally allocated by (in main)

REAL *r_data; ... r_data = (REAL *)mkl_malloc(m_dim * n_dim * sizeof(REAL), 64); ... memset(r_data,0.0,sizeof(r_data)); ... /* setting of r_data values happens in here */ t_data = correlation(r_data, n_dim, m_dim);

and inside of correlation.c is where I get that NaN.

Anthony_H_Intel1

Employee

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-24-2015
03:14 PM

125 Views

Hey Matt,

We tracked down the issue: the third parameter of memset() needs to specify the size of the matrix in addition to sizeof() so it should be

memset(cvm, 0.0, sizeof(cvm) * n_dim * n_dim);

Anthony

mecej4

Black Belt

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-24-2015
03:34 PM

125 Views

Anthony H. (Intel) wrote:

We tracked down the issue: the third parameter of memset() needs to specify the size of the matrix in addition to sizeof() so it should be

memset(cvm, 0.0, sizeof(cvm) * n_dim * n_dim);

That is still in error, and would cause you problems if you target x32 and x64; sizeof(cvm) and sizeof(*cvm) are not necessarily the same, depending on what REAL is defined to be; note that the count of bytes to set to zero should be computed from the size of each array element and the number of elements in the array, and the size of a pointer does not enter into this calculation.

Matt_S_

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

06-25-2015
06:04 AM

125 Views

All,

That cleared it up, however this makes memset SLOW! Doing this is much faster and actually vectorizes according to the compiler (-qopt-report)

for (int i = 0; i < m_dim; i++) { int indx = i * n_dim; for (int j = 0; j < n_dim; j++) { matrix[indx + j] = 0.0; } }

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

For more complete information about compiler optimizations, see our Optimization Notice.