Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Highlighted
Beginner
34 Views

"error ADX is not available" when using ICC 17.0.4 on a Skylake machine

Jump to solution

Hi Everyone,

I'm trying to compile a program that uses ADX. ICC version is:

$ icc --version
icc (ICC) 17.0.4 20170411

The machine is a 6th gen Core i5, so ADX is available. ADX first appeared in 5th gen Core processors.

I'm using `-march=native` which should activate AVX2, BMI2 and other goodies like ADX on this particular machine. When I attempt to compile:

$ icc -Wall -O3 -march=native test.cxx -o test.exe
test.cxx(14): error: #error directive: ADX is not available
  # error ADX is not available
    ^
...

The error is triggered by the following in test.cxx:

#if !defined(__ADX__)
# error ADX is not available
#endif

Adding `-mcpu=corei5` does not help, and adding `-madx` results in an error.

I can't find local man pages. Searching online for the options pretty much sucks. I've found the ICC(1) man page at https://software.intel.com/sites/default/files/m/d/4/1/d/8/icc.txt, but it stops at Pentium 4's. Other than that, I'm getting useless hits for "intel icc enable adx".

How do I compile a program that uses ADX under ICC?

Thank you in advance.

-----

Here are the preprocessor macros provided by ICC.

skylake:$ icc -march=core-avx2 -dM -E - </dev/null | sort
#define __amd64 1
#define __amd64__ 1
#define __ATOMIC_ACQ_REL 4
#define __ATOMIC_ACQUIRE 2
#define __ATOMIC_CONSUME 1
#define __ATOMIC_HLE_ACQUIRE 65536
#define __ATOMIC_HLE_RELEASE 131072
#define __ATOMIC_RELAXED 0
#define __ATOMIC_RELEASE 3
#define __ATOMIC_SEQ_CST 5
#define __AVX__ 1
#define __AVX2__ 1
#define __AVX_I__ 1
#define __BIGGEST_ALIGNMENT__ 64
#define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__
#define __CHAR16_TYPE__ unsigned short
#define __CHAR32_TYPE__ unsigned int
#define __CHAR_BIT__ 8
#define __cilk 200
#define __DATE__ "Aug 21 2017"
#define __DBL_DECIMAL_DIG__ 17
#define __DBL_DENORM_MIN__ 4.9406564584124654e-324
#define __DBL_DIG__ 15
#define __DBL_EPSILON__ 2.2204460492503131e-16
#define __DBL_HAS_DENORM__ 1
#define __DBL_HAS_INFINITY__ 1
#define __DBL_HAS_QUIET_NAN__ 1
#define __DBL_MANT_DIG__ 53
#define __DBL_MAX_10_EXP__ 308
#define __DBL_MAX__ 1.7976931348623157e+308
#define __DBL_MAX_EXP__ 1024
#define __DBL_MIN_10_EXP__ -307
#define __DBL_MIN__ 2.2250738585072014e-308
#define __DBL_MIN_EXP__ -1021
#define __DEC128_EPSILON__ 1E-33DL
#define __DEC128_MANT_DIG__ 34
#define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL
#define __DEC128_MAX_EXP__ 6145
#define __DEC128_MIN__ 1E-6143DL
#define __DEC128_MIN_EXP__ -6142
#define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL
#define __DEC32_EPSILON__ 1E-6DF
#define __DEC32_MANT_DIG__ 7
#define __DEC32_MAX__ 9.999999E96DF
#define __DEC32_MAX_EXP__ 97
#define __DEC32_MIN__ 1E-95DF
#define __DEC32_MIN_EXP__ -94
#define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF
#define __DEC64_EPSILON__ 1E-15DD
#define __DEC64_MANT_DIG__ 16
#define __DEC64_MAX__ 9.999999999999999E384DD
#define __DEC64_MAX_EXP__ 385
#define __DEC64_MIN__ 1E-383DD
#define __DEC64_MIN_EXP__ -382
#define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD
#define __DEC_EVAL_METHOD__ 2
#define __DECIMAL_BID_FORMAT__ 1
#define __DECIMAL_DIG__ 21
#define __EDG__ 1
#define __EDG_PTRDIFF_TYPE__ long
#define __EDG_SIZE_TYPE__ unsigned long
#define __EDG_VERSION__ 411
#define __ELF__ 1
#define __FINITE_MATH_ONLY__ 0
#define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__
#define __FLT_DECIMAL_DIG__ 9
#define __FLT_DENORM_MIN__ 1.40129846e-45F
#define __FLT_DIG__ 6
#define __FLT_EPSILON__ 1.19209290e-7F
#define __FLT_HAS_DENORM__ 1
#define __FLT_HAS_INFINITY__ 1
#define __FLT_HAS_QUIET_NAN__ 1
#define __FLT_MANT_DIG__ 24
#define __FLT_MAX_10_EXP__ 38
#define __FLT_MAX__ 3.40282347e+38F
#define __FLT_MAX_EXP__ 128
#define __FLT_MIN_10_EXP__ -37
#define __FLT_MIN__ 1.17549435e-38F
#define __FLT_MIN_EXP__ -125
#define __FLT_RADIX__ 2
#define __FMA__ 1
#define __GCC_ATOMIC_BOOL_LOCK_FREE 2
#define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2
#define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2
#define __GCC_ATOMIC_CHAR_LOCK_FREE 2
#define __GCC_ATOMIC_INT_LOCK_FREE 2
#define __GCC_ATOMIC_LLONG_LOCK_FREE 2
#define __GCC_ATOMIC_LONG_LOCK_FREE 2
#define __GCC_ATOMIC_POINTER_LOCK_FREE 2
#define __GCC_ATOMIC_SHORT_LOCK_FREE 2
#define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1
#define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2
#define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1
#define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1
#define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1
#define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1
#define _GLIBCXX_NO_BUILTIN_HAS_UNIQ_OBJ_REP 1
#define _GLIBCXX_NO_BUILTIN_LAUNDER 1
#define __GNUC__ 7
#define __GNUC_GNU_INLINE__ 1
#define __GNUC_MINOR__ 0
#define __GNUC_PATCHLEVEL__ 0
#define __GNUC_STDC_INLINE__ 1
#define __gnu_linux__ 1
#define __GXX_ABI_VERSION 1010
#define __ICC 1700
#define __INT16_MAX__ 32767
#define __INT16_TYPE__ short
#define __INT32_MAX__ 2147483647
#define __INT32_TYPE__ int
#define __INT64_MAX__ 9223372036854775807L
#define __INT64_TYPE__ long
#define __INT8_MAX__ 127
#define __INT8_TYPE__ signed char
#define __INTEL_COMPILER 1700
#define __INTEL_COMPILER_BUILD_DATE 20170411
#define __INTEL_COMPILER_UPDATE 4
#define __INTEL_OFFLOAD 1
#define __INTEL_RTTI__ 1
#define __INT_FAST16_MAX__ 9223372036854775807L
#define __INT_FAST16_TYPE__ long
#define __INT_FAST32_MAX__ 9223372036854775807L
#define __INT_FAST32_TYPE__ long
#define __INT_FAST64_MAX__ 9223372036854775807L
#define __INT_FAST64_TYPE__ long
#define __INT_FAST8_MAX__ 127
#define __INT_FAST8_TYPE__ char
#define __INT_LEAST16_MAX__ 32767
#define __INT_LEAST16_TYPE__ short
#define __INT_LEAST32_MAX__ 2147483647
#define __INT_LEAST32_TYPE__ int
#define __INT_LEAST64_MAX__ 9223372036854775807L
#define __INT_LEAST64_TYPE__ long
#define __INT_LEAST8_MAX__ 127
#define __INT_LEAST8_TYPE__ char
#define __INT_MAX__ 2147483647
#define __INTMAX_MAX__ 0x7fffffffffffffff
#define __INTMAX_TYPE__ long int
#define __INTPTR_MAX__ 9223372036854775807L
#define __INTPTR_TYPE__ long
#define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L
#define __LDBL_DIG__ 18
#define __LDBL_EPSILON__ 1.08420217248550443401e-19L
#define __LDBL_HAS_DENORM__ 1
#define __LDBL_HAS_INFINITY__ 1
#define __LDBL_HAS_QUIET_NAN__ 1
#define __LDBL_MANT_DIG__ 64
#define __LDBL_MAX_10_EXP__ 4932
#define __LDBL_MAX__ 1.18973149535723176502e+4932L
#define __LDBL_MAX_EXP__ 16384
#define __LDBL_MIN_10_EXP__ -4931
#define __LDBL_MIN__ 3.36210314311209350626e-4932L
#define __LDBL_MIN_EXP__ -16381
#define __linux 1
#define __linux__ 1
#define linux 1
#define __LONG_DOUBLE_SIZE__ 80
#define __LONG_LONG_MAX__ 0x7fffffffffffffff
#define __LONG_MAX__ 9223372036854775807L
#define __LP64__ 1
#define _LP64 1
#define __MMX__ 1
#define __NO_MATH_INLINES 1
#define __NO_STRING_INLINES 1
#define __OPTIMIZE__ 1
#define __ORDER_BIG_ENDIAN__ 4321
#define __ORDER_LITTLE_ENDIAN__ 1234
#define __ORDER_PDP_ENDIAN__ 3412
#define __pentium4 1
#define __pentium4__ 1
#define __PRAGMA_REDEFINE_EXTNAME 1
#define __PTRDIFF_MAX__ 9223372036854775807L
#define __PTRDIFF_TYPE__ long
#define __QMSPP_ 1
#define __REGISTER_PREFIX__
#define __SCHAR_MAX__ 127
#define __SHRT_MAX__ 32767
#define __SIG_ATOMIC_MAX__ 2147483647
#define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1)
#define __SIG_ATOMIC_TYPE__ int
#define __SIGNED_CHARS__ 1
#define __SIZE_MAX__ 18446744073709551615UL
#define __SIZEOF_DOUBLE__ 8
#define __SIZEOF_FLOAT__ 4
#define __SIZEOF_INT128__ 16
#define __SIZEOF_INT__ 4
#define __SIZEOF_LONG__ 8
#define __SIZEOF_LONG_DOUBLE__ 16
#define __SIZEOF_LONG_LONG__ 8
#define __SIZEOF_POINTER__ 8
#define __SIZEOF_PTRDIFF_T__ 8
#define __SIZEOF_SHORT__ 2
#define __SIZEOF_SIZE_T__ 8
#define __SIZEOF_WCHAR_T__ 4
#define __SIZEOF_WINT_T__ 4
#define __SIZE_TYPE__ unsigned long
#define __SSE__ 1
#define __SSE2__ 1
#define __SSE2_MATH__ 1
#define __SSE3__ 1
#define __SSE4_1__ 1
#define __SSE4_2__ 1
#define __SSE_MATH__ 1
#define __SSSE3__ 1
#define __STDC__ 1
#define __STDC_DEC_FP__ 200704L
#define __STDC_HOSTED__ 1
#define __STDC_NO_ATOMICS__ 1
#define __STDC_UTF_16__ 1
#define __STDC_UTF_32__ 1
#define __STDC_VERSION__ 201112L
#define __TIME__ "09:48:47"
#define __tune_pentium4__ 1
#define __UINT16_MAX__ 65535
#define __UINT16_TYPE__ unsigned short
#define __UINT32_MAX__ 4294967295U
#define __UINT32_TYPE__ unsigned int
#define __UINT64_MAX__ 18446744073709551615UL
#define __UINT64_TYPE__ unsigned long
#define __UINT8_MAX__ 255
#define __UINT8_TYPE__ unsigned char
#define __UINT_FAST16_MAX__ 18446744073709551615UL
#define __UINT_FAST16_TYPE__ unsigned long
#define __UINT_FAST32_MAX__ 18446744073709551615UL
#define __UINT_FAST32_TYPE__ unsigned long
#define __UINT_FAST64_MAX__ 18446744073709551615UL
#define __UINT_FAST64_TYPE__ unsigned long
#define __UINT_FAST8_MAX__ 255
#define __UINT_FAST8_TYPE__ unsigned char
#define __UINT_LEAST16_MAX__ 65535
#define __UINT_LEAST16_TYPE__ unsigned short
#define __UINT_LEAST32_MAX__ 4294967295U
#define __UINT_LEAST32_TYPE__ unsigned int
#define __UINT_LEAST64_MAX__ 18446744073709551615UL
#define __UINT_LEAST64_TYPE__ unsigned long
#define __UINT_LEAST8_MAX__ 255
#define __UINT_LEAST8_TYPE__ unsigned char
#define __UINTMAX_MAX__ 0xffffffffffffffff
#define __UINTMAX_TYPE__ long unsigned int
#define __UINTPTR_MAX__ 18446744073709551615UL
#define __UINTPTR_TYPE__ unsigned long
#define __unix 1
#define __unix__ 1
#define unix 1
#define __USER_LABEL_PREFIX__
#define __VERSION__ "Intel(R) C++ gcc 7.0 mode"
#define __WCHAR_MAX__ 2147483647
#define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1)
#define __WCHAR_TYPE__ int
#define __WINT_MAX__ 4294967295U
#define __WINT_MIN__ 0U
#define __WINT_TYPE__ unsigned int
#define __x86_64 1
#define __x86_64__ 1
skylake:$

 

0 Kudos

Accepted Solutions
Highlighted
34 Views

In the gcc compiler, you may need to add the "-madx" option which also enables the __ADX__ target specific macro. But that's not necessary on the Intel compiler and icc doesn't activate the target macro's in the same way that gcc does.

gcc -c -madx -dM -E test.c | grep ADX
#define __ADX__ 1

icc -c -madx -dM -E test.c | grep ADX
icc: command line warning #10159: invalid argument for option '-m'

See, without the -madx argument, gcc doesn't create the macro:

  gcc -c -dM -E test.c | grep ADX

I wrote a small test case to confirm that the adx instructions are being generated

 icc -c -S test.c 
 grep adcx test.s
        adcx      %rdx, %rsi                                    #5.9
 cat test.c

unsigned char fu(
  unsigned char c_in, unsigned __int64 a, unsigned __int64 b, unsigned __int64 * out)
{
 return _addcarryx_u64 (c_in, a, b, out);
}

View solution in original post

0 Kudos
4 Replies
Highlighted
Moderator
34 Views

 

Can you use intrinsics? _addcarryx_u32/addcarryx_u64

Regards,

Viet Hoang

0 Kudos
Highlighted
35 Views

In the gcc compiler, you may need to add the "-madx" option which also enables the __ADX__ target specific macro. But that's not necessary on the Intel compiler and icc doesn't activate the target macro's in the same way that gcc does.

gcc -c -madx -dM -E test.c | grep ADX
#define __ADX__ 1

icc -c -madx -dM -E test.c | grep ADX
icc: command line warning #10159: invalid argument for option '-m'

See, without the -madx argument, gcc doesn't create the macro:

  gcc -c -dM -E test.c | grep ADX

I wrote a small test case to confirm that the adx instructions are being generated

 icc -c -S test.c 
 grep adcx test.s
        adcx      %rdx, %rsi                                    #5.9
 cat test.c

unsigned char fu(
  unsigned char c_in, unsigned __int64 a, unsigned __int64 b, unsigned __int64 * out)
{
 return _addcarryx_u64 (c_in, a, b, out);
}

View solution in original post

0 Kudos
Highlighted
Beginner
34 Views

Thanks Viet Hoang and Melanie Blower. Please forgive my ignorance...

When using ICC, how do we know when a user wants a particular target feature? In the case of ADX, is it lumped into AVX2 (such that __AVX2__ signals it)?

If not, what happens when I want to build for a minimum target like AVX? How do I know _not_ to enable AVX2 and BMI2, and ADX in this case?

And I guess the other question is, why doesn't ICC define __ADX__ when in GCC mode? Its claiming to be GCC with __GNUC__ and friends.

0 Kudos
Highlighted
34 Views

I've created CMPLRS-6758 in our internal bugs database to request for the compiler to recognize -madx, and add the target macro too. Thanks for pointing this out. 

The compiler does accept many of gcc's -m options and the target macro symbols defined vary depending on that option, for example I saw these differences,

using icc -c
< -D__i686__
< -D__pentiumpro
using -c -mavx
> -D__SSE3__
> -D__SSSE3__
> -D__SSE4_1__
> -D__SSE4_2__
> -D__AVX__

I think if you actually have a call to a builtin intrinsic, the compiler doesn't check if it's supported (w.r.t. the -m option that you chose) it just assumes you know what you're doing.

 

 

0 Kudos