<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Running int8 model on Intel-Optimized-Tensorflow in Intel® Optimized AI Frameworks</title>
    <link>https://community.intel.com/t5/Intel-Optimized-AI-Frameworks/Running-int8-model-on-Intel-Optimized-Tensorflow/m-p/1172609#M99</link>
    <description>&lt;P&gt;I read the article&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.intel.ai/accelerating-tensorflow-inference-with-intel-deep-learning-boost-on-2nd-gen-intel-xeon-scalable-processors/#gs.rzwuy9" target="_blank"&gt;https://www.intel.ai/accelerating-tensorflow-inference-with-intel-deep-learning-boost-on-2nd-gen-intel-xeon-scalable-processors/#gs.rzwuy9&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It mentioned that the 2nd generation instructions such as AVX512_VNNI are optimized for Neural Network&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;I ran one of INT8 models in IntelAI&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/IntelAI/models/tree/master/benchmarks" target="_blank"&gt;https://github.com/IntelAI/models/tree/master/benchmarks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Here is my environment&lt;/P&gt;&lt;P&gt;- Docker:&amp;nbsp;docker.io/intelaipg/intel-optimized-tensorflow:latest&lt;/P&gt;&lt;P&gt;- CPU info&lt;/P&gt;&lt;P&gt;Architecture:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;x86_64&lt;/P&gt;&lt;P&gt;CPU op-mode(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;32-bit, 64-bit&lt;/P&gt;&lt;P&gt;Byte Order:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Little Endian&lt;/P&gt;&lt;P&gt;CPU(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;96&lt;/P&gt;&lt;P&gt;On-line CPU(s) list: 0-95&lt;/P&gt;&lt;P&gt;Thread(s) per core:&amp;nbsp;2&lt;/P&gt;&lt;P&gt;Core(s) per socket:&amp;nbsp;24&lt;/P&gt;&lt;P&gt;Socket(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;2&lt;/P&gt;&lt;P&gt;NUMA node(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;2&lt;/P&gt;&lt;P&gt;Vendor ID:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;GenuineIntel&lt;/P&gt;&lt;P&gt;CPU family:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;6&lt;/P&gt;&lt;P&gt;Model:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;85&lt;/P&gt;&lt;P&gt;Model name:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz&lt;/P&gt;&lt;P&gt;Stepping:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;7&lt;/P&gt;&lt;P&gt;CPU MHz:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1838.080&lt;/P&gt;&lt;P&gt;BogoMIPS:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;5000.00&lt;/P&gt;&lt;P&gt;Hypervisor vendor:&amp;nbsp;&amp;nbsp;KVM&lt;/P&gt;&lt;P&gt;Virtualization type: full&lt;/P&gt;&lt;P&gt;L1d cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;32K&lt;/P&gt;&lt;P&gt;L1i cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;32K&lt;/P&gt;&lt;P&gt;L2 cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1024K&lt;/P&gt;&lt;P&gt;L3 cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;36608K&lt;/P&gt;&lt;P&gt;NUMA node0 CPU(s):&amp;nbsp;&amp;nbsp;0-23,48-71&lt;/P&gt;&lt;P&gt;NUMA node1 CPU(s):&amp;nbsp;&amp;nbsp;24-47,72-95&lt;/P&gt;&lt;P&gt;Flags:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;I expect to run the Neural Network by the 2 gen instructions (AVX512_VNNI)&lt;/P&gt;&lt;P&gt;but it shows that the following optimized instructions are used:&lt;/P&gt;&lt;P&gt;AVX512F, AVX2, FMA&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Is the docker image the optimized version to run Neural Network?&lt;/P&gt;&lt;P&gt;​How can I get the information whether AVX512_VNNI is used or not?&lt;/P&gt;&lt;P&gt;How can I compile the code provided by IntelAI by the 2 gen Intel instructions?&lt;/P&gt;&lt;P&gt;Which docker image can I use to run the program?&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
    <pubDate>Thu, 16 Jan 2020 03:24:11 GMT</pubDate>
    <dc:creator>CHung</dc:creator>
    <dc:date>2020-01-16T03:24:11Z</dc:date>
    <item>
      <title>Running int8 model on Intel-Optimized-Tensorflow</title>
      <link>https://community.intel.com/t5/Intel-Optimized-AI-Frameworks/Running-int8-model-on-Intel-Optimized-Tensorflow/m-p/1172609#M99</link>
      <description>&lt;P&gt;I read the article&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.intel.ai/accelerating-tensorflow-inference-with-intel-deep-learning-boost-on-2nd-gen-intel-xeon-scalable-processors/#gs.rzwuy9" target="_blank"&gt;https://www.intel.ai/accelerating-tensorflow-inference-with-intel-deep-learning-boost-on-2nd-gen-intel-xeon-scalable-processors/#gs.rzwuy9&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It mentioned that the 2nd generation instructions such as AVX512_VNNI are optimized for Neural Network&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;I ran one of INT8 models in IntelAI&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/IntelAI/models/tree/master/benchmarks" target="_blank"&gt;https://github.com/IntelAI/models/tree/master/benchmarks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Here is my environment&lt;/P&gt;&lt;P&gt;- Docker:&amp;nbsp;docker.io/intelaipg/intel-optimized-tensorflow:latest&lt;/P&gt;&lt;P&gt;- CPU info&lt;/P&gt;&lt;P&gt;Architecture:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;x86_64&lt;/P&gt;&lt;P&gt;CPU op-mode(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;32-bit, 64-bit&lt;/P&gt;&lt;P&gt;Byte Order:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Little Endian&lt;/P&gt;&lt;P&gt;CPU(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;96&lt;/P&gt;&lt;P&gt;On-line CPU(s) list: 0-95&lt;/P&gt;&lt;P&gt;Thread(s) per core:&amp;nbsp;2&lt;/P&gt;&lt;P&gt;Core(s) per socket:&amp;nbsp;24&lt;/P&gt;&lt;P&gt;Socket(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;2&lt;/P&gt;&lt;P&gt;NUMA node(s):&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;2&lt;/P&gt;&lt;P&gt;Vendor ID:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;GenuineIntel&lt;/P&gt;&lt;P&gt;CPU family:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;6&lt;/P&gt;&lt;P&gt;Model:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;85&lt;/P&gt;&lt;P&gt;Model name:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz&lt;/P&gt;&lt;P&gt;Stepping:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;7&lt;/P&gt;&lt;P&gt;CPU MHz:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1838.080&lt;/P&gt;&lt;P&gt;BogoMIPS:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;5000.00&lt;/P&gt;&lt;P&gt;Hypervisor vendor:&amp;nbsp;&amp;nbsp;KVM&lt;/P&gt;&lt;P&gt;Virtualization type: full&lt;/P&gt;&lt;P&gt;L1d cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;32K&lt;/P&gt;&lt;P&gt;L1i cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;32K&lt;/P&gt;&lt;P&gt;L2 cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1024K&lt;/P&gt;&lt;P&gt;L3 cache:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;36608K&lt;/P&gt;&lt;P&gt;NUMA node0 CPU(s):&amp;nbsp;&amp;nbsp;0-23,48-71&lt;/P&gt;&lt;P&gt;NUMA node1 CPU(s):&amp;nbsp;&amp;nbsp;24-47,72-95&lt;/P&gt;&lt;P&gt;Flags:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;I expect to run the Neural Network by the 2 gen instructions (AVX512_VNNI)&lt;/P&gt;&lt;P&gt;but it shows that the following optimized instructions are used:&lt;/P&gt;&lt;P&gt;AVX512F, AVX2, FMA&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Is the docker image the optimized version to run Neural Network?&lt;/P&gt;&lt;P&gt;​How can I get the information whether AVX512_VNNI is used or not?&lt;/P&gt;&lt;P&gt;How can I compile the code provided by IntelAI by the 2 gen Intel instructions?&lt;/P&gt;&lt;P&gt;Which docker image can I use to run the program?&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jan 2020 03:24:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Optimized-AI-Frameworks/Running-int8-model-on-Intel-Optimized-Tensorflow/m-p/1172609#M99</guid>
      <dc:creator>CHung</dc:creator>
      <dc:date>2020-01-16T03:24:11Z</dc:date>
    </item>
    <item>
      <title>Duplicated thread. Please</title>
      <link>https://community.intel.com/t5/Intel-Optimized-AI-Frameworks/Running-int8-model-on-Intel-Optimized-Tensorflow/m-p/1172610#M100</link>
      <description>&lt;P&gt;Duplicated thread. Please refer to&amp;nbsp;https://software.intel.com/en-us/forums/intel-optimized-ai-frameworks/topic/843478#comment-1951298&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2020 05:27:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Optimized-AI-Frameworks/Running-int8-model-on-Intel-Optimized-Tensorflow/m-p/1172610#M100</guid>
      <dc:creator>Jing_Xu</dc:creator>
      <dc:date>2020-01-20T05:27:32Z</dc:date>
    </item>
  </channel>
</rss>

