Does anyone know if Intel or any other vendor is contemplating creating GPU hardware that supports binary128?
There is an interesting article published by IEEE.org describing a university research project that creates basically what I am looking for. It is titled "High Performance High-Precision Floating-Point
Operations on FPGAs using OpenCL".
OpenCL does not currently support binary128, but the Intel FPGA SDK for OpenCL provides support enabling one to call a custom module from an OpenCL kernel. This enabled the researchers to access their custom binary128 hardware from inside OpenCL kernels.
For more complete information about compiler optimizations, see our Optimization Notice.