In the document GVCS001-The Compute Architecture of Intel Processor Graphics Gen8.pdf
"Finally, one of the FPUs provides extended math capability to support high-throughput transcendental math functions and double precision 64-bit floating-point."
Does this mean its possible for intel gpu opencl to one day fully support cl_khr_fp64 :)
Does the FPU also do high-throughput double precision transcendental math functions?
Before someone mentions Xeon's and AVX, there is no reason Intel can not give both options and let the market decide.
Perhaps you could release a pro part (Xeon with igp) with both of the two FPU's supporting double precision...
sorry for the delayed reply. I agree with you that fp64 would be very nice to have. At this point I don't have any details, but let me see if I can find any updates.
@Robert Ioffe, Thank you. Its great to hear its being looked at.
@alanmac I think Nvidia target their fp64 performance to match Intel CPU fp64 performance. ie.so that your not at a disadvantage vs the cpu when targeting cuda/opencl for you fp64 application. AMD doesn't do this, they try and give as much as they can without significantly compromising their general graphic (game) performance. My AMD 290X gives me 700Gflop (double precision). Even my old AMD HD 7770 was faster than my Haswell's (i7) AVX opencl (my app uses fp64 rsqrt).
For what its worth, this is my app, https://sourceforge.net/projects/openclsolarsyst/ it needs a lot of double precision grunt, I need double precision in order to have long term accuracy (and stability). All of the computation and graphics is running completely on the GPU. Its only copied back to CPU memory when close encounter detection is turned on (so that it can be dumped to a log).
Something else to consider
From http://en.libreofficeforum.org/node/9119 in the section "Is there a minimum OpenCL version required by LibreOffice?"
"According to /core/sc/source/core/opencl/opencl_device.cxx the criteria appears based on the presence of these double floating-point (64bit) precision extensions:
cl_khr_fp64, which is OpenCL v1.0 compliant.
"And Precision is nonnegotiable for spreadsheets IEE764 required"
I believe that in the future amd/intel "APU's" the igp section will grow in size much faster than the number of cpu cores and/or avx bit size