Intel® Quartus® Prime Software
Intel® Quartus® Prime Design Software, Design Entry, Synthesis, Simulation, Verification, Timing Analysis, System Design (Platform Designer, formerly Qsys)
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.
17268 Discussions

cl-fast-relaxed-math and profiling tools

Altera_Forum
Honored Contributor II
1,189 Views

Hi, 

There are two questions: 

First : 

In OpenCL standard it provides the cl-fast-relaxed-math to speed up and could lack of accuracy. 

I test the OpenCL code with this flag on INTEL,NIVIDA and AMD platforms. 

It could gain a speedup ~1x. 

But I use the AOCL compiler to add cl-fast-relaxed-math while compiling the OpenCL kernel Code. 

It seems that it could not gain any performance. Is the AOCL library doesn't support this flag now ? 

 

Second : 

 

I write a OpenCL program and the program might execute EnqueueNDRange API many time(use the for loop to enqueue repeatedly). The host only executes API and READ/WRITE buffer. Although from host executes EnqueueNDRange and READ/WRITE buffer to the FPGA receive the API signal to execute kernel code will waste 10~100ms overhead. Because there is no profiling tool to profile the detail situation. Therefore could any one help this problem ? 

 

SDK : 14.1 

platform : DE5  

 

Thanks
0 Kudos
0 Replies
Reply