Intel® High Level Design
Support for Intel® High Level Synthesis Compiler, DSP Builder, OneAPI for Intel® FPGAs, Intel® FPGA SDK for OpenCL™
663 Discussions

parallel prefix sum OpenCL implementation

beiwang2003
Beginner
1,522 Views

I am wondering if anyone knows an efficient parallel prefix sum OpenCL implementation for FPGA. I am currently using the one at CLPP , but it is extremely slow. I guess it makes sense since it was developed earlier for GPU. Anyone knows an open source parallel prefix sum optimized for FPGA? Thanks

0 Kudos
2 Replies
AnilErinch_A_Intel
1,505 Views

Hi ,

If you have the CUDA code for the prefix sum , then you can convert it to DPC++ and then try to compile the DPC++ code for FPGA.

https://software.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/software-development-process/migrating-code-to-dpc/migrating-from-cuda-to-dpc.html

Thanks and Regards

Anil


0 Kudos
AnilErinch_A_Intel
1,452 Views

Hi ,

Please let us know if the previous suggestions was helpful.

So that we can treat this issue as closed.

Thanks and Regards

Anil


0 Kudos
Reply