home.social

#adaptivecpp — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #adaptivecpp, aggregated by home.social.

  1. My latest paper from IWOCL'25 is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10.1145/37

  2. My latest paper from IWOCL'25 is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10.1145/37

  3. My latest paper from IWOCL'25 is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10.1145/37

  4. My latest paper from IWOCL'25 is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10.1145/37

  5. My latest paper from IWOCL'25 is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10.1145/37

  6. Managed to get an #Intel Arc A750 #gpu running on #risc_v using #OpenCL, #SYCL, and #AdaptiveCpp. Software PR's submitted for review.

    #hpc #supercomputing

    @risc_v

  7. / 2020 / ++ / Is starting to look pretty nice. I’m not sure if there is a story for mobile devices and older devices. It seems that OpenCL 1.2 isn’t enough for SYCL 2020, but unsure. Also not sure how Vulcan fits in here. It’s also unclear how it scales to smaller kernels and data. The API looks like kernel/buffer/queue. Not sure if you can queue up many kernels or if they can be pipelined.

    How does the new IR that llvm is using fit in here? So many questions.

  8. For a while (and still AFAIK) you needed to build clang from source to get #OpenMP working on MacOS.

    Remember running into that with #AdaptiveCPP+#Spack

  9. #oneAPI / #SYCL 2020 / #DPC++ / #AdaptiveCpp Is starting to look pretty nice. I’m not sure if there is a story for mobile devices and older devices. It seems that OpenCL 1.2 isn’t enough for SYCL 2020, but unsure. Also not sure how Vulcan fits in here. It’s also unclear how it scales to smaller kernels and data. The API looks like kernel/buffer/queue. Not sure if you can queue up many kernels or if they can be pipelined.

    How does the new IR that llvm is using fit in here? So many questions.

  10. #oneAPI / #SYCL 2020 / #DPC++ / #AdaptiveCpp Is starting to look pretty nice. I’m not sure if there is a story for mobile devices and older devices. It seems that OpenCL 1.2 isn’t enough for SYCL 2020, but unsure. Also not sure how Vulcan fits in here. It’s also unclear how it scales to smaller kernels and data. The API looks like kernel/buffer/queue. Not sure if you can queue up many kernels or if they can be pipelined.

    How does the new IR that llvm is using fit in here? So many questions.

  11. #oneAPI / #SYCL 2020 / #DPC++ / #AdaptiveCpp Is starting to look pretty nice. I’m not sure if there is a story for mobile devices and older devices. It seems that OpenCL 1.2 isn’t enough for SYCL 2020, but unsure. Also not sure how Vulcan fits in here. It’s also unclear how it scales to smaller kernels and data. The API looks like kernel/buffer/queue. Not sure if you can queue up many kernels or if they can be pipelined.

    How does the new IR that llvm is using fit in here? So many questions.

  12. #oneAPI / #SYCL 2020 / #DPC++ / #AdaptiveCpp Is starting to look pretty nice. I’m not sure if there is a story for mobile devices and older devices. It seems that OpenCL 1.2 isn’t enough for SYCL 2020, but unsure. Also not sure how Vulcan fits in here. It’s also unclear how it scales to smaller kernels and data. The API looks like kernel/buffer/queue. Not sure if you can queue up many kernels or if they can be pipelined.

    How does the new IR that llvm is using fit in here? So many questions.

  13. Like C++ standard parallelism offload? #AdaptiveCpp substantially outperforms NVIDIA HPC SDK's nvc++ in C++ PSTL offloading for the LULESH app on A100. Perhaps this a good moment to reconsider your use of vendor compilers 😉 (Also, the same AdaptiveCpp binary runs runs on AMD/Intel hw too!)

  14. Like C++ standard parallelism offload? #AdaptiveCpp substantially outperforms NVIDIA HPC SDK's nvc++ in C++ PSTL offloading for the LULESH app on A100. Perhaps this a good moment to reconsider your use of vendor compilers 😉 (Also, the same AdaptiveCpp binary runs runs on AMD/Intel hw too!)

  15. Like C++ standard parallelism offload? #AdaptiveCpp substantially outperforms NVIDIA HPC SDK's nvc++ in C++ PSTL offloading for the LULESH app on A100. Perhaps this a good moment to reconsider your use of vendor compilers 😉 (Also, the same AdaptiveCpp binary runs runs on AMD/Intel hw too!)

  16. Like C++ standard parallelism offload? #AdaptiveCpp substantially outperforms NVIDIA HPC SDK's nvc++ in C++ PSTL offloading for the LULESH app on A100. Perhaps this a good moment to reconsider your use of vendor compilers 😉 (Also, the same AdaptiveCpp binary runs runs on AMD/Intel hw too!)

  17. Like C++ standard parallelism offload? #AdaptiveCpp substantially outperforms NVIDIA HPC SDK's nvc++ in C++ PSTL offloading for the LULESH app on A100. Perhaps this a good moment to reconsider your use of vendor compilers 😉 (Also, the same AdaptiveCpp binary runs runs on AMD/Intel hw too!)

  18. For a while (and still AFAIK) you needed to build clang from source to get #OpenMP working on MacOS.

    Remember running into that with #AdaptiveCPP+#Spack

  19. For a while (and still AFAIK) you needed to build clang from source to get #OpenMP working on MacOS.

    Remember running into that with #AdaptiveCPP+#Spack

  20. For a while (and still AFAIK) you needed to build clang from source to get #OpenMP working on MacOS.

    Remember running into that with #AdaptiveCPP+#Spack

  21. For a while (and still AFAIK) you needed to build clang from source to get #OpenMP working on MacOS.

    Remember running into that with #AdaptiveCPP+#Spack