home.social

#opencl — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #opencl, aggregated by home.social.

  1. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  2. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  3. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  4. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  5. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  6. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

  7. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

    #OpenCL #HPC #GPU #Compute #SPIRV

  8. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

    #OpenCL #HPC #GPU #Compute #SPIRV

  9. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

    #OpenCL #HPC #GPU #Compute #SPIRV

  10. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

    #OpenCL #HPC #GPU #Compute #SPIRV

  11. Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
    My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
    #FluidX3D benchmarks here: github.com/ProjectPhysX/FluidX
    And the #OpenCL infos:
    - Arc B390: opencl.gpuinfo.org/displayrepo
    - Core Ultra X7 358H: opencl.gpuinfo.org/displayrepo

  12. Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
    My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
    #FluidX3D benchmarks here: github.com/ProjectPhysX/FluidX
    And the #OpenCL infos:
    - Arc B390: opencl.gpuinfo.org/displayrepo
    - Core Ultra X7 358H: opencl.gpuinfo.org/displayrepo

  13. Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
    My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
    #FluidX3D benchmarks here: github.com/ProjectPhysX/FluidX
    And the #OpenCL infos:
    - Arc B390: opencl.gpuinfo.org/displayrepo
    - Core Ultra X7 358H: opencl.gpuinfo.org/displayrepo

  14. Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
    My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
    #FluidX3D benchmarks here: github.com/ProjectPhysX/FluidX
    And the #OpenCL infos:
    - Arc B390: opencl.gpuinfo.org/displayrepo
    - Core Ultra X7 358H: opencl.gpuinfo.org/displayrepo

  15. Newest #IntelArc #GPU family member is here, the Panther Lake Arc B390... and it... purrs? 🖖 🥺 🐈‍⬛
    My OpenCL-Benchmark on the B390 measures ~7.4 TFlops FP32 and ~120GB/s memory bandwidth. hw-smi also works with the B390.
    #FluidX3D benchmarks here: github.com/ProjectPhysX/FluidX
    And the #OpenCL infos:
    - Arc B390: opencl.gpuinfo.org/displayrepo
    - Core Ultra X7 358H: opencl.gpuinfo.org/displayrepo

  16. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper

  17. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper
    #OpenCL #SPIRV

  18. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper
    #OpenCL #SPIRV

  19. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper
    #OpenCL #SPIRV

  20. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper
    #OpenCL #SPIRV

  21. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.

  22. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.
    #IWOCL #OpenCL #SYCL #HPC #Heterogeneous #Compute

  23. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.
    #IWOCL #OpenCL #SYCL #HPC #Heterogeneous #Compute

  24. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.
    #IWOCL #OpenCL #SYCL #HPC #Heterogeneous #Compute

  25. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.
    #IWOCL #OpenCL #SYCL #HPC #Heterogeneous #Compute

  26. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org

  27. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org
    #IWOCL #OpenCL #SYCL #HPC #Khronos #HeterogeneousComputing

  28. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org
    #IWOCL #OpenCL #SYCL #HPC #Khronos #HeterogeneousComputing

  29. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org
    #IWOCL #OpenCL #SYCL #HPC #Khronos #HeterogeneousComputing

  30. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org
    #IWOCL #OpenCL #SYCL #HPC #Khronos #HeterogeneousComputing

  31. Обучение LLM с нуля на c#

    Напишем с нуля на c# маленькую модель размером 422 Кб, сохраним в GGUF и запустим в LM Studio . А в этом нам поможет всего один единственный компонент: ILGPU , позволяющий обучать модель на OpenCL . А точнее - на встройке AMD.

    habr.com/ru/articles/1017484/

    #opencl #ии_и_машинное_обучение #ai #c# #иимодель #разработка

  32. Обучение LLM с нуля на c#

    Напишем с нуля на c# маленькую модель размером 422 Кб, сохраним в GGUF и запустим в LM Studio . А в этом нам поможет всего один единственный компонент: ILGPU , позволяющий обучать модель на OpenCL . А точнее - на встройке AMD.

    habr.com/ru/articles/1017484/

    #opencl #ии_и_машинное_обучение #ai #c# #иимодель #разработка

  33. Обучение LLM с нуля на c#

    Напишем с нуля на c# маленькую модель размером 422 Кб, сохраним в GGUF и запустим в LM Studio . А в этом нам поможет всего один единственный компонент: ILGPU , позволяющий обучать модель на OpenCL . А точнее - на встройке AMD.

    habr.com/ru/articles/1017484/

    #opencl #ии_и_машинное_обучение #ai #c# #иимодель #разработка

  34. Обучение LLM с нуля на c#

    Напишем с нуля на c# маленькую модель размером 422 Кб, сохраним в GGUF и запустим в LM Studio . А в этом нам поможет всего один единственный компонент: ILGPU , позволяющий обучать модель на OpenCL . А точнее - на встройке AMD.

    habr.com/ru/articles/1017484/

    #opencl #ии_и_машинное_обучение #ai #c# #иимодель #разработка