home.social

#opencl — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #opencl, aggregated by home.social.

  1. #FluidX3D #CFD v3.7 brings faster Q-criterion isosurface rendering with #OpenCL local memory optimization! 🖖🤠
    github.com/ProjectPhysX/FluidX

    Instead of 32 velocities for each #GPU thread, now an 8x8x8 workgroup loads & reuses 11x11x11 velocities in L1$, a 12x VRAM BW reduction.

    Fascinating insight: Which thread loads which cell from VRAM to L1$, and which thread renders which grid cell within the workgroup, can be very different!
    github.com/ProjectPhysX/FluidX

    PS: plugged X-wing Gif in #GitHub preview 🖖😜

  2. OpenCL 3.1 is here.

    The Khronos Group has moved several capabilities into the core spec, including SPIR-V kernels, subgroups, and integer dot products.

    Also includes improvements to the memory model and synchronization, plus better alignment with Vulkan via device UUID queries.

    Implementations are already underway across major vendors and open source projects.

    - Full Blog: khronos.org/blog/opencl-3.1-is
    - OpenCL specification GitHub
    - Khronos Discord

  3. The OpenCL Working Group has published the first in a series of cooperative matrix extensions — and your feedback can help shape them before finalization.

    cl_khr_cooperative_matrix brings cooperative matrix load, store, and multiply-add to OpenCL, developed with Arm, Intel, and Qualcomm. A companion OpenCL C language extension is also in RFC.

    Review and comment:
    🔗 Spec draft: github.com/KhronosGroup/OpenCL
    🔗 Clang RFC: discourse.llvm.org/t/rfc-clang
    🔗 Full blog: khronos.org/blog/opencl-cooper

  4. IWOCL 2026 is next week!

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. The program includes the latest technical talks, Khronos Working Group updates, application case studies, and ample opportunity to connect with peers across industry and academia.

    Registration remains open: www.iwocl.org

    See you there.

  5. The countdown is on — IWOCL 2026 is just two weeks away.

    Join the global OpenCL and SYCL community in Heilbronn, Germany (May 6–8) for the premier forum dedicated to open compute languages and heterogeneous platform programming. Expect the latest technical talks, Khronos Working Group updates, and ample opportunity to connect with peers across industry and academia.

    Registration is open: www.iwocl.org

  6. Keynote at IWOCL 2026: Paulius Velesko presents chipStar — compiling unmodified CUDA/HIP code into OpenCL & SPIR-V fat binaries that run on Intel, AMD, NVIDIA, ARM, and RISC-V hardware. No recompilation needed.

    Join us at IWOCL 2026, May 6–8 in Heilbronn, Germany to hear more.

    View the full program at: iwocl.org/iwocl-2026/conferenc

  7. The IWOCL 2026 program is live!

    The 14th International Workshop on OpenCL and SYCL is coming to Heilbronn, Germany this May 6–8, and the full conference program has just been published at iwocl.org.

    This year's lineup is packed. Whether you're working on heterogeneous HPC, GPU portability, or the future of open compute standards, this is the event for you.

    Explore the program at iwocl.org/iwocl-2026/conferenc

  8. OpenCL, SYCL и матрицы

    В данной статье я описываю свой опыт разработки приложений с OpenCL/SYCL. Вычисления на видеокартах ассоциируются преимущественно с графикой, научными вычислениями и с недавних пор с нейросетями. Но чаще всего с графикой. Тем не менее, графические процессоры обладают свойствами, за счёт которых их очень удобно использовать в задачах, напрямую не связанных с перечисленным выше. И главное из этих свойств – массовый параллелизм. Самый краткий ввод в гетерогенное программирование: у нас есть две роли — хост и девайс , задача хоста – формировать задачи и отдавать их на девайс, в то время как задача девайса обработать их и вернуть результат. При этом хост и девайс могут быть одним вычислительным устройством (это не обязательно CPU + GPU, так как CPU может отдавать задачи сам себе).

    habr.com/ru/articles/994986/

    #gpgpu #opencl #sycl #gpu_computing

  9. Call for Submissions for IWOCL 2026 has been extended until January 25th!

    Now is the time to submit on of four types of submissions:

    + Technical Presentations & Case Studies (500 word abstract)
    + Short Research Papers (3-4 page paper)
    + Full Research Papers (8-12 page paper)
    + Posters (1-2 page abstract)

    Get your submission in now and join us in for another inspiring workshop at IWOCL 2026!
    iwocl.org

  10. Khronos Group Welcomes Lisuan Technology as Contributor Member

    Lisuan Technology is a company dedicated to the research and development of graphics rendering GPUs. Founded by leading figures in the GPU industry, it possesses capabilities in large-scale chip R&D, top-level architecture design, software stack design, and mass production. They are interested in #OpenCL, #OpenGL, #OpenGLES, #Vulkan, #WebGL, #SPIR, and #OpenXR.
    lisuantech.com/

  11. Khronos Group Welcomes Lisuan Technology as Contributor Member

    Lisuan Technology is a company dedicated to the research and development of graphics rendering GPUs. Founded by leading figures in the GPU industry, it possesses capabilities in large-scale chip R&D, top-level architecture design, software stack design, and mass production. They are interested in , , , , , , and .
    lisuantech.com/

  12. Khronos Group Welcomes Lisuan Technology as Contributor Member

    Lisuan Technology is a company dedicated to the research and development of graphics rendering GPUs. Founded by leading figures in the GPU industry, it possesses capabilities in large-scale chip R&D, top-level architecture design, software stack design, and mass production. They are interested in #OpenCL, #OpenGL, #OpenGLES, #Vulkan, #WebGL, #SPIR, and #OpenXR.
    lisuantech.com/

  13. Khronos Group Welcomes Lisuan Technology as Contributor Member

    Lisuan Technology is a company dedicated to the research and development of graphics rendering GPUs. Founded by leading figures in the GPU industry, it possesses capabilities in large-scale chip R&D, top-level architecture design, software stack design, and mass production. They are interested in #OpenCL, #OpenGL, #OpenGLES, #Vulkan, #WebGL, #SPIR, and #OpenXR.
    lisuantech.com/

  14. Khronos Group Welcomes Lisuan Technology as Contributor Member

    Lisuan Technology is a company dedicated to the research and development of graphics rendering GPUs. Founded by leading figures in the GPU industry, it possesses capabilities in large-scale chip R&D, top-level architecture design, software stack design, and mass production. They are interested in #OpenCL, #OpenGL, #OpenGLES, #Vulkan, #WebGL, #SPIR, and #OpenXR.
    lisuantech.com/

  15. We are excited for this year's Supercomputing! Khronos will be well represented at the premier event for ! Learn more about all the Khronos related sessions on our website: khronos.org/events/supercomput

  16. I just uploaded the 5000th #OpenCL hardware report to @sascha's gpuinfo.org database! 🖖🥳 And guess what #GPU I reserved the spot for: #Intel Arc B580 #Battlemage 🟦
    opencl.gpuinfo.org/displayrepo
    I have contributed 4.2% (211) of all entries. 🖖🫡

  17. What an honor to start the #IWOCL conference with my keynote talk! Nowhere else you get to talk to so many #OpenCL and #SYCL experts in one room! I shared some updates on my #FluidX3D #CFD solver, how I optimized it at the smallest level of a single grid cell, to scale it up on the largest #Intel #Xeon6 #HPC systems that provide more memory capacity than any #GPU server. 🖖😃

  18. Just arrived in wonderful Heidelberg, looking forward to present the keynote talk at #IWOCL tomorrow!! See you there! 🖖😁
    iwocl.org/ #OpenCL #SYCL #FluidX3D #GPU #HPC

  19. Managed to get an #Intel Arc A750 #gpu running on #risc_v using #OpenCL, #SYCL, and #AdaptiveCpp. Software PR's submitted for review.

    #hpc #supercomputing

    @risc_v

  20. Hot Aisle's 8x AMD #MI300X server is the fastest computer I've ever tested in #FluidX3D #CFD, achieving a peak #LBM performance of 205 GLUPs/s, and a combined VRAM bandwidth of 23 TB/s. 🖖🤯
    The #RTX 5090 looks like a toy in comparison.

    MI300X beats even Nvidia's GH200 94GB. This marks a very fascinating inflection point in #GPGPU: #CUDA is not the performance leader anymore. 🖖😛
    You need a cross-vendor language like #OpenCL to leverage its power.

    FluidX3D on #GitHub: github.com/ProjectPhysX/FluidX

  21. I'm getting the material ready for my upcoming #GPGPU course that starts on March. Even though I most probably won't get to it,I also checked my trivial #SYCL programs. Apparently the 2025.0 version of the #Intel #OneAPI #DPCPP runtime doesn't like any #OpenCL platform except Intel's own (I have two other platforms that support #SPIRV, so why aren't they showing up? From the documentation I can find online this should be sufficient, but apparently it's not …)

  22. @BenjaminHCCarr another article on #GPU code portability where people put their heads in the sand and pretend very hard that #OpenCL doesn't exist...
    OpenCL has solved #GPGPU cross-compatibility 16 years ago already and today is in better shape than ever.

  23. @dexter Here's the AMD w/ CUDA support/library/api/shim/translation/thing that came up the other day:

    > ZLUDA is a drop-in replacement for CUDA on non-NVIDIA GPU. ZLUDA allows to run unmodified CUDA applications using non-NVIDIA GPUs with near-native performance.

    - vosen.github.io/ZLUDA/blog/zlu
    - github.com/vosen/ZLUDA

    #forDiscussion #gpu #ai #cuda #amd #nvidia #zluda #opencl #machineLearning

  24. 讓 <del>Intel</del>AMD GPU 直接跑 CUDA 程式的 ZLUDA

    先前提過「在 Intel 內顯上面直接跑 CUDA 程式的 ZLUDA」,結果後來事情大翻轉,AMD 跑去贊助專案,變成支援 AMD GPU 了:「AMD Quietly Funded A Drop-In CUDA Implementation Built On ROCm: It's Now Open-Source」,專案在 GitHub 的 vosen

    blog.gslin.org/archives/2024/0

    #Computer #Murmuring #Software #amd #blender #cuda #driver #intel #nvidia #open #opencl #rocm #source #zluda

  25. #FluidX3D v2.13 is out, providing faster #VTK export with automatic SI unit conversion and a variety of bug fixes!
    Full release notes: github.com/ProjectPhysX/FluidX
    #GPU #CFD #OpenCL #GPGPU #HPC #GitHub

  26. [OUTDATED: PLEASE SEE https://eepy.moe/notes/9wty6rkb3j1r0b5d)

    New account, new intro post
    (yes this is pretty much copy/pasted with a couple of tweaks)

    Hi! I'm Skye, who used to be
    @[email protected]

    Some quick facts about me:
    - I'm
    #transfem
    - I have
    #autism
    - My sona, also called Skye, is a transfem shark
    - I live in the
    #UK (aka TERF Island :/ )

    My hobbies:
    - I do a lot of programming in languages such as
    #c++ , #rustlang , #kotlin and #pythonlang
    - I know a decent amount about HPC programming (I've used
    #cuda , #opencl , #hiplang and #vulkan compute)
    - I really love old computers, esp ones to do with the history of
    #HPC, such as #SGI machines - sadly I do not own any of these computers as of writing this :(
    - I've participated in the homebrew scenes for the Original Xbox, and the PS3
    - On the note of consoles, I own: an
    #xboxone S, a #PS3, an #originalxbox, a #Wii, and a #GameCube
    - Other than programming aligned interests, I really like:
    -
    #trains (expect me to post a lot of train photos/videos when I go on them lol)
    -
    #cosplay - sadly I've never actually owned a cosplay or made one, however, I've been to quite a few comiccons around the UK, and one day, when I have the money, I'd love to make/buy a cosplay (I might make a cosplan post at somepoint?)
    - I'm also into
    #mtb , and when I do it I may upload videos of cool shit I do lol

    Shows I've watched and loved (and where my cosplans mostly come from lol - also, these are in no particular order):
    - The Dragon Prince -
    #thedragonprince
    - She-Ra (2018) -
    #shera
    - Yakitori -
    #yakitori
    - The Owl House -
    #theowlhouse
    - Avatar: The Last Airbender -
    #atla
    - The Legend of Korra -
    #tlok
    - Arcane -
    #arcane
    - Brand New Animal -
    #bna
    - Voltron -
    #vld

    Links:
    - My
    #CodeBerg profile: https://codeberg.org/ExoSkye
    - My
    #GitHub profile (for old projects): https://github.com/ExoSkye
    - My website:
    https://rsxgirl.gay/
    -
    #Discord - ask me personally for my tag if you want it
    - My Email: [email protected]

    Anyway, thanks for checking out my profile!

  27. I should mention that this isn't just a matter of the dominant player intentionally boycotting standards that would make them lose the vendor lock-in advantage (hello #NVIDIA). All major vendors are guilty of this one way or the other. For example, #AMD unjustifiably pulled (or maybe failed to add) #SPIR and #CPU support from their new #OpenCL implementation. #Intel's #oneAPI (even while still leveraging the OpenCL backend) effectively failed on any other OpenCL platform.

  28. I should mention that this isn't just a matter of the dominant player intentionally boycotting standards that would make them lose the vendor lock-in advantage (hello #NVIDIA). All major vendors are guilty of this one way or the other. For example, #AMD unjustifiably pulled (or maybe failed to add) #SPIR and #CPU support from their new #OpenCL implementation. #Intel's #oneAPI (even while still leveraging the OpenCL backend) effectively failed on any other OpenCL platform.

  29. I should mention that this isn't just a matter of the dominant player intentionally boycotting standards that would make them lose the vendor lock-in advantage (hello #NVIDIA). All major vendors are guilty of this one way or the other. For example, #AMD unjustifiably pulled (or maybe failed to add) #SPIR and #CPU support from their new #OpenCL implementation. #Intel's #oneAPI (even while still leveraging the OpenCL backend) effectively failed on any other OpenCL platform.

  30. #Blender 3.4 HIP Performance With #Radeon RX 7900 Series + RDNA3 #OpenCL Compute Benchmarks

    -- Some initial #Linux Blender HIP + @openclapi benchmarks for the #RX7900 series

    phoronix.com/review/rx7900-ble

    Original tweet : twitter.com/phoronix/status/16