#gpucomputing — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #gpucomputing, aggregated by home.social.
-
SHIFTING CURRENTS IN COMPUTATIONAL LATTICES
New NVIDIA CUDA Toolkit 12.2 features improve Python GPU programming. Learn how this helps developers run complex calculations faster on NVIDIA GPUs.
#NVIDIACUDA, #GPUcomputing, #PythonDev, #TechUpdate, #ParallelProcessing
https://newsletter.tf/nvidia-cuda-toolkit-12-2-python-gpu-computing/
-
SHIFTING CURRENTS IN COMPUTATIONAL LATTICES
New NVIDIA CUDA Toolkit 12.2 features improve Python GPU programming. Learn how this helps developers run complex calculations faster on NVIDIA GPUs.
#NVIDIACUDA, #GPUcomputing, #PythonDev, #TechUpdate, #ParallelProcessing
https://newsletter.tf/nvidia-cuda-toolkit-12-2-python-gpu-computing/
-
NVIDIA's CUDA Toolkit 12.2 is out, offering new tools that make running complex calculations on GPUs much easier for Python developers.
#NVIDIACUDA, #GPUcomputing, #PythonDev, #TechUpdate, #ParallelProcessing
https://newsletter.tf/nvidia-cuda-toolkit-12-2-python-gpu-computing/ -
NVIDIA's CUDA Toolkit 12.2 is out, offering new tools that make running complex calculations on GPUs much easier for Python developers.
#NVIDIACUDA, #GPUcomputing, #PythonDev, #TechUpdate, #ParallelProcessing
https://newsletter.tf/nvidia-cuda-toolkit-12-2-python-gpu-computing/ -
Người dùng đang tìm cách triển khai suy luận cục bộ cho mô hình lớn Qwen2.5-72B trên 2 GPU L40 (48GB VRAM mỗi chiếc) nhưng gặp trở ngại. Khi dùng Huggingface, quá trình bị treo, còn vLLM thì báo lỗi khởi tạo WorkerProc. Anh ấy đang tìm kiếm các gợi ý để giải quyết vấn đề phân chia mô hình và tăng tốc suy luận trên hệ thống đa GPU.
#LLM #AITech #vLLM #Huggingface #LocalInference #GPUComputing #Qwen2_5_72Bhttps://www.reddit.com/r/LocalLLaMA/comments/1q7gr9w/local_inference_with_big_model_shared_
-
New in the #VirtualObservatory: “Order Computational and Storage Resources at FAI” by Fesenkov Astrophysical Institute
https://dachs.fai.kz/soft_order_sims/q/compres/info
#AstronomicalInstrumentation #ComputationalAstronomy #GpuComputing #AutomatedTelescopes -
So sánh chi phí khi fine-tune Llama 3 70B:
- **AWS H100**: $4.50/giờ, setup 45 phút (cài driver + tải dữ liệu)
- **Cụm RTX4090s phân tán**: $2.00/giờ, setup 5 phút
Giả định: Cụm chậm hơn 1.6x do WAN.
📊 Kết quả:
• Chạy một lần dài → AWS nhanh hơn.
• Vòng nghiên cứu (3-4 lần chạy nhỏ) → Cụm RTX4090s rẻ hơn và cạnh tranh về tổng thời gian nhờ giảm chi phí "setup" lặp lại.
#AI #GPUComputing #CostOptimization #Llama3 #TríTuệNhânTạo #MáyTínhGPU #TốiƯuChiPhí -
llama.cpp trên llama-server gặp vấn đề hiệu suất lớn khi dùng eGPU qua Thunderbolt 4. Tốc độ prefill (xử lý prompt) giảm từ ~2500 t/s (1 GPU) xuống ~150 t/s (2 GPU, 1 qua TB4). Có phải độ trễ của TB4 là thủ phạm chính? Liệu Oculink có tốt hơn?
#llama_cpp #llama_server #eGPU #Thunderbolt4 #LLM #AIPerformance #GPUComputing #HiệuSuấtAI #TínhToánGPU #PhầnCứngAI #MôHìnhNgônNgữ
-
The Largest CUDA Update in 20 Years: CUDA 13.1 Reconstructs GPU Programming
https://www.buysellram.com/blog/cuda-13-1-reinvents-gpu-development-the-biggest-leap-in-two-decades/
#CUDA #CudaTile #Nvidia #GPU #GPUPrograming #CUDA131 #HPC #Blackwell #TileProgramming #DeveloperTools #GPUComputing #tech #technews
-
The Largest CUDA Update in 20 Years: CUDA 13.1 Reconstructs GPU Programming
https://www.buysellram.com/blog/cuda-13-1-reinvents-gpu-development-the-biggest-leap-in-two-decades/
#CUDA #CudaTile #Nvidia #GPU #GPUPrograming #CUDA131 #HPC #Blackwell #TileProgramming #DeveloperTools #GPUComputing #tech #technews
-
The Largest CUDA Update in 20 Years: CUDA 13.1 Reconstructs GPU Programming
https://www.buysellram.com/blog/cuda-13-1-reinvents-gpu-development-the-biggest-leap-in-two-decades/
#CUDA #CudaTile #Nvidia #GPU #GPUPrograming #CUDA131 #HPC #Blackwell #TileProgramming #DeveloperTools #GPUComputing #tech #technews
-
The Largest CUDA Update in 20 Years: CUDA 13.1 Reconstructs GPU Programming
https://www.buysellram.com/blog/cuda-13-1-reinvents-gpu-development-the-biggest-leap-in-two-decades/
#CUDA #CudaTile #Nvidia #GPU #GPUPrograming #CUDA131 #HPC #Blackwell #TileProgramming #DeveloperTools #GPUComputing #tech #technews
-
The Largest CUDA Update in 20 Years: CUDA 13.1 Reconstructs GPU Programming
https://www.buysellram.com/blog/cuda-13-1-reinvents-gpu-development-the-biggest-leap-in-two-decades/
#CUDA #CudaTile #Nvidia #GPU #GPUPrograming #CUDA131 #HPC #Blackwell #TileProgramming #DeveloperTools #GPUComputing #tech #technews
-
Simulating a Planet on the GPU: Part 1 (2022)
https://www.patrickcelentano.com/blog/planet-sim-part-1
#HackerNews #Simulating #a #Planet #on #the #GPU #Part #1 #2022 #GPUComputing #PlanetSimulation #GraphicsProgramming
-
Simulating a Planet on the GPU: Part 1 (2022)
https://www.patrickcelentano.com/blog/planet-sim-part-1
#HackerNews #Simulating #a #Planet #on #the #GPU #Part #1 #2022 #GPUComputing #PlanetSimulation #GraphicsProgramming
-
Simulating a Planet on the GPU: Part 1 (2022)
https://www.patrickcelentano.com/blog/planet-sim-part-1
#HackerNews #Simulating #a #Planet #on #the #GPU #Part #1 #2022 #GPUComputing #PlanetSimulation #GraphicsProgramming
-
Simulating a Planet on the GPU: Part 1 (2022)
https://www.patrickcelentano.com/blog/planet-sim-part-1
#HackerNews #Simulating #a #Planet #on #the #GPU #Part #1 #2022 #GPUComputing #PlanetSimulation #GraphicsProgramming
-
Simulating a Planet on the GPU: Part 1 (2022)
https://www.patrickcelentano.com/blog/planet-sim-part-1
#HackerNews #Simulating #a #Planet #on #the #GPU #Part #1 #2022 #GPUComputing #PlanetSimulation #GraphicsProgramming
-
Nebius Group reported a Q3 net loss of $120M amid heavy spending on AI infrastructure, but secured a $3B, five-year deal with Meta to provide cloud and GPU resources for next-gen AI models. The partnership strengthens Nebius’s position in the high-performance AI cloud market and underscores its long-term growth potential despite short-term losses.
#Nebius #Meta #AIInfrastructure #ArtificialIntelligence #CloudComputing #GPUComputing #TECHi
Read Full Article Here :- https://www.techi.com/nebius-reports-q3-loss-meta-ai-deal/
-
🚀 New on the Bioconductor Blog: GPU Support in Bioconductor
📝 Written by Andres Wokaty
Bioconductor is building stronger support for GPU-accelerated package development, enabling faster and more scalable analysis workflows.
Learn how package maintainers can take advantage of this new GPU infrastructure: https://blog.bioconductor.org/posts/2025-10-10-gpus/
-
🧪Curious about high performance across GPUs? Our new paper benchmarks a parallel FSI code on CUDA, SYCL & OpenMP across top systems. See Aristotle Martin present it at #ISC2025 on June 11, 10:45 in Hamburg! #HPC #GPUcomputing #PerformancePortability
-
🧪Curious about high performance across GPUs? Our new paper benchmarks a parallel FSI code on CUDA, SYCL & OpenMP across top systems. See Aristotle Martin present it at #ISC2025 on June 11, 10:45 in Hamburg! #HPC #GPUcomputing #PerformancePortability
-
🚀 So, you think strapping consumer GPUs together is the tech equivalent of duct-taping a rocket? 🤔 GitHub's magical fairy dust promises to turn your GPU potato farm into a supercomputer, but only if you squint hard enough. 🥔✨
https://github.com/Foreseerr/TScale #GPUComputing #TechInnovation #Supercomputing #GitHub #MagicPotatoFarm #HackerNews #ngated -
🚀 So, you think strapping consumer GPUs together is the tech equivalent of duct-taping a rocket? 🤔 GitHub's magical fairy dust promises to turn your GPU potato farm into a supercomputer, but only if you squint hard enough. 🥔✨
https://github.com/Foreseerr/TScale #GPUComputing #TechInnovation #Supercomputing #GitHub #MagicPotatoFarm #HackerNews #ngated -
🚀 So, you think strapping consumer GPUs together is the tech equivalent of duct-taping a rocket? 🤔 GitHub's magical fairy dust promises to turn your GPU potato farm into a supercomputer, but only if you squint hard enough. 🥔✨
https://github.com/Foreseerr/TScale #GPUComputing #TechInnovation #Supercomputing #GitHub #MagicPotatoFarm #HackerNews #ngated -
🚀 So, you think strapping consumer GPUs together is the tech equivalent of duct-taping a rocket? 🤔 GitHub's magical fairy dust promises to turn your GPU potato farm into a supercomputer, but only if you squint hard enough. 🥔✨
https://github.com/Foreseerr/TScale #GPUComputing #TechInnovation #Supercomputing #GitHub #MagicPotatoFarm #HackerNews #ngated -
🚀 Ready to test the limits of performance?
Join the @EPCC Hackathon on AMD GPUs and explore the cutting-edge #MI300A and AMD’s Next Generation #Fortran Compiler with #OpenMP offload!
💻 Bring your code, ideas, and curiosity.
🔧 Optimize, accelerate, and innovate with us.
🏆 Let’s see what you can build!🔗 https://www.archer2.ac.uk/training/courses/250527-amd-hackathon/
-
🚀 Ready to test the limits of performance?
Join the @EPCC Hackathon on AMD GPUs and explore the cutting-edge #MI300A and AMD’s Next Generation #Fortran Compiler with #OpenMP offload!
💻 Bring your code, ideas, and curiosity.
🔧 Optimize, accelerate, and innovate with us.
🏆 Let’s see what you can build!🔗 https://www.archer2.ac.uk/training/courses/250527-amd-hackathon/
-
🚀 Ready to test the limits of performance?
Join the @EPCC Hackathon on AMD GPUs and explore the cutting-edge #MI300A and AMD’s Next Generation #Fortran Compiler with #OpenMP offload!
💻 Bring your code, ideas, and curiosity.
🔧 Optimize, accelerate, and innovate with us.
🏆 Let’s see what you can build!🔗 https://www.archer2.ac.uk/training/courses/250527-amd-hackathon/
-
🚀 Ready to test the limits of performance?
Join the @EPCC Hackathon on AMD GPUs and explore the cutting-edge #MI300A and AMD’s Next Generation #Fortran Compiler with #OpenMP offload!
💻 Bring your code, ideas, and curiosity.
🔧 Optimize, accelerate, and innovate with us.
🏆 Let’s see what you can build!🔗 https://www.archer2.ac.uk/training/courses/250527-amd-hackathon/
-
🚀 Ready to test the limits of performance?
Join the @EPCC Hackathon on AMD GPUs and explore the cutting-edge #MI300A and AMD’s Next Generation #Fortran Compiler with #OpenMP offload!
💻 Bring your code, ideas, and curiosity.
🔧 Optimize, accelerate, and innovate with us.
🏆 Let’s see what you can build!🔗 https://www.archer2.ac.uk/training/courses/250527-amd-hackathon/
-
NVIDIA stellt DGX Spark und DGX Station vor: KI-Supercomputer für den Schreibtisch
NVIDIA hat auf der GTC 2025 zwei neue KI-Supercomputer vorgestellt, die erstmals Data-Center-Leistung auf den Desktop bringen
https://www.apfeltalk.de/magazin/news/nvidia-stellt-dgx-spark-und-dgx-station-vor-ki-supercomputer-fuer-den-schreibtisch/
#KI #News #DataScience #DGXSpark #DGXStation #GPUComputing #GraceBlackwell #HighPerformanceComputing #KIEntwicklung #KISupercomputer #MachineLearning #NVIDIADGX -
NVIDIA stellt DGX Spark und DGX Station vor: KI-Supercomputer für den Schreibtisch
NVIDIA hat auf der GTC 2025 zwei neue KI-Supercomputer vorgestellt, die erstmals Data-Center-Leistung auf den Desktop bringen
https://www.apfeltalk.de/magazin/news/nvidia-stellt-dgx-spark-und-dgx-station-vor-ki-supercomputer-fuer-den-schreibtisch/
#KI #News #DataScience #DGXSpark #DGXStation #GPUComputing #GraceBlackwell #HighPerformanceComputing #KIEntwicklung #KISupercomputer #MachineLearning #NVIDIADGX -
And compression is now super fast!
💻Performance on Mac M1:
✅𝐂𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧: 7 GB/s
✅𝐃𝐞𝐜𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧: 8 GB/s
Wait till multithreading happens on GPU and you only decompress on demand#compression
#llms
#GPUComputing
#ai𝐏𝐚𝐩𝐞𝐫: alphaxiv.org/abs/2411.05239
-
In the long run it seems we have to replace #opencl in our scientific software, which used pyopencl for #GPUcomputing on all vendors' cards. Which way should we go?
#SYCL?
We want #FOSS, vendor neutrality, longevity of the software and an easy way to use it from python (ah, and performance, of course) -
Going to Nvidia GTC?
Visit us at booth 1422 to talk about how we support AI/ML from desktop to cloud to edge.
Then join us for drinks and tacos at Continental Bar on March 20 from 7 pm to 10 pm.
-
Nvidia nutzt die Computer Vision and Pattern Recognition Conference zur Veröffentlichung mehrerer Machine-Learning-Projekte. www.heise.de/developer/meldung… #GPUComputing #MachineLearning #Nvidia