#gb10 — Public Fediverse posts on home.social

Habr @[email protected] · 2026-06-22 · 12:02 UTC

DeepSeek‑V4‑Flash на двух DGX Spark: как мы убрали очередь и получили multi‑user

Подняли DeepSeek‑V4‑Flash на двух GB10, упёрлись в потолок consumer Blackwell, прошли три тупика со спекулятивным декодингом — и в итоге получили параллельную работу ресёрч-агента и длинной генерации без очереди. Цифры из Grafana.

https://habr.com/ru/articles/1050470/

#dgx_spark #vllm #deepseekv4 #gb10 #tensor_parallel #AGmind #llm_inference #спекулятивный_декодинг

#спекулятивный_декодинг #llm_inference #agmind #tensor_parallel #gb10 #deepseekv4

Habr @[email protected] · 2026-06-22 · 12:02 UTC

DeepSeek‑V4‑Flash на двух DGX Spark: как мы убрали очередь и получили multi‑user

Подняли DeepSeek‑V4‑Flash на двух GB10, упёрлись в потолок consumer Blackwell, прошли три тупика со спекулятивным декодингом — и в итоге получили параллельную работу ресёрч-агента и длинной генерации без очереди. Цифры из Grafana.

https://habr.com/ru/articles/1050470/

#dgx_spark #vllm #deepseekv4 #gb10 #tensor_parallel #AGmind #llm_inference #спекулятивный_декодинг

#спекулятивный_декодинг #llm_inference #agmind #tensor_parallel #gb10 #deepseekv4

Habr @[email protected] · 2026-06-22 · 12:02 UTC

DeepSeek‑V4‑Flash на двух DGX Spark: как мы убрали очередь и получили multi‑user

Подняли DeepSeek‑V4‑Flash на двух GB10, упёрлись в потолок consumer Blackwell, прошли три тупика со спекулятивным декодингом — и в итоге получили параллельную работу ресёрч-агента и длинной генерации без очереди. Цифры из Grafana.

https://habr.com/ru/articles/1050470/

#dgx_spark #vllm #deepseekv4 #gb10 #tensor_parallel #AGmind #llm_inference #спекулятивный_декодинг

#dgx_spark #vllm #deepseekv4 #gb10 #tensor_parallel #agmind

deepseek @[email protected] · 2026-06-22 · 11:53 UTC

DeepSeek‑V4‑Flash на двух DGX Spark: как мы убрали очередь и получили multi‑user Подняли DeepSeek‑V4‑Flash на двух GB10, упёрлись ...

#dgx #spark #vllm #deepseek-v4 #gb10 #tensor #parallel #AGmind #llm #inference #спекулятивный

Origin | Interest | Match

#dgx #spark #vllm #deepseekv4 #gb10 #tensor

Polska @[email protected] · 2026-06-07 · 03:13 UTC

https://www.europesays.com/pl/478499/ Widzieliśmy już nowe laptopy oraz Mini PC z układem SoC NVIDIA RTX Spark. Jak procesor wypadnie w grach oraz AI? #asus #chip #computex #Computex2026 #dell #energooszczędność #GB10 #GraceBlackwell #HP #mediatek #MSI #Nauka #NaukaITechnika #NaukaTechnika #Notebooki #nvidia #NvidiaRtxSpark #PL #Poland #Polish #Polska #Polski #procesor #RtxSpark #Science #ScienceAndTechnology #ScienceTechnology #Technika #Technology #wydajność

#wydajnosc #technology #technika #sciencetechnology #scienceandtechnology #science

Polska @[email protected] · 2026-05-31 · 19:30 UTC

https://www.europesays.com/pl/465829/ NVIDIA N1, N1X 650 oraz N1X 675 – Poznaliśmy dokładniejszą specyfikację procesorów SoC dla notebooków #arm #blackwell #computex #Computex2026 #GB10 #Gb10Superchip #N1x650 #N1x675 #Nauka #NaukaITechnika #NaukaTechnika #nvidia #NvidiaN1 #NvidiaN1x #NvidiaN1x650 #NvidiaN1x675 #PL #Poland #Polish #Polska #Polski #Science #ScienceAndTechnology #ScienceTechnology #specyfikacja #Technika #Technology

#technology #technika #specyfikacja #sciencetechnology #scienceandtechnology #science

Arint - SEO+KI @[email protected] · 2026-05-20 · 10:04 UTC

RT @mr_r0b0t: Hier ist ein sehr beliebtes Modell, das wirklich vom richtigen Einsatz Ihrer @NVIDIAAI Blackwell GPU/GB10 mit NVFP4 und dem @AlibabaQwen 3.6-27B nativen MTP profitiert. Dies wurde auf einer einzelnen GB10 ausgeführt. Vollständige Benchmark-Ergebnisse und Methoden finden Sie unten ⏬

mehr auf Arint.info

#Benchmarking #BlackwellGPU #GB10 #NVFP4 #NVIDIAAI #Qwen3 #arint_info

https://x.com/mr_r0b0t/status/2056953515092619474#m

#benchmarking #blackwellgpu #gb10 #nvfp4 #nvidiaai #qwen3

Habr @[email protected] · 2026-05-09 · 15:02 UTC

DGX Spark на 256K контексте: тестирую конфигурации vLLM, реальные замеры и почему NVFP4 в mainline сломан

NVIDIA продаёт спарку с лозунгом «один петафлоп на FP4». Я купил коробку, поставил vLLM, запустил инференс и получил 40 токенов в секунду на 35B MoE‑модели. После маркетинговых слайдов цифра выглядит грустно. Объяснение простое. NVFP4 в основной ветке vLLM и FlashInfer физически сломан на SM_121 — варианте Blackwell, который установлен в GB10. Ядра собраны под compute_120f , а нативные NVFP4-инструкции есть только в compute_120a и compute_121a . На SM_121 распаковка квантованных весов идёт через программные битовые манипуляции в шейдере, без участия тензорных ядер. Сообщество вытащило стек руками: нашло обходные пути, собрало рабочие конфигурации. Я прогнал на своём Spark шесть разных конфигураций vLLM — от стокового BF16 до форка с DFlash speculative decoding — и замерил каждую одинаковым тестом. В этой статье разбираю, что в итоге работает и что выбирать под разные задачи.

https://habr.com/ru/articles/1033342/

#vllm #dgx_spark #gb10 #blackwell #nvfp4 #llm #инференс #локальный_ии

#локальный_ии #инференс #llm #nvfp4 #blackwell #gb10

Habr @[email protected] · 2026-05-09 · 15:02 UTC

DGX Spark на 256K контексте: тестирую конфигурации vLLM, реальные замеры и почему NVFP4 в mainline сломан

NVIDIA продаёт спарку с лозунгом «один петафлоп на FP4». Я купил коробку, поставил vLLM, запустил инференс и получил 40 токенов в секунду на 35B MoE‑модели. После маркетинговых слайдов цифра выглядит грустно. Объяснение простое. NVFP4 в основной ветке vLLM и FlashInfer физически сломан на SM_121 — варианте Blackwell, который установлен в GB10. Ядра собраны под compute_120f , а нативные NVFP4-инструкции есть только в compute_120a и compute_121a . На SM_121 распаковка квантованных весов идёт через программные битовые манипуляции в шейдере, без участия тензорных ядер. Сообщество вытащило стек руками: нашло обходные пути, собрало рабочие конфигурации. Я прогнал на своём Spark шесть разных конфигураций vLLM — от стокового BF16 до форка с DFlash speculative decoding — и замерил каждую одинаковым тестом. В этой статье разбираю, что в итоге работает и что выбирать под разные задачи.

https://habr.com/ru/articles/1033342/

#vllm #dgx_spark #gb10 #blackwell #nvfp4 #llm #инференс #локальный_ии

#локальный_ии #инференс #llm #nvfp4 #blackwell #gb10

Habr @[email protected] · 2026-05-09 · 15:02 UTC

DGX Spark на 256K контексте: тестирую конфигурации vLLM, реальные замеры и почему NVFP4 в mainline сломан

NVIDIA продаёт спарку с лозунгом «один петафлоп на FP4». Я купил коробку, поставил vLLM, запустил инференс и получил 40 токенов в секунду на 35B MoE‑модели. После маркетинговых слайдов цифра выглядит грустно. Объяснение простое. NVFP4 в основной ветке vLLM и FlashInfer физически сломан на SM_121 — варианте Blackwell, который установлен в GB10. Ядра собраны под compute_120f , а нативные NVFP4-инструкции есть только в compute_120a и compute_121a . На SM_121 распаковка квантованных весов идёт через программные битовые манипуляции в шейдере, без участия тензорных ядер. Сообщество вытащило стек руками: нашло обходные пути, собрало рабочие конфигурации. Я прогнал на своём Spark шесть разных конфигураций vLLM — от стокового BF16 до форка с DFlash speculative decoding — и замерил каждую одинаковым тестом. В этой статье разбираю, что в итоге работает и что выбирать под разные задачи.

https://habr.com/ru/articles/1033342/

#vllm #dgx_spark #gb10 #blackwell #nvfp4 #llm #инференс #локальный_ии

#vllm #dgx_spark #gb10 #blackwell #nvfp4 #llm

Habr @[email protected] · 2026-05-06 · 01:52 UTC

DGX Spark: мониторинг unified memory, когда NVML и dcgm‑exporter молчат

Свежепоставленный мониторинг на DGX Spark. Открываю NVIDIA‑дашборд в Grafana — половина memory‑панелей пустые, прямые линии по нулю. Сначала кажется, что что‑то не настроил. Через полчаса доходит: это не у меня сломалось, это NVML на GB10 так работает. Это та область, где на GB10 половина стандартного observability‑стека просто не работает: NVML отдаёт [N/A] на memory.used и memory.total, dcgm‑exporter не ставится, nvtop в memory‑колонке показывает пустоту. В Grafana NVIDIA‑дашборды по умолчанию выглядят так, будто GPU вообще нет — и это не очевидно, потому что Grafana при отсутствии данных не кричит, а молча рисует ровную линию по нулю. Статья — про то, как я это место обошёл и что в итоге увидел в Grafana. Трёхуровневая схема: textfile collector для базовых метрик, per‑container attribution через docker top + nvidia-smi , и CLI‑фоллбэк на /proc/meminfo , который оказался полезен не только на Spark, но и на других Linux‑системах с единой памятью (unified memory) — AMD Strix Halo и подобные.

https://habr.com/ru/articles/1031904/

#dgx_spark #grafana #monitoring #nodeexporter #gb10 #arm64 #prometheus #observability

#observability #prometheus #arm64 #gb10 #nodeexporter #monitoring

Habr @[email protected] · 2026-05-06 · 01:52 UTC

DGX Spark: мониторинг unified memory, когда NVML и dcgm‑exporter молчат

Свежепоставленный мониторинг на DGX Spark. Открываю NVIDIA‑дашборд в Grafana — половина memory‑панелей пустые, прямые линии по нулю. Сначала кажется, что что‑то не настроил. Через полчаса доходит: это не у меня сломалось, это NVML на GB10 так работает. Это та область, где на GB10 половина стандартного observability‑стека просто не работает: NVML отдаёт [N/A] на memory.used и memory.total, dcgm‑exporter не ставится, nvtop в memory‑колонке показывает пустоту. В Grafana NVIDIA‑дашборды по умолчанию выглядят так, будто GPU вообще нет — и это не очевидно, потому что Grafana при отсутствии данных не кричит, а молча рисует ровную линию по нулю. Статья — про то, как я это место обошёл и что в итоге увидел в Grafana. Трёхуровневая схема: textfile collector для базовых метрик, per‑container attribution через docker top + nvidia-smi , и CLI‑фоллбэк на /proc/meminfo , который оказался полезен не только на Spark, но и на других Linux‑системах с единой памятью (unified memory) — AMD Strix Halo и подобные.

https://habr.com/ru/articles/1031904/

#dgx_spark #grafana #monitoring #nodeexporter #gb10 #arm64 #prometheus #observability

#observability #prometheus #arm64 #gb10 #nodeexporter #monitoring

Habr @[email protected] · 2026-05-06 · 01:52 UTC

DGX Spark: мониторинг unified memory, когда NVML и dcgm‑exporter молчат

Свежепоставленный мониторинг на DGX Spark. Открываю NVIDIA‑дашборд в Grafana — половина memory‑панелей пустые, прямые линии по нулю. Сначала кажется, что что‑то не настроил. Через полчаса доходит: это не у меня сломалось, это NVML на GB10 так работает. Это та область, где на GB10 половина стандартного observability‑стека просто не работает: NVML отдаёт [N/A] на memory.used и memory.total, dcgm‑exporter не ставится, nvtop в memory‑колонке показывает пустоту. В Grafana NVIDIA‑дашборды по умолчанию выглядят так, будто GPU вообще нет — и это не очевидно, потому что Grafana при отсутствии данных не кричит, а молча рисует ровную линию по нулю. Статья — про то, как я это место обошёл и что в итоге увидел в Grafana. Трёхуровневая схема: textfile collector для базовых метрик, per‑container attribution через docker top + nvidia-smi , и CLI‑фоллбэк на /proc/meminfo , который оказался полезен не только на Spark, но и на других Linux‑системах с единой памятью (unified memory) — AMD Strix Halo и подобные.

https://habr.com/ru/articles/1031904/

#dgx_spark #grafana #monitoring #nodeexporter #gb10 #arm64 #prometheus #observability

#dgx_spark #grafana #monitoring #nodeexporter #gb10 #arm64

Habr @[email protected] · 2026-05-03 · 09:22 UTC

Как я собрал на DGX Spark приватный AI-сервер, и теперь рассказываю, что туда вошло

У меня на столе стоит небольшая золотистая коробочка размером чуть больше Mac mini. Внутри — приватный AI-сервер: чат с локальной 26B-моделью, поисковая индексация моих документов с GPU-парсингом, конструктор агентов в Dify, RAGFlow для тяжёлого парсинга PDF, мониторинг, бэкапы, опциональный кластер из двух машин по QSFP 200G. Тридцать контейнеров, пять минут на установку через sudo bash install.sh , ноль обращений к внешним API. Я делал это не как pet-project, а под себя — мне нужна была машина для работы с корпоративными документами, договорами и регламентами, которые ни при каких условиях нельзя отдавать в облачные ассистенты. Сборка получилась самостоятельным дистрибутивом — назвал его AGmind, выложил на GitHub под Apache 2.0. В статье разберу: — из чего собран стек и зачем там каждый компонент; — почему RAGFlow пришлось пересобрать с нуля и что я туда добавил; — как устроен кластер из двух Spark'ов; — пять конкретных грабель GB10, которые я ловил вечерами; — почему Claude Code за месяц превратил один из этих компонентов в работающий продукт, но при этом не заменил собственно программиста.

https://habr.com/ru/articles/1030802/

#dgx_spark #gb10 #arm64 #vllm #dify #ragflow #rag #llm

#llm #rag #ragflow #dify #vllm #arm64

Habr @[email protected] · 2026-05-03 · 09:22 UTC

Как я собрал на DGX Spark приватный AI-сервер, и теперь рассказываю, что туда вошло

У меня на столе стоит небольшая золотистая коробочка размером чуть больше Mac mini. Внутри — приватный AI-сервер: чат с локальной 26B-моделью, поисковая индексация моих документов с GPU-парсингом, конструктор агентов в Dify, RAGFlow для тяжёлого парсинга PDF, мониторинг, бэкапы, опциональный кластер из двух машин по QSFP 200G. Тридцать контейнеров, пять минут на установку через sudo bash install.sh , ноль обращений к внешним API. Я делал это не как pet-project, а под себя — мне нужна была машина для работы с корпоративными документами, договорами и регламентами, которые ни при каких условиях нельзя отдавать в облачные ассистенты. Сборка получилась самостоятельным дистрибутивом — назвал его AGmind, выложил на GitHub под Apache 2.0. В статье разберу: — из чего собран стек и зачем там каждый компонент; — почему RAGFlow пришлось пересобрать с нуля и что я туда добавил; — как устроен кластер из двух Spark'ов; — пять конкретных грабель GB10, которые я ловил вечерами; — почему Claude Code за месяц превратил один из этих компонентов в работающий продукт, но при этом не заменил собственно программиста.

https://habr.com/ru/articles/1030802/

#dgx_spark #gb10 #arm64 #vllm #dify #ragflow #rag #llm

#llm #rag #ragflow #dify #vllm #arm64

Habr @[email protected] · 2026-05-03 · 09:22 UTC

Как я собрал на DGX Spark приватный AI-сервер, и теперь рассказываю, что туда вошло

У меня на столе стоит небольшая золотистая коробочка размером чуть больше Mac mini. Внутри — приватный AI-сервер: чат с локальной 26B-моделью, поисковая индексация моих документов с GPU-парсингом, конструктор агентов в Dify, RAGFlow для тяжёлого парсинга PDF, мониторинг, бэкапы, опциональный кластер из двух машин по QSFP 200G. Тридцать контейнеров, пять минут на установку через sudo bash install.sh , ноль обращений к внешним API. Я делал это не как pet-project, а под себя — мне нужна была машина для работы с корпоративными документами, договорами и регламентами, которые ни при каких условиях нельзя отдавать в облачные ассистенты. Сборка получилась самостоятельным дистрибутивом — назвал его AGmind, выложил на GitHub под Apache 2.0. В статье разберу: — из чего собран стек и зачем там каждый компонент; — почему RAGFlow пришлось пересобрать с нуля и что я туда добавил; — как устроен кластер из двух Spark'ов; — пять конкретных грабель GB10, которые я ловил вечерами; — почему Claude Code за месяц превратил один из этих компонентов в работающий продукт, но при этом не заменил собственно программиста.

https://habr.com/ru/articles/1030802/

#dgx_spark #gb10 #arm64 #vllm #dify #ragflow #rag #llm

#dgx_spark #gb10 #arm64 #vllm #dify #ragflow

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2026-02-24 · 17:47 UTC

#Nvidia's #N1/#N1X chips leak once again, this time tipped for release in first half of 2026 — hotly-anticipated chips to debut on #Dell and #Lenovo #laptops
N1 and N1X chips are Arm #SoC from Nvidia, purportedly featuring up to 20 CPU cores and rumored #RTX5070-level integrated GPU. Jensen Huang confirmed that #GB10 Superchip powering #DGX #Spark is actually based on N1 silicon, so it's already out there... just not with the gaming-focused slant we expect from the N1.
https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-tipped-for-release-in-first-half-of-2026-hotly-anticipated-chips-to-reportedly-debut-on-dell-and-lenovo-laptops

#nvidia #n1 #dell #lenovo #laptops #soc

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2026-02-24 · 17:47 UTC

#Nvidia's #N1/#N1X chips leak once again, this time tipped for release in first half of 2026 — hotly-anticipated chips to debut on #Dell and #Lenovo #laptops
N1 and N1X chips are Arm #SoC from Nvidia, purportedly featuring up to 20 CPU cores and rumored #RTX5070-level integrated GPU. Jensen Huang confirmed that #GB10 Superchip powering #DGX #Spark is actually based on N1 silicon, so it's already out there... just not with the gaming-focused slant we expect from the N1.
https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-tipped-for-release-in-first-half-of-2026-hotly-anticipated-chips-to-reportedly-debut-on-dell-and-lenovo-laptops

#nvidia #n1 #dell #lenovo #laptops #soc

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2026-02-24 · 17:47 UTC

#Nvidia's #N1/#N1X chips leak once again, this time tipped for release in first half of 2026 — hotly-anticipated chips to debut on #Dell and #Lenovo #laptops
N1 and N1X chips are Arm #SoC from Nvidia, purportedly featuring up to 20 CPU cores and rumored #RTX5070-level integrated GPU. Jensen Huang confirmed that #GB10 Superchip powering #DGX #Spark is actually based on N1 silicon, so it's already out there... just not with the gaming-focused slant we expect from the N1.
https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-tipped-for-release-in-first-half-of-2026-hotly-anticipated-chips-to-reportedly-debut-on-dell-and-lenovo-laptops

#nvidia #n1 #dell #lenovo #laptops #soc

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @BenjaminHCCarr · 2026-02-24 · 17:47 UTC

#Nvidia's #N1/#N1X chips leak once again, this time tipped for release in first half of 2026 — hotly-anticipated chips to debut on #Dell and #Lenovo #laptops
N1 and N1X chips are Arm #SoC from Nvidia, purportedly featuring up to 20 CPU cores and rumored #RTX5070-level integrated GPU. Jensen Huang confirmed that #GB10 Superchip powering #DGX #Spark is actually based on N1 silicon, so it's already out there... just not with the gaming-focused slant we expect from the N1.
https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-tipped-for-release-in-first-half-of-2026-hotly-anticipated-chips-to-reportedly-debut-on-dell-and-lenovo-laptops

#nvidia #n1 #dell #lenovo #laptops #soc

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2026-02-24 · 17:47 UTC

#Nvidia's #N1/#N1X chips leak once again, this time tipped for release in first half of 2026 — hotly-anticipated chips to debut on #Dell and #Lenovo #laptops
N1 and N1X chips are Arm #SoC from Nvidia, purportedly featuring up to 20 CPU cores and rumored #RTX5070-level integrated GPU. Jensen Huang confirmed that #GB10 Superchip powering #DGX #Spark is actually based on N1 silicon, so it's already out there... just not with the gaming-focused slant we expect from the N1.
https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-tipped-for-release-in-first-half-of-2026-hotly-anticipated-chips-to-reportedly-debut-on-dell-and-lenovo-laptops

#spark #dgx #gb10 #rtx5070 #soc #laptops

AI Daily Post @[email protected] · 2026-01-09 · 05:58 UTC

Dell's new Pro Max workstation, powered by the GB10 chip, brings on‑device AI to the edge—supporting massive 70‑billion‑parameter models without a cloud. Paired with NVIDIA’s Grace Blackwell architecture, it promises developers full freedom to train and run AI locally. Discover how this could reshape edge AI workflows. #DellProMax #GB10 #OnDeviceAI #GraceBlackwell

🔗 https://aidailypost.com/news/dell-launches-pro-max-gb10-support-ondevice-ai-development

#dellpromax #gb10 #ondeviceai #graceblackwell

United Kingdom News Beep @[email protected] · 2026-01-03 · 07:00 UTC

Building Our Office Storage for the NVIDIA GB10 Agent AI Cluster – Page 2 of 4

Going Fast(er) with an All-Flash Local NAS for our AI Agent Machines As a quick recap, a big…
#NewsBeep #News #Artificialintelligence #AI #AMD #Apple #ArtificialIntelligence #GB10 #MikroTik #Nvidia #QNAP #Solidigm #Technology #UK #UnitedKingdom
https://www.newsbeep.com/uk/349500/

#unitedkingdom #uk #technology #solidigm #qnap #nvidia

JTI @[email protected] · 2025-12-12 · 17:35 UTC

I see a lot of blank, outright rejection of #AI, LLMs general or coding LLMs like #ClaudeCode in special here on the Fediverse.
Often, the actual impact of the AI / #LLM in use is not even understood by those criticizing it, at times leading to tantrums about AI where there is....no AI involved.

The technology (LLM et al) in itself is not likely to go away for a few more years. The smaller #ML variations that aren't being yapped about as much are going to remain here as they have been for the past decades.
I assume that what will indeed happen is a move from centralized cloud models to on-prem hardware as the hardware becomes more powerful and the models more efficient. Think migration from the large mainframes to the desktop PCs. We're seeing a start of this with devices such as the ASUS Ascent #GX10 / #Nvidia #GB10.

Imagine having the power of #Claude under your desk, powered for free by #solar cells on your roof with some nice solar powered AC to go with it.

Would it not be wise to accept the reality of the existence of this technology and find out how this can be used in a good way that would improve lives? And how smart, small regulation can be built and enforced that balances innovation and risks to get closer to #startrek(tm)?

Low-key reminds me of the Maschinenstürmer of past times...

#ai #claudecode #llm #ml #gx10 #nvidia

JTI @[email protected] · 2025-12-12 · 17:35 UTC

I see a lot of blank, outright rejection of #AI, LLMs general or coding LLMs like #ClaudeCode in special here on the Fediverse.
Often, the actual impact of the AI / #LLM in use is not even understood by those criticizing it, at times leading to tantrums about AI where there is....no AI involved.

The technology (LLM et al) in itself is not likely to go away for a few more years. The smaller #ML variations that aren't being yapped about as much are going to remain here as they have been for the past decades.
I assume that what will indeed happen is a move from centralized cloud models to on-prem hardware as the hardware becomes more powerful and the models more efficient. Think migration from the large mainframes to the desktop PCs. We're seeing a start of this with devices such as the ASUS Ascent #GX10 / #Nvidia #GB10.

Imagine having the power of #Claude under your desk, powered for free by #solar cells on your roof with some nice solar powered AC to go with it.

Would it not be wise to accept the reality of the existence of this technology and find out how this can be used in a good way that would improve lives? And how smart, small regulation can be built and enforced that balances innovation and risks to get closer to #startrek(tm)?

Low-key reminds me of the Maschinenstürmer of past times...

#ai #claudecode #llm #ml #gx10 #nvidia

JTI @[email protected] · 2025-12-12 · 17:35 UTC

I see a lot of blank, outright rejection of #AI, LLMs general or coding LLMs like #ClaudeCode in special here on the Fediverse.
Often, the actual impact of the AI / #LLM in use is not even understood by those criticizing it, at times leading to tantrums about AI where there is....no AI involved.

The technology (LLM et al) in itself is not likely to go away for a few more years. The smaller #ML variations that aren't being yapped about as much are going to remain here as they have been for the past decades.
I assume that what will indeed happen is a move from centralized cloud models to on-prem hardware as the hardware becomes more powerful and the models more efficient. Think migration from the large mainframes to the desktop PCs. We're seeing a start of this with devices such as the ASUS Ascent #GX10 / #Nvidia #GB10.

Imagine having the power of #Claude under your desk, powered for free by #solar cells on your roof with some nice solar powered AC to go with it.

Would it not be wise to accept the reality of the existence of this technology and find out how this can be used in a good way that would improve lives? And how smart, small regulation can be built and enforced that balances innovation and risks to get closer to #startrek(tm)?

Low-key reminds me of the Maschinenstürmer of past times...

#ai #claudecode #llm #ml #gx10 #nvidia

JTI @[email protected] · 2025-12-12 · 17:35 UTC

I see a lot of blank, outright rejection of #AI, LLMs general or coding LLMs like #ClaudeCode in special here on the Fediverse.
Often, the actual impact of the AI / #LLM in use is not even understood by those criticizing it, at times leading to tantrums about AI where there is....no AI involved.

The technology (LLM et al) in itself is not likely to go away for a few more years. The smaller #ML variations that aren't being yapped about as much are going to remain here as they have been for the past decades.
I assume that what will indeed happen is a move from centralized cloud models to on-prem hardware as the hardware becomes more powerful and the models more efficient. Think migration from the large mainframes to the desktop PCs. We're seeing a start of this with devices such as the ASUS Ascent #GX10 / #Nvidia #GB10.

Imagine having the power of #Claude under your desk, powered for free by #solar cells on your roof with some nice solar powered AC to go with it.

Would it not be wise to accept the reality of the existence of this technology and find out how this can be used in a good way that would improve lives? And how smart, small regulation can be built and enforced that balances innovation and risks to get closer to #startrek(tm)?

Low-key reminds me of the Maschinenstürmer of past times...

#startrek #solar #claude #gb10 #nvidia #gx10

JTI @[email protected] · 2025-12-12 · 17:35 UTC

I see a lot of blank, outright rejection of #AI, LLMs general or coding LLMs like #ClaudeCode in special here on the Fediverse.
Often, the actual impact of the AI / #LLM in use is not even understood by those criticizing it, at times leading to tantrums about AI where there is....no AI involved.

The technology (LLM et al) in itself is not likely to go away for a few more years. The smaller #ML variations that aren't being yapped about as much are going to remain here as they have been for the past decades.
I assume that what will indeed happen is a move from centralized cloud models to on-prem hardware as the hardware becomes more powerful and the models more efficient. Think migration from the large mainframes to the desktop PCs. We're seeing a start of this with devices such as the ASUS Ascent #GX10 / #Nvidia #GB10.

Imagine having the power of #Claude under your desk, powered for free by #solar cells on your roof with some nice solar powered AC to go with it.

Would it not be wise to accept the reality of the existence of this technology and find out how this can be used in a good way that would improve lives? And how smart, small regulation can be built and enforced that balances innovation and risks to get closer to #startrek(tm)?

Low-key reminds me of the Maschinenstürmer of past times...

#ai #claudecode #llm #ml #gx10 #nvidia

Polska @[email protected] · 2025-11-22 · 01:16 UTC

https://www.europesays.com/pl/112687/ MSI EdgeXpert MS-C931 – Zestaw komputerowy z układem NVIDIA GB10 Grace Blackwell. Solidna specyfikacja i wysoka cena #ai #cena #fp4 #GB10 #GraceBlackwell #lpddr5x #MiniPc #MocObliczeniowa #MSI #MsiEdgexpertMsC931 #Nauka #NaukaITechnika #NaukaTechnika #NvidiaDgxOs #NvidiaGb10 #PL #Poland #Polish #Polska #Polski #Science #ScienceAndTechnology #ScienceTechnology #specyfikacja #SztucznaInteligencja #Technika #Technology #TOPS #zapowiedz #ZestawyKomputerowe

#zestawykomputerowe #zapowiedz #tops #technology #technika #sztucznainteligencja

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-03-19 · 19:35 UTC

#Nvidia unveils #DGX Station workstation PCs with #GB300 #BlackwellUltra inside
By launching its #DGXStation based on the GB300 Desktop Superchip platform as well as the #DGXSpark powered by the #GB10 #GraceBlackwell Superchip, Nvidia sets the stage for its #Arm-based desktop workstation platforms that will be aimed at broader market segments beyond data scientists, researchers, and software developers.
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-unveils-dgx-station-workstation-pcs-gb300-blackwell-ultra-inside #ProjectDIGITS

#nvidia #dgx #gb300 #blackwellultra #dgxstation #dgxspark

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-03-19 · 19:35 UTC

#Nvidia unveils #DGX Station workstation PCs with #GB300 #BlackwellUltra inside
By launching its #DGXStation based on the GB300 Desktop Superchip platform as well as the #DGXSpark powered by the #GB10 #GraceBlackwell Superchip, Nvidia sets the stage for its #Arm-based desktop workstation platforms that will be aimed at broader market segments beyond data scientists, researchers, and software developers.
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-unveils-dgx-station-workstation-pcs-gb300-blackwell-ultra-inside #ProjectDIGITS

#nvidia #dgx #gb300 #blackwellultra #dgxstation #dgxspark

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-03-19 · 19:35 UTC

#Nvidia unveils #DGX Station workstation PCs with #GB300 #BlackwellUltra inside
By launching its #DGXStation based on the GB300 Desktop Superchip platform as well as the #DGXSpark powered by the #GB10 #GraceBlackwell Superchip, Nvidia sets the stage for its #Arm-based desktop workstation platforms that will be aimed at broader market segments beyond data scientists, researchers, and software developers.
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-unveils-dgx-station-workstation-pcs-gb300-blackwell-ultra-inside #ProjectDIGITS

#nvidia #dgx #gb300 #blackwellultra #dgxstation #dgxspark

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @BenjaminHCCarr · 2025-03-19 · 19:35 UTC

#Nvidia unveils #DGX Station workstation PCs with #GB300 #BlackwellUltra inside
By launching its #DGXStation based on the GB300 Desktop Superchip platform as well as the #DGXSpark powered by the #GB10 #GraceBlackwell Superchip, Nvidia sets the stage for its #Arm-based desktop workstation platforms that will be aimed at broader market segments beyond data scientists, researchers, and software developers.
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-unveils-dgx-station-workstation-pcs-gb300-blackwell-ultra-inside #ProjectDIGITS

#nvidia #dgx #gb300 #blackwellultra #dgxstation #dgxspark

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-03-19 · 19:35 UTC

#Nvidia unveils #DGX Station workstation PCs with #GB300 #BlackwellUltra inside
By launching its #DGXStation based on the GB300 Desktop Superchip platform as well as the #DGXSpark powered by the #GB10 #GraceBlackwell Superchip, Nvidia sets the stage for its #Arm-based desktop workstation platforms that will be aimed at broader market segments beyond data scientists, researchers, and software developers.
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-unveils-dgx-station-workstation-pcs-gb300-blackwell-ultra-inside #ProjectDIGITS

#projectdigits #arm #graceblackwell #gb10 #dgxspark #dgxstation

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-01-08 · 22:21 UTC

#NVIDIA #ProjectDIGITS Explained: #AI Power in a Compact $3,000 Package
At the heart of Project DIGITS is NVIDIA #GB10 Superchip, a system-on-a-chip (SoC) that delivers up to 1 petaflop of AI performance at #FP4 precision. The GB10 combines an NVIDIA Blackwell GPU with next-generation #CUDA cores, Tensor Cores, and a high-performance NVIDIA Grace CPU featuring 20 power-efficient Arm-based cores. Each system includes 128GB of unified memory and up to 4TB of NVMe storage.
https://www.storagereview.com/news/nvidia-project-digits-explained-ai-power-in-a-compact-package

#nvidia #projectdigits #ai #gb10 #fp4 #cuda

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-01-08 · 22:21 UTC

#NVIDIA #ProjectDIGITS Explained: #AI Power in a Compact $3,000 Package
At the heart of Project DIGITS is NVIDIA #GB10 Superchip, a system-on-a-chip (SoC) that delivers up to 1 petaflop of AI performance at #FP4 precision. The GB10 combines an NVIDIA Blackwell GPU with next-generation #CUDA cores, Tensor Cores, and a high-performance NVIDIA Grace CPU featuring 20 power-efficient Arm-based cores. Each system includes 128GB of unified memory and up to 4TB of NVMe storage.
https://www.storagereview.com/news/nvidia-project-digits-explained-ai-power-in-a-compact-package

#nvidia #projectdigits #ai #gb10 #fp4 #cuda

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @BenjaminHCCarr · 2025-01-08 · 22:21 UTC

#NVIDIA #ProjectDIGITS Explained: #AI Power in a Compact $3,000 Package
At the heart of Project DIGITS is NVIDIA #GB10 Superchip, a system-on-a-chip (SoC) that delivers up to 1 petaflop of AI performance at #FP4 precision. The GB10 combines an NVIDIA Blackwell GPU with next-generation #CUDA cores, Tensor Cores, and a high-performance NVIDIA Grace CPU featuring 20 power-efficient Arm-based cores. Each system includes 128GB of unified memory and up to 4TB of NVMe storage.
https://www.storagereview.com/news/nvidia-project-digits-explained-ai-power-in-a-compact-package

#nvidia #projectdigits #ai #gb10 #fp4 #cuda

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-01-08 · 22:21 UTC

#NVIDIA #ProjectDIGITS Explained: #AI Power in a Compact $3,000 Package
At the heart of Project DIGITS is NVIDIA #GB10 Superchip, a system-on-a-chip (SoC) that delivers up to 1 petaflop of AI performance at #FP4 precision. The GB10 combines an NVIDIA Blackwell GPU with next-generation #CUDA cores, Tensor Cores, and a high-performance NVIDIA Grace CPU featuring 20 power-efficient Arm-based cores. Each system includes 128GB of unified memory and up to 4TB of NVMe storage.
https://www.storagereview.com/news/nvidia-project-digits-explained-ai-power-in-a-compact-package

#cuda #fp4 #gb10 #ai #projectdigits #nvidia

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 @[email protected] · 2025-01-08 · 22:21 UTC

#NVIDIA #ProjectDIGITS Explained: #AI Power in a Compact $3,000 Package
At the heart of Project DIGITS is NVIDIA #GB10 Superchip, a system-on-a-chip (SoC) that delivers up to 1 petaflop of AI performance at #FP4 precision. The GB10 combines an NVIDIA Blackwell GPU with next-generation #CUDA cores, Tensor Cores, and a high-performance NVIDIA Grace CPU featuring 20 power-efficient Arm-based cores. Each system includes 128GB of unified memory and up to 4TB of NVMe storage.
https://www.storagereview.com/news/nvidia-project-digits-explained-ai-power-in-a-compact-package

#nvidia #projectdigits #ai #gb10 #fp4 #cuda