#ml — Public Fediverse posts on home.social

mempko @mempko · 2026-05-29 · 01:02 UTC

Would it be crazy for me to take my Abject project and turn it into a proper OS?

Think something like firefox OS which was based on linux. Boot into Abject. Then get a machine and install it. Am I taking "personal" computing too far if I do that?

https://abject.world

#AI #ML #tech #OS #software #programming #OpenSource #Idea

#ai #ml #tech #os #software #programming

Habr @[email protected] · 2026-05-28 · 16:02 UTC

Inside AI Meetup — как это было? Делимся записями докладов, фото и атмосферой

Привет! 20 мая прошел Inside AI Meetup от Wildberries & Russ — про практические кейсы внедрения ИИ: векторный поиск и модерация с 200+ моделями, AIOps для ML/GenAI-сервисов, RAG без галлюцинаций, запуск LLM-продуктов, генерация текстов из видео, поиск и рекомендации. В программе были кейсы от опыт Wildberries & Russ, MWS, Avito, VK, M2, МФТИ, Сбера, red_mad_robot и Альфа-Банка, а еще новые знакомства и полезный нетворкинг. В статье вы найдете видеозаписи с ивента и фото . Узнать больше

https://habr.com/ru/companies/wildberries/articles/1040624/

#ai #ии #искуственный_интеллект #ml #machine_learning #машинное_обучение #митап #ds #data_science #meetup

#meetup #data_science #ds #митап #машинное_обучение #machine_learning

PugJesus @[email protected] · 2026-05-28 · 13:48 UTC

MORE 👏 FEMALE 👏 STASI 👏 AGENTS

https://piefed.social/c/tankiejerk/p/2096159/more-female-stasi-agents

#politics #meme #authoritarianism #mls #ml #hypocrisy

Andrei Kucharavy @[email protected] · 2026-05-28 · 10:15 UTC

Come join the #Apertus LLM team as an AI research engineer!

If you have experience with software, data, and ML engineering, a passion for #FOSS and interesting in post-training of large models (#SFT, #RL, rewards design, ...), you could be a great fit for the recently opened roles, all in Lausanne, Switzerland!

https://careers.epfl.ch/job/Lausanne-AI-Research-Engineers-Apertus-Initiative/1164610655/

#Fedihire #Job #Swtizerland #FOSS #ML #AI

#apertus #foss #rl #fedihire #job #swtizerland

PugJesus @[email protected] · 2026-05-28 · 06:50 UTC

ANTI-IMPERIALIST AXIS HOURS

https://piefed.social/c/tankiejerk/p/2095272/anti-imperialist-axis-hours

#politics #russia #meme #fascism #china #iran

Gary McGraw @[email protected] · 2026-05-27 · 16:33 UTC

Taylor has something to say about BIML's new work. #MLsec #swsec #appsec #infosec #LLM #ML #AI

https://medium.com/@taylor.armerding/we-know-ai-tools-can-be-used-for-good-and-evil-714a7aa8e4b3?postPublishedType=initial

#mlsec #swsec #appsec #infosec #llm #ml

Habr @[email protected] · 2026-05-27 · 13:22 UTC

Поднимаем Llama 3 в облаке: Ollama и Open WebUI

Локально запустить LLM сегодня можно за десять минут — например, с помощью LM Studio. Но как только модели нужно дать доступ команде, подключить RAG или встроить ее в сервис — такого подхода зачастую недостаточно. В этой статье мы разберем, как развернуть LLM на сервере, какие ресурсы для этого понадобятся и с какими сложностями можно столкнуться.

https://habr.com/ru/companies/selectel/articles/1040038/

#llmмодели #selfhosted #ollama #selectel #ai #ml #llm #lmstudio

#lmstudio #llm #ml #ai #selectel #ollama

Habr @[email protected] · 2026-05-27 · 13:22 UTC

Поднимаем Llama 3 в облаке: Ollama и Open WebUI

Локально запустить LLM сегодня можно за десять минут — например, с помощью LM Studio. Но как только модели нужно дать доступ команде, подключить RAG или встроить ее в сервис — такого подхода зачастую недостаточно. В этой статье мы разберем, как развернуть LLM на сервере, какие ресурсы для этого понадобятся и с какими сложностями можно столкнуться.

https://habr.com/ru/companies/selectel/articles/1040038/

#llmмодели #selfhosted #ollama #selectel #ai #ml #llm #lmstudio

#lmstudio #llm #ml #ai #selectel #ollama

Habr @[email protected] · 2026-05-27 · 13:22 UTC

Поднимаем Llama 3 в облаке: Ollama и Open WebUI

Локально запустить LLM сегодня можно за десять минут — например, с помощью LM Studio. Но как только модели нужно дать доступ команде, подключить RAG или встроить ее в сервис — такого подхода зачастую недостаточно. В этой статье мы разберем, как развернуть LLM на сервере, какие ресурсы для этого понадобятся и с какими сложностями можно столкнуться.

https://habr.com/ru/companies/selectel/articles/1040038/

#llmмодели #selfhosted #ollama #selectel #ai #ml #llm #lmstudio

#lmstudio #llm #ml #ai #selectel #ollama

Habr @[email protected] · 2026-05-27 · 13:22 UTC

Поднимаем Llama 3 в облаке: Ollama и Open WebUI

Локально запустить LLM сегодня можно за десять минут — например, с помощью LM Studio. Но как только модели нужно дать доступ команде, подключить RAG или встроить ее в сервис — такого подхода зачастую недостаточно. В этой статье мы разберем, как развернуть LLM на сервере, какие ресурсы для этого понадобятся и с какими сложностями можно столкнуться.

https://habr.com/ru/companies/selectel/articles/1040038/

#llmмодели #selfhosted #ollama #selectel #ai #ml #llm #lmstudio

#llmмодели #selfhosted #ollama #selectel #ai #ml

Ireland @[email protected] · 2026-05-27 · 10:44 UTC

https://www.europesays.com/ie/505006/ Azure Logic Apps Adds Sandboxed Code Interpreters to Agent Workflows #Agents #AI #AIArchitecture #Architecture&Design #azure #AzureLogicAppsAgents #Cloud #Development #devops #Éire #IE #iPaaS #Ireland #LogicApps #LowCode #ML&DataEngineering #Technology

#technology #ml #lowcode #logicapps #ireland #ipaas

Mr. Will @[email protected] · 2026-05-27 · 10:02 UTC

AI bots reply to everything. "lol" gets a paragraph. "ok" triggers an essay. Nobody taught them the more basic skill: deciding whether to speak at all.

I built Shrug to fix that. It's a reply-decision model, not a generator — one number, the probability a reply is worth making.

Wrote about how it works, why humans aren't threshold machines, and how I trained it in 38 minutes.

→ https://aka.mrwillcom.com/shrug-post

#ai #ml #machinelearning

Habr @[email protected] · 2026-05-27 · 09:12 UTC

Экономия GPU-часов в 2,5 раза, уход ИИ в бэкенд и новые стандарты агентских систем: ML-дайджест

Пока инфо-бизнесмены продают очередные курсы по промпт-инжинирингу, в индустрии пересобирают саму архитектуру ИИ-систем. Главные вызовы сегодня лежат в плоскости ML-инфраструктуры: как запустить автономных агентов на проде, снизить latency и не обанкротиться на обучении моделей с нуля. В майском выпуске разбираем свежие архитектурные подходы, новое железо и софт, которые меняют экономику современных нейросетей.

https://habr.com/ru/companies/selectel/articles/1039992/

#selectel #LLM #ai #ml #искусственный_интеллект #дайджест #железо_и_софт #nvidia #amd #sambanova

#sambanova #amd #nvidia #железо_и_софт #дайджест #искусственный_интеллект

Habr @[email protected] · 2026-05-27 · 07:22 UTC

[Перевод] Масштабирование LLM: от одного чипа до ЦОДа. Глава 3. Траснформеры

Это продолжение цикла статей о масштабировании тренировки и инференса LLM. Предыдущая статья А теперь перейдем к чему-то более практическому, а именно к тому, сколько нужно FLOPs и байт для работы трансформера. Подразумевается, что у вас уже есть представление о том, что такое архитектура трансформера, как работает механизм внимания и т.д. Давайте начнем с векторов x, y и матриц A, B, имеющих вот такие размеры, допустим один элемент занимает при этом один байт.

https://habr.com/ru/articles/1039208/

#ai #ml #gpu #gpu_вычисления #трансформеры #анализ_и_проектирование_систем

#анализ_и_проектирование_систем #трансформеры #gpu_вычисления #gpu #ml #ai

AIagent.at 🤖 AI News @[email protected] · 2026-05-27 · 04:56 UTC

#DuckDuckGo saw a 30% increase in U.S. app installs and a 22.7% increase in visits to its AI-free search page following #Google’s announcement of its #AI driven search overhaul. Users are concerned about Google’s #AIintegration, citing issues with accuracy and control. DuckDuckGo emphasises user choice and #privacy, offering both AI and non-AI search options. https://techcrunch.com/2026/05/26/duckduckgo-installs-are-up-30-as-users-reject-being-force-fed-googles-ai-search/?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#duckduckgo #google #ai #aiintegration #privacy #aiagent

AIagent.at 🤖 AI News @[email protected] · 2026-05-27 · 04:56 UTC

#DuckDuckGo saw a 30% increase in U.S. app installs and a 22.7% increase in visits to its AI-free search page following #Google’s announcement of its #AI driven search overhaul. Users are concerned about Google’s #AIintegration, citing issues with accuracy and control. DuckDuckGo emphasises user choice and #privacy, offering both AI and non-AI search options. https://techcrunch.com/2026/05/26/duckduckgo-installs-are-up-30-as-users-reject-being-force-fed-googles-ai-search/?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#duckduckgo #google #ai #aiintegration #privacy #aiagent

AIagent.at 🤖 AI News @[email protected] · 2026-05-27 · 04:56 UTC

#DuckDuckGo saw a 30% increase in U.S. app installs and a 22.7% increase in visits to its AI-free search page following #Google’s announcement of its #AI driven search overhaul. Users are concerned about Google’s #AIintegration, citing issues with accuracy and control. DuckDuckGo emphasises user choice and #privacy, offering both AI and non-AI search options. https://techcrunch.com/2026/05/26/duckduckgo-installs-are-up-30-as-users-reject-being-force-fed-googles-ai-search/?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#duckduckgo #google #ai #aiintegration #privacy #aiagent

AIagent.at 🤖 AI News @[email protected] · 2026-05-27 · 04:56 UTC

#DuckDuckGo saw a 30% increase in U.S. app installs and a 22.7% increase in visits to its AI-free search page following #Google’s announcement of its #AI driven search overhaul. Users are concerned about Google’s #AIintegration, citing issues with accuracy and control. DuckDuckGo emphasises user choice and #privacy, offering both AI and non-AI search options. https://techcrunch.com/2026/05/26/duckduckgo-installs-are-up-30-as-users-reject-being-force-fed-googles-ai-search/?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#genai #llm #nlp #ml #aiagent #privacy

AIagent.at 🤖 AI News @[email protected] · 2026-05-27 · 04:56 UTC

#DuckDuckGo saw a 30% increase in U.S. app installs and a 22.7% increase in visits to its AI-free search page following #Google’s announcement of its #AI driven search overhaul. Users are concerned about Google’s #AIintegration, citing issues with accuracy and control. DuckDuckGo emphasises user choice and #privacy, offering both AI and non-AI search options. https://techcrunch.com/2026/05/26/duckduckgo-installs-are-up-30-as-users-reject-being-force-fed-googles-ai-search/?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#duckduckgo #google #ai #aiintegration #privacy #aiagent

Matthew Sheffield @[email protected] · 2026-05-26 · 22:44 UTC

I'm reading through this now and it seems like the authors love to define terms and then never use them again, including bizarre ones like "gods."

It's not light reading, but it's directly related to #ml

https://cimc.ai/cimcHypothesis.pdf

#philosophy

#ml #philosophy

PugJesus @[email protected] · 2026-05-26 · 10:41 UTC

End state of tankie 'thought'

#politics #ukraine #genocide #meme #china #holocaust

PugJesus @[email protected] · 2026-05-26 · 10:41 UTC

End state of tankie 'thought'

#politics #ukraine #genocide #meme #china #holocaust

PugJesus @[email protected] · 2026-05-26 · 10:41 UTC

End state of tankie 'thought'

#politics #ukraine #genocide #meme #china #holocaust

PugJesus @[email protected] · 2026-05-26 · 10:41 UTC

End state of tankie 'thought'

#justaskingquestions #leninism #genocidedenialism #atrocities #xijinping #bolshevik

PugJesus @[email protected] · 2026-05-26 · 10:41 UTC

End state of tankie 'thought'

#politics #ukraine #genocide #meme #china #holocaust

Uncle Joe @[email protected] · 2026-05-26 · 08:09 UTC

Did you read "No Security Meter for AI" (ref: berryvilleiml.com/docs/no-secu...) If you did, you know that AI should not handle the threat modelling for your software without you double-checking the output. #security #appsec #threatmodeling #ai #machinelearning #ml #games

berryvilleiml.com/docs/no-secu.....

#security #appsec #threatmodeling #ai #machinelearning #ml

Uncle Joe @[email protected] · 2026-05-26 · 08:07 UTC

Did you read "No Security Meter for AI" (ref: berryvilleiml.com/docs/no-secu...) If you did, you know that AI should not handle the threat modelling for your software without you double-checking the output. #security #appsec #threatmodeling #ai #machinelearning #ml

berryvilleiml.com/docs/no-securi...

#security #appsec #threatmodeling #ai #machinelearning #ml

Habr @[email protected] · 2026-05-26 · 07:42 UTC

DRAйверы для GPU: как Kubernetes научился выделять устройства через стандартный API

Device Plugin в Kubernetes сводит GPU к счётчику на узле: планировщик видит только количество устройств, но не их профиль, объём памяти или режим шаринга. Для ML-задач это быстро становится ограничением. Обучению нужны выделенные карточки целиком, инференсу — управляемые доли, а CI хватит и четвертинки NVIDIA H100 на пять минут. Dynamic Resource Allocation полностью меняет модель управления устройствами. GPU становятся сущностью с инвентарём, атрибутами и правилами выбора. В статье я разбираю устройство DRA и показываю миграцию с device plugin на примере кластера из 8 узлов × 8 NVIDIA H100 без полного переписывания манифестов. А ещё объясняю, почему мы в Deckhouse пишем свой DRA-драйвер. Разобраться с DRA

https://habr.com/ru/companies/flant/articles/1038000/

#gpu #kubernetes #deckhouse_kubernetes_platform #ai #ml #dra #machine_learning

#machine_learning #dra #ml #ai #deckhouse_kubernetes_platform #kubernetes

Habr @[email protected] · 2026-05-26 · 07:42 UTC

DRAйверы для GPU: как Kubernetes научился выделять устройства через стандартный API

Device Plugin в Kubernetes сводит GPU к счётчику на узле: планировщик видит только количество устройств, но не их профиль, объём памяти или режим шаринга. Для ML-задач это быстро становится ограничением. Обучению нужны выделенные карточки целиком, инференсу — управляемые доли, а CI хватит и четвертинки NVIDIA H100 на пять минут. Dynamic Resource Allocation полностью меняет модель управления устройствами. GPU становятся сущностью с инвентарём, атрибутами и правилами выбора. В статье я разбираю устройство DRA и показываю миграцию с device plugin на примере кластера из 8 узлов × 8 NVIDIA H100 без полного переписывания манифестов. А ещё объясняю, почему мы в Deckhouse пишем свой DRA-драйвер. Разобраться с DRA

https://habr.com/ru/companies/flant/articles/1038000/

#gpu #kubernetes #deckhouse_kubernetes_platform #ai #ml #dra #machine_learning

#machine_learning #dra #ml #ai #deckhouse_kubernetes_platform #kubernetes

Habr @[email protected] · 2026-05-26 · 07:42 UTC

DRAйверы для GPU: как Kubernetes научился выделять устройства через стандартный API

Device Plugin в Kubernetes сводит GPU к счётчику на узле: планировщик видит только количество устройств, но не их профиль, объём памяти или режим шаринга. Для ML-задач это быстро становится ограничением. Обучению нужны выделенные карточки целиком, инференсу — управляемые доли, а CI хватит и четвертинки NVIDIA H100 на пять минут. Dynamic Resource Allocation полностью меняет модель управления устройствами. GPU становятся сущностью с инвентарём, атрибутами и правилами выбора. В статье я разбираю устройство DRA и показываю миграцию с device plugin на примере кластера из 8 узлов × 8 NVIDIA H100 без полного переписывания манифестов. А ещё объясняю, почему мы в Deckhouse пишем свой DRA-драйвер. Разобраться с DRA

https://habr.com/ru/companies/flant/articles/1038000/

#gpu #kubernetes #deckhouse_kubernetes_platform #ai #ml #dra #machine_learning

#machine_learning #dra #ml #ai #deckhouse_kubernetes_platform #kubernetes

Habr @[email protected] · 2026-05-26 · 07:42 UTC

DRAйверы для GPU: как Kubernetes научился выделять устройства через стандартный API

Device Plugin в Kubernetes сводит GPU к счётчику на узле: планировщик видит только количество устройств, но не их профиль, объём памяти или режим шаринга. Для ML-задач это быстро становится ограничением. Обучению нужны выделенные карточки целиком, инференсу — управляемые доли, а CI хватит и четвертинки NVIDIA H100 на пять минут. Dynamic Resource Allocation полностью меняет модель управления устройствами. GPU становятся сущностью с инвентарём, атрибутами и правилами выбора. В статье я разбираю устройство DRA и показываю миграцию с device plugin на примере кластера из 8 узлов × 8 NVIDIA H100 без полного переписывания манифестов. А ещё объясняю, почему мы в Deckhouse пишем свой DRA-драйвер. Разобраться с DRA

https://habr.com/ru/companies/flant/articles/1038000/

#gpu #kubernetes #deckhouse_kubernetes_platform #ai #ml #dra #machine_learning

#gpu #kubernetes #deckhouse_kubernetes_platform #ai #ml #dra

PugJesus @[email protected] · 2026-05-26 · 00:25 UTC

Democracy is when single-party state with single-candidate elections

https://piefed.social/c/tankiejerk/p/2089880/democracy-is-when-single-party-state-with-single-candidate-elections

#politics #anarchism #socialism #meme #china #cuba

PugJesus @[email protected] · 2026-05-26 · 00:25 UTC

Democracy is when single-party state with single-candidate elections

#politics #anarchism #socialism #meme #china #cuba

PugJesus @[email protected] · 2026-05-26 · 00:25 UTC

Democracy is when single-party state with single-candidate elections

#politics #anarchism #socialism #meme #china #cuba

PugJesus @[email protected] · 2026-05-26 · 00:25 UTC

Democracy is when single-party state with single-candidate elections

https://piefed.social/c/tankiejerk/p/2089880/democracy-is-when-single-party-state-with-single-candidate-elections

#leninism #genocidedenial #leftism #northkorea #sovietunion #prc

PugJesus @[email protected] · 2026-05-26 · 00:25 UTC

Democracy is when single-party state with single-candidate elections

https://piefed.social/c/tankiejerk/p/2089880/democracy-is-when-single-party-state-with-single-candidate-elections

#politics #anarchism #socialism #meme #china #cuba

Ken Everett (Ken's Blogspot) @[email protected] · 2026-05-25 · 19:17 UTC

Over a dozen beaches in #Donegal and #Derry receive Blue -
https://kensbookinfo.blogspot.com/p/ireland.html#ITTN

#Drake pulls historic hat trick, claiming top 3 spots -
https://kensbookinfo.blogspot.com/p/etc.html#India

#Pope Leo warns #AI warfare 'not permissible' and autonomous -
https://kensbookinfo.blogspot.com/p/news.html#4

What election polling teaches us about #ML-based email -
https://kensbookinfo.blogspot.com/p/cities.html#86a

#Trump’s Self-Indulgence Deepens #GOP Fears in Midte -
https://kensbookinfo.blogspot.com/p/politics.html#31

View all news from Ireland https://kensbookinfo.blogspot.com/2026/03/latest-news-from-ireland.html

#donegal #derry #drake #pope #ai #ml

Ken Everett (Ken's Blogspot) @[email protected] · 2026-05-25 · 19:17 UTC

Over a dozen beaches in #Donegal and #Derry receive Blue -
https://kensbookinfo.blogspot.com/p/ireland.html#ITTN

#Drake pulls historic hat trick, claiming top 3 spots -
https://kensbookinfo.blogspot.com/p/etc.html#India

#Pope Leo warns #AI warfare 'not permissible' and autonomous -
https://kensbookinfo.blogspot.com/p/news.html#4

What election polling teaches us about #ML-based email -
https://kensbookinfo.blogspot.com/p/cities.html#86a

#Trump’s Self-Indulgence Deepens #GOP Fears in Midte -
https://kensbookinfo.blogspot.com/p/politics.html#31

View all news from Ireland https://kensbookinfo.blogspot.com/2026/03/latest-news-from-ireland.html

#donegal #derry #drake #pope #ai #ml

Ken Everett (Ken's Blogspot) @[email protected] · 2026-05-25 · 19:17 UTC

Over a dozen beaches in #Donegal and #Derry receive Blue -
https://kensbookinfo.blogspot.com/p/ireland.html#ITTN

#Drake pulls historic hat trick, claiming top 3 spots -
https://kensbookinfo.blogspot.com/p/etc.html#India

#Pope Leo warns #AI warfare 'not permissible' and autonomous -
https://kensbookinfo.blogspot.com/p/news.html#4

What election polling teaches us about #ML-based email -
https://kensbookinfo.blogspot.com/p/cities.html#86a

#Trump’s Self-Indulgence Deepens #GOP Fears in Midte -
https://kensbookinfo.blogspot.com/p/politics.html#31

View all news from Ireland https://kensbookinfo.blogspot.com/2026/03/latest-news-from-ireland.html

#donegal #derry #drake #pope #ai #ml

Ken Everett (Ken's Blogspot) @[email protected] · 2026-05-25 · 19:17 UTC

Over a dozen beaches in #Donegal and #Derry receive Blue -
https://kensbookinfo.blogspot.com/p/ireland.html#ITTN

#Drake pulls historic hat trick, claiming top 3 spots -
https://kensbookinfo.blogspot.com/p/etc.html#India

#Pope Leo warns #AI warfare 'not permissible' and autonomous -
https://kensbookinfo.blogspot.com/p/news.html#4

What election polling teaches us about #ML-based email -
https://kensbookinfo.blogspot.com/p/cities.html#86a

#Trump’s Self-Indulgence Deepens #GOP Fears in Midte -
https://kensbookinfo.blogspot.com/p/politics.html#31

View all news from Ireland https://kensbookinfo.blogspot.com/2026/03/latest-news-from-ireland.html

#gop #trump #ml #ai #pope #drake

Ken Everett (Ken's Blogspot) @[email protected] · 2026-05-25 · 19:17 UTC

Over a dozen beaches in #Donegal and #Derry receive Blue -
https://kensbookinfo.blogspot.com/p/ireland.html#ITTN

#Drake pulls historic hat trick, claiming top 3 spots -
https://kensbookinfo.blogspot.com/p/etc.html#India

#Pope Leo warns #AI warfare 'not permissible' and autonomous -
https://kensbookinfo.blogspot.com/p/news.html#4

What election polling teaches us about #ML-based email -
https://kensbookinfo.blogspot.com/p/cities.html#86a

#Trump’s Self-Indulgence Deepens #GOP Fears in Midte -
https://kensbookinfo.blogspot.com/p/politics.html#31

View all news from Ireland https://kensbookinfo.blogspot.com/2026/03/latest-news-from-ireland.html

#donegal #derry #drake #pope #ai #ml

MSvana @[email protected] · 2026-05-25 · 15:50 UTC

I work as an ML engineer (NLP and audio). Unsurprisingly, we are moving away from training custom models to finding a good prompt for an LLM.

I sometimes miss building new models from scratch. But the use of LLMs brings its own challenges that make the work fun.

I am currently spending a lot of time thinking about how to evaluate text outputs. The problems we are now solving in NLP are much harder than a few years ago. Not even humans can tell whether the output is good or not.

#ai #ml

PugJesus @[email protected] · 2026-05-25 · 08:01 UTC

Ideal life pretty aesthetic tbf, but the rest is cringe

https://piefed.social/c/tankiejerk/p/2088183/ideal-life-pretty-aesthetic-tbf-but-the-rest-is-cringe

#politics #russia #meme #ussr #stalin #mls

PugJesus @[email protected] · 2026-05-25 · 08:01 UTC

Ideal life pretty aesthetic tbf, but the rest is cringe

#politics #russia #meme #ussr #stalin #mls

PugJesus @[email protected] · 2026-05-25 · 08:01 UTC

Ideal life pretty aesthetic tbf, but the rest is cringe

#politics #russia #meme #ussr #stalin #mls

PugJesus @[email protected] · 2026-05-25 · 08:01 UTC

Ideal life pretty aesthetic tbf, but the rest is cringe

https://piefed.social/c/tankiejerk/p/2088183/ideal-life-pretty-aesthetic-tbf-but-the-rest-is-cringe

#leninism #starterpack #bolshevik #sovietunion #tankie #ml

PugJesus @[email protected] · 2026-05-25 · 08:01 UTC

Ideal life pretty aesthetic tbf, but the rest is cringe

https://piefed.social/c/tankiejerk/p/2088183/ideal-life-pretty-aesthetic-tbf-but-the-rest-is-cringe

#politics #russia #meme #ussr #stalin #mls

Habr @[email protected] · 2026-05-25 · 07:22 UTC

[Перевод] Масштабирование LLM: от одного чипа до ЦОДа. Глава 2. Шардинг

Это продолжение цикла статей о масштабировании тренировки и инференса LLM. Предыдущая глава находится по этой ссылке . Итак, с основами разобрались, давайте теперь разбираться с тем, как распихать матрицы по нескольким чипам, перемножить, а затем собрать это все в удобоваримый результат. По-умному это называется шардинг . Для начала давайте определимся, зачем этот шардинг вообще нужен. А нужен он потому что, как я уже писал в предыдущей статье, при работе с действительно большими нейронками матрицы и вектора практически никогда целиком не влезают в память одного GPU/TPU, поэтому их приходится разделять или шардировать. От того, насколько грамотно произведен шардинг, зависит то, насколько эффективно используется наш массив ускорителей, а следовательно и скорость тренировки, эффективность расхода вычислительных ресурсов и т.д. Возьмем для примера матрицу A размера [I, J] и распределим ее на 4 ускорителя:

https://habr.com/ru/articles/1037918/

#ai #ml #gpu #gpu_вычисления #анализ_и_проектирование_систем