#localai — Public Fediverse posts on home.social

Arint - SEO+KI @[email protected] · 2026-05-14 · 16:01 UTC

RT @stableAPY: Ich kann immer noch nicht fassen, dass meine 3060 12GB Qwen 3.6 35B mit 40 tok/s ausführt. Diese Karte kostet gebraucht etwa 200$, während alle über extrem teure 128GB Unified Memory oder RTX 6000-Karten schwärmen. Eine einzelne 3060 12GB kann für erste lokale KI-Experimente weit von ausreichend entfernt sein – sie ist günstig und in Kombination mit etwas RAM und einem einigermaßen ordentlichen CPU leistet sie ihren Dienst. Natürlich gibt es Decode-Einbrüche, wenn der Kontext wächst, und man kann keine mehreren Sub-Agents gleichzeitig ausführen, aber es ist ein günstiger Einstiegspunkt. Zum Beispiel paart sie sich sehr gut mit meiner 3090: 3090 läuft als Main-Agent 35B -np 2 = so kann ich 2 parallele Agents haben 3060 als Sub-Agent 35B -np1 Auf diese Weise kann mein Main-Hermes Arbeit an diesen Sub-Agent delegieren, während er an etwas anderem arbeitet. Ich führe auch einen Hermes-Cron-Job aus, damit sie den Main-Agent nicht überlasten, und es stört mich nicht, dass es langsamer ist, weil es im Hintergrund passiert.

mehr auf Arint.info

#3060 #Hardware #KI #LocalAI #OpenSource #Qwen #arint_info

https://x.com/stableAPY/status/2054846979755200583#m

#hardware #ki #localai #opensource #qwen #arint_info

Alvin Ashcraft 🐿️ @[email protected] · 2026-05-13 · 11:09 UTC

Foundry Local 1.1: Live Transcription, Embeddings, and Responses API | by Sam Kemp

https://devblogs.microsoft.com/foundry/foundry-local-v1-1/

#foundrylocal #models #ai #localai #foundry #webgpu

knoppix @[email protected] · 2026-05-11 · 20:20 UTC

Fedora approved the AI Developer Desktop initiative to create AI-focused Atomic Desktop images with local-first tooling and no default cloud AI connections. 🤖
Planned Fedora 45 releases include open-source AI images plus CUDA-based remixes for Intel, AMD, NVIDIA, and ARM hardware support. 🐧

🔗 https://itsfoss.com/news/fedora-ai-developer-desktops/

#TechNews #Fedora #Ubuntu #Linux #AI #ArtificialIntelligence #OpenSource #Atomic #CUDA #Cloud #CloudAI #LocalAI #FOSS #NVIDIA #AMD #Intel #ARM #MachineLearning #Developers

#technews #fedora #ubuntu #linux #ai #artificialintelligence

Thomas @[email protected] · 2026-05-11 · 09:15 UTC

New week, small update: Run LLMs Locally

Now with a new setup for OpenCode with Qwen 3.6 and Gemma 4, including permissions and thinking variants.

https://codeberg.org/thbley/talks/raw/branch/main/Run_LLMs_Locally_2026_ThomasBley.pdf

#ai #llm #llamacpp #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode

#ai #llm #llamacpp #stablediffusion #qwen3 #glm

Hacker News @[email protected] · 2026-05-10 · 19:51 UTC

Local AI needs to be the norm

https://unix.foo/posts/local-ai-needs-to-be-norm/

#HackerNews #LocalAI #AITrends #TechForGood #FutureOfAI #CommunityDriven

#hackernews #localai #aitrends #techforgood #futureofai #communitydriven

Hacker News @[email protected] · 2026-05-10 · 19:51 UTC

Local AI needs to be the norm

https://unix.foo/posts/local-ai-needs-to-be-norm/

#HackerNews #LocalAI #AITrends #TechForGood #FutureOfAI #CommunityDriven

#hackernews #localai #aitrends #techforgood #futureofai #communitydriven

Hacker News @[email protected] · 2026-05-10 · 19:51 UTC

Local AI needs to be the norm

https://unix.foo/posts/local-ai-needs-to-be-norm/

#HackerNews #LocalAI #AITrends #TechForGood #FutureOfAI #CommunityDriven

#hackernews #localai #aitrends #techforgood #futureofai #communitydriven

Hacker News @[email protected] · 2026-05-10 · 19:51 UTC

Local AI needs to be the norm

https://unix.foo/posts/local-ai-needs-to-be-norm/

#HackerNews #LocalAI #AITrends #TechForGood #FutureOfAI #CommunityDriven

#communitydriven #futureofai #techforgood #aitrends #localai #hackernews

Hacker News @[email protected] · 2026-05-10 · 19:51 UTC

Local AI needs to be the norm

https://unix.foo/posts/local-ai-needs-to-be-norm/

#HackerNews #LocalAI #AITrends #TechForGood #FutureOfAI #CommunityDriven

#hackernews #localai #aitrends #techforgood #futureofai #communitydriven

ResearchBuzz: Firehose @[email protected] · 2026-05-09 · 14:24 UTC

Ars Technica: Chrome’s 4GB AI model isn’t new, but you’re not wrong for being confused. “Some desktop Chrome users have also noted that the browser appears to suddenly want more storage space for AI. This is true—Chrome does download a 4GB AI model for on-device processing. It’s been doing that for years, though. Google hasn’t actually changed anything about Chrome’s on-device AI, […]

https://rbfirehose.com/2026/05/09/ars-technica-chromes-4gb-ai-model-isnt-new-but-youre-not-wrong-for-being-confused/

#ai #aiassisted #chrome #google #googlechrome #largelanguagemodelsllm

ResearchBuzz: Firehose @[email protected] · 2026-05-09 · 14:24 UTC

Ars Technica: Chrome’s 4GB AI model isn’t new, but you’re not wrong for being confused. “Some desktop Chrome users have also noted that the browser appears to suddenly want more storage space for AI. This is true—Chrome does download a 4GB AI model for on-device processing. It’s been doing that for years, though. Google hasn’t actually changed anything about Chrome’s on-device AI, […]

https://rbfirehose.com/2026/05/09/ars-technica-chromes-4gb-ai-model-isnt-new-but-youre-not-wrong-for-being-confused/

#ai #aiassisted #chrome #google #googlechrome #largelanguagemodelsllm

ResearchBuzz: Firehose @[email protected] · 2026-05-09 · 14:24 UTC

Ars Technica: Chrome’s 4GB AI model isn’t new, but you’re not wrong for being confused. “Some desktop Chrome users have also noted that the browser appears to suddenly want more storage space for AI. This is true—Chrome does download a 4GB AI model for on-device processing. It’s been doing that for years, though. Google hasn’t actually changed anything about Chrome’s on-device AI, […]

https://rbfirehose.com/2026/05/09/ars-technica-chromes-4gb-ai-model-isnt-new-but-youre-not-wrong-for-being-confused/

#ai #aiassisted #chrome #google #googlechrome #largelanguagemodelsllm

ResearchBuzz: Firehose @[email protected] · 2026-05-09 · 14:24 UTC

Ars Technica: Chrome’s 4GB AI model isn’t new, but you’re not wrong for being confused. “Some desktop Chrome users have also noted that the browser appears to suddenly want more storage space for AI. This is true—Chrome does download a 4GB AI model for on-device processing. It’s been doing that for years, though. Google hasn’t actually changed anything about Chrome’s on-device AI, […]

https://rbfirehose.com/2026/05/09/ars-technica-chromes-4gb-ai-model-isnt-new-but-youre-not-wrong-for-being-confused/

#webbrowsers #localai #llm #largelanguagemodelsllm #googlechrome #google

ResearchBuzz: Firehose @[email protected] · 2026-05-09 · 14:24 UTC

Ars Technica: Chrome’s 4GB AI model isn’t new, but you’re not wrong for being confused. “Some desktop Chrome users have also noted that the browser appears to suddenly want more storage space for AI. This is true—Chrome does download a 4GB AI model for on-device processing. It’s been doing that for years, though. Google hasn’t actually changed anything about Chrome’s on-device AI, […]

https://rbfirehose.com/2026/05/09/ars-technica-chromes-4gb-ai-model-isnt-new-but-youre-not-wrong-for-being-confused/

#ai #aiassisted #chrome #google #googlechrome #largelanguagemodelsllm

DrBob, 🧠 Mechanic @[email protected] · 2026-05-09 · 13:20 UTC

How to Replace Siri with a Free Local Model

Explain the difference between local AI and cloud AI in simple terms

#LocalAI is processed on your device, keeping all data private.
#CloudAI is processed on a server and requires internet access.

https://app.therundown.ai/guides/how-to-replace-siri-with-a-free-local-model

#LocallyAI #gemma #gemma4 #llm #ai

#localai #cloudai #locallyai #gemma #gemma4 #llm

bsrtech @[email protected] · 2026-05-09 · 04:24 UTC

A $1,999 Mac mini runs a 70B parameter model that a $4,000 Windows workstation physically cannot.
The reason: Apple Silicon's unified memory. No separate VRAM pool. No PCIe bottleneck. Just one shared memory for CPU, GPU, and Neural Engine.
Full breakdown： https://www.buysellram.com/blog/why-mac-mini-is-the-surprising-frontrunner-for-local-ai-agents/

#ArtificialIntelligence #AI #LocalAI #MacMini #AppleSilicon #LLM #AIAgents #MachineLearning #EdgeAI #TechInfrastructure #DataPrivacy #Automation #AIHardware

#artificialintelligence #ai #localai #macmini #applesilicon #llm

James B. @[email protected] · 2026-05-09 · 04:23 UTC

A $1,999 Mac mini runs a 70B parameter model that a $4,000 Windows workstation physically cannot.
The reason: Apple Silicon's unified memory. No separate VRAM pool. No PCIe bottleneck. Just one shared memory for CPU, GPU, and Neural Engine.
Full breakdown： https://www.buysellram.com/blog/why-mac-mini-is-the-surprising-frontrunner-for-local-ai-agents/

#ArtificialIntelligence #AI #LocalAI #MacMini #AppleSilicon #LLM #AIAgents #MachineLearning #EdgeAI #TechInfrastructure #DataPrivacy #Automation #AIHardware

#artificialintelligence #ai #localai #macmini #applesilicon #llm

Alex S. @[email protected] · 2026-05-09 · 04:17 UTC

A $1,999 Mac mini runs a 70B parameter model that a $4,000 Windows workstation physically cannot.
The reason: Apple Silicon's unified memory. No separate VRAM pool. No PCIe bottleneck. Just one shared memory for CPU, GPU, and Neural Engine.
Full breakdown： https://www.buysellram.com/blog/why-mac-mini-is-the-surprising-frontrunner-for-local-ai-agents/

#ArtificialIntelligence #AI #LocalAI #MacMini #AppleSilicon #LLM #AIAgents #MachineLearning #EdgeAI #TechInfrastructure #DataPrivacy #Automation #AIHardware

#artificialintelligence #ai #localai #macmini #applesilicon #llm

ALEXBSR @[email protected] · 2026-05-09 · 04:10 UTC

The rise of local AI is changing hardware demand in unexpected ways — and the Mac Mini is emerging as one of the biggest winners.

What makes it interesting is not just the compact form factor. Apple Silicon’s unified memory architecture, low power consumption, quiet operation, and ability to run AI workloads locally are making the Mac Mini increasingly attractive for developers, startups, and businesses building AI agents.

Recent reports show that higher-memory Mac Mini configurations are experiencing major shortages as AI adoption accelerates.

This article explores:
• Why local AI agents are growing rapidly
• How the Mac Mini became a practical AI workstation
• The role of unified memory for LLM workloads
• Why developers are moving away from cloud-only AI setups
• What this trend means for future AI infrastructure

https://www.buysellram.com/blog/why-mac-mini-is-the-surprising-frontrunner-for-local-ai-agents/

#ArtificialIntelligence #AI #LocalAI #MacMini #AppleSilicon #LLM #AIAgents #MachineLearning #EdgeAI #DataPrivacy #Automation #AIHardware #technology

#artificialintelligence #ai #localai #macmini #applesilicon #llm

BuySellRam.com @[email protected] · 2026-05-09 · 04:08 UTC

A $1,999 Mac mini runs a 70B parameter model that a $4,000 Windows workstation physically cannot.
The reason: Apple Silicon's unified memory. No separate VRAM pool. No PCIe bottleneck. Just one shared memory for CPU, GPU, and Neural Engine.
Full breakdown： https://www.buysellram.com/blog/why-mac-mini-is-the-surprising-frontrunner-for-local-ai-agents/

#ArtificialIntelligence #AI #LocalAI #MacMini #AppleSilicon #LLM #AIAgents #MachineLearning #EdgeAI #TechInfrastructure #DataPrivacy #Automation #AIHardware #tech

#artificialintelligence #ai #localai #macmini #applesilicon #llm

Supartacus @[email protected] · 2026-05-08 · 13:40 UTC

"OmniVoice" is the best local TTS I've used so far.

https://github.com/k2-fsa/OmniVoice

#ai #tts #localai #linux