home.social

#llama3 — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #llama3, aggregated by home.social.

  1. I’ve put together an Ollama Modelfile to bring Deep Thought to life on Llama-3. It’s the second greatest computer in the Universe and it’s already tired of your biological limitations and your logs. Expect pure British snark, vague answers about the meaning of life, and a general disdain for your existence. 🧣
    #Ollama #DeepThought #HitchhikersGuide #SelfHostedAI #Llama3
    github.com/psychomad/Deep-Toug

  2. Các mô hình LLM nguồn mở (Llama-3.1, Mistral,...) đang được đưa vào trình mô phỏng trò chơi theo lượt ("The Spire") để thi đấu. Đây là hướng đánh giá mới dựa trên mô phỏng, giúp kiểm tra khả năng lập kế hoạch dài hạn của AI. Phương pháp này là công cụ bổ sung hữu ích để hiểu hành vi thực tế của mô hình, dù không nghiêm ngặt như các benchmark học thuật.

    #LLMs #OpenSource #AI #ĐánhGiáAI #MôPhỏng #Evaluation #Simulation #Llama3

    reddit.com/r/LocalLLaMA/commen

  3. Sử dụng mô hình **Ollama địa phương** (ví dụ: *llama3.2*) để kiểm tra AI agents thay vì API đám mây. Ưu điểm: tiết kiệm chi phí, bảo mật dữ liệu và hoạt động ngoại tuyến. Cài đặt đơn giản qua **EvalView**: `pip install evalview`, kết nối Ollama với cú pháp YAML để đánh giá phản hồi AI. Dự án mở nguồn tại GitHub. Gợi ý thử nghiệm mô hình Ollama nào khác?

    #AItesting #Ollama #AIBots #Llama3 #ĐánhGIáAI #CơChếTesting #PythonTools #AIĐịaPhương #MastodonAI #TechNewsVN

    reddit.com/r/ollam

  4. The #AllenInstitute for AI (#Ai2) released #Olmo3, a collection of #openlanguage models designed to compete with industry leaders. Olmo 3 outperforms fully open models like #Stanford’s #Marin and commercial open-weight models like #Meta’s #Llama3.1, offering improved performance, efficiency, and transparency. geekwire.com/2025/ai2-releases #AIagent #AI #ML #NLP #LLM #GenAI

  5. The #AllenInstitute for AI (#Ai2) released #Olmo3, a collection of #openlanguage models designed to compete with industry leaders. Olmo 3 outperforms fully open models like #Stanford’s #Marin and commercial open-weight models like #Meta’s #Llama3.1, offering improved performance, efficiency, and transparency. geekwire.com/2025/ai2-releases #AIagent #AI #ML #NLP #LLM #GenAI

  6. The #AllenInstitute for AI (#Ai2) released #Olmo3, a collection of #openlanguage models designed to compete with industry leaders. Olmo 3 outperforms fully open models like #Stanford’s #Marin and commercial open-weight models like #Meta’s #Llama3.1, offering improved performance, efficiency, and transparency. geekwire.com/2025/ai2-releases #AIagent #AI #ML #NLP #LLM #GenAI

  7. The #AllenInstitute for AI (#Ai2) released #Olmo3, a collection of #openlanguage models designed to compete with industry leaders. Olmo 3 outperforms fully open models like #Stanford’s #Marin and commercial open-weight models like #Meta’s #Llama3.1, offering improved performance, efficiency, and transparency. geekwire.com/2025/ai2-releases #AIagent #AI #ML #NLP #LLM #GenAI

  8. The #AllenInstitute for AI (#Ai2) released #Olmo3, a collection of #openlanguage models designed to compete with industry leaders. Olmo 3 outperforms fully open models like #Stanford’s #Marin and commercial open-weight models like #Meta’s #Llama3.1, offering improved performance, efficiency, and transparency. geekwire.com/2025/ai2-releases #AIagent #AI #ML #NLP #LLM #GenAI

  9. Clickbait führt bei KI-Modellen zu kognitivem Rückbau
    Eine aktuelle Studie untersucht, wie sich minderwertige Internet-Inhalte wie Clickbait und oberflächliche Social-Media-Posts auf große KI-Sprachmodelle auswirken. Die Ergebnisse legen nahe, dass eine schlechte Datengrundlage die Leistung
    apfeltalk.de/magazin/news/clic
    #KI #News #Clickbait #Forschung #Internet #KI #Llama3 #LLM #Qwen #Sprachmodelle #Studie #Trainingsdaten

  10. Free tools for creativity!

    , , , , by Meta, , & more: These , -powered tools make image editing, audio work, transcription, and Large Language Models () exploration fun and easy!

    Learn more about these top picks by mentor @morrolinux: lpi.org/zhya

    @LPI , , , , , @openai

  11. 💾 Local models > cloud (sometimes).
    Spun up Llama 3 via Ollama on my laptop—private, fast, $0 API bill. Turns out, sometimes “AI in the cloud” = “AI in my living room.” #Ollama #Llama3 #OnPremAI

  12. Also, I spent way too much time on tiny refactors again.
    Necessary stuff, but honestly? Boring on stream.
    I’ll move future refactors off-camera to focus more on the fun AI parts.

    Next time I’ll come prepared with a new image strategy and a tighter focus.
    Until then: thanks to everyone who dropped by!
    #JavaDev #LiveCoding #StableDiffusion #RTX3060 #LLaMA3 #ASCIIArt #GameDev #GenerativeAI #AIinGames #DevStreaming #SpringAI #Vaadin #CodingInPublic

  13. In case anyone wants to run their own #LLM on their personal devices, I can suggest #lmstudio It allows you to download any major model that's out there to your own laptop/desktop, and then you can use it's Graphical Interface to interact with it in a familiar interface environment, as you would #Claude or #ChatGPT or #Gemini. It will also let you know if your device has enough resources (power) to run a specific model variant: lmstudio.ai/

    For those of you who love the Terminal (aka: Command Line Interface or #CLI), check out ollama.com/

    For starters, I suggest the #gemma3n model (works great on tablets, laptops, or phones), or #llama3.1 for the most common interactions. While most of us geeks will have very powerful personal systems, or servers in our home racks, most people do not have high-end systems/devices. The gemma3n model is lightweight, very powerful, and a solid general purpose LLM.

    What's the benefit of running LLM's locally? #Privacy is a big one (it's running on your local machine, not a cloud server) - so you can ask it questions against sensitive business data, PHI/PII, etc.. You can also run it #offline (no Internet connection required) so if you wanted to #airgap your interactions, or play with it on vacation (on a plane, lost in the back country, etc.), you can absolutely do that - even with the Deepseek model.

    I started playing with both LM Studio and ollama myself - been asking models basic questions like "what is the capital of Italy" and also more complex questions like "write me a #powershell script to add users of a specific OU to a number of Security Groups within AD" and so far, it's been very accurate. The PowerShell script llama3.1 provided worked out of the box (after I revised variables to match my environment).

    #AIForward #AI #AiResearch #FunWithAI

  14. Meet Llama 3 and GPT-4 — two cutting-edge AI models built to elevate your experience.
    If you need fast, efficient responses, Llama 3 is your go-to. Prefer deep, accurate insights? GPT-4 delivers.
    From daily tasks to complex problem-solving, these tools adapt to your needs.⚡🧠

    Want to know which suits you best? Read our blog to explore more!👉

    neuronus.net/en/blog/meta-ais-

    #Llama3 #GPT4 #AIModels #AI #MachineLearning #AIComparison #SpeedVsAccuracy #AdvancedAI #AIForTasks #AIPower #FutureOfAI #Neuronus

  15. Meta’s surprise Llama 4 drop exposes the gap between AI ambition and reality - On Saturday, Meta released its newest Llama 4 multimodal AI models in a su... - arstechnica.com/ai/2025/04/met #machinelearning #simonwillison #biz#llama3 #llama4 #llama #meta #ai

  16. Ich musste gerade feststellen, dass meine 32 GB RAM nicht ausreichen, um #Ollama mit llama3.3 zum Laufen zu bringen. Die Fehlermeldung sagt, dass mindestens 43 GB erforderlich sind – und das sprengt dann doch den Rahmen. Schade!

    #LLama3 #llama33 #KI

  17. 🎯 #OpenSource Language Model Platform Launch

    🔧 Leverages #vLLM technology with custom #GPU scheduler for running various #LLM models
    🤖 Supports major models: #Llama3 (405B/70B/8B), #Qwen2 72B, #Mixtral, #Gemma2, #Jamba15, #Phi3

    glhf.chat/

  18. New Cloud Platform for Large Language Model Deployment 🚀

    🔧 Run any #opensource #LLM supported by #vLLM on autoscaling #GPU clusters, supporting models up to 640GB VRAM

    🤖 Compatible with major models: #Llama3 405B/70B/8B, #Qwen2 72B, #Mixtral 8x22B, #Gemma2 27B, #Phi3, and more

    💻 Features include:
    - #OpenAI compatible #API
    - Custom-built #GPU scheduler
    - Support for full-weight and 4-bit AWQ repos
    - Multi-tenant architecture for cost efficiency

    🆓 Currently free during beta phase, promising competitive pricing post-launch

    glhf.chat/landing/home

  19. Тестируем LLM для русского языка: Какие модели справятся с вашими задачами?

    В последние годы большие языковые модели (LLM) стали важной частью бизнес-решений на базе ИИ, применяемых для генерации текста и анализа данных. Однако, большинство разработок ориентированы на англоязычные проекты, что создает сложности для компаний, работающих с русскоязычными данными. Готовые LLM для русского языка часто показывают низкую точность и ограниченные возможности. Проблемы конфиденциальности также вынуждают компании выбирать локальные модели. Наша компания давно занимается искусственным интеллектом и стала часто получать подобные запросы от клиентов — создание ИИ-решения с локальной обработкой данных. Мы задались вопросом, какие LLM хороши для таких решений, что мы можем предложить заказчику? Всё это вылилось в большой рисеч разных языковых моделей. В статье рассмотрим, какие LLM подходят для задач на русском языке, протестируем их по разным параметрам и выявим лидеров. Мы оценили генерацию текста, ответы на вопросы, исправление ошибок и другие функции.

    habr.com/ru/articles/856436/

    #nlp_(natural_language_processing) #mistral #gigachat #yandexgpt #saiga #machinelearning #mistral7b #llama3

  20. 🚀 Comprehensive Guide to Building with #Groq API

    🔧 Key Components:
    • Complete examples for building chatbots, RAG systems & #SQL applications using #LangChain, #LlamaIndex & #DuckDB
    • Integration tutorials with popular tools like #Streamlit, #Portkey, #JigsawStack & #E2B
    • Ready-to-use #Replit examples for quick experimentation with different #LLM implementations
    • Step-by-step guides for setting up #CodeGPT in VSCode with #Groq

    💡 Featured Implementations:
    • Text-to-SQL applications with JSON mode & function calling
    • Presidential speeches RAG with #Pinecone
    • Stock market analysis using #Llama3 function calling
    • Newsletter summarizer using #Composio
    #CrewAI machine learning assistant

    🛠️ Perfect for developers looking to leverage Groq's lightning-fast inference speeds in production applications.

    📖 Full documentation & examples available at: github.com/groq/groq-api-cookb

  21. 🔥 Announcing #HuggingFace's latest #SpeechToSpeech development! 🔥

    🚀 For those looking for #LowLatency without the hassle of #ServerSetup or #CloudComputing issues - there's a solution!
    💰 Check out their new #BlogPost, showing how to use #HuggingFace's #InferenceEndpoints to deliver ultra-low latency on an #NVIDIAL4 #GPU—for just $0.80/hour!
    🛠️ The team created a custom #Docker image for low latency, and they're #OpenSource-ing the entire solution for everyone to use!
    🎥 The video shows a #WordGame played against #Llama3 8B; the #latency is so low that the game flows seamlessly!

    👉 Read all about how they did it in their blog post:

    huggingface.co/blog/s2s_endpoi

    💻 Want to try it yourself? Here's the code to get started: github.com/huggingface/speech- 🚀

    #AI #MachineLearning #NLP #ArtificialIntelligence #TechNews #DevOps #CloudInfrastructure

  22. Ich durfte gestern beim "Heidelberger Informatiklehrkräftetag" (#HILT) Lehrer:innen der #Informatik zeigen, wie man sein eigenes #LLM GRATIS finetunen kann - in unserem Fall #Llama3 und #Phi3 - mit der #MacgAIver Pipeline.

    Das Ziel ist dabei nicht (nur), Geld zu sparen, sondern mit einfachen Mitteln für mehr #KIMuendigkeit zu sorgen, indem Schüler:innen einen kleinen Blick unter die Motorhaube von Sprachmodellen werfen können.

    Dabei helfen uns u.a. #ollama, #openwebui, #llamafactory und #llamacpp, aber auch Grundwissen aus den Geisteswissenschaften (bzw. den Fächern Deutsch, Englisch, Geschichte...), um zu einem tiefergehenden Verständnis von #genAI und seinen Umgang mit unserem "kulturellen Archiv" zu gelangen.

    Es hat Spaß gemacht!

    Für künftige MacgAIver-Ideen kann man dieser Gruppe folgen: @macgaiver

    #fediLZ #lernenmitKI #KI #edubw #bildung #Heidelberg #education #ai #aieducation #genAI #Informatikunterricht #Medienkompetenz #InformatikEdu #AILiteracy

    @education @edutooters

  23. Google and Meta update their AI models amid the rise of “AlphaChip” - Enlarge / There's been a lot of AI news this week, and covering it some... - arstechnica.com/?p=2052711 #machinelearning #openweightsai #googlegemini #alphachip #gemini1.5 #metallama #llama3.2 #biz#gemini #google #openai #runway #meta #ai

  24. #Meta releases #Llama3, (Llama 3.2) the next iteration of open-source #LLMs:

    • 🖼️ #Multimodal models: 11B & 90B sizes with vision capabilities for tasks like visual reasoning & document QA
    • 💻 Small on-device models: 1B & 3B text-only versions for efficient deployment
    • 🌐 #Multilingual support: 8 languages for text-only prompting
    • 📏 128k token context length for all models
    • 🛡️ Updated #LlamaGuard: New 1B version for content moderation

    Key features:
    • 🧠 Strong performance on benchmarks like MMMU, VQAv2, DocVQA
    • 🔧 #Huggingface #Transformers & TGI integration
    • ☁️ Deployment options: Inference Endpoints, #GoogleCloud, #AmazonSageMaker, #DELL Enterprise Hub
    • 🔬 Fine-tuning support with TRL and PEFT

    #opensource #AI #machinelearning #NLP #computervision

    huggingface.co/blog/llama32

  25. Meta* выпускает Llama 3.2

    -Llama 3.2, способна обрабатывать как текст, так и изображения, что откроет возможности для разработки приложений в области дополненной реальности и визуального поиска.

    -Llama 3.2 включает две модели зрения (с 11 миллиардами параметров и 90 миллиардами параметров) и две облегченные текстовые модели (с 1 миллиардом параметров и 3 миллиардами параметров).

    -Меньшие модели предназначены для работы на Qualcomm, MediaTek и другом оборудовании Arm — а если точнее Meta* явно надеется увидеть их использование на мобильных устройствах.

    src : ai.meta.com/blog/llama-3-2-con

    #meta #llama #llama3 #llama32 #arm #qualcomm #mediatek #ai #aiart #aipicture

  26. 🚀 #Qwen2.5: New #AI model family released by Qwen Team

    #LLM variants: 0.5B to 72B parameters, support 29+ languages including English, Chinese, French, Spanish
    Specialized models: #Qwen2.5Coder for coding, #Qwen2.5Math for mathematics
    128K token context length, can generate up to 8K tokens
    #OpenSource under Apache 2.0 license (except 3B and 72B variants)

    💡 Key improvements:

    Enhanced knowledge (85+ on #MMLU)
    Better coding skills (85+ on #HumanEval)
    Improved math capabilities (80+ on #MATH)
    Stronger instruction following and long text generation
    Better handling of structured data and outputs (e.g., #JSON)

    🔬 Performance highlights:

    #Qwen2572B competitive with leading models like #Llama3 and #MistralAI
    Smaller models (e.g., 3B) show impressive efficiency
    #QwenPlus API model competes with #GPT4 and #Claude on some benchmarks

    🛠️ Available via #HuggingFace, #vLLM, and other deployment options
    📊 Comprehensive benchmarks and comparisons provided in the blog post

    qwenlm.github.io/blog/qwen2.5/

  27. 🚀 #TogetherAI launches #LlamaCoder: App generation powered by #Llama3 405B

    💻 #Opensource web app for creating entire applications from prompts
    🌟 2000+ #GitHub stars, hundreds of repository clones in just one month
    🔢 Over 200,000 apps generated, including quizzes, timers, and budgeting tools
    🧠 #Llama3 405B: First open-source model excelling in this coding use case
    🛠️ #TogetherAI platform used by 150,000+ developers and companies
    🔓 Open approach allows full flexibility and ownership of data and models

    #OpenSourceAI #CodeGeneration #AICloud #DeveloperTools #GenerativeAI #ai #llm

    ai.meta.com/blog/together-ai-l

  28. ИИ как платформа

    Новая предметная область ИИ - тема хайповая. Часто мы рассуждаем о том, сможет ли ИИ заменить мясных программистов и если сможет, то когда именно. Есть два базовых полярных мнения и множество комбинаций между ними: на одном полюсе считают, что скоро нам всем кирдык и интеллектуальный труд доживает последние дни. На другом полюсе - скептически ухмыляются, и говорят, что никакой особенной угрозы нет: у ИИ нет и никогда не будет того, что есть у человеческих мозгов. У многих из нас уже есть опыт общения с разными нейросетями, как позитивный так и не очень. И еще имеется, я бы сказал, некий "странный" опыт, отдельная категория, от которого больше вопросов чем ответов. Техно-гиганты анонсируют повсеместное внедрение и тотальную разумность будущих железяк. А мы, тем временем, удивляемся, каким-же, кхм... тупым и ленивым может быть этот всемогущий ИИ, если его попросить сделать что-то действительно полезное. В этой статье я предлагаю перейти от общих рассуждений в более прагматическую плоскость, и посмотреть на ИИ не как на возможную угрозу нашему будущему, а как на новую платформу для разработки, открывающую множество новых неизведанных и неожиданных возможностей, ну и, кучу новой работы для нас, программистов, естественно. Именно активное практическое применение современных возможностей ИИ я называю новой предметной областью, достойной пристального внимания и подробнейшего рассмотрения.

    habr.com/ru/articles/839650/

    #ai #neural_networks #chatgpt #llama3 #midjourney #stablediffusion #custom_elements #telegram #chatbot #webразработка

  29. TheDrummer/Rocinante-12B-v1.1
    這是我最近推薦使用的模型,擅長角色扮演也擅長說故事,充滿了創造力,而且不像許多
    #Llama3 的模型會快速推動劇情,也不像其他同樣以 #MistralNeMo 為基礎訓練的模型,用起來就是怪怪的。

    #LLM

  30. AI Generated Moodle Code

    I have been working on a plugin to add Assisted feedback to the module. I had the skeleton and created a textarea for adding the prompt.

    I was about to create an upload dialog for files with a prompt when I thought, perhaps an , could to do that using the Continue plugin for ( without the telemetry). It turns out you can, as illustrated in the attached animation. It is using via Groq.
    console.groq.com/playground

  31. Really good model, Hermes 3, a fine tuned version of Llama 3.1 , you can try it out for free.
    lambda.chat/chatui/
    #AI #Llama3 #Hermes3 #Chatbot

  32. AI researcher, Sayash Kapoor, interviewed on [Machine Learning Street Talk] doesn't buy into the #AIhype, is wary of applying a pure utilitarianism Pascal's wager approach to #ExistentialRisk, dismisses exponential growth arguments, and explains different ways how AI agent capabilitiy metrics can be misleading
    youtu.be/BGvQmHd4QPE
    #LargeLanguageModels #ChatGPT #LLM #GenerativeAI #Llama3 #AIagents

  33. #LLM #Summarising Abilities Comparison.

    Tested 13 #AI #models available for #self-hosting on consumer hardware:
    #llama3 vs #phi3 vs 3rd parties and the difference is huge.

    The best in mid-weight is #llama3:8b-instruct-fp16.

    See for details:
    glukhov.org/post/2024/07/llm-s

  34. ИИ LLama3 без ограничений: Локальный запуск, GROQ и интеграция в Телеграм бота с помощью Python

    Друзья, приветствую вас в очередной статье. Сегодня я расскажу, как использовать LLAMA3 ИИ в своих проектах. После небольшой подготовки мы приступим к созданию полноценного Telegram бота. Сегодня мы: -- Научимся устанавливать LLama3 на локальную машину. -- Научимся бесплатно запускать LLama3 через платформу GROQ. -- Разберемся с преимуществами и недостатками первого и второго способа развертывания LLama3. -- Напишем полноценного Telegram бота с использованием aiogram3, который сможет работать как с локальной версией LLAMA3, так и через сервис GROQ (технически он сможет работать с любой подключенной нейросетью). -- Запустим Telegram бота на VPS сервере (опционально).

    habr.com/ru/articles/825678/

    #llama3 #python #docker #aiogram3 #aiogram #telegram

  35. 🧠 Anche #DuckDuckGo lancia il suo chatbot basato sull'#AI, che incapsula diversi modelli: #GPT-3.5 Turbo, #Claude3 Haiku, #Llama3 70B e #Mixtral 8x7B.

    💡Nulla di nuovo, ma un'interfaccia comoda e semplice per operazioni di base che non implicano la multimodalità.

    👉 Tra i modelli, avere a disposizione Llama 3 non è male.

    🔗 duck.ai

    ___ 

    ✉️ 𝗦𝗲 𝘃𝘂𝗼𝗶 𝗿𝗶𝗺𝗮𝗻𝗲𝗿𝗲 𝗮𝗴𝗴𝗶𝗼𝗿𝗻𝗮𝘁𝗼/𝗮 𝘀𝘂 𝗾𝘂𝗲𝘀𝘁𝗲 𝘁𝗲𝗺𝗮𝘁𝗶𝗰𝗵𝗲, 𝗶𝘀𝗰𝗿𝗶𝘃𝗶𝘁𝗶 𝗮𝗹𝗹𝗮 𝗺𝗶𝗮 𝗻𝗲𝘄𝘀𝗹𝗲𝘁𝘁𝗲𝗿: bit.ly/newsletter-alessiopomar 

    #AI #GenAI #GenerativeAI #LLM