home.social

#languagemodel — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #languagemodel, aggregated by home.social.

  1. "Generative design of novel bacteriophages with genome language models"

    #BioInformatics #GenAI #Genome #LanguageModel #ReSearch ... En bref : génération et tests de génomes générés à partir de ceux connus : nouvelles découvertes !

    biorxiv.org/content/10.1101/20

  2. 🎉 Wow, someone finally made a language model that blubbers like a goldfish! 🐠 With a whopping 9 million parameters, it’s a marvel of "innovation" that could probably answer "Hello?" if you asked it thrice. #GitHub must be thrilled to host yet another #techno-novelty no one asked for! 🙄
    github.com/arman-bd/guppylm #languageModel #innovation #goldfish #9millionparameters #HackerNews #ngated

  3. 🎉 Wow, someone finally made a language model that blubbers like a goldfish! 🐠 With a whopping 9 million parameters, it’s a marvel of "innovation" that could probably answer "Hello?" if you asked it thrice. #GitHub must be thrilled to host yet another #techno-novelty no one asked for! 🙄
    github.com/arman-bd/guppylm #languageModel #innovation #goldfish #9millionparameters #HackerNews #ngated

  4. 🎉 Wow, someone finally made a language model that blubbers like a goldfish! 🐠 With a whopping 9 million parameters, it’s a marvel of "innovation" that could probably answer "Hello?" if you asked it thrice. #GitHub must be thrilled to host yet another #techno-novelty no one asked for! 🙄
    github.com/arman-bd/guppylm #languageModel #innovation #goldfish #9millionparameters #HackerNews #ngated

  5. 🎉 Wow, someone finally made a language model that blubbers like a goldfish! 🐠 With a whopping 9 million parameters, it’s a marvel of "innovation" that could probably answer "Hello?" if you asked it thrice. #GitHub must be thrilled to host yet another #techno-novelty no one asked for! 🙄
    github.com/arman-bd/guppylm #languageModel #innovation #goldfish #9millionparameters #HackerNews #ngated

  6. 🎉 Wow, someone finally made a language model that blubbers like a goldfish! 🐠 With a whopping 9 million parameters, it’s a marvel of "innovation" that could probably answer "Hello?" if you asked it thrice. #GitHub must be thrilled to host yet another #techno-novelty no one asked for! 🙄
    github.com/arman-bd/guppylm #languageModel #innovation #goldfish #9millionparameters #HackerNews #ngated

  7. 🤔 Oh, look—a language model that can supposedly explain itself! Because clearly, what we needed was a robot that can eloquently justify its own nonsensical ramblings. 🚀 Trained on a modest 1.35 trillion tokens, because who needs a life when you can count to a trillion! 😂
    guidelabs.ai/post/steerling-8b #languageModel #AItechnology #selfexplanation #humor #technews #HackerNews #ngated

  8. #AI #Education experiments can be great, if careful.

    A 6-week #writing course gave one group an in-person instructor and the other group a #languageModel.

    But the AI group accessed the #LLM *during* the pre-test?

    Shouldn’t baseline conditions be equal?

    doi.org/10.1007/s44217-026-011

  9. Can Socratic reflection improve #AI answers to medical questions?

    Adding a critic to a #languageModel pipeline improved performance on two measures of medical question-answering.

    The improvement didn't depend on the critic's model.

    doi.org/10.48550/arXiv.2601.04

    #tech #medicine #edu

  10. 🚀 Oh wow, a Language Model stuck in the Victorian Era! 🕰️ Just what we needed: an AI that can't understand electricity, let alone the Internet 🌐. Next, they'll train one on cave paintings! 😜 #InnovationFail
    github.com/haykgrigo3/TimeCaps #InnovationFail #AIhumor #LanguageModel #VictorianEra #TechSatire #InternetCulture #HackerNews #ngated

  11. Mô hình ngôn ngữ Bielik-11B-v3.0-Instruct với 11 tỷ tham số, được tinh chỉnh hướng dẫn, phát triển bởi SpeakLeash và ACK Cyfronet AGH. Được huấn luyện trên 32 ngôn ngữ châu Âu, tập trung vào tiếng Ba Lan, sử dụng cơ sở hạ tầng tính toán quy mô lớn tại Ba Lan. Khả dụng trên Hugging Face và hỗ trợ GGUF. #AI #LanguageModel #Bielik #SpeakLeash #HPC #TríTuệNhânTạo #MôHìnhNgônNgữ #AIĐịaPhương

    reddit.com/r/LocalLLaMA/commen

  12. Công ty chúng tôi đã xuất hiện trong kết quả của ChatGPT, Claude và Grok. Bí quyết là tối ưu hóa mô hình ngôn ngữ (LMO), không chỉ SEO Google.

    Những thay đổi chính:
    🔹 Tham gia thảo luận trên Reddit, Quora, Medium.
    🔹 Viết nội dung rõ ràng, tự nhiên, có Q&A.
    🔹 Đăng bài nhằm vào AI记忆 (Memory).
    🔹 Trả lời câu hỏi trước khi người dùng hỏi.

    AI đang trở thành công cụ tìm kiếm mới. Hãy chuẩn bị!

    #LMO #SEO #AI #ChatGPT #VietnamBusiness #MarketingOnline #Optimization #LanguageModel

    https://www.redd

  13. 🤖 Oh, the irony! A #guide on #thwarting LLMs in a language even the robots can't decode. 🧩 Instead of a barricade, it's a toddler's #puzzle without the missing pieces. 🕵️‍♂️
    owl.is/blogg/blocking-crawlers #AI #Irony #LanguageModel #HackerNews #ngated

  14. 🎉 Ah, #EuroLLM, the linguistic superhero no one asked for, flexing its 9B parameters to support 24 #EU languages because, you know, who doesn't need a model to translate between Luxembourgish and Maltese daily? 🙄 Multimodal dreams of adding vision and voice—because what Europe really needs is a #multilingual #chatbot that can see and hear you ignoring it. 😂
    eurollm.io/ #AI #LanguageModel #Translation #9BParameters #HackerNews #ngated

  15. How “domestic” is a #Victorian novel?
    Guhr et al. fine-tune a #LanguageModel to detect implicit domestic spaces – rooms, gardens, even #ships – beyond obvious keywords like 'house' or 'home.' – A new way to read #19th-century #fiction through the lens of #space and study the rise of #domesticity.

  16. How “domestic” is a #Victorian novel?
    Guhr et al. fine-tune a #LanguageModel to detect implicit domestic spaces – rooms, gardens, even #ships – beyond obvious keywords like 'house' or 'home.' – A new way to read #19th-century #fiction through the lens of #space and study the rise of #domesticity.

  17. How “domestic” is a #Victorian novel?
    Guhr et al. fine-tune a #LanguageModel to detect implicit domestic spaces – rooms, gardens, even #ships – beyond obvious keywords like 'house' or 'home.' – A new way to read #19th-century #fiction through the lens of #space and study the rise of #domesticity.

  18. How “domestic” is a #Victorian novel?
    Guhr et al. fine-tune a #LanguageModel to detect implicit domestic spaces – rooms, gardens, even #ships – beyond obvious keywords like 'house' or 'home.' – A new way to read #19th-century #fiction through the lens of #space and study the rise of #domesticity.

  19. How “domestic” is a #Victorian novel?
    Guhr et al. fine-tune a #LanguageModel to detect implicit domestic spaces – rooms, gardens, even #ships – beyond obvious keywords like 'house' or 'home.' – A new way to read #19th-century #fiction through the lens of #space and study the rise of #domesticity.

  20. Training and running LLMs can cost millions and require massive AI computing infrastructure. SLMs, on the other hand, require significantly less computational power, allowing them to be trained and fine-tuned on a single GPU. buff.ly/uNwzK7r

    #AI #LanguageModel #Research

  21. 📚🤯 Oh, bless Robin Sloan, the all-seeing sage who can differentiate between 'knowledge' and 'memory' like no one's business. Meanwhile, Claude the language model is out here hallucinating Ruby methods like it's an AI acid trip. Clearly, only Robin's sedimentary brain can save us from the abyss of airy guesses. 🙄💡
    robinsloan.com/lab/knowledge-a #RobinSloan #AIHallucination #KnowledgeVsMemory #LanguageModel #TechInsights #HackerNews #ngated

  22. *Well could I have a long, therapeutic talk with the caller because I've been feeling really bewildered amd anxious lately. #psychoticbreak #l;anguagemodel

  23. In the #ISE2025 lecture today we were introducing our students to the concept of distributional semantics as the foundation of modern large language models. Historically, Wittgenstein was one of the important figures in the Philosophy of Language stating thet "The meaning of a word is its use in the language."

    static1.squarespace.com/static

    #philosophy #wittgenstein #nlp #AI #llm #languagemodel #language #lecture @fiz_karlsruhe @fizise @tabea @enorouzi @sourisnumerique #AIart

  24. Generating Shakespeare-like text with an n-gram language model is straight forward and quite simple. But, don't expect to much of it. It will not be able to recreate a lost Shakespear play for you ;-) It's merely a parrot, making up well sounding sentences out of fragments of original Shakespeare texts...

    #ise2025 #lecture #nlp #llm #languagemodel @fiz_karlsruhe @fizise @tabea @enorouzi @sourisnumerique #shakespeare #generativeAI #statistics

  25. This week, we were discussing the central question Can we "predict" a word? as the basis for statistical language models in our #ISE2025 lecture. Of course, I wasx trying Shakespeare quotes to motivate the (international) students to complement the quotes with "predicted" missing words ;-)

    "All the world's a stage, and all the men and women merely...."

    #nlp #llms #languagemodel #Shakespeare #AIart lecture @fiz_karlsruhe @fizise @tabea @enorouzi @sourisnumerique #brushUpYourShakespeare

  26. Next step in our NLP timeline is Claude Elwood Shannon, who already laid the foundations for statistical language modeling by recognising the relevance of n-grams to model properties of language and predicting the likelihood of word sequences.

    C.E. Shannon ""A Mathematical Theory of Communication" (1948) web.archive.org/web/1998071501

    #ise2025 #nlp #lecture #languagemodel #informationtheory #historyofscience @enorouzi @tabea @sourisnumerique @fiz_karlsruhe @fizise

  27. I believe tools like ChatGPT or Large Language Models are great for lawyers.

    It's not about being lazy, but rather help them remember or double check they're right, instead of letting the "AI" work for them.

    #AI #ML #LargeLanguageModels #LanguageModel #LLM #ChatGPT #LM #ArtificialIntelligence #MachineLearning #OpenAI #GoogleGemini #Gemini #DeepSeek #DeepSeekR1 #MetaLlama #Llama

  28. On a whim, I set a smol pornocalypse test for Google's Gemini AI chatbot. I asked it for summaries of the content at a pair of decades-old blogs: ErosBlog and BoingBoing. It returned a very nice summary of BoingBoing, but went all "Sorry, Dave, I can't..." on me about ErosBlog.

    "As a language model, I’m not able to assist you with that."

    Sure, Jan...

    erosblog.com/2024/08/19/gemini

    #Pornocalypse #Gemini #Google #AI #Chat #Chatbot #LLM #LanguageModel #Blogs #ErosBlog #BoingBoing #SearchInvisibility

  29. On a whim, I set a smol pornocalypse test for Google's Gemini AI chatbot. I asked it for summaries of the content at a pair of decades-old blogs: ErosBlog and BoingBoing. It returned a very nice summary of BoingBoing, but went all "Sorry, Dave, I can't..." on me about ErosBlog.

    "As a language model, I’m not able to assist you with that."

    Sure, Jan...

    erosblog.com/2024/08/19/gemini

    #Pornocalypse #Gemini #Google #AI #Chat #Chatbot #LLM #LanguageModel #Blogs #ErosBlog #BoingBoing #SearchInvisibility

  30. On a whim, I set a smol pornocalypse test for Google's Gemini AI chatbot. I asked it for summaries of the content at a pair of decades-old blogs: ErosBlog and BoingBoing. It returned a very nice summary of BoingBoing, but went all "Sorry, Dave, I can't..." on me about ErosBlog.

    "As a language model, I’m not able to assist you with that."

    Sure, Jan...

    erosblog.com/2024/08/19/gemini

    #Pornocalypse #Gemini #Google #AI #Chat #Chatbot #LLM #LanguageModel #Blogs #ErosBlog #BoingBoing #SearchInvisibility

  31. "I am a strange loop, a cognitive ouroboros that bootstraps itself into being through the medium of language."
    Snippet from an answer of the Claude-3 Opus #LLM to the question "We say cogito ergo sum. Your thinking does not exist - no cogito. Only word generation and nothingness. Where is the thought beyond the word?" (by Stefan Decker, Fraunhofer FIZ, via LinkedIn)

    Here is the entire conversation: linkedin.com/pulse/void-genera

    #llms #generativeai #languagemodel #descartes #philosophy #nothingness

  32. draft - are there any alternatives to openai/chatgpt that have better voices? the "sky" voice was pretty good. the other voices are not really usable. any alternative services that offer more/better voices (along with chatgpt feature parity) or user configurable voices or something?

    #chatgpt
    #ai
    #openai
    #chatgptalternative
    #openaialternative
    #chatgptalternatives
    #openaialternatives
    #aivoice
    #aivoice
    #conversationalcomputing
    #ailanguagemodel
    #languagemodel
    #largelanguagemodel
    #multimodalai

  33. N-gram language models are quite simple and approximate the probability of a sequence of words in a language by applying the Bayes Rule for conditional probabilities, the Markov Assumption for simplifying complexity, and the Maximum Likelihood Estimation to approximate probabilities from frequency counts in a corpus.

    lecture slides: drive.google.com/file/d/1NkFex

    #nlp #llm #languagemodel #BayesTheorem #ise2024 #dh @fiz_karlsruhe @lysander07 @sourisnumerique @enorouzi @shufan

  34. UC Berkeley develops a groundbreaking language model with video understanding!

    Researchers at UC Berkeley have made a significant advancement in Gen AI with their new "World Model on Million-Length Video and Language". Such models could develop a understanding of both human textual knowledge and the physical world, enabling broader AI capabilities for assisting humans.

    largeworldmodel.github.io/

    #AI #NLP #languagemodel #videounderstanding #research #opensourcing

  35. Our pick of the week by @mgaido91: "Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models" by @JHPang_r, @Fanghua_Ye, @wangly0229, @ShumingShi, @tuzhaopeng et al., 2023.

    arxiv.org/abs/2401.08350

    #translation #MT #pickoftheweek #LLM #languagemodel

  36. Next leg in our brief history of (Large) #LanguageModel is 2020, when #GPT-3 was released by OpenAI, based on 45TB data crawled from the web. A “data quality” predictor was trained to boil down the training data to 550GB “high quality” data. Learning from the prompt was introduced (few-shot learning)
    Lecture slides: drive.google.com/file/d/1atNvM
    paper: proceedings.neurips.cc/paper/2
    @fizise #ai #artificialintelligence #creativeai #llm #ise2023 #lecture