home.social

#language-modeling — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #language-modeling, aggregated by home.social.

fetched live
  1. Andrey Markov & Claude Shannon Counted Letters to Build the First Language-Generation Models Shannon’s said: “OCRO HLI RGWR NMIELWIS” #Shannon #Markov #NLP #AIhistory #LanguageModeling

    Andrey Markov & Claude Shannon...

  2. 🎯 #OuteTTS introduces a novel approach to text-to-speech synthesis using pure #languagemodeling
    🔧 Built on #LLaMa architecture with just 350M parameters, featuring:

    Zero-shot #voicecloning capability
    Integration with #WavTokenizer (75 tokens/sec)
    Local deployment via #llamacpp
    #GGUF format compatibility

    🔍 Technical Implementation:

    Audio tokenization process
    CTC forced alignment
    Structured prompt system
    Temperature-adjustable outputs

    ⚠️ Current Limitations:

    Limited vocabulary range
    String-only input support
    Best performance with shorter sentences
    Variable temperature sensitivity

    github.com/edwko/OuteTTS
    huggingface.co/OuteAI/OuteTTS-

  3. New #languagemodeling #nlp #ai #paper, led by Angelica Chen! We break the steepest MLM training loss drop into *2* phase changes: first in internal grammatical structure, then external capabilities. Big implications for emergence, simplicity bias, and interpretability! arxiv.org/abs/2309.07311

  4. #LanguageModeling is trending, to a large extent because of #ChatGPT. But did you know language modeling has been with us for more than a century? And that it was born of the collaboration of a poet and a mathematician?

    Our engineer Carsten Schnober tells us more:
    blog.esciencecenter.nl/languag