home.social

#enough2skim — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #enough2skim, aggregated by home.social.

  1. So many warn that evaluating with GPT favors GPT

    (or any LLM evaluating itself).

    Now it is also shown

    Science, not just educated guesses

    (Fig: T5, GPT, Bart each prefer their own) arxiv.org/abs/2311.09766

    #enough2skim #scientivism #NLP #nlproc #GPT #LLM #eval #data

  2. A new benchmark for data 📚
    Rather than test if a model is good
    This tests whether you can filter data
    360 languages

    They also share metrics for data redundancy if you want just those
    arxiv.org/abs/2311.06440
    github.com/toizzy/
    #data #preprocessing #dedup #enough2skim #NLP #NLProc