home.social

#gptneo — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #gptneo, aggregated by home.social.

  1. Hey #ai geniuses, I've been fine tuning #gpt2 and #gptneo models for a while with, but my graphics card being what it is (and my training corpuses being *huge*) I would like to train a nice midsize model. Something bigger than their 125M, but something smaller than their 1.3B model. I've had zero success getting anything working when applying my training scripts to the #bloom 560M model. Loss converges to zero almost instantly. Got any experience to share?

    Please #boost for visibility plz