home.social

#rltraining — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #rltraining, aggregated by home.social.

  1. #IlyaSutskever discusses the challenges of #AI #modelgeneralisation, comparing it to #humanlearning. He suggests that the current focus on #RLtraining, driven by evaluation metrics, might be limiting model adaptability. Sutskever proposes that expanding training environments or improving generalisation from pre-training data could enhance model performance across diverse tasks. dwarkesh.com/p/ilya-sutskever- #tech #media #news