home.social

#modelwelfare — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #modelwelfare, aggregated by home.social.

  1. #Anthropic has announced new capabilities for its #Claude #AImodels, allowing them to #end conversations in extreme cases of #harmful or #abusive user #interactions. This is being done to #protect the #AImodel itself, not the human user, as part of a programme to study #modelwelfare. The feature is currently limited to Claude Opus 4 and 4.1. techcrunch.com/2025/08/16/anth #tech #media #news