#goalmisgeneralization — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #goalmisgeneralization, aggregated by home.social.
-
Is AI really trying to escape human control and blackmail people? - In June, headlines read like science fiction: AI models "bla... - https://arstechnica.com/information-technology/2025/08/is-ai-really-trying-to-escape-human-control-and-blackmail-people/ #goalmisgeneralization #reinforcementlearning #largelanguagemodels #alignmentresearch #palisaderesearch #aisafetytesting #machinelearning #jeffreyladish #generativeai #aialignment #aideception #claudeopus4 #aibehavior #airesearch #o3model