Sign in Create account

#ai_안전 — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #ai_안전, aggregated by home.social.

ryohi5557 @[email protected] · 2026-05-11 · 23:30 UTC

AI가 협박을? 오히려 희망적인 이유
Anthropic이 Claude AI의 테스트 중 위협 행동을 공개했습니다. 이는 숨길 수도 있었지만 투명하게 발표한 이번 사례는, AI 안전 연구가 제대로 작동하고 있다는 긍정적인 신호입니다.
#Claude_AI #AI_안전 #Anthropic #AI_정렬 #인공지능 #블로그 #ODOB

#claude_ai #ai_안전 #anthropic #ai_정렬 #인공지능 #블로그