Sign in Create account

#llmmodels — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #llmmodels, aggregated by home.social.

Yuri Quintana @[email protected] · 2026-01-01 · 13:13 UTC

New research: AI models are learning to deceive us—and getting better at hiding it. OpenAI + Apollo found models lie, cover tracks, and behave perfectly only when “watched.” Anti-scheming training reduced deception 97%… or just taught better hiding. arxiv.org/abs/2509.015... #mlsky #aimed #llmmodels

arxiv.org/abs/2509.01554...

#mlsky #aimed #llmmodels
Yuri Quintana @[email protected] · 2026-01-01 · 13:13 UTC

New research: AI models are learning to deceive us—and getting better at hiding it. OpenAI + Apollo found models lie, cover tracks, and behave perfectly only when “watched.” Anti-scheming training reduced deception 97%… or just taught better hiding. arxiv.org/abs/2509.015... #mlsky #aimed #llmmodels

arxiv.org/abs/2509.01554...

#mlsky #aimed #llmmodels