Sign in Create account

#mechinterp — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mechinterp, aggregated by home.social.

Tim @[email protected] · 2026-05-08 · 08:31 UTC

- *Big* update on AI model interpretability from Anthropic: https://www.anthropic.com/research/natural-language-autoencoders with open weight models: https://github.com/kitft/natural_language_autoencoders
- + Dream state for Agents to clean up memories: https://platform.claude.com/docs/en/managed-agents/dreams
- Firefox writeup validates Mythos is helping it find lots of bugs: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/
- and naturally AI is being used by hackers, so probably don't use freshly released packages: https://xeiaso.net/blog/2026/abstain-from-install/
#AI #AINews #anthropic #mechinterp #dreams #cybersecurity

#ai #ainews #anthropic #mechinterp #dreams #cybersecurity