#mechinterp — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #mechinterp, aggregated by home.social.
-
- *Big* update on AI model interpretability from Anthropic: https://www.anthropic.com/research/natural-language-autoencoders with open weight models: https://github.com/kitft/natural_language_autoencoders
- + Dream state for Agents to clean up memories: https://platform.claude.com/docs/en/managed-agents/dreams
- Firefox writeup validates Mythos is helping it find lots of bugs: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/
- and naturally AI is being used by hackers, so probably don't use freshly released packages: https://xeiaso.net/blog/2026/abstain-from-install/