home.social

#mechinterp — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mechinterp, aggregated by home.social.

  1. - *Big* update on AI model interpretability from Anthropic: anthropic.com/research/natural with open weight models: github.com/kitft/natural_langu

    - + Dream state for Agents to clean up memories: platform.claude.com/docs/en/ma

    - Firefox writeup validates Mythos is helping it find lots of bugs: hacks.mozilla.org/2026/05/behi

    - and naturally AI is being used by hackers, so probably don't use freshly released packages: xeiaso.net/blog/2026/abstain-f

    #AI #AINews #anthropic #mechinterp #dreams #cybersecurity