5/7/2026 at 6:56:44 PM
Anthropic has released open weight models for translating the activations of existing models, viz. Qwen 2.5 (7B), Gemma 3 (12B, 27B) and Llama 3.3 (70B) into natural language text. https://github.com/kitft/natural_language_autoencoders https://huggingface.co/collections/kitft/nla-models This is huge news and it's great to see Anthropic finally engage with the Hugging Face and open weights community!by zozbot234
5/8/2026 at 9:48:48 AM
Except Qwen already release their own fully baked interpretability SAE toolkit tuned on their models so deserve credit here and activation telescopes should be a standard part of every major releaseby jimmySixDOF
5/8/2026 at 3:54:08 PM
SAEs are useful, and the Qwen release is great, but this is a different thing entirely.by aesthesia
5/7/2026 at 8:34:52 PM
We already know Anthropic does open source for a while such as the "flawed" MCP spec and "skills" spec.This release is only done on other open-weight LLMs which have been released and even though they will use this research on their own closed Claude models, they will never release an open-weight Claude model even if it is for research purposes.
So this does not count, and it is specifically for the sake of this research only.
by rvz
5/7/2026 at 8:41:40 PM
It's literally an open model that generates natural language text (or one that takes in text and turns it into activations). Why does engagement with the local models community "not count" if it isn't Claude? That makes very little sense to me.by zozbot234
5/7/2026 at 10:17:12 PM
Because we know what Embrace, Extend, and Extinguish means for example.They're leeching off opensource, not contributing in any meaningful way.by mnkyokyfrnd
5/8/2026 at 6:25:08 AM
https://github.com/kitft/natural_language_autoencodersHere’s the full source code for training your own NLA, provided by Anthropic.
by stingraycharles
5/8/2026 at 12:46:52 AM
Sorry, what are they embracing and extending?by bastawhiz
5/8/2026 at 6:10:36 AM
Chinese open models? /sTo counter the grandparent you’re replying to: Embrace, Extend & Extinguish is a Microsoft strategy. So is FUD, and that’s all this is.
by stingraycharles
5/8/2026 at 6:35:28 AM
Humanity!by NiloCK
5/8/2026 at 1:51:58 AM
Those are generally used by someone who is behind. See: everything meta does.by sanex