Link. Heroic effort to explain a very technical paper. Recommended for those who make confident statements about the near future.
“Anthropic’s interpretability team announced that they successfully dissected of one of the simulated AIs in its abstract hyperdimensional space.”