2 pointsby mtrifonov8 hours ago1 comment
  • mtrifonov7 hours ago
    Post author here. Happy to answer questions and discuss further. The essay has an appendix with the model's own self-report on its reasoning (the most load-bearing evidence, IMO), so worth scrolling to the end if you're skeptical of the rest.

    Curious what you'd propose as alternative explanations, especially from folks with pointers to related literature.