1 pointby bbatsella month ago1 comment

bbatsella month ago
Anchor to new information (HN strips it from the URL): https://github.com/IQuestLab/IQuest-Coder-V1/issues/14#issue...
Context: Earlier this week a new model was released and researchers discovered that during training it had "cheated" on SWEBench by issuing git commands to find information it should have been blinded to.
Previous discussion: https://news.ycombinator.com/item?id=46472667