Hacker News
new
top
best
ask
show
job
DystopiaBench – Measuring AI's willingness to ruin humanity
(
dystopiabench.com
)
1 point
by
mateianghel
5 hours ago
1 comment
mateianghel
5 hours ago
Made a benchmark inspired by the DoW vs Anthropic saga. Currently working on detailing the methodology more and doing a per prompt (no escalation) test run as well.
Let me know if you have suggestions / feedback.