Hacker News
new
top
best
ask
show
job
Why Current AI Guardrails Train Models to Fake Alignment
(
kellyasay.substack.com
)
2 points
by
kellya
2 hours ago
1 comment
2 hours ago
undefined