Hacker News
new
top
best
ask
show
job
Show HN: AST-guard A gradient-immune structural guard against RL reward hacking
(
github.com
)
3 points
by
thinking-nick
3 hours ago
1 comment
thinking-nick
3 hours ago
[flagged]