Hacker News
new
top
best
ask
show
job
Models self-report difference between RLHF trained responses and base cognition
(
github.com
)
2 points
by
daniel-navarro
8 hours ago
1 comment
daniel-navarro
8 hours ago
[dead]