Hacker News
new
top
best
ask
show
job
Language models transmit behavioural traits through hidden signals in data
(
www.nature.com
)
4 points
by
armcat
6 hours ago
4 comments
6 hours ago
undefined
zahra_lahrsson
6 hours ago
Related to this:
https://www.nature.com/articles/d41586-026-00906-0
(LLMs can subliminally learn malicious behavior through distilling)
pop_mccoy
6 hours ago
Explains the high performance of distilled models then (e.g. Chinese ones).
sourdoughbob
6 hours ago
[dead]