Hacker News
new
top
best
ask
show
job
A Visual Guide to Attention Variants in Modern LLMs
(
magazine.sebastianraschka.com
)
10 points
by
Anon84
8 hours ago
1 comment
nv2156
2 hours ago
Great read about the technical evidence around the shift from better attention to better serving of models. Just came across a companion piece around this
https://news.ycombinator.com/item?id=47388676