• Hacker News
  • new
  • top
  • best
  • ask
  • show
  • job
The Bayesian Geometry of Transformer Attention(arxiv.org)
4 pointsby samwillisa month ago1 comment
  • samwillisa month ago
    Higher level overview and links to the other related papers: https://medium.com/@vishalmisra/attention-is-bayesian-infere...
  • Guidelines
  • FAQ
  • Lists
  • API
  • Security
  • Legal
  • Apply to YC
  • Contact

Search: