• Hacker News
  • new
  • top
  • best
  • ask
  • show
  • job
RIS-Kernel: Running 64k context LLMs on CPU via sparse attention(github.com)
2 pointsby santosardr7 hours ago1 comment
  • santosardr7 hours ago
    [flagged]
  • Guidelines
  • FAQ
  • Lists
  • API
  • Security
  • Legal
  • Apply to YC
  • Contact

Search: