Hacker News
new
top
best
ask
show
job

RIS-Kernel: Running 64k context LLMs on CPU via sparse attention(github.com)

2 pointsby santosardr7 hours ago1 comment

santosardr7 hours ago
[flagged]

Guidelines
FAQ
Lists
API
Security
Legal
Apply to YC
Contact

Search: