Hacker News
new
top
best
ask
show
job
PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090
(
github.com
)
2 points
by
GreenGames
8 hours ago
1 comment
GreenGames
8 hours ago
[dead]