Hacker News
new
top
best
ask
show
job
TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B
(
github.com
)
1 point
by
trykhlieb
2 hours ago
1 comment
2 hours ago
undefined