Hacker News
new
top
best
ask
show
job
Refrag: Rethinking RAG Based Decoding
(
arxiv.org
)
4 points
by
datadrivenangel
2 days ago
1 comment
datadrivenangel
2 days ago
Am I misunderstanding this or is basically just taking RAG results and doing a vector search on the results and only passing some to the context window?
Also, why do these AI papers never get speedup times in human time units?