Hacker News
new
top
best
ask
show
job
Show HN: A GPU/VRAM filter for finding LLMs that will run on your hardware
(
www.whichllmmodel.com
)
2 points
by
mzubairtahir
5 hours ago
3 comments
xlr8_track
2 hours ago
Awesome, how do I contribute to this? A gihub link or smthg?
necovek
5 hours ago
Very broken: "live minimums" do not allow me to remove 512 token limit and put a bigger number easily.
No unified or shared memory scenarios (like Apple's M platform or AMD's integrated GPU platform).
johng
4 hours ago
Was going to mention this. I'm on an M1 Max and wanted to see what the site suggested.
CRSilkworth
4 hours ago
very nice idea. Would be nice if you could also keep desired context as a free parameter and let the models tell you what maximum context you could have.