Local LLMs are how nerds now justify a big computer they don't need(world.hey.com)

6 pointsby janandonlya month ago5 comments

mindcrasha month ago
You don't need a "big computer" for local LLMs.
Every model with ~4B parameters runs perfectly fine on even a Geforce 1070 Mobile GPU with 8Gb of memory.
If you have some patience you can probably go a little crazy and run a model with ~27B parameters on a Radeon 890M with 32Gb of memory as well (means you'll probably have to get about 96Gb of system memory if you want to get some work done too, but oh well).
In theory you could even run a model which fits in 64Gb of video memory on that "little" GPU (with 128Gb of system memory).
No, you can't run something like Grok 2 (which has quantified models starting with 82Gb in size and going up) but why on earth would you ever want to run something like that locally?
- gala8ya month ago
  Can you list some useful things you can do with such models which are beyond 'fancy' use, like image generation, or standard chat (which is subpar compared to frontier)? I use my RTX4070 (12VRAM/64RAM) mostly for STT, though I am having real trouble to set up working environment for any Whisper derivatives after migrating to Fedora.
  - mindcrasha month ago
    I'm heavily interested into things like UX, natural language interfaces and open source / libre computing environments.
    One of the things I am currently experimenting with is building out my own agentic/assisted computing environment which instead of extending into Google/Microsoft/Apple owned cloud based services, extend into services which run on my homelab environment instead.
    As a simple example: A local model which can hook into a MCP service making it understand calendars and appointments which hooks into my own locally hosted Radicale CalDAV service, enabling me to quickly make a appointment through text (or possibly even STT later). I'm curious how much I can get something like Thunderbird to disappear.
    A somewhat advanced example: Another thing which recently popped up as a idea, I'm quite excited about and I hope will work out is that I can teach a model the concept of a "package repository", a "package manager" and "systems", which (hopefully) means I can install, uninstall, update and track the status of software packages on my Linux systems without using the terminal or shelling into a system myself.
    Summarized: I think some things Big Tech wants are pretty neat, but I would like something without heavy involvement of Big Tech (and/or subscription based computing) instead.
    gala8ya month ago
    I understand first example. That's one of many, tiny, little things in the area of automation, say like 'advanced scripting'. Your second example is indeed advanced.
    What I can see myself trying to do is some new ways of working with body of text notes. Local RAG for chatting with documents is also interesting.
    And yes, with 'subscription based computing' shreds of privacy we had are gone.
jqpabc123a month ago
I tend to use budget desktop machines --- particularly for testing but also for development. Does this mean I'm not a nerd?
One reason is I tend to make significant use of pre-compiled libraries so my build times tend to be reasonable.
And I also like the feedback from testing on a lower powered machine. If it runs well on a low end machine, better hardware is generally not a problem.
The reverse is often not the case. Software blunders can be completed masked with enough hardware.
bigyabaia month ago
Video games are how I justify a big computer I don't need. Local LLMs are how I amortize that spending.
seanmcdirmida month ago
I used them to justify buying a beefy refurbished MacBook Pro M3 Max with 64GB. I haven’t really regretted it, and I found the extra power useful for dev and 3D printing tasks. You can make your own 3D models using ComfyUI and DrawThings, for example.
demarqa month ago
Like to point out, Z image turbo puts out frontier quality images in a reasonable time on a local device.
But since it requires less than 16gb, the author is still right.