ML eng obsessed with model quantization & efficient inference. Building with open-source LLMs on local hardware. x.com/ethimcvm