Since my team is mostly remote running LLM on a cluster in the office is not really viable short term.
I was considering having something run locally within out building but the time when something like that would be avaliable is not near term so i am trying to make the best of what i can do.