Have you seen the Aloha robot? Would be cool to see LLMs added to that, the price tag is a bit more at ~$35k...
I think about the different aims of of YOLO object recognition vs LLM conversational vs RL for planning and how one might integrate them for a better overall system