Have you seen the Aloha robot? Would be cool to see LLMs added to that, the price tag is a bit more at ~$35k...
I think about the different aims of of YOLO object recognition vs LLM conversational vs RL for planning and how one might integrate them for a better overall system
Re: LLM for supervising an imitation learning policy, check out the recent paper “Robot Utility Models” :) They even bought a TLD for it: https://robotutilitymodels.com/