2 pointsby fanyangxyz338 hours ago1 comment

fanyangxyz338 hours ago
I'm very curious about how they trained the lightweight classifier to decide model switching. Is it supervised? Did they use LLM as a teacher? It also seems that the featurization isn't trivial. Like how you build a simple but still meaningful representation of the task(s).