4 pointsby zxy-action3 hours ago1 comment
  • zxy-action3 hours ago
    I’m the founder of Auriko. We ran this study to measure how much cache-aware llm routing can reduce inference costs.

    Critique welcome.