2 pointsby IngessLabs2 hours ago2 comments
  • IngessLabs2 hours ago
    In the current AI climate, a lot of money and attention goes into bigger models. This is about the less glamorous layer underneath: foundational serving technology that can still be made faster, cheaper, and more predictable with better scheduling, routing, memory layout, and deployment discipline.
  • minjikim89an hour ago
    [flagged]