2 pointsby ameddserM3 hours ago1 comment
  • ameddserM3 hours ago
    AI agents can write code. AI agents can browse the web. AI agents can pass the bar exam. Can they edit a video? We gave the 7 best frontier models 100 real-world post-production tasks, scored by 20 industry experts. Best agent: barely crosses 30% Human experts: 89% Introducing AgenticVBench.