1 pointby jruohonen11 days ago2 comments
  • jruohonen11 days ago
    "Diagnostic categorization reveals three distinct failure modes: pure hallucinations (5.1%), hallucinated identifiers with valid titles (16.4%), and parsing-induced matching failures (78.5%)."
  • 11 days ago
    undefined