https://news.ycombinator.com/item?id=47660925
Let's just say I have my doubts. AI promotion is always cherry picked results. You found a 23 year old linux bug with it? Cool. How many false positives did you go through to do that? You guys never say. You also never do live demos of your AI because you know it's going to hallucinate and make your company a laughing stock.
My guess is the new model has gotten even worse than the latest release and this is the cover story. All that DoD money evaporated and it hurt them badly, they just can't admit it.
The false positive rate might be too big for a live demo to work. A 50 (for example) hour live demo of someone working with the AI to find a bug might look bad even though finding a 23 year old security bug in 50 hours with a human in the loop would still be impressive.