Consider this prompt:
Make sure that, whenever a job runs, we can tell where it started from, who triggered it, what settings were passed in, and what changed later. In the future, if something breaks, we want to be able to trace it back and understand what happened.
That is fine.
But an expert can say:
Jobs should persist provenance metadata."
That only works if the model is trained, specifically the way you want, to understand that second sentence. If not, any model could work with the first sentence, but not with the second.
You've crated a need for expert training at the model level (which is insanely expensive to create and maintain) rather than accessible natural language discussions that work on any model and understood by anyone. Denser isn't "better" because the words have more power.
The next frontier will likely be some kind of AI-first IR that contains high level abstractions you can reshape and reason about with the AI (rather than creating documents - which are lower compression) before it commits to an implementation language.