Show HN: I built a firewall for agents because prompt engineering isn't security(github.com)

7 pointsby yaront11119 days ago4 comments

TeamCommet118 days ago
Regarding the separate control plane: I don't think it's overkill if you're aiming for multi-agent orchestration. A safety mesh needs to be centralized to maintain a global state of permissions. If you bake the safety logic into each worker, you end up with the same "flimsy logic" problem you're trying to solve.
Curious, how are you handling latency in the CAP v2 protocol when the control plane has to intercept every intent before execution?
amadeuswoo19 days ago
Interesting architecture. Im curious about the workflow when an agent hits a denied action, does it get a structured rejection it can reason about and try an alternative, or does it just fail? Wondering how the feedback loop works between safety kernel and the LLM's planning
- yaront11119 days ago
  Great question. This is actually a core design principle of the Cordum Agent Protocol (CAP).
  It’s definitely a *structured rejection*, not a silent fail. Since the LLM needs to "know" it was blocked to adjust its plan, the kernel returns a standard error payload (e.g., `PolicyViolationError`) with context.
  The flow looks like this: 1. *Agent:* Sends intent "Delete production DB". 2. *Kernel:* Checks policy -> DENY. 3. *Kernel:* Returns a structured result: `{ "status": "blocked", "reason": "destructive_action_limit", "message": "Deletion requires human approval" }`. 4. *Agent (LLM):* Receives this as an observation. 5. *Agent (Re-planning):* "Oh, I can't delete it. I will generate a slack message to the admin asking for approval instead."
  This feedback loop turns safety from a "blocker" into a constraint that the agent can reason around, which is critical for autonomous recovery.
exordex19 days ago
I built formal testing for AI agents, runs on the cli, free version launching soon - includes MCP security tests and chaos engineering features: https://exordex.com/waitlist
- yaront11118 days ago
  Exordex is a great tool for the CI/CD pipeline to test agents. Cordum is the Runtime Kernel that enforces those policies in production. Ideally? You use Exordex to test that your agent works, and Cordum to guarantee it stays safe.
hackerunewz19 days ago
Nice job, but is'nt it a bit overkill?
- yaront11119 days ago
  It is overkill for a demo. But for my production environment, I need an external safety layer. I can't rely on 'prompt engineering' when real data is at stake.