Agent Infrastructure Control Plane

Auth, evals, and replay for production agent workflows.

agent_ops combines scoped credentials, release-grade eval suites, and trace-driven replay into one operator-facing system.

Scoped

Short-lived service-agent tokens, connector grants, and approval gates.

Replayable

Every eval run and live workflow becomes a trace you can inspect and rerun.

Release-aware

Gate new workflow versions on pass rate, policy expectations, and regressions.

Policy latency

120ms

Replay status

Ready

Eval gates

3/3

Included pillars

Three products in one app

Agent Auth & Permissions Gateway

Protect connectors, mint safe scopes, and route risky actions to reviewers.

Observability & Replay

Ingest traces, group failures, and replay stored inputs against any workflow version.

Evals-as-a-Service

Build suites, run cases asynchronously, and enforce release gates from one dashboard.

Trust Layer

Centralized approvals

Keep risky writes behind approvers and keep every auth decision inside a durable audit log.

Debugging Layer

Run-by-run timelines

Follow prompts, tool calls, latency, cost, outcomes, and grouped failures without leaving LiveView.

Quality Layer

Workflow CI

Register targets, version them, run suites, and block releases when critical quality metrics degrade.