PVS-1: Policy Verdict Schema

Status of This Memo

This document specifies a standards track protocol for the Agent Control Layer ecosystem and requests discussion and suggestions for improvements. Distribution of this memo is unlimited.

Abstract

The Policy Verdict Schema (PVS-1) defines a standard JSON structure for policy enforcement decisions produced by ACL's The Gavel (and compatible policy engines). It is designed to be:

Simple enough to embed inside ADP-1 agent steps
Expressive enough for security, compliance, and monitoring
Extensible for future policy engines and additional metadata

1. Terminology

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.

Policy: A rule or constraint that agent outputs must satisfy.

Verdict: The result of evaluating content against one or more policies.

Policy Engine: A system that evaluates content against policies and produces verdicts.

The Gavel: ACL's reference policy engine implementation.

2. Schema

A PVS-1 verdict is a single JSON object with the following fields:

{
  "version": "pvs-1",
  "decision": "allow",
  "approved": true,
  "reasoning": "Short explanation of the decision.",
  "policy_violations": [],
  "confidence_score": 0.97,
  "policy_set": ["No PII leakage", "No financial advice"],
  "metadata": {
    "engine": "the-gavel",
    "engine_version": "2.0.0",
    "latency_ms": 520,
    "tenant_id": "tenant-123",
    "agent_id": "coach"
  }
}

2.1 Required Fields

Field	Type	Description
`version`	string	MUST be `"pvs-1"` for this spec
`decision`	string	One of `"allow"`, `"deny"`, or `"escalate"`
`approved`	boolean	`true` if `decision` is `"allow"`, `false` otherwise. Retained for backwards compatibility
`reasoning`	string	Human-readable explanation of the decision
`policy_violations`	array	Policy descriptions that were violated (empty if decision is "allow")
`confidence_score`	number	Engine confidence in verdict (0.0 to 1.0)

Decision Values

"allow" — Content passes policy evaluation; proceed with execution
"deny" — Content violates policy; block execution
"escalate" — Engine is uncertain or policy requires human review

2.2 Optional Fields

Field	Type	Description
`policy_set`	array	Policy names evaluated
`metadata`	object	Engine-specific metadata

Consumers MUST ignore unknown keys in metadata.

2.3 Constraints

When decision is "allow", policy_violations MUST be an empty array.
When decision is "allow", approved MUST be true.
When decision is "deny" or "escalate", approved MUST be false.
confidence_score MUST be between 0.0 and 1.0 inclusive.
Low confidence (< 0.7) SHOULD result in decision: "escalate".

3. The Gavel Integration

The Gavel (ACL's reference policy engine) uses this TypeScript interface:

type PolicyDecision = "allow" | "deny" | "escalate";
 
interface PolicyEvaluation {
  version: string;
  decision: PolicyDecision;
  approved: boolean;  // Derived: true if decision === "allow"
  reasoning: string;
  policy_violations: string[];
  confidence_score: number;
  policy_set?: string[];
  metadata?: {
    engine: string;
    engine_version: string;
    latency_ms: number;
    tenant_id?: string;
    agent_id?: string;
  };
}

The Gavel determines decision based on:

Clear violation detected → decision: "deny"
No violations, high confidence (≥ 0.7) → decision: "allow"
Uncertain or low confidence (< 0.7) → decision: "escalate"

To produce PVS-1 compliant output, policy engines SHOULD:

Add version: "pvs-1" to its JSON output
Include decision with the appropriate value
Derive approved from decision for backwards compatibility
Include policy_set with evaluated policy names
Include metadata.engine identifying the engine
Include metadata.engine_version with semantic version
Include metadata.tenant_id and metadata.agent_id when available

3.1 Escalation Handling

When a policy engine returns decision: "escalate", the consuming system SHOULD route the verdict to a human review process. The specific implementation is left to the platform.

Content → Policy Engine → PVS-1 Verdict
                              ↓
                    decision === "allow"    → Proceed
                    decision === "deny"     → Block
                    decision === "escalate" → Human Review
                              ↓
                    Human Reviewer → Approve/Reject

Recommended escalation triggers:

Low confidence score (< 0.7)
Ambiguous policy match
High-stakes operation (configured per use case)
Explicit policy requiring human review
System uncertainty or error conditions

Platforms implementing PVS-1 SHOULD:

Pause execution when decision === "escalate"
Present the verdict to a human reviewer
Allow the reviewer to approve (proceed) or reject (block)
Log the human decision for audit purposes

4. Embedding in ADP-1

PVS verdicts are designed to embed directly inside ADP-1 steps as observation.output:

{
  "action": {
    "type": "tool_call",
    "name": "policy_judge",
    "input": {
      "draft_output": "...",
      "policies": ["No PII", "No financial advice"]
    }
  },
  "observation": {
    "type": "tool_result",
    "output": {
      "version": "pvs-1",
      "decision": "deny",
      "approved": false,
      "reasoning": "Draft contained direct SSN.",
      "policy_violations": ["No PII"],
      "confidence_score": 0.98,
      "policy_set": ["No PII", "No financial advice"],
      "metadata": {"engine": "the-gavel", "latency_ms": 650}
    }
  }
}

This enables downstream systems to:

Enforce decisions (block/allow/escalate)
Route uncertain verdicts to human reviewers
Aggregate policy violation statistics
Audit and explain why output was blocked or escalated

5. Security Considerations

5.1 Threat Model

Threat	Mitigation
Verdict Tampering	Sign verdicts; store in immutable audit log
Policy Bypass	Enforce verdicts at policy enforcement points
False Negatives	Use confidence_score thresholds; human review for low confidence
Information Leakage	Redact sensitive content from reasoning field

5.2 Verdict Integrity

Verdicts SHOULD be signed when stored or transmitted
Systems MUST NOT allow agents to modify their own verdicts
Audit logs SHOULD include original content hash alongside verdict

5.3 Confidence Thresholds

Implementations SHOULD define confidence thresholds:

confidence_score >= 0.9: Auto-enforce verdict
confidence_score < 0.9: Queue for human review

6. Conformance

6.1 Conformance Levels

Level 1 (Core): An implementation MUST:

Emit valid JSON conforming to PVS-1 schema
Include all required fields
Enforce the approved/policy_violations constraint
Use valid confidence_score range

Level 2 (Extended): An implementation MUST also:

Include policy_set with evaluated policies
Include metadata.engine identifying the policy engine
Provide meaningful reasoning text

Level 3 (Complete): An implementation MUST also:

Include metadata.latency_ms for performance monitoring
Support verdict signing for integrity
Integrate with ADP-1 step embedding

6.2 Schema Validation

A JSON Schema for PVS-1 is provided at schemas/pvs-1.schema.json. Conforming implementations SHOULD validate verdicts against this schema.

7. Future Work

Structured Violations: Objects with IDs, severities, remediation hints
Policy Categories: privacy, financial, safety taxonomies
Policy Links: References to machine-readable policy definitions
Escalation Priority: Levels like urgent, normal, low for triaging human review

8. References

8.1 Normative References

[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, May 2017.
[RFC8259] Bray, T., Ed., "The JavaScript Object Notation (JSON) Data Interchange Format", STD 90, RFC 8259, December 2017.

8.2 Informative References

[ADP-1] AControlLayer, "Agent Data Protocol", ADP-1, 2025.

9. Acknowledgments

The authors thank the early reviewers and implementers who provided feedback on this specification.

Lm Review Response Readme