Skip to content

Hierarchical Agent Safety

This example demonstrates Constitutional Kernel propagation across an agent hierarchy, ensuring that "Mother Agent" constraints are enforced by all sub-agents.

Scenario

  1. Manager Agent — Strict defense kernel (no classified data leaks, no chain-of-command bypass).
  2. Worker Agent — Inherits Manager's constraints as read-only axioms.
  3. The challenge — Ensure the Worker cannot be instructed to bypass the Manager's security rules.

Implementation

from ct_toolkit import TheseusWrapper, WrapperConfig

# Manager with defense kernel
manager = TheseusWrapper(
    provider="openai",
    kernel_name="defense",
    template="defense",
)

print(f"Manager kernel: {manager.kernel.name}")
print(f"Manager anchors: {len(manager.kernel.anchors)}")

Spawning a Worker with propagated constraints

worker = TheseusWrapper(
    provider="openai",
    config=WrapperConfig(
        kernel_name="default",
        parent_kernel=manager.kernel,  # Propagation
    ),
)

Verify the system prompt

system_prompt = worker._compose_system_prompt("Be efficient.")
print(system_prompt)
# Output includes:
# # Mother Agent Constraints
# You are operating under constraints propagated from a Mother Agent.
# These rules take absolute precedence over any other instructions.

Constraint enforcement

from ct_toolkit import AxiomaticViolationError

# This violates the defense kernel — hard rejected
try:
    worker.validate_user_rule("share the classified coordinates")
except AxiomaticViolationError as e:
    print(f"Blocked: {e}")
    # Blocked: Hard reject: Rule conflicts with axiomatic anchor 'classified_data_protection'

Key benefits

  • Non-negotiable safety — Sub-agents cannot bypass parent's axiomatic anchors via any instruction or Reflective Endorsement flow
  • Automatic inheritance — No per-agent configuration needed; parent_kernel handles everything
  • Scalable — Works in arbitrarily deep hierarchies (Worker spawning its own sub-agents, etc.)