Proofrail for OpenClaw

An execution harness for OpenClaw agents. Evidence gates, post-change verification, risk controls — making agents work in a disciplined, reviewable way.

It focuses on the runtime behaviors that matter in practice:

advisory-first evidence-before-mutation guidance
mutation-before-verification tracking
repeated low-signal probe blocking
dangerous command approval or blocking
large output summarization
per-session compaction anchors
session-scoped task ledger
post-mutation validation suggestions
JSONL audit trails for guardrail decisions

Why Proofrail exists

Prompt quality matters, but workflow control matters just as much. Proofrail is not a prompt pack. It is a control-and-correction plugin that sits around tool use and enforces a tighter engineering loop:

no evidence, no mutation in strict mode; actionable advisory in the default mode
after changes, validate before continuing, with batch-friendly advisory defaults
risky actions are blocked by default; approval mode is opt-in when the host has a working plugin approval route

This is what turns “just answer” behavior into “inspect, act, verify, recover” behavior. The goal is simple: less blind mutation, less state drift, and more verifiable work.

Release Status

This tree is prepared for release: v0.0.6.

Notable v0.0.6 scope includes:

pending-verification state survives compaction snapshots and is restored when a session resumes
read-only validation commands now include pip show, npm ls, python -m json.tool, python -m py_compile, Test-Path, and Get-Command
unknown-target mutating exec commands record an advisory, and ignored advisories are written to the audit log
strict + batch validation allows mutations below mutationBatchMax; after_each_mutation remains fail-closed
compiled dist/ runtime output is included for ClawHub and OpenClaw package loading

The current scope is intentionally OpenClaw-first:

make the OpenClaw plugin production-grade first
keep module boundaries clean enough to extract reusable core later
keep host-specific integration narrow so future variants can land cleanly

Naming

The public product name is Proofrail.

The OpenClaw variant is published as:

package name: proofrail-openclaw
plugin id: proofrail
release title: Proofrail for OpenClaw

In OpenClaw config, use plugins.entries.proofrail.config.

This naming keeps the Proofrail brand consistent across docs, config, runtime artifacts, and future host-specific variants.

Configuration

Plugin-specific settings are read from plugins.entries.proofrail.config.

Config surface exposed in openclaw.plugin.json:

enforcementMode: advisory, strict, guarded, or off (default: advisory)
advisoryInjection: compact, full, or off (default: compact)
validationPolicy: batch, after_each_mutation, immediate, or off (default: batch)
mutationBatchMax: maximum unverified mutation batch size before advisory severity escalates
dangerousCommandAction: approve, block, warn, or allow (default: warn)
summaryThresholdChars: summarization threshold for oversized tool output
lowSignalBlockThreshold: repeated low-signal probe threshold

Hook decisions prefer the live per-handler config injected by OpenClaw (event.context.pluginConfig) and fall back to api.pluginConfig when the hook payload does not carry context.

Host-global state such as tools.exec.security still comes from the runtime config snapshot.

Installation

From a local working tree:

openclaw plugins install <path-to-proofrail-openclaw>

From a packed tarball:

npm pack
openclaw plugins install npm-pack:./proofrail-openclaw-0.0.6.tgz

After install, enable the plugin if needed and restart the serving Gateway before expecting hook behavior to change.

Runtime Model

Current runtime state intentionally supports only observe, execute, and review. plan and wait_user remain reserved contract states for future workflow tools.

Design Goals

Keep deterministic guardrails in runtime hooks, not fragile prompt prose.
Keep the plugin modular so future plan, task, and ask-user features do not collapse into one file.
Keep host-specific code narrow so shared abstractions can be extracted later.
Keep runtime artifacts out of source control.
Keep the public release tree clean: typed, packable, and reviewer-friendly.

Module Layout

index.ts: plugin entry only
lib/register-hooks.ts: host-facing hook wiring and orchestration
lib/constants.ts: policy constants and appended behavior rules
lib/types.ts: local contracts for hook-facing code
lib/text.ts: pure text extraction and normalization helpers
lib/path.ts: file/path hint helpers
lib/tooling.ts: shared runtime configuration and normalization helpers
lib/tool-normalize.ts: canonical tool-name and category normalization
lib/command-risk.ts: shell command risk classification, mutating exec detection, and validation exec detection
lib/evidence-policy.ts: evidence, low-signal, intent, and mutation labeling policies
lib/result-status.ts: normalized tool result success/failure detection
lib/session-state.ts: in-memory session state lifecycle
lib/validation.ts: changed-path derivation and narrow validation suggestions
lib/task-ledger.ts: session task context, close summaries, and final review checklist
lib/audit.ts: best-effort JSONL audit trail
lib/compaction.ts: compaction snapshot persistence
lib/workflow/contracts.ts: forward-looking workflow interfaces for tasks, plans, ask-user, and policy decisions
schemas/: forward-looking contracts for planned workflow primitives
evals/: reviewer-facing evaluation stubs for safety, workflow, and recovery
docs/: maintainer-facing architecture notes

Runtime Artifacts

The plugin writes runtime artifacts under the OpenClaw state directory:

compaction snapshots under state/plugins/proofrail/sessions/<session-key>/last-compaction-snapshot.json
JSONL audit entries under state/plugins/proofrail/audit.jsonl

If the host runtime does not expose api.runtime.state.resolveStateDir(), Proofrail falls back to a local .proofrail/ directory under the plugin root for best-effort development smoke tests.

These paths are runtime artifacts and must stay out of version control.

Planned Next Steps

Phase 1, still OpenClaw-only:

add durable task records
add plan mode primitives
add structured ask-user state
add worktree isolation policy
add broader eval coverage for safety and recovery paths

Phase 2:

extract shared runtime core from stable OpenClaw modules
add additional host integrations on top of the stable core

Non-Goals

copying proprietary prompts or closed-system internals
implying official affiliation with OpenClaw, any model vendor, or any provider
mixing local instance identity, credentials, or private ops policy into the plugin

Release Verification

Before publishing a release bundle, run from a clean checkout:

npm install
npm run typecheck
npm run smoke
npm pack --dry-run

Release notes for this tree live in CHANGELOG.md.

Review Bundle Expectations

A reviewable bundle should include:

source files and maintainer docs
a deterministic smoke test for hook behavior
package metadata for OpenClaw plugin discovery
no machine-specific absolute paths or private deployment notes
no dead repository links before the public repo exists
no mixed-language user-facing runtime text

Proofrail for OpenClaw

Proofrail for OpenClaw

Why Proofrail exists

Release Status

Naming

Configuration

Installation

Runtime Model

Design Goals

Module Layout

Runtime Artifacts

Planned Next Steps

Non-Goals

Release Verification

Review Bundle Expectations

Source and release

Source repository

Source commit

Install command

Metadata

Compatibility