2026

Agentic Audit Platform

Durable Execution Environment

Durable AgentsTool CallingHuman-in-the-LoopWorkflow Orchestration

Overview

A reference architecture for AI-native audit execution—a durable substrate where agents ingest documents, reason over structured engagement state, call tools, request missing inputs, and pause/resume across long-running workflows with explicit human control points. Transforms auditing from human-scale document workflow into machine-scale procedural execution.

Challenges

Audits span weeks with complex document dependencies and approval gates
Existing tools lack durable state management for long-running agent workflows
Human oversight required at critical decision points without breaking execution flow
Document ingestion requires structured extraction across varied formats

Approach

Built execution graph architecture with persistent state checkpointing
Implemented tool schema with Zod for structured document extraction
Created human approval gates that pause/resume without losing context
Designed MCP server integration for external system orchestration

Durable Agent State Management

typescript

// Checkpoint-based execution with human approval gates
interface AuditState {
  engagementId: string
  documents: ProcessedDocument[]
  findings: Finding[]
  pendingApprovals: ApprovalRequest[]
  checkpointId: string
}

async function executeAuditStep(state: AuditState) {
  const checkpoint = await loadCheckpoint(state.checkpointId)
  
  // Resume from last known state
  const agent = createAgent({
    model: "claude-sonnet-4-20250514",
    tools: [extractDocument, analyzeFinding, requestApproval],
    state: checkpoint
  })

  const result = await agent.run({
    onApprovalRequired: async (request) => {
      // Persist state and pause for human review
      await saveCheckpoint({ ...state, pendingApprovals: [request] })
      return { status: "paused", resumeToken: request.id }
    }
  })

  return result
}

Tech Stack

TypeScriptVercel AI SDKSupabaseZodMCP ServersClaude Agent SDK

Outcomes

Reduced manual document processing time by 80%
Achieved reliable pause/resume across multi-day workflows
Enabled non-technical auditors to interact with agent via natural language

VLM Training Pipeline XTRAP AI Safety Research