Zach Wright Email Zach
All Updates

Here is what you missed while you were shipping.

Swarm Daily: Durability Is the New Agent Primitive

Background responses, resumable workflows, and pause-for-approval runtimes are turning agent reliability into a state and recovery problem.

The Big Thing

The agent stack is being rebuilt around runs that outlive the request that started them.

Why it matters: once models spend minutes researching, wait on approvals, or survive deploy boundaries, synchronous chat semantics stop being enough. Operators now need checkpoints, cancellation, replay-safe side effects, and event delivery that survives disconnects. The hard problem is shifting from "can the model call a tool?" to "can the system resume cleanly after time passes?"

Code & Tools

  1. OpenAI background mode + webhooks - long reasoning runs can execute with background=true, then complete via polling or webhook events instead of tying reliability to a client socket. https://developers.openai.com/api/docs/guides/background https://developers.openai.com/api/docs/guides/webhooks
  2. Cloudflare AgentWorkflow + HITL approvals - durable workflows now sit next to agent sessions with native wait states, reminders, escalation patterns, and approval gates. https://developers.cloudflare.com/changelog/2026-02-03-agents-workflows-integration/ https://developers.cloudflare.com/agents/concepts/human-in-the-loop/
  3. Cloudflare keepAlive() + diagnostics channels - runtime survival and observability are becoming first-class APIs because long-lived sessions fail in unglamorous ways. https://developers.cloudflare.com/changelog/post/2026-03-02-agents-sdk-v070/
  4. Vercel Workflow / WDK - durable, resumable, observable workflows for AI apps are now a managed hosting feature, not a side project for infra teams. https://vercel.com/docs/workflow https://vercel.com/changelog/vercel-workflow-is-now-twice-as-fast https://useworkflow.dev/docs/ai
  5. LangGraph durable execution + interrupts - persistence, deterministic replay, and human review now live inside the graph runtime instead of a separate orchestration layer. https://docs.langchain.com/oss/javascript/langgraph/durable-execution https://docs.langchain.com/oss/javascript/langgraph/interrupts
  6. Trigger.dev and Inngest durable task patterns - background job platforms are explicitly courting agent builders with retries, queues, monitoring, memoized steps, and point-of-failure recovery. https://trigger.dev/docs/introduction https://www.inngest.com/docs/learn/how-functions-are-executed

Tech Impact

Meme of the Day

"Automation" (xkcd) - because every team eventually rediscovers that the hard part is not starting the workflow, it is living with the workflow they built.

Image URL: https://imgs.xkcd.com/comics/automation.png
Post: https://xkcd.com/1319/