Submit work from anywhere — Discord, Slack, REST, GitHub, MCP, or a YAML file. TaskSmith queues it, spawns Claude Code headless, validates output, retries intelligently, tracks costs, and pings you when it's done.
tasksmith setup → tasksmith runClaude Code is brilliant at writing code. TaskSmith handles the rest — queueing, retrying, validating, learning, and notifying.
Submit work from Discord, Slack, REST API, GitHub webhooks, MCP clients, or YAML files. TaskSmith queues it, runs Claude Code headless, and handles the full lifecycle. No terminal babysitting.
Define a validation command. TaskSmith retries until tests pass. The circuit breaker detects stuck loops, infrastructure failures, contradictions, cost ceilings, and timeouts — ejecting doomed tasks before they burn your budget.
SOUL.md, USER.md, conventions, memory, and project context assembled into every invocation. 7 bundled skills (ralph-loop, bug-hunt, code-review, research, doc-gen, heartbeat, project-init) in Claude Code's native SKILL.md format.
Three-tier memory — hot (MEMORY.md, every prompt), warm (JSONL, searchable), cold (compressed archives). Per-task JSONL event logs. Semantic search via Ollama, OpenAI, or Gemini. tasksmith insights for automated pattern detection.
tasksmith costs — per-model/project spend breakdown, time-series rollups, budget alerts, and weighted-moving-average forecasting. Daily, weekly, and monthly limits with configurable warning thresholds.
7 inbound sources (file drop, CLI, REST API, Discord bot, GitHub webhooks, Slack Events, MCP). 5 outbound providers (Discord, Slack, ntfy.sh, email, webhooks). Tasks flow in from anywhere. Results land where you need them.
Chain tasks with depends_on in a directed acyclic graph. Downstream tasks wait. Failure propagates. Cycle detection built in. tasksmith dag shows status.
13 MCP tools and 4 resources. Any MCP client can submit tasks and search memory. tasksmith cc-install registers as a Claude Code MCP server in one command. Agents helping agents.
Human-in-the-loop for high-risk tasks — rule-based matching, timeout auto-reject. REST API auth with bearer tokens and rate limiting. Two-tier input sanitization. Discord guild/channel scoping.
Set model: auto and let TaskSmith pick. Template-based defaults, prompt complexity heuristics, automatic escalation to a stronger model on failure. tasksmith insights shows actual vs. all-opus cost savings.
Per-task JSONL event logs survive engine restarts. Iteration checkpointing resumes from last completed iteration — zero wasted tokens. Orphaned tasks auto-recovered on startup.
From submission to passing tests. Unattended.
YAML file, Discord, Slack, GitHub webhook, REST API, MCP, or CLI
Trust-level validation, smart model selection, approval gate check
Claude Code runs headless with skills injected and full project context
Run tests. Classify failures. Circuit breaker ejects stuck loops. Retry until green.
Archive to memory. Log costs. Push results to Discord, Slack, or your phone.
Official plugins ship with the CLI. Enable any with one line in config. Lazy-loaded — disabled plugins cost nothing.
Auto-create issues on failure. Submit tasks from GitHub issues with tasksmith submit --from-github-issue. Webhook intake with HMAC-SHA256 verification.
Success rates, model comparison, failure patterns, cost outliers, trends. tasksmith metrics and tasksmith insights for full analytics in your terminal.
Vector-based search over task history via Ollama (local), OpenAI, or Gemini embeddings. tasksmith semantic --query "auth refactor"
Deploy to Cloudflare Pages on task success. Rollback, cache purge, deployment history. tasksmith plugin run cloudflare
tasksmith plugin create my-thing
npm is the plugin manager. Publish to @tasksmith-dev/* or tasksmith-plugin-*.
TaskSmith dogfoods its own development. These numbers are real.
Software architect from Birmingham, AL. Built TaskSmith because Claude Code is powerful but stateless — every session starts from scratch with no queue, no retry, no notifications. OpenClaw's 430k lines were too many. TaskSmith is under 10,000 lines of core TypeScript, 8 bundled plugins, and it's been dogfooding its own development since v0.5.0.