GitPedia

Umadev

A project-director agent that drives your Claude Code / Codex / OpenCode to plan, build, review, govern & ship software like a real dev team — runnable code + a delivery proof. Brings no model of its own.项目总监 Agent:驱动你已登录的 Claude Code / Codex / OpenCode,像真实开发团队那样规划、编码、评审、治理、交付,交还可运行代码。

From umacloud·Updated June 28, 2026·View on GitHub·

**umadev** is A project-director agent that drives your Claude Code / Codex / OpenCode to plan, build, review, govern & ship software like a real dev team — runnable code + a delivery proof. Brings no model of its own.项目总监 Agent:驱动你已登录的 Claude Code / Codex / OpenCode,像真实开发团队那样规划、编码、评审、治理、交付,交还可运行代码。 The project is written primarily in Rust, distributed under the MIT License license, first published in 2026. Key topics include: agents, ai, ai-agents, ai-coding, claude-code.

Latest release: v1.0.15
June 28, 2026View Changelog →

umadev

<div align="center"> <img src="docs/assets/umadev-logo-en.png" alt="umadev" width="760">

A project-director agent for the coding CLI you already use.

License: MIT
Rust
Spec
Version

English | 简体中文 | 繁體中文

</div>

umadev is a single command-line tool that drives an AI coding CLI you already have — Claude Code, Codex, or OpenCode — as one continuous session. It owns no model of its own: the model your base is connected to is the brain.

You describe what you want in plain language, and umadev drives the base to build it: it plans the work, writes the code, convenes a review team — product, architecture, design, frontend, backend, QA, security — runs quality and governance checks, and hands back runnable code plus a delivery proof. It sizes itself to the task: a small edit stays a small edit; a full project gets the full pipeline.

The central idea is that umadev works like a real software delivery team. It leads a roster of roles — product manager, architect, UI/UX designer, frontend, backend, QA, security, DevOps, and the director — that plan, build, review, and sign off the way a real team does. The director doesn't type code directly; the base (the AI coding CLI) is the engineer that does the work. The roles coordinate only through shared artifact files and structured verdicts — they never chat to each other.

It's a single Rust binary. npm is just the delivery shell.


Table of Contents


Install

bash
npm install -g umadev

The npm package is a distribution shim. The actual program is a Rust binary. Prebuilt binaries ship for macOS (Apple Silicon and Intel), Linux (x86_64 and ARM64), and Windows (x86_64).

The install also pulls a small local embedding model (multilingual-e5-small, f16, ~224 MB) as an optional dependency and wires it up automatically — it powers the offline vector search with no API key and no runtime network, no manual download step. If your registry or network skips the optional download, umadev still works: retrieval falls back to BM25-only, and re-running npm install -g umadev restores the vector channel.

Build from source:

bash
git clone https://github.com/umacloud/umadev.git cd umadev && cargo build --release --features vector-local ./target/release/umadev --version

Building from source? The embedding model is not in the repo (it's too large for git, ~224 MB). A plain cargo build --release gives a BM25-only binary; the local vector channel needs the --features vector-local flag and the model on disk. The prebuilt binaries and npm i bundle both automatically — for a source build, download multilingual-e5-small into ~/.umadev/embed-model/ once:

bash
mkdir -p ~/.umadev/embed-model && cd ~/.umadev/embed-model for f in config.json tokenizer.json model.safetensors; do curl -fsSL "https://huggingface.co/intfloat/multilingual-e5-small/resolve/main/$f" -o "$f" done

umadev auto-discovers it there (or point UMADEV_EMBED_MODEL_DIR at any directory with those three files). Without the model, umadev still works — retrieval just falls back to BM25-only.

You also need one AI coding CLI installed and logged in — that's the brain umadev drives:

BaseInstallLog in
Claude Codenpm i -g @anthropic-ai/claude-codeclaude auth login
Codexnpm i -g @openai/codexcodex login
OpenCodesee opencode.aiopencode auth login

umadev injects nothing into the base. Whatever your base is configured with — an official login or your own third-party / local-model routing — is exactly what runs.


Quickstart

bash
umadev # launch the chat UI; first run lets you pick a base

Tell it what you want built:

text
> add CSV export to the reports page > build me a todo app with a Postgres backend > /goal ship a working SaaS landing page # keep working until the goal is met

Or run a build non-interactively:

bash
umadev run "add CSV export to the reports page" --backend claude-code

umadev sizes the work to the request — you don't pick a mode. A build typed in chat earns the same planning, team review, governance, and delivery proof as a /run — there is no separate chat path and build path. All builds run on an isolated umadev/<slug> branch; your working branch is never touched, and umadev never merges or pushes on its own.

A full worked example

Suppose you run:

bash
umadev init umadev

Then type:

text
Build a course-booking mini app. Users can browse courses, pick a time, book, cancel. Admins can manage courses and bookings.

umadev will:

  1. Route the request. The base's own model judges this: a full build, not a quick edit. You see an intent card — "full build, entering the delivery flow" — before any code is written.
  2. Clarify. Fill in sensible defaults for target platform, payment scope, and admin complexity. Auto mode self-resolves; manual mode lets you confirm each.
  3. Research. When the base has web access, search competing booking apps — features, pricing, design trends, real user reviews — and merge that with built-in knowledge about booking systems, admin CRUD, permissions, and form validation. Everything lands in output/<slug>-research.md.
  4. Draft three core documents. PRD (roles, scope, EARS acceptance criteria), architecture (data model, APIs, auth, deployment), UI/UX (design direction, tokens, typography, component states, icon library). Pause for your review.
  5. Build an execution plan. A dependency plan (plan.json) rendered as a live checklist. Each task links back to its FR id.
  6. Implement the frontend. The director schedules each task, the base writes the code, each file is governance-checked as it lands.
  7. Pause for preview. You see the running app before backend work starts.
  8. Implement the backend and integration.
  9. Run the full quality gate. Build, test, lint, contract check, runtime probe, governance scan.
  10. Produce the delivery pack. Scorecard, proof pack, compliance mapping — on disk and ready to hand to a teammate, client, or reviewer.

This is not "AI chatted and said done." It leaves real artifacts on disk.


Project Origin

umadev evolved from the original shangyankeji/super-dev project.

Early super-dev was closer to an AI coding governance tool. It focused on what AI-generated code must not contain: emoji icons, hardcoded colors, unsafe patterns.

umadev has grown that into a full project-director agent:

  • From single-point governance to whole-pipeline governance. It no longer only checks code; every phase from requirement to delivery is brought under the process and its gates.
  • From loose scripts to a spec-driven system. The source of truth is UMADEV_HOST_SPEC_V1, 25 clauses, ~112 governance rules.
  • Rewritten in Rust. One binary, fast startup, low dependency surface, cross-platform distribution.
  • From blocking bad output to driving the base through the process. Claude Code / Codex / OpenCode are the brain and hands; umadev is the director and governance rail wrapped around them.

super-dev asked: "how do we stop AI from writing bad code?" umadev asks: "how do we make AI deliver a complete, shippable, auditable project — the way a real software team would?"


What Problem It Solves

Most people hit the same problems when they use AI coding tools at scale:

  • The AI starts coding immediately, without a PRD, architecture, or acceptance criteria.
  • The frontend is built, but backend API paths do not match.
  • The UI looks generic: random colors, random fonts, template-like composition.
  • The AI leaves placeholders, fake data, and TODOs — and still says "done."
  • After one requirement change, context drifts and earlier decisions are forgotten.
  • Code exists, but there is no quality report or delivery evidence.
  • Your team has standards and internal knowledge, but you copy them manually into prompts.

umadev turns those problems into a structured workflow:

mermaid
flowchart LR A["One requirement"] --> B["Route & plan"] B --> C["Research"] C --> D["PRD / Architecture / UI/UX"] D --> E["Execution plan"] E --> F["Frontend"] F --> G["Preview"] G --> H["Backend"] H --> I["Quality gate"] I --> J["Delivery proof"]

Features

  • Drives a base you already use. Claude Code, Codex, or OpenCode, run as one continuous session so the base keeps its context across a whole build instead of starting cold each step. No API key of its own.
  • Works like a real dev team. The director convenes a roster of nine named roles — product manager, architect, UI/UX designer, frontend engineer, backend engineer, QA, security, DevOps, and the director itself — that plan, implement, review, and sign off the way a real team does.
  • Plans the work and shows it. A build becomes a dependency plan (.umadev/plan.json) rendered as a live checklist you can steer with /plan. Steps are driven step by step; the director owns the plan, not the base.
  • A build in chat is a real build. A build typed in the chat UI earns the same planning, team scheduling, governance, and delivery proof as umadev run. There is no weaker chat path.
  • Ships a delivery proof. PRD, architecture, and UI/UX docs, a scorecard, and a proof pack — scaled to the task, so a one-page change doesn't get an enterprise dossier.
  • Carries engineering standards into the base — with a fully-local dual-channel RAG. 418 curated knowledge files (commercial-grade engineering standards, design rules) plus a map of your existing code are compiled into the binary and retrieved on every working turn by a two-channel hybrid engine: pure-Rust BM25 + a local vector model (multilingual-e5-small, f16, via candle) fused with RRF, HyDE query expansion on top. No API key, no network, millisecond recall over your own standards and business docs. Zero config.
  • Self-evolving memory — it learns from each run. Mistakes the base hits are recorded with a frequency signal to a local store; a genuine recurrence triggers a higher-level corrective reflection. Both are recalled into later prompts, so the same pitfall isn't repeated — umadev gets better on your codebase the more you use it.
  • A real terminal UI. Markdown, syntax-highlighted code, live diff cards as files change (word-level highlighting), folding tool rows, a build-completion card with a clickable preview URL, and slash commands throughout.
  • Governance you can audit. A trust dial (plan / guarded / auto), irreversible actions always confirm on every tier, an MCP server (umadev mcp serve) that exposes the governor to other tools, and compliance mapping (SOC 2 / ISO 27001 / EU AI Act).
  • Goal-until-met builds. /goal <objective> drives the base to keep working until the objective is met — native /goal on all three bases; UMADEV_NO_GOAL_MODE=1 opts out.

How It Works

A turn flows through up to five layers. Every consult of the base fails open to a safe default, so a bug or an unreachable base never blocks you.

mermaid
flowchart TB User["User — requirement / review / preview / deploy"] --> UI["umadev TUI / CLI<br/>chat + slash commands"] UI --> Director["umadev-agent<br/>Router · Plan · Schedule · Verify · Finalize"] Director --> Spec["umadev-spec<br/>UMADEV_HOST_SPEC_V1 · 25 clauses"] Director --> Knowledge["umadev-knowledge<br/>418 files · BM25 + vector · repo-map"] Director --> Governance["umadev-governance<br/>~112 rules · audit · compliance"] Director --> Contract["umadev-contract<br/>OpenAPI · frontend↔backend path check"] Director --> Runtime["umadev-runtime<br/>Runtime trait"] Runtime --> Host["umadev-host<br/>claude-code · codex · opencode<br/>(subprocess drivers)"] Runtime --> Offline["Offline templates<br/>CI / no-base fallback"] Host --> Workspace["Workspace<br/>source · output · release"] Offline --> Workspace Governance --> Evidence[".umadev/audit/<br/>tool-calls.jsonl · verify.jsonl"] Contract --> Quality["Quality gate"] Evidence --> Quality Quality --> Release["proof-pack-*.zip<br/>scorecard-*.html"]

The five layers in plain English:

  1. Route. The base's own model judges what your message needs — a small edit, a debug, or a full build. The intent is surfaced immediately ("small change, on it" vs. "full build, entering the delivery flow") so you can correct it before work starts. The base model is authoritative; if it is unreachable, the turn falls to the lightest path, never a keyword guess.
  2. Plan. A real build is broken into a dependency plan that umadev owns and renders as a live checklist. The plan is stored in .umadev/plan.json and is steerable with /plan.
  3. Schedule. umadev walks the plan step by step. Writing roles (frontend / backend engineer) drive the main session serially — single-writer. Reviewing roles (product manager, architect, designer, QA, security, frontend, backend, DevOps) each get their own read-only forked session and review in parallel, returning structured verdicts. Roles communicate only through shared artifact files and their verdicts.
  4. Verify and self-correct. Each step is checked against its acceptance criteria on a deterministic floor — coverage, contract, build/test, hard gates — not by the model self-assessing "good enough." Blocking findings come back as one fix directive with evidence attached. The loop is bounded by a gap counter and stall counter; it ends cleanly when done or genuinely stuck.
  5. Finalize. Once the floor is clean, umadev produces the delivery artifacts and proof pack. The run's episodes feed the lessons store for future runs.

Underneath every path, a curated system prompt — identity, engineering and design standards, the relevant slice of the knowledge base, recalled pitfalls from past runs, and an outline of your existing code — is injected into the base on every working turn.

The firmware is pre-loaded once at launch, so the first reply doesn't pay the 30–60s per-message cold start that re-priming from scratch would cost.


The Review Team

umadev convenes a nine-seat review roster. Each role is a real job function, not a label.

mermaid
flowchart TB Director["Director<br/>owns plan · drives main session · aggregates verdicts"] subgraph Writing ["Writing roles — main session, serial (single-writer)"] FE["Frontend engineer"] BE["Backend engineer"] end subgraph Reviewing ["Reviewing roles — read-only fork, parallel"] PM["Product manager<br/>scope · acceptance criteria"] Arch["Architect<br/>data model · APIs · scalability"] Design["UI/UX designer<br/>tokens · typography · component states"] QA["QA engineer<br/>test coverage · edge cases"] Sec["Security<br/>SAST · secrets · auth"] FER["Frontend reviewer<br/>contract · accessibility · slop"] DevOps["DevOps<br/>Dockerfile · CI · deploy"] end Director --> Writing Director --> Reviewing Writing --> Artifacts["Shared artifact files<br/>output/*.md · source tree"] Reviewing --> Artifacts Artifacts --> Verdicts["RoleVerdict { accepts, blocking, advisory, evidence }"] Verdicts --> Director

How the team works:

  • Writing roles drive the main session serially. Only one writer touches source at a time.
  • Reviewing roles each run on their own read-only forked session, in parallel, and return a RoleVerdictaccepts, blocking findings with evidence, and advisory notes.
  • The director aggregates deterministically. The deterministic floor governs loop control; critic opinions are advisory only. Blocking findings are folded into one rework directive injected back into the main session, along with the evidence.
  • Roles never chat to each other. The shared blackboard is the output artifact files and the source tree. The only communication channel between roles is the verdict.
  • The team scales with the task. A bugfix convenes no team. A greenfield build convenes the full roster. Complexity determines the seats.

Why You Can Trust the Output

Trust in the output comes from four things that happen on every build:

1. The deterministic floor runs regardless of what the model thinks

Build, lint, typecheck, and test results are checked directly. The acceptance floor checks spec coverage and contract alignment — the model doesn't self-report "it passed."

2. The frontend↔backend contract is verified mechanically

umadev-contract parses the architecture doc's API table into a typed spec, renders openapi.json to .umadev/contracts/, extracts every fetch/axios call in the frontend source, and cross-validates the paths. A mismatch is a blocking finding.

3. Every important action leaves evidence

Tool calls, verification runs, and critic verdicts are written to .umadev/audit/ as JSONL. The proof pack includes the evidence chain.

4. Governance runs on every file write

~112 rules check for emoji-as-icons, hardcoded colors, leaked secrets, AI-slop UI patterns, and unsafe code constructs. They run as a pre-write hook into Claude Code, as a pre-commit hook in git, and as part of the quality gate. All rules are configurable in .umadev/rules.toml and are fail-open — a bug in the governor never blocks your work.


Runtime Modes

Drive a local AI coding CLI (the product)

Backend IDBinaryHow umadev calls it
claude-codeclaudeclaude --print --output-format text
codexcodexcodex exec --sandbox workspace-write
opencodeopencodeopencode run

The base brings its own model — umadev has none

umadev connects to no model API and stores no credentials of its own. The base uses its own configured model — your logged-in subscription, or whatever third-party / local model you routed through the base. By default umadev passes no --model flag; the base runs on its own configuration. Use /model <id> to override a session, or change the model in the base's own config.

umadev reads and surfaces the base's current model and reasoning effort in /status — it reads ~/.claude/settings.json for Claude Code, ~/.codex/config.toml for Codex, and opencode.json for OpenCode — but never overrides those values unless you explicitly ask it to.

Wider model coverage means routing the base to a third-party or local model. That is the base's job. umadev does not add new base drivers for that.

Offline templates (internal fallback, not a product mode)

Offline mode (/offline) makes no model calls and no network requests. It is an internal deterministic fallback for demos, smoke tests, and CI — not a mode you choose for real work. The first-run picker lists only the three bases. The output is templates with TODO placeholders, not real code.


The Full Delivery Flow

For a heavyweight greenfield build, the director can expand the plan into the full nine-phase chain. This is the most complete delivery path — not a funnel every message is forced through.

mermaid
flowchart LR CL["clarify"] --> R["research"] R --> D["docs<br/>PRD · Architecture · UI/UX"] D --> G1["docs_confirm<br/>gate"] G1 --> S["spec<br/>plan · tasks"] S --> F["frontend"] F --> G2["preview_confirm<br/>gate"] G2 --> B["backend"] B --> Q["quality gate"] Q --> L["delivery"]

Small tasks have a lightweight path — the router routes to the right depth, and the plan expands to fit. Declare the task type with /kind (fullstack / frontend-only / backend-only / bugfix / refactor) and the plan trims accordingly.

Phase outputs

PhaseWhat it produces
clarifyoutput/<slug>-clarify.md, output/<slug>-clarify-answers.md
researchoutput/<slug>-research.md — web research + knowledge base hits merged
docsoutput/<slug>-prd.md, output/<slug>-architecture.md, output/<slug>-uiux.md
docs_confirmGate — pause for review before any code is written
specoutput/<slug>-execution-plan.md, .umadev/plan.json, .umadev/changes/<id>/tasks.md
frontendSource code + output/<slug>-frontend-notes.md
preview_confirmGate — running app in the browser before backend work begins
backendSource code + output/<slug>-backend-notes.md
qualityoutput/<slug>-quality-gate.json, output/<slug>-quality-gate.md, runtime-proof.json
deliveryoutput/<slug>-delivery-notes.md, release/proof-pack-*.zip, release/scorecard-*.html

Quality Gate

The quality gate is a pre-delivery review that runs independently of the model.

It checks:

  • PRD goal, scope, and acceptance criteria completeness.
  • Architecture APIs, data model, error handling, and authentication.
  • UI/UX tokens, typography, icon library, component states, and dark mode.
  • Frontend API calls cross-validated against the backend contract.
  • Emoji icons, hardcoded colors, and generic AI-template UI patterns.
  • Build, test, lint, and typecheck results.
  • Dockerfile, CI config, migrations, and .env.example.
  • Leaked API keys, passwords, and connection strings.
  • Audit logs and compliance mapping.

The runtime probe (umadev verify --runtime) boots the app and hits its routes, writing runtime-proof.json — evidence that the app actually starts and responds.

Outputs:

text
output/<slug>-quality-gate.json output/<slug>-quality-gate.md runtime-proof.json

Default threshold:

toml
[quality] threshold = 90 skip_checks = []

Governance

umadev started as a governance tool and that remains a core capability.

The spec layer has 25 clauses. The implementation includes ~112 governance checks across UI quality, security, frontend architecture, backend engineering, and language-specific hazards. Every check is configurable in .umadev/rules.toml — each rule can be disabled, path-excluded, or tuned. They exist to backstop the base's output, not to make the final engineering call for you.

Governance entry points:

mermaid
flowchart LR A["Claude Code hook<br/>pre-write"] --> G["umadev-governance"] B["umadev ci<br/>pre-commit / CI"] --> G C["MCP server<br/>umadev mcp serve"] --> G D["Quality gate<br/>pre-delivery scan"] --> G G --> E["Decision: pass · block · advisory"] E --> F[".umadev/audit/<br/>tool-calls.jsonl"]

Every governance function is fail-open: an error path returns pass, never a block. A bug in the governor never stops your work.

Project policy:

toml
[disabled] clauses = [] [exclusions] paths = ["src/legacy/**", "**/*.test.ts"] [extra] blocked_domains = ["internal-bad-proxy.corp"]

The compliance mapping (umadev report) maps the evidence chain to SOC 2 / ISO 27001 / EU AI Act controls.


Knowledge Base

umadev ships with 418 curated markdown knowledge files bundled directly into the binary and auto-extracted to ~/.umadev/knowledge on first launch. They are not generic documentation — they are commercial-grade engineering standards formatted for injection into an AI coding CLI.

The corpus covers: product design, PRD methodology, system architecture, frontend engineering, backend engineering, database design, security, testing, CI/CD, operations, mobile, desktop, mini programs, HarmonyOS, cross-platform development, domain verticals, UI/UX, design systems, and engineering playbooks.

On every working turn, umadev retrieves the most relevant chunks for the current requirement and phase, prepends them into the base's system prompt, and also injects a dependency-ranked outline of your existing code (the repo map). The result is that every prompt the base sees arrives pre-loaded with the applicable standard — not just a generic instruction.

Retrieval flow:

mermaid
flowchart LR A["Requirement + phase"] --> B["Tokenizer<br/>CJK bigram + English"] B --> C["BM25 lexical<br/>(pure Rust)"] B --> D["Vector embeddings<br/>(local candle, f16, offline)"] A --> H["HyDE query expansion"] H --> C H --> D C --> E["RRF fusion"] D --> E E --> F["Top-k chunks"] F --> G["Injected into base system prompt<br/>alongside repo-map slice + recalled pitfalls"]

Two-channel hybrid retrieval, fully local. Lexical BM25 (pure Rust, CJK-aware) and dense vector search run as two channels fused with Reciprocal Rank Fusion, with a HyDE-style query expansion widening recall first. The vector channel runs a small bilingual model (multilingual-e5-small, f16) locally via candle — bundled with the install, no API key, no network, millisecond recall over your own project standards and business docs. It degrades to BM25-only if the model is ever absent (fail-open). No cloud embedding service is required or used.

It learns from each run. Mistakes the base hits during a build are recorded with a frequency signal to a local store; on a genuine recurrence umadev asks the base for a higher-level corrective strategy (a reflection). Both are recalled into later prompts, so the same pitfall isn't repeated — the longer you use it on a codebase, the less it stumbles on the same thing twice.

Add your own knowledge:

bash
umadev knowledge-manage add ./team-docs --name team-docs umadev knowledge-manage search "payment webhook idempotency"

Deliverables

After a full run:

text
your-project/ output/ app-clarify.md app-research.md app-prd.md app-architecture.md app-uiux.md app-execution-plan.md app-frontend-notes.md app-backend-notes.md app-quality-gate.json app-quality-gate.md app-compliance-mapping.json app-delivery-notes.md .umadev/ plan.json workflow-state.json contracts/ openapi.json openapi.yaml audit/ tool-calls.jsonl frontend-api-calls.jsonl verify.jsonl release/ proof-pack-app-20260620090000.zip proof-pack-app-20260620090000.manifest.txt scorecard-app-20260620090000.html runtime-proof.json

The proof pack and scorecard are what you hand to a teammate, client, or reviewer. Everything else is intermediate work.


Commands

umadev has two entry points that mirror each other:

  • TUI slash commands — type inside the umadev chat UI (recommended for daily use).
  • Terminal CLI subcommands — for scripts and CI, no TUI needed.

Typing / in the TUI opens a command palette — Tab to autocomplete, / to cycle. /help (or F1) lists every command and keybinding.

TUI slash commands

Pick the base and model

CommandWhat it does
/claude · /codex · /opencodeSwitch the base being driven (saved to ~/.umadev/config.toml)
/offlineSwitch to deterministic offline templates (demo / CI, no network)
/statusActive base, its current model, and reasoning effort (read from the base's own config)
/model <id>Override the model for this session only (default: umadev passes none, base uses its own)
/kind <type>Set task type (fullstack / frontend-only / backend-only / bugfix / refactor) to trim the plan

Drive the flow

CommandWhat it does
just typeRoutes to the right path; a build typed here gets the same systems as /run
/run <requirement>Start a full build explicitly
/goal <objective>Keep the base working until the objective is met (native on all three bases; UMADEV_NO_GOAL_MODE=1 opts out)
/quick <task>Force the light path for a trivial one-off change
/plan [skip|add|veto|up|down <id>]View or steer the live dependency plan
/continue (or c at a gate)Approve the current gate and advance
/revise <feedback>Stay at the gate, redo the current phase with feedback
/manual · /autoPer-gate confirmation vs. fully automatic (shift+Tab also toggles)
/redoRe-run the previous phase block
/abort · /stopAbort the current run (on-disk state kept, resumable later)

Preview and delivery

CommandWhat it does
/previewStart the frontend dev server and open the browser
/stop-previewStop the preview server
/deployPreview the deploy command (look, don't run)
/deploy confirmRun the deploy

Checkpoints and rewind (shadow git — never touches your .git)

CommandWhat it does
/checkpoint [label]Snapshot the workspace files
/rewind [id]List / roll back to a file checkpoint

Inspect artifacts and state

CommandWhat it does
/specPrint the full UMADEV_HOST_SPEC_V1 spec
/diff [name]Show an artifact (prd · architecture · uiux · …)
/verifyWorkspace conformance report and evidence chain
/doctorSelf-test (binary / manifest / probes)
/statusCurrent phase, gate, and run state
/historyFull conversation history
/usageToken and usage statistics
/knowledgeKnowledge-base hits for this run
/skill · /mcpInstalled Skills / MCP servers
/configEffective configuration

Design and project

CommandWhat it does
/design <direction>Lock the design-system direction (modern-minimal · editorial-clean · …)
/template <name>Pick a scaffold template
/name <name>Set the project slug
/initWrite the umadev.yaml manifest

General

CommandWhat it does
/help (or F1)Help overlay with all keybindings
/clearClear the chat
/exportExport the current session
/quit (or Esc)Exit (workflow state is saved, resumable)

Terminal CLI subcommands

Workspace lifecycle

CommandWhat it does
umadev initScaffold the workspace (umadev.yaml + design system / knowledge seeds)
umadevLaunch the chat TUI
umadev doctorSelf-test
umadev verifyWorkspace conformance and evidence chain; --runtime boots the app and hits its routes
umadev reportCompliance mapping (SOC 2 / ISO 27001 / EU AI Act)
umadev historyList rollback snapshots
umadev rollback latestRoll back to a snapshot
umadev updateUpgrade umadev via npm
umadev uninstallFull uninstall: removes ~/.umadev, governance hooks, and the binary (--base <id> for hook-only)

Non-interactive run (scripts / CI)

CommandWhat it does
umadev run "<requirement>" --backend <id>Run a pipeline, pausing at the docs_confirm gate
umadev continue [--backend <id>]Approve the current gate
umadev revise "<feedback>"Stay at the gate, record a revision, rerun the block
umadev spec [--clauses]Print the spec (--clauses for the clause table)

Governance / CI

CommandWhat it does
umadev ci [--changed-only] [--report-only]Run governance over every source file (CI mode)
umadev install --base <claude-code|pre-commit|…>Install the pre-write governance hook

Platform extensions

CommandWhat it does
umadev mcp serveRun as an MCP server — exposes govern_file / govern_command to Claude Desktop, Cursor, Continue, and others
umadev mcp-manage <install|list|remove>Manage the base CLI's MCP servers
umadev skill <install|list|remove>Manage Skills (knowledge + rules + prompt packs)
umadev knowledge-manage <add|list|search|remove>Manage custom knowledge-base docs

Help

CommandWhat it does
umadev examplesCommand cheat-sheet
umadev guide60-second walkthrough

Common environment variables

VariableWhat it doesDefault
UMADEV_CLAUDE_BIN / UMADEV_CODEX_BINPath to the claude / codex binaryclaude / codex
UMADEV_WORKER_TIMEOUTPer-call worker timeout in seconds300
UMADEV_VERIFY_TIMEOUT_SECSVerify-loop per-call timeout in seconds120
UMADEV_MODEL_PLAN / UMADEV_MODEL_BUILDPer-phase model tiers (same as /model plan|build)
UMADEV_NO_GOAL_MODEDisable /goal mode if set to 1
OPENAI_EMBED_KEYEnable remote vector embeddings (else bundled local model + BM25)
XDG_CONFIG_HOMEBase directory for config.toml$HOME

Configuration

User config:

toml
# ~/.umadev/config.toml backend = "claude-code" lang = "en" # model is empty by default — the base uses its own configured model. # Set it (or use /model <id>) only to override a session. # model = "opus"

Project config:

toml
# .umadevrc [quality] threshold = 90 skip_checks = [] [pipeline] skip_phases = [] max_review_rounds = 3 auto_approve_gates = true [knowledge] enabled = true engine = "hybrid" top_k = 6

Trust mode controls autonomy: umadev run --mode plan|guarded|auto (default guarded). Irreversible actions — git merge, reset, deletes, deploys, network pushes — always confirm on every tier.


Rust Architecture

umadev is a ten-crate Rust workspace.

mermaid
flowchart TB Binary["umadev binary<br/>CLI · TUI · CI · hook · MCP · doctor"] Binary --> Agent["umadev-agent<br/>Router · Plan · Director · Critics · Trust · Lessons"] Binary --> TUI["umadev-tui<br/>ratatui · markdown · live diffs · build cards"] Binary --> I18N["umadev-i18n<br/>zh-CN · zh-TW · en"] Agent --> Spec["umadev-spec<br/>UMADEV_HOST_SPEC_V1 · 25 clauses"] Agent --> Governance["umadev-governance<br/>~112 rules · audit · compliance"] Agent --> Knowledge["umadev-knowledge<br/>418 files · BM25 · vector · repo-map"] Agent --> Contract["umadev-contract<br/>OpenAPI · path reconciliation"] Agent --> Runtime["umadev-runtime<br/>Runtime trait · OfflineRuntime"] Runtime --> Host["umadev-host<br/>claude-code · codex · opencode drivers"]
CrateRole
umadevBinary entry point. CLI verbs, TUI launch, CI runner, pre-write hook, MCP/Skill/Knowledge management, doctor.
umadev-specUMADEV_HOST_SPEC_V1 as Rust data — clauses, phases, gates, runtime kinds. Normative prose mirror: spec/.
umadev-governanceFail-open enforcement kernel. ~112 checks across UI quality, security, frontend, backend, and language hazards. Policy, audit (JSONL), compliance mapping (SOC 2 / ISO 27001 / EU AI Act).
umadev-agentThe director engine. Router (typed RoutePlan, base-model intent triage), plan state (owned Plan/PlanStep DAG → .umadev/plan.json), firmware builder (compose_firmware — identity, standards, knowledge chunks, pitfall recall, repo-map slice), director loop (step drive, verify, bounded self-correct, finalize), critics (nine role seats, parallel forked sessions, structured RoleVerdict), trust tiers, lessons store (frequency-signal pitfall recall, HyDE query expansion, BM25+vector RRF fusion).
umadev-runtimeRuntime trait + OfflineRuntime + RuntimeKind. The three host drivers implement this trait; umadev owns no model endpoint.
umadev-hostHostDriver trait for the three bases: claude, codex, opencode. Each implements umadev_runtime::Runtime. BACKEND_IDS is the authoritative ID list, locked by tests.
umadev-knowledgeStructured retrieval over the bundled knowledge corpus. Markdown-aware chunker, pure-Rust BM25 + CJK-bigram tokenizer, a local vector channel (multilingual-e5-small, f16, via candle — bundled, offline; a remote endpoint via OPENAI_EMBED_KEY is an optional override), HyDE query expansion, RRF fusion, mtime-cached BM25 index. Also repomap: per-language regex symbol scan, degree-centrality ranked, intent-personalized, token-budgeted, mtime-cached.
umadev-contractMachine-verifiable frontend↔backend API contract. Parses the architecture doc's API table into a typed ApiSpec, renders openapi.{json,yaml}, extracts frontend fetch/axios calls, cross-validates. Self-contained OpenAPI subset.
umadev-tuiratatui terminal app over the engine event stream. Markdown rendering, syntax-highlighted code, live word-level diff cards, folding tool rows, build-completion card with clickable preview URL.
umadev-i18nTrilingual (zh-CN / zh-TW / en) string catalogs and system-locale detection for all user-facing text.

Development

Requirements:

  • Rust 1.87+
  • Cargo
  • Node.js 18+ only if testing the npm distribution shim
bash
cargo build --workspace cargo test --workspace cargo clippy --workspace --all-targets -- -D warnings cargo fmt --all

Clippy runs at pedantic level workspace-wide; new code must pass with -D warnings.

Recommended reading order:

  1. spec/UMADEV_HOST_SPEC_V1.md — the normative spec
  2. crates/umadev-spec/src/lib.rs — clauses as Rust data
  3. crates/umadev-agent/src/router.rs — routing logic
  4. crates/umadev-governance/src/rules.rs — the governance rules
  5. crates/umadev/src/main.rs — binary entry point

Other authoritative references:


License

MIT.

Contributors

Showing top 1 contributor by commit count.

View all contributors on GitHub →

This article is auto-generated from umacloud/umadev via the GitHub API.Last fetched: 6/28/2026