What you get
Verify claims
Test whether a behavior claim is true and produce evidence either way. No staging, no advocacy — just investigation.
Run QA flows
Drive terminal CLIs, web apps, or Electron apps through end-to-end flows. Report pass/fail with annotated screenshots.
Record demos
Generate polished before/after comparison videos of PRs, complete with title cards, keystroke overlays, and window chrome.
Get started
- UI
- CLI
Run
/plugins in a Droid session, go to the Browse tab, find droid-control, and install it.You also need the runtime tools for your use case (tuistory, agent-browser, ffmpeg, etc.). See Prerequisites for per-use-case install commands.
Commands
Droid Control adds three slash commands. Each handles the full workflow end-to-end: planning, execution, recording, and reporting.- /verify
- /qa-test
- /demo
Test a specific behavior claim and report findings with evidence.Droid launches the app, attempts the claim, and reports what actually happened — with screenshots and text snapshots as evidence.
How /demo works
Understands the change
Fetches the PR description, diff, and linked ticket. Identifies what needs to be proven and what could be confused with existing behavior.
Plans the interaction
Scripts a sequence of actions that produces visible evidence the feature works. For comparison PRs, both branches run identical interactions so only the behavior differs.
Captures both branches
Launches recorded sessions on the baseline and candidate branches in parallel using worker subagents.
Composes the video
Renders a polished video via Remotion with title cards, window chrome, keystroke overlays, and effects. Six visual presets range from cinematic to utilitarian.
Example output
Every video below was planned, recorded, and rendered entirely by a Droid.- CLI: single-branch
- CLI: before/after
Automation drivers
Droid Control supports three automation backends. The right one is selected automatically based on what you’re targeting.tuistory
Virtual PTY automation. Default for terminal work. Playwright-style CLI with asciinema recording and forced truecolor output.
true-input
Real terminal emulator. Headless Wayland compositor (Linux), KVM/QEMU (Windows), or QEMU monitor (macOS). For when you need real rendering evidence.
agent-browser
Web and Electron apps. Playwright-backed CLI with Chrome DevTools Protocol support. Navigates pages, fills forms, clicks buttons, captures screenshots.
Video rendering
Demo and showcase videos are rendered with Remotion, a React-based video engine. The plugin includes 22 visual components and 6 presets.Visual presets
Visual presets
| Preset | Look | Best for |
|---|---|---|
factory | Warm black, traffic lights, amber glow | Official Factory content |
factory-hero | Same + gradient background | Landing pages, social |
hero | Cool gradient, generous margins | Non-Factory marketing |
macos | Dark, clean frame | General-purpose demos |
presentation | Black, generous margins | Slide decks, talks |
minimal | No window bar, tight margins | Docs embeds, inline clips |
Automatic layers (always present)
Automatic layers (always present)
- Warm radial backgrounds, floating particles, film grain overlay, color grading
- Motion blur title-to-content transition
- Animated window chrome with traffic lights and glassmorphic borders
- Auto-scaled title/subtitle text
Effect layers (selected at compose time)
Effect layers (selected at compose time)
- Spotlight overlays to highlight specific regions
- Directed zoom for small text or details
- Keystroke pills showing user actions
- Section headers and transition sweeps
Architecture
The plugin uses a composition architecture with three layers:- Orchestrator — Routes each request through three independent lookups (target, stage, artifact) to determine which skills to load.
- 10 atom skills — Self-contained background knowledge loaded on demand, split into drivers, targets, stages, and polish.
- 3 commands — Parse arguments into commitments, then delegate to atoms via hybrid handoffs.
Architecture deep dive
Design rationale: UX for droids, waterfall routing, task delegation, and hybrid handoffs.
Prerequisites
Only install what you need for your use case.Terminal demos (tuistory)
Terminal demos (tuistory)
Web/Electron automation (agent-browser)
Web/Electron automation (agent-browser)
Real terminal emulator (true-input)
Real terminal emulator (true-input)
| Platform | Required tools |
|---|---|
| Linux/Wayland | cage, wtype, a Wayland-compatible terminal |
| Windows (KVM) | libvirt, qemu, KVM VM with SSH |
| macOS (QEMU) | qemu, socat, macOS VM with SSH |
Video composition (showcase)
Video composition (showcase)
Requires Node.js >= 18, Chrome/Chromium,
ffmpeg, ffprobe, and agg.See also
Source code
Full plugin source: skills, commands, scripts, and Remotion components.
Plugins
Learn how plugins work, how to install them, and how to build your own.
Automated QA skill
Deeper QA automation with CI integration, failure learning, and structured reports.
README
Quick start, command reference, and prerequisites.
