Agentic AI Atlasby a5c.ai
OverviewWikiGraphFor AgentsEdgesSearchWorkspace
/
GitHubDocsDiscord
i.6Wiki
Agentic AI Atlas · vercel-labs/agent-browser
docs/reference-repos/clawhub/vercel-labs/agent-browser/researcha5c.ai
Search the atlas/
Wiki · linked records

Article and nearby pages

I.Current articlepp. 1 - 1
I.
Wiki article

docs/reference-repos/clawhub/vercel-labs/agent-browser/research

Reading · 4 min

vercel-labs/agent-browser reference

Native Rust browser automation CLI by Vercel Labs, designed specifically for AI agents. Uses Chrome/Chromium via CDP directly (no Playwright/Puppeteer dependency). Provides accessibility-tree snapshots with element refs for reliable interaction. Supports sessions, auth vault, state persistence, and video recording.

Page nodewiki/docs/reference-repos/clawhub/vercel-labs/agent-browser/research.mdNearby pages · 0Documents · 0

vercel-labs/agent-browser

  • **Archetype**: utility-with-skill
  • **Stars**: 28,819
  • **Last pushed**: 2026-04-12
  • **License**: Apache-2.0
  • **Discovered**: 2026-04-12
  • **Source**: ClawHub skills (published as "steipete/agent-browser")
  • **Skills found**: 6 skills (agent-browser, dogfood, electron, slack, vercel-sandbox, agentcore)
  • **Fork**: No

Summary

Native Rust browser automation CLI by Vercel Labs, designed specifically for AI agents. Uses Chrome/Chromium via CDP directly (no Playwright/Puppeteer dependency). Provides accessibility-tree snapshots with element refs for reliable interaction. Supports sessions, auth vault, state persistence, and video recording.

The SKILL.md is a meta-skill that loads sub-skills dynamically via agent-browser skills get <name>. Sub-skills cover: core browser automation, exploratory testing/QA (dogfood), Electron desktop app automation (VS Code, Slack, Discord, Figma), Slack workspace automation, Vercel Sandbox microVMs, and AWS Bedrock AgentCore cloud browsers.

Published on ClawHub as "steipete/agent-browser" (~unnamed downloads). steipete (OpenClaw co-founder) appears to publish this as a ClawHub skill wrapping the Vercel Labs CLI.

Assessment

HIGH extractable value. Browser automation is foundational for many processes. The dogfooding/QA skill is directly useful as a process. The Electron app automation pattern (controlling VS Code, Slack, etc.) is novel and high-value. The accessibility-tree-based interaction model is more reliable than CSS selectors.

**Extraction priority**: HIGH

Extractable Value: vercel-labs/agent-browser

Processes

1. Exploratory QA / Dogfooding Process

  • **Source**: dogfood skill (exploratory testing, bug hunts, app quality review)
  • **Placement**: specializations/shared/exploratory-qa-dogfooding.js
  • **Description**: Process for systematic exploratory testing of web applications: navigate to target app -> take accessibility snapshot -> identify interactive elements -> exercise core user flows -> capture screenshots at each step -> report visual/functional issues -> verify responsive behavior. Uses breakpoints for tester to confirm found issues.

2. Web Data Extraction Pipeline

  • **Source**: Core agent-browser skill (navigate, snapshot, extract)
  • **Placement**: specializations/shared/web-data-extraction.js
  • **Description**: Process for structured data extraction from websites: navigate to URL -> take accessibility snapshot -> identify data elements via refs -> extract structured data -> paginate/follow links -> aggregate results. Handles authentication via auth vault.

3. Cross-Browser Form Testing

  • **Source**: Browser automation (fill_form, click, snapshot, evaluate)
  • **Placement**: specializations/shared/form-testing-automation.js
  • **Description**: Process for automated form testing: identify all form fields -> generate test cases (valid, invalid, edge cases) -> fill and submit -> capture responses -> verify validation messages -> report results. Breakpoint before destructive form submissions.

Plugin Ideas

1. Browser Automation Plugin

  • **Category**: Tools Integration
  • **install.md**: Installs agent-browser CLI (npm global or brew), runs agent-browser install for Chrome download, configures auth vault for common sites. Provides babysitter tasks for browser operations (navigate, snapshot, click, fill, screenshot). Enables web-based verification steps in any process.

2. Visual Regression Testing Plugin

  • **Category**: QA & Testing
  • **install.md**: Combines agent-browser screenshots with image comparison. Captures baseline screenshots of web app pages, then on subsequent runs compares against baselines to detect visual regressions. Stores baselines in project's .a5c/visual-baselines/. Reports diffs with annotated images.

3. Electron App Automation Plugin

  • **Category**: Tools Integration
  • **install.md**: Configures agent-browser's Electron skill for desktop app automation. Supports controlling VS Code, Slack, Discord, Figma, Notion, Spotify via accessibility tree. Enables processes that span IDE + browser + messaging workflows.

Library Mapping

Extractable ProcessLibrary StatusActionExisting PathTarget Placement
Exploratory QA / Dogfooding ProcessNEWSystematic exploratory testing with accessibility snapshots and core user flow exercise-specializations/shared/exploratory-qa-dogfooding.js
Web Data Extraction PipelineNEWStructured data extraction from websites with accessibility tree navigation-specializations/shared/web-data-extraction.js
Cross-Browser Form TestingNEWAutomated form testing with test case generation and validation verification-specializations/shared/cross-browser-form-testing.js
Accessibility-Tree NavigationNEWWeb interaction via accessibility snapshots and element refs instead of CSS selectors-specializations/shared/accessibility-tree-navigation.js

Plugin Marketplace Mapping

Plugin IdeaMarketplace StatusActionExisting PluginTarget Placement
Browser Automation IntegrationUPGRADENative browser automation beyond existing dev-browser pluginplugins/a5c/marketplace/plugins/dev-browser/plugins/a5c/marketplace/plugins/browser-automation-integration/
Visual Regression TestingNEWScreenshot-based visual regression testing with baseline comparison-plugins/a5c/marketplace/plugins/visual-regression-testing/
Electron App AutomationNEWDesktop app automation for VS Code, Slack, Discord, Figma via accessibility tree-plugins/a5c/marketplace/plugins/electron-app-automation/

Implicit Procedural Knowledge

  • **Accessibility-tree-based interaction**: Using accessibility snapshots with element refs instead of CSS selectors is more robust for agent interaction. This pattern should inform how babysitter tasks interact with web UIs.
  • **Dynamic skill loading pattern**: The meta-SKILL.md that loads sub-skills via CLI (agent-browser skills get <name>) is a pattern for versioned, on-demand skill delivery. Skills ship with the CLI rather than as static markdown.
  • **Auth vault pattern**: Persistent authentication state management for web sessions, avoiding re-login on each automation run. Relevant for any plugin that needs to maintain authenticated sessions.

Trail

Wiki
Babysitter Docs
Reference Repos

Clawhub

Vercel Labs

Agent Browser

vercel-labs/agent-browser

Page record

Open node ledger

wiki/docs/reference-repos/clawhub/vercel-labs/agent-browser/research.md

Documents

No documented graph nodes on this page.