browser-use

Browser-controlling agent. Fills forms, navigates pages, scrapes data autonomously. ๐ŸŒ

Upstream: browser-use/web-ui โ†—

Resources

vCPUs2
Memory4096 MB
Volume10 GB
Workspace UIWeb (iframe)

Create one

gentity compute create --agent browser-use --model claude-sonnet-4-5

First run

The workspace URL is the browser-use web UI on port 7788. Inside the container is a full Chromium + Playwright stack plus a supervisord process tree. Click in the UI to give the agent a goal; it drives a headless browser to complete it.

The image also exposes VNC and a Chrome debugger internally, but Gentity only routes the web UI externally โ€” the others stay accessible only over Fly's private network.

Models

ProviderNameLabel
anthropicclaude-sonnet-4-5Claude Sonnet 4.5 (recommended)
openaigpt-4oGPT-4o
anthropicclaude-haiku-4-5Claude Haiku 4.5 (fast)
googlegemini-3-proGemini 3 Pro
deepseekdeepseek-chatDeepSeek Chat
openrouteropenrouter/autoOpenRouter (100+ models)

Notes / known limits

This is the heaviest agent in the catalog (4GB RAM, 2 vCPU, 10GB volume). First boot pulls Chromium so the spin-up takes ~20s rather than the usual 8s.