Tasks
A Task is a single automation job. You provide a natural-language prompt describing the goal, a starting URL, and optionally a JSON schema for structured output. Skyvern navigates the browser and returns the result.prompt (what to do) and url (where to start). Beyond those, data_extraction_schema defines the shape of structured output, max_steps caps the number of AI decision cycles (which controls cost), and webhook_url lets you receive async notifications when the task completes. See Task Parameters for the full list.
Tasks are best for one-off automations, quick data extraction, and prototyping. When a single task isn’t enough, you have two options for multi-step work:
Browser Automation (Code)
Write multi-step automations in Python or TypeScript with full Playwright control. Version, test, and deploy automation code like any other software.
Workflows (Visual)
Build multi-step automations visually in the Cloud UI with drag-and-drop blocks. No code required. Share templates across your team.
Browser Automation
Browser Automation is the code-first way to build multi-step automations. The Skyvern SDK connects to a cloud Chromium instance over CDP, layers Playwright on top, and injects AI into every page interaction. There are three layers that nest inside each other: Browser, Page, and Agent.Browser
Every code-based automation starts by launching a browser. A Browser is a cloud Chromium instance with a Playwright context. Cookies, storage, and auth state persist across every page you open inside it.Page
A Page wraps a Playwright page with AI. Standard Playwright calls likegoto, click("#id"), and fill work as-is. AI actions work without selectors: Skyvern screenshots the page and decides what to interact with from your prompt.
Agent
An Agent runs complete multi-step AI tasks inside a page you already have open. It reuses the current page with its cookies, login state, and navigation history. You control when to hand off to the agent and when to take back control.Full method reference: Page, Agent, Browser. Developer guide: Multi-Step Automations.
Workflows
A Workflow is a reusable automation template built in the Cloud UI workflow editor. You drag and drop blocks onto a canvas, wire them together, and save. Workflows can be versioned, shared across your team, scheduled on a cron, and run repeatedly with different parameters. Each block can reference outputs from previous blocks using Jinja templating:{{search_query}} for parameters, {{extract_block.product_name}} for upstream block outputs.
Workflows are the right choice when you want no-code automation, team-shared templates, scheduled recurring jobs, or visual drag-and-drop logic.
Blocks
Blocks are the building units of workflows. Each block performs one operation. Navigation and interaction: Navigate (AI-guided navigation toward a goal), Action (click, type, select, upload), Go to URL (direct navigation), Login (authenticate with stored credentials), Wait (pause for a duration), and Human Interaction (pause for manual intervention). Data and files: Extract (pull structured data into JSON), File Download, File Upload, File Parser (PDFs, CSVs, Excel), and PDF Parser (specialized text extraction). Logic and control flow: Conditional (if/else branching), For Loop (repeat over a list), Validation (assert conditions, halt on failure), and Code (custom Python/Playwright scripts). Communication: HTTP Request (external API calls), Text Prompt (text-only LLM prompt, no browser), and Send Email. For detailed block configuration, see Block Types and Configuration.Runs
Every time you execute a task or kick off a workflow, Skyvern creates a Run to track progress and store outputs. A run moves through a lifecycle:tsk_ prefix, workflow runs get wr_. The response includes the run_id, status, output (matching your extraction schema), recording_url, screenshot_urls, downloaded_files, failure_reason (if something went wrong), and step_count.
Schedules
A Schedule runs a workflow automatically on a recurring basis. You define a cron expression and timezone, and Skyvern triggers the workflow at each interval.cron_expression is a standard 5-field cron (minimum 5-minute interval), timezone is an IANA identifier like America/New_York, and parameters are passed to each scheduled run. You can pause a schedule by setting enabled to false.
Credentials
Credentials provide secure storage for authentication data. Skyvern encrypts credentials at rest and in transit, injects them directly into the browser, and never sends them to the LLM. Supported credential types are usernames and passwords, TOTP codes (authenticator apps), and credit cards. You can store them in Skyvern’s native encrypted storage, or sync from Bitwarden, 1Password, Azure Key Vault, or a custom HTTP vault. See Credentials for setup instructions.Browser Sessions and Profiles
Skyvern offers two ways to manage browser state across runs. Browser Sessions are live browser instances that maintain state across multiple operations. Cookies, storage, and page context persist for the duration of the session (up to 24 hours). They’re useful for chaining operations in real-time or allowing human intervention between steps.Artifacts
Every run generates artifacts for observability, debugging, and audit trails: end-to-end video recordings, screenshots captured after each action, downloaded files, JSON-structured logs at step/task/workflow levels, and HAR files for network debugging.Engines
Skyvern supports multiple AI engines.skyvern-2.0 is the latest and default for the Cloud UI (the SDK defaults to skyvern-1.0). Other options include openai-cua (OpenAI Computer Use Agent), anthropic-cua (Anthropic Computer Use Agent), and ui-tars.
Quick Reference
| I want to… | Use |
|---|---|
| Run a one-off automation | Task |
| Build multi-step automation in code | Browser Automation |
| Build multi-step automation visually | Workflow |
| Trigger a UI-built workflow from code | Run from Code |
| Keep a browser open between operations | Browser Session |
| Skip login on repeated runs | Browser Profile |
| Store secrets securely | Credentials |
| Debug a failed run | Artifacts |
Choose your path
Use the dashboard
Run tasks, build workflows visually, and monitor runs. No code required.
Build with the API
Integrate Skyvern into your product with Python, TypeScript, or REST.
AI Agents Quickstart
Give Claude Code, Cursor, or Windsurf browser automation via MCP.
Self-host Skyvern
Deploy on your own infrastructure with your own LLM keys.

