Capability · Framework — agents

LaVague

LaVague pioneered the large-action-model pattern: a world model plans the next step, a retriever finds the most relevant DOM chunk, and an action engine generates Selenium or Playwright code that actually runs in the browser. It's popular in agent research and for building bespoke vertical browser agents.

Framework facts

Category: agents
Language: Python
License: Apache-2.0
Repository: https://github.com/lavague-ai/LaVague

Install

pip install lavague

Quickstart

from lavague.core import ActionEngine, WorldModel
from lavague.core.agents import WebAgent
from lavague.drivers.selenium import SeleniumDriver

driver = SeleniumDriver(headless=False)
agent = WebAgent(WorldModel(), ActionEngine(driver))
agent.get('https://huggingface.co/docs')
agent.run('Find the Transformers quickstart page.')

Alternatives

browser-use — lighter Playwright-first library
Skyvern — self-hosted platform with vision
Stagehand — TypeScript DOM-first agent
AgentQL — query-language over Playwright

Frequently asked questions

What makes LaVague different?

It explicitly separates planning (world model) from grounding (action engine) and emits inspectable Selenium/Playwright code rather than a black-box trace. That makes it easier to audit and replay.

Does LaVague need vision models?

No. The default flow is text-only, using DOM retrieval. Vision is optional and can be layered in for harder sites.

Sources

LaVague — GitHub — accessed 2026-04-20
LaVague — docs — accessed 2026-04-20