Hi HN! I'm the dev behind Agent Browser.
I was building AI agents that browse the web and kept hitting the same wall: sending screenshots or accessibility snapshots to the model burns tokens and is slow. So I tried rendering pages as ASCII wireframes instead.
Every interactive element gets a numbered label. The agent sees a compact text representation and says "click [12]" or "type [5] hello". A typical page costs 3x to 10x less tokens depending on the content.
You can use it with the Vercel AI SDK or as an MCP server. To add it to Cursor or Claude Desktop:
{
"mcpServers": {
"agent-browser": {
"command": "npx",
"args": ["-y", "@agent-browser-io/browser", "mcp"]
}
}
}Still experimental, would love your feedback. Happy to answer questions!