[GitHub Trending] alibaba/page-agent
Scored daily by a customisable AI persona to surface the most relevant engineering leadership news.
JavaScript in-page GUI agent from Alibaba, directly relevant to AI agent control of web interfaces.
Alibaba's PageAgent is an open-source GUI agent that runs entirely as in-page JavaScript, enabling natural language control of web interfaces without browser extensions, Python, or headless browsers. It uses text-based DOM manipulation instead of screenshots or multi-modal LLMs, supports bring-your-own LLMs, and offers an optional Chrome extension for multi-page tasks plus an MCP Server for external control. The library is designed for easy integration into SaaS products, smart form filling, and accessibility, with a one-line script tag or npm package available.
alibaba