alibaba/page-agent
An in-page JavaScript GUI agent that enables natural language control over web interfaces, streamlining complex workflows and enhancing web accessibility.
Core Features
Quick Start
npm install page-agentDetailed Introduction
Page Agent is a lightweight, in-page JavaScript library designed to empower web applications with natural language control. It acts as a GUI agent directly within the browser, allowing users or other AI systems to interact with web interfaces using simple text commands. By focusing on text-based DOM manipulation, it avoids the complexities and permissions often associated with screenshot-based or multi-modal approaches. This makes it ideal for building AI copilots, automating repetitive form filling in enterprise systems, and significantly improving web accessibility, all while allowing developers to bring their preferred LLMs.