page-agent
JavaScript in-page GUI agent. Control web interfaces with natural language.
Discover how the IIFE demo build enables quick Page Agent integration without npm. Auto-initialize Page Agent instantly in your browser for seamless testing.
How to Migrate from browser-use to PageAgent for Client-Side AutomationEasily migrate from browser use to PageAgent for client-side automation. Update imports, instantiate PageAgent with your config, and leverage the familiar API.
How the extract_content Tool Handles Dynamic Content and Infinite Scroll PagesLearn how the extract_content tool efficiently handles dynamic content and infinite scroll pages by rebuilding a flat DOM snapshot on agent interaction commands for precise data capture.
How to Debug Agent Execution Using Activity Events and Logging in Page AgentDebug Page Agent execution by using activity events and console logs. Trace state transitions from observation to tool execution for efficient debugging.
How the switch_tab Tool Manages Cross-Tab Navigation and ContextDiscover how the switch_tab tool manages cross-tab navigation and context by delegating to TabsController. Learn how it validates tab IDs, persists state, and routes operations effectively.
How to Configure `enableMask` to Control SimulatorMask Visibility in Page AgentLearn how to configure enableMask to control SimulatorMask visibility in alibaba/page-agent. Set enableMask to true or use showMask hideMask functions for dynamic control.
How the i18n System Supports Multiple Languages in the Page Agent Panel UIDiscover how the alibaba/page-agent i18n system enables instant language switching in the Panel UI with a type-safe layer for nested translations and parameter interpolation.
How to Implement a `drag_element` Tool for Complex UI Interactions in Page AgentLearn to implement the drag_element tool in alibaba/page-agent for complex UI interactions. Synthesize mouse events and expose as a tool following Zod schema patterns. Enhance your page automation capabilities.
How the Page-Agent Monorepo Structure Organizes Inter-Package DependenciesDiscover how the Page-Agent monorepo uses npm workspaces to organize inter-package dependencies ensuring topological order and seamless local imports.
How to Create a Headless PageAgentCore Instance Without the UI PanelLearn how to create a headless PageAgentCore instance bypassing UI dependencies. Import `@page-agent/core`, instantiate PageController with enableMask false, and pass to PageAgentCore.
How PageAgent Manages Memory and History Events Across Execution StepsDiscover how PageAgent manages memory and history events across execution steps. Learn about its persistent history array and real time synchronization for LLM context and UI.
How to Implement the ask_user Tool with Custom Callbacks for Human-in-the-Loop WorkflowsLearn how to implement the ask_user tool with custom callbacks for human-in-the-loop workflows. PageAgentCore pauses execution for human input using async callbacks. Customize UI implementations easily.
Have a question about this repo?
These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:
curl -s "https://instagit.com/install.md" Maintain an open-source project? Get it listed too →