page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

24 articles 2.2k View on GitHub ↗
24 articles
How the IIFE Demo Build Enables Quick Page Agent Integration Without npm

Discover how the IIFE demo build enables quick Page Agent integration without npm. Auto-initialize Page Agent instantly in your browser for seamless testing.

how-to-guide
Mar 9, 2026
How to Migrate from browser-use to PageAgent for Client-Side Automation

Easily migrate from browser use to PageAgent for client-side automation. Update imports, instantiate PageAgent with your config, and leverage the familiar API.

migration-guide
Mar 9, 2026
How the extract_content Tool Handles Dynamic Content and Infinite Scroll Pages

Learn how the extract_content tool efficiently handles dynamic content and infinite scroll pages by rebuilding a flat DOM snapshot on agent interaction commands for precise data capture.

how-to-guide
Mar 9, 2026
How to Debug Agent Execution Using Activity Events and Logging in Page Agent

Debug Page Agent execution by using activity events and console logs. Trace state transitions from observation to tool execution for efficient debugging.

how-to-guide
Mar 9, 2026
How the switch_tab Tool Manages Cross-Tab Navigation and Context

Discover how the switch_tab tool manages cross-tab navigation and context by delegating to TabsController. Learn how it validates tab IDs, persists state, and routes operations effectively.

internals
Mar 9, 2026
How to Configure `enableMask` to Control SimulatorMask Visibility in Page Agent

Learn how to configure enableMask to control SimulatorMask visibility in alibaba/page-agent. Set enableMask to true or use showMask hideMask functions for dynamic control.

how-to-guide
Mar 9, 2026
How the i18n System Supports Multiple Languages in the Page Agent Panel UI

Discover how the alibaba/page-agent i18n system enables instant language switching in the Panel UI with a type-safe layer for nested translations and parameter interpolation.

internals
Mar 9, 2026
How to Implement a `drag_element` Tool for Complex UI Interactions in Page Agent

Learn to implement the drag_element tool in alibaba/page-agent for complex UI interactions. Synthesize mouse events and expose as a tool following Zod schema patterns. Enhance your page automation capabilities.

how-to-guide
Mar 9, 2026
How the Page-Agent Monorepo Structure Organizes Inter-Package Dependencies

Discover how the Page-Agent monorepo uses npm workspaces to organize inter-package dependencies ensuring topological order and seamless local imports.

internals
Mar 9, 2026
How to Create a Headless PageAgentCore Instance Without the UI Panel

Learn how to create a headless PageAgentCore instance bypassing UI dependencies. Import `@page-agent/core`, instantiate PageController with enableMask false, and pass to PageAgentCore.

how-to-guide
Mar 9, 2026
How PageAgent Manages Memory and History Events Across Execution Steps

Discover how PageAgent manages memory and history events across execution steps. Learn about its persistent history array and real time synchronization for LLM context and UI.

internals
Mar 9, 2026
How to Implement the ask_user Tool with Custom Callbacks for Human-in-the-Loop Workflows

Learn how to implement the ask_user tool with custom callbacks for human-in-the-loop workflows. PageAgentCore pauses execution for human input using async callbacks. Customize UI implementations easily.

how-to-guide
Mar 9, 2026

Have a question about this repo?

These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:

Share the following with your agent to get started:
curl -s "https://instagit.com/install.md"

Works with
Claude Codex Cursor VS Code OpenClaw Any MCP Client

Maintain an open-source project? Get it listed too →