Web Navigator：自动化网页研究与浏览 - Openclaw Skills-脚本在线

AI智能体脚本智能办公脚本自动化游戏脚本浏览器自动化脚本服务器脚本

Web Navigator：自动化网页研究与浏览 - Openclaw Skills

作者：互联网

2026-03-30

AI教程

什么是 Web Navigator？

Web Navigator 是一款专为 AI 智能体设计的强大扩展，旨在提供对实时网页数据和交互式浏览器会话的无缝访问。作为 Openclaw Skills 库的一部分，它允许开发人员构建能够超越其训练数据、获取当前信息、与 Web 应用程序交互并从复杂的 DOM 结构中提取结构化数据的智能体。

该技能对于需要高保真网页交互的工作流特别有效，例如导航分页结果、处理重 JavaScript 界面或执行多步搜索序列。通过集成 Web Navigator，用户可以将他们的代码智能体转变为全面的研究和自动化中心。

下载入口:https://github.com/openclaw/skills/tree/main/skills/mematron/web-navigator

安装与下载

1. ClawHub CLI

从源直接安装技能的最快方式。

npx clawhub@latest install web-navigator

2. 手动安装

将技能文件夹复制到以下位置之一

全局模式 ~/.openclaw/skills/ 工作区 /skills/

优先级：工作区 > 本地 > 内置

3. 提示词安装

将此提示词复制到 OpenClaw 即可自动安装。

请帮我使用 Clawhub 安装 web-navigator。如果尚未安装 Clawhub，请先安装（npm i -g clawhub）。

Web Navigator 应用场景

执行实时市场调研和竞争对手价格监控。
自动从动态 Web 门户提取技术文档。
在开发期间验证实时 API 端点或 Web 服务的可用性。
针对时事进行深度研究以生成内容。

Web Navigator 工作原理

AI 智能体识别出对外部数据的需求并触发 Web Navigator 技能。
初始化浏览器实例以导航到特定 URL 或执行搜索引擎查询。
该技能分析页面内容，识别相关的 HTML 元素和导航路径。
执行滚动、点击或表单提交等操作以获取所需信息。
抓取并清理结构化内容，将其返回到智能体上下文进行分析或存储。

Web Navigator 配置指南

要将此功能集成到您的环境中，请使用以下命令：

# 通过 CLI 安装技能
openclaw add web-navigator

# 确保存在浏览器自动化的依赖项
npm install playwright
npx playwright install chromium

Web Navigator 数据架构与分类体系

Web Navigator 技能使用以下模式组织交互数据和结果：

属性	描述
`session_id`	当前浏览实例的唯一标识符。
`target_url`	用于数据提取的主要访问地址。
`extraction_map`	将 CSS 选择器映射到所需数据字段的 JSON 对象。
`navigation_history`	会话期间访问的所有内部链接的日志。
`result_payload`	从网页提取的最终结构化数据。

name: web-navigator
description: [TODO: Complete and informative explanation of what the skill does and when to use it. Include WHEN to use this skill - specific scenarios, file types, or tasks that trigger it.]

Web Navigator

Overview

[TODO: 1-2 sentences explaining what this skill enables]

Structuring This Skill

[TODO: Choose the structure that best fits this skill's purpose. Common patterns:

1. Workflow-Based (best for sequential processes)

Works well when there are clear step-by-step procedures
Example: DOCX skill with "Workflow Decision Tree" -> "Reading" -> "Creating" -> "Editing"
Structure: ## Overview -> ## Workflow Decision Tree -> ## Step 1 -> ## Step 2...

2. Task-Based (best for tool collections)

Works well when the skill offers different operations/capabilities
Example: PDF skill with "Quick Start" -> "Merge PDFs" -> "Split PDFs" -> "Extract Text"
Structure: ## Overview -> ## Quick Start -> ## Task Category 1 -> ## Task Category 2...

3. Reference/Guidelines (best for standards or specifications)

Works well for brand guidelines, coding standards, or requirements
Example: Brand styling with "Brand Guidelines" -> "Colors" -> "Typography" -> "Features"
Structure: ## Overview -> ## Guidelines -> ## Specifications -> ## Usage...

4. Capabilities-Based (best for integrated systems)

Works well when the skill provides multiple interrelated features
Example: Product Management with "Core Capabilities" -> numbered capability list
Structure: ## Overview -> ## Core Capabilities -> ### 1. Feature -> ### 2. Feature...

Patterns can be mixed and matched as needed. Most skills combine patterns (e.g., start with task-based, add workflow for complex operations).

Delete this entire "Structuring This Skill" section when done - it's just guidance.]

[TODO: Replace with the first main section based on chosen structure]

[TODO: Add content here. See examples in existing skills:

Code samples for technical skills
Decision trees for complex workflows
Concrete examples with realistic user requests
References to scripts/templates/references as needed]

Resources (optional)

Create only the resource directories this skill actually needs. Delete this section if no resources are required.

scripts/

Executable code (Python/Bash/etc.) that can be run directly to perform specific operations.

Examples from other skills:

PDF skill: fill_fillable_fields.py, extract_form_field_info.py - utilities for PDF manipulation
DOCX skill: document.py, utilities.py - Python modules for document processing

Appropriate for: Python scripts, shell scripts, or any executable code that performs automation, data processing, or specific operations.

Note: Scripts may be executed without loading into context, but can still be read by Codex for patching or environment adjustments.

references/

Documentation and reference material intended to be loaded into context to inform Codex's process and thinking.

Examples from other skills:

Product management: communication.md, context_building.md - detailed workflow guides
BigQuery: API reference documentation and query examples
Finance: Schema documentation, company policies

Appropriate for: In-depth documentation, API references, database schemas, comprehensive guides, or any detailed information that Codex should reference while working.

assets/

Files not intended to be loaded into context, but rather used within the output Codex produces.

Examples from other skills:

Brand styling: PowerPoint template files (.pptx), logo files
Frontend builder: HTML/React boilerplate project directories
Typography: Font files (.ttf, .woff2)

Appropriate for: Templates, boilerplate code, document templates, images, icons, fonts, or any files meant to be copied or used in the final output.

Not every skill requires all three types of resources.

上一篇：DeepSeek OCR：AI驱动的图像文本识别 - Openclaw Skills 下一篇：IntercomSwap：P2P BTC 与 USDT 跨链兑换 - Openclaw Skills