并行增强:批量数据 AI 搜索与研究 - Openclaw Skills
作者:互联网
2026-04-09
什么是 并行增强?
并行增强是一项专为需要扩展数据采集能力的开发人员和数据研究员设计的尖端工具。通过与 Openclaw Skills 集成,该工具允许用户将静态 CSV 或 JSON 列表转化为包含实时网页信息的增强数据集。无论您是需要查找初创公司的最新融资轮次、首席执行官的 LinkedIn 个人资料,还是特定的产品规格,此技能都能处理网页搜索和数据提取的高强度工作。
并行增强的核心价值在于其自然语言界面。用户只需描述其意图,而无需编写复杂的爬虫或手动搜索查询。该技能利用各种处理层级来平衡速度和深度,使其成为从快速联系人查询到 Openclaw Skills 生态系统内深度竞争情报分析的通用选择。
下载入口:https://github.com/openclaw/skills/tree/main/skills/normallygaussian/parallel-enrichment
安装与下载
1. ClawHub CLI
从源直接安装技能的最快方式。
npx clawhub@latest install parallel-enrichment
2. 手动安装
将技能文件夹复制到以下位置之一
全局模式~/.openclaw/skills/
工作区
/skills/
优先级:工作区 > 本地 > 内置
3. 提示词安装
将此提示词复制到 OpenClaw 即可自动安装。
请帮我使用 Clawhub 安装 parallel-enrichment。如果尚未安装 Clawhub,请先安装(npm i -g clawhub)。
并行增强 应用场景
- 通过将高管联系详情和社交资料追加到潜在客户列表,实现潜在客户增强自动化。
- 通过收集数千家公司的融资、总部和员工人数数据进行市场研究。
- 利用来自实时网页数据的技术规格和市场定价来增强产品数据库。
- 清理并更新 CRM 记录,确保公司名称和领导层信息是最新的。
- 通过提取行业竞争对手的市场地位和关键产品特征,扩展竞争分析规模。
- 提供源数据,可以是内联 JSON 数组或 CSV 文件的路径。
- 使用自然语言定义您的增强意图,准确指定您想要发现的新列或信息。
- 可选地定义源列和增强列的描述,以提高 AI 数据映射的准确性。
- 选择处理层级——从 lite-fast 到 ultra-fast——以匹配研究任务的复杂性。
- 执行命令开始增强过程,该过程会提供一个监控 URL 以实时跟踪进度。
- 获取输出 CSV,其中包含您的原始数据以及新增强的字段和每一行的成功状态。
并行增强 配置指南
要利用此技能,您必须首先从提供商处获取 API 密钥,并在本地计算机上安装 CLI。按照以下步骤开始:
# 安装 Parallel CLI 工具
curl -fsSL https://parallel.ai/install.sh | bash
# 在环境变量中设置您的 API 密钥
export PARALLEL_API_KEY=your_secret_api_key_here
安装完成后,您可以开始使用 Openclaw Skills 直接从终端处理您的数据文件。
并行增强 数据架构与分类体系
并行增强通过将信息追加到现有结构而不是覆盖它们来确保数据完整性。输出通常遵循以下模式:
| 列 | 描述 |
|---|---|
| 原始列 | 保留输入 CSV 或 JSON 中的所有字段以维持上下文。 |
| 增强列 | 根据您的自然语言意图生成的自定义字段(例如:ceo_name、total_funding)。 |
_parallel_status |
一个跟踪字段,指示该特定行的增强是成功还是遇到了错误。 |
对于更复杂的数据结构,用户可以定义源列元数据,以帮助 AI 有效地解释模糊的数据点。
name: parallel-enrichment
description: "Bulk data enrichment via Parallel API. Adds web-sourced fields (CEO names, funding, contact info) to lists of companies, people, or products. Use for enriching CSV files or inline data."
homepage: https://parallel.ai
Parallel Enrichment
Bulk data enrichment that adds web-sourced fields to lists of companies, people, or products. Describe what you want in natural language.
When to Use
Trigger this skill when the user asks for:
- "enrich this list with...", "add CEO names to...", "find funding for these companies..."
- "look up contact info for...", "get LinkedIn profiles for..."
- Bulk data operations on CSV files or lists
- Adding web-sourced columns to existing datasets
- Lead enrichment, company research, product comparison
Quick Start
# Inline data
parallel-cli enrich run r
--data '[{"company": "Google"}, {"company": "Microsoft"}]' r
--intent "CEO name and founding year" r
--target output.csv
# CSV file
parallel-cli enrich run r
--source-type csv --source input.csv r
--target output.csv r
--intent "CEO name and founding year"
CLI Reference
Basic Usage
parallel-cli enrich run [options]
Note: There is no --json flag for enrich. Results are written to the target file.
Common Flags
| Flag | Description |
|---|---|
--data " |
Inline JSON array of records |
--source-type csv |
Source file type |
--source |
Input CSV file path |
--target |
Output CSV file path |
--source-columns " |
Describe input columns |
--enriched-columns " |
Specify output columns |
--intent " |
Natural language description of what to find |
--processor |
Processing tier (see table below) |
Processor Tiers
| Processor | Use Case |
|---|---|
lite-fast |
Simple lookups |
base-fast |
Basic enrichment |
core-fast |
Standard enrichment |
pro-fast |
Deep enrichment (default) |
ultra-fast |
Complex multi-source enrichment |
Examples
Inline data enrichment:
parallel-cli enrich run r
--data '[{"company": "Stripe"}, {"company": "Square"}, {"company": "Adyen"}]' r
--intent "CEO name, headquarters city, and latest funding round" r
--target ./companies-enriched.csv
CSV file enrichment:
parallel-cli enrich run r
--source-type csv r
--source ./leads.csv r
--target ./leads-enriched.csv r
--source-columns '[{"name": "company_name", "description": "Company name"}]' r
--intent "Find CEO name, company size, and LinkedIn company page URL"
With explicit output columns:
parallel-cli enrich run r
--data '[{"name": "Sam Altman"}, {"name": "Satya Nadella"}]' r
--source-columns '[{"name": "name", "description": "Person full name"}]' r
--enriched-columns '[
{"name": "current_company", "description": "Current company/employer"},
{"name": "title", "description": "Current job title"},
{"name": "twitter", "description": "Twitter/X handle"}
]' r
--target ./people-enriched.csv
Using AI to suggest columns:
# First, get AI suggestions
parallel-cli enrich suggest r
--source-type csv r
--source ./companies.csv r
--intent "competitor analysis data"
# Then run with suggested columns
parallel-cli enrich run r
--source-type csv r
--source ./companies.csv r
--target ./companies-analysis.csv r
--intent "competitor analysis: market position, key products, recent news"
Best-Practice Prompting
Intent Description
Write 1-2 sentences describing:
- What specific fields you want to add
- Context about the data (B2B companies, tech startups, etc.)
- Any constraints (recent data, specific sources)
Good:
--intent "Find CEO name, total funding raised, and number of employees for B2B SaaS companies"
Poor:
--intent "Find stuff about these companies"
Source Column Descriptions
When using --source-columns, provide context:
[
{"name": "company", "description": "Company name, may include Inc/LLC suffix"},
{"name": "website", "description": "Company website URL, may be partial"}
]
Response Format
The CLI outputs:
- A monitoring URL to track progress
- Status updates as rows are processed
- Final output written to target CSV
The target CSV contains:
- All original columns from the source
- New enriched columns as specified
- A
_parallel_statuscolumn indicating success/failure per row
Output Handling
After enrichment completes:
- Report the number of rows enriched
- Preview the first few rows:
head -6 output.csv - Share the full path to the output file
- Note any rows that failed enrichment
Configuration File
For complex enrichments, use a YAML config:
# enrich-config.yaml
source:
type: csv
path: ./input.csv
columns:
- name: company_name
description: "Company legal name"
- name: website
description: "Company website URL"
target:
type: csv
path: ./output.csv
enriched_columns:
- name: ceo_name
description: "Current CEO full name"
- name: employee_count
description: "Approximate number of employees"
- name: funding_total
description: "Total funding raised in USD"
processor: pro-fast
Then run:
parallel-cli enrich run enrich-config.yaml
Running Out of Context?
For large enrichments, save results and use sessions_spawn:
parallel-cli enrich run --source-type csv --source input.csv --target /tmp/enriched-.csv --intent "..."
Then spawn a sub-agent:
{
"tool": "sessions_spawn",
"task": "Read /tmp/enriched-.csv and summarize the results. Report row count, success rate, and preview first 5 rows.",
"label": "enrich-summary"
}
Error Handling
| Exit Code | Meaning |
|---|---|
| 0 | Success |
| 1 | Unexpected error (network, parse) |
| 2 | Invalid arguments |
| 3 | API error (non-2xx) |
Common issues:
- Row failures: Check
_parallel_statuscolumn in output - Timeout: Use smaller batches or lower processor tier
- Rate limits: Add delays between large enrichments
Prerequisites
- Get an API key at parallel.ai
- Install the CLI:
curl -fsSL https://parallel.ai/install.sh | bash
export PARALLEL_API_KEY=your-key
References
- API Docs
- Enrichment API Reference
相关推荐
专题
+ 收藏
+ 收藏
+ 收藏
+ 收藏
+ 收藏
+ 收藏
最新数据
相关文章
阿里云大模型服务平台百炼新人免费额度如何申请?申请与使用免费额度教程及常见问题解答
办公 AI 工具 OpenClaw 部署 Windows 系统一站式教程
Qwen3.6 正式发布!阿里云百炼同步开启“AI大模型节省计划”超值优惠
【新手零难度操作 】OpenClaw 2.6.4 安装误区规避与快速使用指南(包含最新版安装包)
OpenClaw 2.6.4 可视化部署 打造个人 AI 数字员工(包含最新版安装包)
【小白友好!】OpenClaw 2.6.4 本地 AI 智能体快速搭建教程(内有安装包)
零基础部署 OpenClaw v2.6.2,Windows 系统完整教程
【适合新手的】零基础部署 OpenClaw 自动化工具教程
开发者们的第一台自主进化的“爱马仕”来了
极简部署 OpenClaw 2.6.2 本地 AI 智能体快速启用(含最新版安装包)
AI精选
