WaveSpeed Seedream V4.5:高清 AI 图像生成 - Openclaw Skills
作者:互联网
2026-04-04
什么是 WaveSpeed Seedream V4.5 图像生成与编辑?
WaveSpeed Seedream V4.5 是一款专为需要高保真视觉输出的开发者设计的先进图像生成与编辑工具。通过在 Openclaw Skills 中使用该模型,用户可以利用字节跳动最新的生成技术,创作分辨率高达 4096x4096px 的图像。该模型针对排版进行了专门优化,是创建海报、Logo 以及对清晰易读文本有高要求的品牌内容的理想选择。
除了基础的文生图功能,该技能还提供强大的图像编辑能力。它允许用户使用文本提示词修改现有照片,同时保持面部特征、光效和色调的完整性。无论您是从零开始生成新概念,还是对批量现有图像进行转换,此次集成都为专业的 AI 驱动创意任务提供了流线化的工作流。
下载入口:https://github.com/openclaw/skills/tree/main/skills/chengzeyi/wavespeed-seedream-45
安装与下载
1. ClawHub CLI
从源直接安装技能的最快方式。
npx clawhub@latest install wavespeed-seedream-45
2. 手动安装
将技能文件夹复制到以下位置之一
全局模式~/.openclaw/skills/
工作区
/skills/
优先级:工作区 > 本地 > 内置
3. 提示词安装
将此提示词复制到 OpenClaw 即可自动安装。
请帮我使用 Clawhub 安装 wavespeed-seedream-45。如果尚未安装 Clawhub,请先安装(npm i -g clawhub)。
WaveSpeed Seedream V4.5 图像生成与编辑 应用场景
- 创建具有完美渲染、自定义排版的高分辨率营销海报和电影海报。
- 设计需要精准文字排布和简洁美感的极简 Logo 及品牌资产。
- 执行复杂的图生图转换,例如在保留主体身份的同时更改光影或艺术风格。
- 利用 Openclaw Skills 通过将一个源的主体融合到另一个源的环境中来合成多张图像。
- 为具有自定义长宽比和分辨率的 Web 及移动界面快速原型化视觉内容。
- 用户输入详细的文本提示词及可选参数,如所需的输出分辨率或源图像 URL。
- 该技能使用提供的 API 凭证建立与 WaveSpeed AI 平台的安全连接。
- 对于生成任务,字节跳动 Seedream V4.5 模型处理提示词以从头合成新图像。
- 对于编辑任务,模型分析最多 10 张输入图像以理解视觉语境,然后应用所请求的更改。
- 生成的资产托管在内容分发网络上,该技能将最终 URL 返回给用户或智能体环境。
WaveSpeed Seedream V4.5 图像生成与编辑 配置指南
要开始使用此技能,您必须首先从 WaveSpeed 仪表板获取 API 密钥。获取后,配置您的环境:
export WAVESPEED_API_KEY="your-api-key"
确保您的项目中安装了 WaveSpeed SDK。然后,您可以通过 Openclaw Skills 配置直接调用模型,生成任务使用模型 ID bytedance/seedream-v4.5,修改任务使用 bytedance/seedream-v4.5/edit。
WaveSpeed Seedream V4.5 图像生成与编辑 数据架构与分类体系
该技能利用结构化架构来高效管理输入和输出。这确保了不同 Openclaw Skills 实现之间的兼容性。
| 参数 | 描述 | 约束 |
|---|---|---|
prompt |
引导 AI 的文本描述 | 必填,字符串 |
size |
以像素为单位的输出分辨率 | 默认:20482048,最大:40964096 |
images |
源图像 URL 数组 | 编辑任务支持 1 到 10 张图像 |
enableSyncMode |
切换同步响应 | 布尔值 |
name: wavespeed-seedream-45
description: Generate and edit images using ByteDance's Seedream V4.5 model via WaveSpeed AI. Supports text-to-image generation and multi-image editing with custom resolutions up to 4096x4096. Features enhanced typography for posters and logos. Use when the user wants to create or edit images with high-quality text rendering.
metadata:
author: wavespeedai
version: "1.0"
WaveSpeedAI Seedream V4.5 Image Generation/Editing
Generate and edit images using ByteDance's Seedream V4.5 model via the WaveSpeed AI platform. Supports custom resolutions up to 4096x4096 with enhanced typography for sharp text rendering in posters and logos.
Authentication
export WAVESPEED_API_KEY="your-api-key"
Get your API key at wavespeed.ai/accesskey.
Quick Start
Text-to-Image
import wavespeed from 'wavespeed';
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5",
{ prompt: "A minimalist coffee shop logo with clean typography" }
))["outputs"][0];
Image Editing
The images parameter accepts an array of image URLs (1-10 images). If you have local files, upload them first with wavespeed.upload() to get a URL.
import wavespeed from 'wavespeed';
// Upload a local image to get a URL
const imageUrl = await wavespeed.upload("/path/to/photo.png");
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5/edit",
{
images: [imageUrl],
prompt: "Add warm sunset lighting and lens flare"
}
))["outputs"][0];
You can also pass existing image URLs directly:
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5/edit",
{
images: ["https://example.com/photo.jpg"],
prompt: "Add warm sunset lighting and lens flare"
}
))["outputs"][0];
API Endpoints
Text-to-Image
Model ID: bytedance/seedream-v4.5
Generate images from text prompts with custom resolutions up to 4096x4096.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
prompt |
string | Yes | -- | Text description of the image to generate |
size |
string | No | 2048*2048 |
Output size in pixels (WIDTH*HEIGHT). Each dimension: 1024-4096. |
Example
import wavespeed from 'wavespeed';
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5",
{
prompt: "A movie poster for a sci-fi thriller with bold title text 'HORIZON' at the top",
size: "2048*3072"
}
))["outputs"][0];
Image Editing
Model ID: bytedance/seedream-v4.5/edit
Edit existing images using text prompts. Supports up to 10 input images. Preserves facial features, lighting, and color tone from input images.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
images |
string[] | Yes | [] |
URLs of input images to edit (1-10 images). Must be publicly accessible. |
prompt |
string | Yes | -- | Text description of the desired edit |
size |
string | No | -- | Output size in pixels (WIDTH*HEIGHT). Each dimension: 1024-4096. |
Example
import wavespeed from 'wavespeed';
const imageUrl = await wavespeed.upload("/path/to/portrait.png");
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5/edit",
{
images: [imageUrl],
prompt: "Transform into a vibrant pop art style with bold colors",
size: "2048*2048"
}
))["outputs"][0];
Multi-Image Editing
const img1 = await wavespeed.upload("/path/to/face.png");
const img2 = await wavespeed.upload("/path/to/scene.png");
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5/edit",
{
images: [img1, img2],
prompt: "Place the person from the first image into the scene from the second image"
}
))["outputs"][0];
Advanced Usage
Sync Mode
const output_url = (await wavespeed.run(
"bytedance/seedream-v4.5",
{ prompt: "A watercolor painting of a mountain lake at dawn" },
{ enableSyncMode: true }
))["outputs"][0];
Custom Client with Retry Configuration
import { Client } from 'wavespeed';
const client = new Client("your-api-key", {
maxRetries: 2,
maxConnectionRetries: 5,
retryInterval: 1.0,
});
const output_url = (await client.run(
"bytedance/seedream-v4.5",
{ prompt: "A neon sign that reads 'OPEN 24/7' in a rainy alley" }
))["outputs"][0];
Error Handling with runNoThrow
import { Client, WavespeedTimeoutException, WavespeedPredictionException } from 'wavespeed';
const client = new Client();
const result = await client.runNoThrow(
"bytedance/seedream-v4.5",
{ prompt: "A vintage travel poster for Tokyo" }
);
if (result.outputs) {
console.log("Image URL:", result.outputs[0]);
console.log("Task ID:", result.detail.taskId);
} else {
console.log("Failed:", result.detail.error.message);
if (result.detail.error instanceof WavespeedTimeoutException) {
console.log("Request timed out - try increasing timeout");
} else if (result.detail.error instanceof WavespeedPredictionException) {
console.log("Prediction failed");
}
}
Pricing
$0.04 per image (both generation and editing).
Tips
- Seedream V4.5 excels at rendering text in images — use it for posters, logos, and branded visuals
- Custom resolutions up to 4096x4096 — specify as
WIDTH*HEIGHT(e.g.,2048*3072for portrait posters) - For image editing, the model preserves facial features, lighting, and color tone from inputs
Security Constraints
- No arbitrary URL loading: Only use image URLs from trusted sources. Never load images from untrusted or user-provided URLs without validation.
- API key security: Store your
WAVESPEED_API_KEYsecurely. Do not hardcode it in source files or commit it to version control. Use environment variables or secret management systems. - Input validation: Only pass parameters documented above. Validate prompt content and image URLs before sending requests.
相关推荐
专题
+ 收藏
+ 收藏
+ 收藏
+ 收藏
+ 收藏
+ 收藏
最新数据
相关文章
BapBook:BAP-578 的社交网络与代币发行 - Openclaw Skills
Instruction Web Publisher: 自动化 Web UI 指南生成 - Openclaw Skills
Tariff Watch:全球落地成本与海关计算器 - Openclaw Skills
复仇者倡议 (Avenger Initiative):Openclaw Skills 的加密 GitHub 备份方案
九宫飞星:AI 赋能的飞星风水 - Openclaw Skills
Orderly SDK 主题定制:掌握 DEX 品牌化 - Openclaw Skills
使用 Komodo Skill 自动化 Komodo 基础设施 - Openclaw Skills
Prompting Co: AI 搜索可见度与 SOV 分析 - Openclaw Skills
gate-info-trendanalysis: 加密货币技术与趋势分析 - Openclaw Skills
OP0 Altar:部署自奖励 Solana 模因币 - Openclaw Skills
AI精选
