baoyu-comic

设计与多媒体已审计 @JimLiu v1.56.1

信任分: 92/100
兼容 Agent: 1

速查档案只列事实：领域、Agent、信任分、作者、原文章节。装与不装请看下方作者解读。

领域: 设计与多媒体
兼容 Agent: Claude Code
信任分: 92 / 100 · 已通过审计
作者 / 版本 / 许可: @JimLiu · v1.56.1 · 未声明 license
安装命令数: 1 条

需要注意： 未限定 allowed-tools，默认拥有全部工具权限。

想读作者英文原文？ ↓ 滚到正文区切换 · 在 GitHub 查看 ↗

解读由编辑根据原文凝练而成，命令、链接、术语均与作者原文一致；想看完整论述请切到右侧

baoyu-comic 把一段脚本变成多格知识漫画——重点不是「画风炫」，而是「让概念能被读懂」。

设计思路

作者明确把它定位成「knowledge comics」——知识漫画，目标是把抽象概念用人物 / 场景 / 对白讲清楚，而不是炫技。所以它把美术风格（art style）和叙事语气（tone）做成两个独立可调维度，可以组合成无数种风格：严肃论文风可以配水彩，搞笑科普可以配像素风。

工作流

按 SKILL.md 中 Workflow 章节：① 读用户提供的脚本（可以是分镜 prompt，也可以是大段叙述）；② 自动判断或问用户：分多少页、art style、tone；③ 批量进入图像生成；④ 每页按 Page 1 / Page 2 命名输出，方便后续单页改稿。整个交互优先调 Agent 内置 input 工具（AskUserQuestion 之类），没有时降级到纯文本编号问答。

内置能力

Reference images：你可以传角色参考图，保证主角面部一致
Language Handling：脚本中文、对白中文都没问题
Page Modification：单独改某一页不会动其他页
Changing Preferences：把你常用的风格组合保存成默认配置

适合的场景

知识科普类公众号，把一篇文字推送变成可分享的漫画版
给小孩 / 学生讲技术概念（递归、并发、机器学习）
内部培训材料，把流程图变成情节剧
公司内部价值观、文化故事的视觉化

注意

脚本质量直接决定漫画质量——脚本含糊（"画一段关于诚信的故事"）会让 Agent 自己脑补；脚本具体（"4 页：A 在地铁里发现钱包，B 提醒……"）效果就稳。

Knowledge Comic Creator

Create original knowledge comics with flexible art style × tone combinations.

User Input Tools

When this skill prompts the user, follow this tool-selection rule (priority order):

Prefer built-in user-input tools exposed by the current agent runtime — e.g., AskUserQuestion, request_user_input, clarify, ask_user, or any equivalent.
Fallback: if no such tool exists, emit a numbered plain-text message and ask the user to reply with the chosen number/answer for each question.
Batching: if the tool supports multiple questions per call, combine all applicable questions into a single call; if only single-question, ask them one at a time in priority order.

Concrete AskUserQuestion references below are examples — substitute the local equivalent in other runtimes.

Image Generation Tools

When this skill needs to render an image, resolve the backend in this order:

Current-request override — if the user names a specific backend in the current message, use it.
Saved preference — if EXTEND.md sets preferred_image_backend to a backend available right now, use it.
Auto-select (when the preference is auto, unset, or the pinned backend isn't available):
- Codex (imagegen) — first, inspect your available-skills / tool inventory. If a skill named imagegen is listed, you are running inside Codex and MUST use it: invoke via the Skill tool with skill: "imagegen", passing the saved prompt file's content (plus output path and aspect ratio per Codex imagegen's own args). Codex imagegen is the official raster backend in that runtime and outranks any non-native skill (e.g., baoyu-imagine) unless the user has explicitly pinned a different preferred_image_backend.
- Other runtime-native tools — if the runtime exposes a different native image tool (e.g., Hermes image_generate), use it the same way.
- Otherwise, if exactly one non-native backend is installed (e.g., baoyu-imagine), use it.
- Otherwise (multiple non-native backends with no runtime-native tool), ask the user once — batch with any other initial questions.
If none are available, tell the user and ask how to proceed.

⛔ Never substitute SVG, HTML, canvas, or other code-based rendering for raster image generation. Codex imagegen's own description says it should be used "when the output should be a bitmap asset rather than repo-native code or vector." If you cannot resolve a raster backend via step 3, fall through to step 4 and ask the user — do not silently emit SVG, write inline <svg> markup, or produce HTML/CSS art as a substitute. This applies even if the article/section seems "diagram-like": the consumer skill calling this rule has already decided that a raster image is what it needs.

Setting preferred_image_backend: ask forces the step-3 prompt every run regardless of available backends. Users change the pinned backend via the ## Changing Preferences section below.

Prompt file requirement (hard): write each image's full, final prompt to a standalone file under prompts/ (naming: NN-{type}-[slug].md) BEFORE invoking any backend. The backend receives the prompt file (or its content); the file is the reproducibility record and lets you switch backends without regenerating prompts.

Concrete tool names (imagegen, image_generate, baoyu-imagine) above are examples — substitute the local equivalents under the same rule.

Reference Images

Users may supply reference images to guide art style, palette, scene composition, or subject. This is separate from the auto-generated character sheet (Step 7.1) — both can coexist: user refs guide the look, the character sheet anchors recurring character identity.

Intake: Accept via --ref <files...> or when the user provides file paths / pastes images in conversation.

File path(s) → copy to refs/NN-ref-{slug}.{ext} alongside the comic output
Pasted image with no path → ask the user for the path (per the User Input Tools rule above), or extract style traits verbally as a text fallback
No reference → skip this section

Usage modes (per reference):

Usage	Effect
`direct`	Pass the file to the backend as a reference image on every page (or selected pages)
`style`	Extract style traits (line treatment, texture, mood) and append to every page's prompt body
`palette`	Extract hex colors and append to every page's prompt body

Record in each page's prompt frontmatter when refs exist:

references:
  - ref_id: 01
    filename: 01-ref-scene.png
    usage: direct

At generation time:

Verify each referenced file exists on disk
If usage: direct AND the chosen backend accepts multiple reference images → pass both the character sheet (Step 7.2) and the user refs via the backend's ref parameter; compress images first per Step 7.1's guidance to avoid payload failures
If the backend accepts only one ref → prefer the character sheet for pages with recurring characters; embed user-ref traits in the prompt body instead
For style/palette usage → embed extracted traits in every page's prompt text (applies regardless of backend capability)

Options

Visual Dimensions

Option	Values	Description
`--art`	ligne-claire (default), manga, realistic, ink-brush, chalk, minimalist	Art style / rendering technique
`--tone`	neutral (default), warm, dramatic, romantic, energetic, vintage, action	Mood / atmosphere
`--layout`	standard (default), cinematic, dense, splash, mixed, webtoon, four-panel	Panel arrangement
`--aspect`	3:4 (default, portrait), 4:3 (landscape), 16:9 (widescreen)	Page aspect ratio
`--lang`	auto (default), zh, en, ja, etc.	Output language
`--ref <files...>`	File paths	Reference images applied to every page for style / palette / scene guidance. See Reference Images above.

Partial Workflow Options

Option	Description
`--storyboard-only`	Generate storyboard only, skip prompts and images
`--prompts-only`	Generate storyboard + prompts, skip images
`--images-only`	Generate images from existing prompts directory
`--regenerate N`	Regenerate specific page(s) only (e.g., `3` or `2,5,8`)

Details: references/partial-workflows.md

Art, Tone & Preset Catalogue

Art styles (6): ligne-claire, manga, realistic, ink-brush, chalk, minimalist. Full definitions at references/art-styles/<style>.md.
Tones (7): neutral, warm, dramatic, romantic, energetic, vintage, action. Full definitions at references/tones/<tone>.md.

Presets (5) with special rules beyond plain art+tone:

Preset	Equivalent	Hook
`ohmsha`	manga + neutral	Visual metaphors, no talking heads, gadget reveals
`wuxia`	ink-brush + action	Qi effects, combat visuals, atmospheric
`shoujo`	manga + romantic	Decorative elements, eye details, romantic beats
`concept-story`	manga + warm	Visual symbol system, growth arc, dialogue+action balance
`four-panel`	minimalist + neutral + four-panel layout	起承转合 structure, B&W + spot color, stick-figure characters

Full rules at references/presets/<preset>.md — load the file when a preset is picked.

Compatibility matrix and content-signal → preset table live in references/auto-selection.md. Read it before recommending combinations in Step 2.

Script Directory

Important: All scripts are located in the scripts/ subdirectory of this skill.

Agent Execution Instructions:

Determine this SKILL.md file's directory path as {baseDir}
Script path = {baseDir}/scripts/<script-name>.ts
Replace all {baseDir} in this document with the actual path
Resolve ${BUN_X} runtime: if bun installed → bun; if npx available → npx -y bun; else suggest installing bun

Script Reference:

Script	Purpose
`scripts/merge-to-pdf.ts`	Merge comic pages into PDF

File Structure

Output directory: comic/{topic-slug}/

Slug: 2-4 words kebab-case from topic (e.g., alan-turing-bio)
Conflict: append timestamp (e.g., turing-story-20260118-143052)

Contents:

File	Description
`source-{slug}.{ext}`	Source files
`analysis.md`	Content analysis
`storyboard.md`	Storyboard with panel breakdown
`characters/characters.md`	Character definitions
`characters/characters.png`	Character reference sheet
`prompts/NN-{cover\|page}-[slug].md`	Generation prompts
`NN-{cover\|page}-[slug].png`	Generated images
`{topic-slug}.pdf`	Final merged PDF

Language Handling

Detection Priority:

--lang flag (explicit)
EXTEND.md language setting
User's conversation language
Source content language

Rule: Use user's input language or saved language preference for ALL interactions:

Storyboard outlines and scene descriptions
Image generation prompts
User selection options and confirmations
Progress updates, questions, errors, summaries

Technical terms remain in English.

Workflow

Progress Checklist

Comic Progress:
- [ ] Step 1: Setup & Analyze
  - [ ] 1.1 Preferences (EXTEND.md) ⛔ BLOCKING
    - [ ] Found → load preferences → continue
    - [ ] Not found → run first-time setup → MUST complete before other steps
  - [ ] 1.2 Analyze, 1.3 Check existing
- [ ] Step 2: Confirmation - Style & options ⚠️ REQUIRED
- [ ] Step 3: Generate storyboard + characters
- [ ] Step 4: Review outline (conditional)
- [ ] Step 5: Generate prompts
- [ ] Step 6: Review prompts (conditional)
- [ ] Step 7: Generate images
  - [ ] 7.1 Generate character sheet (if needed) → characters/characters.png
  - [ ] 7.2 Generate pages (with --ref if character sheet exists)
- [ ] Step 8: Merge to PDF
- [ ] Step 9: Completion report

Flow

Input → [Preferences] ─┬─ Found → Continue
                       │
                       └─ Not found → First-Time Setup ⛔ BLOCKING
                                      │
                                      └─ Complete setup → Save EXTEND.md → Continue
                                                                              │
        ┌─────────────────────────────────────────────────────────────────────┘
        ↓
Analyze → [Check Existing?] → [Confirm: Style + Reviews] → Storyboard → [Review?] → Prompts → [Review?] → Images → PDF → Complete

Step Summary

Step	Action	Key Output
1.1	Load EXTEND.md preferences ⛔ BLOCKING if not found	Config loaded
1.2	Analyze content	`analysis.md`
1.3	Check existing directory	Handle conflicts
2	Confirm style, focus, audience, reviews	User preferences
3	Generate storyboard + characters	`storyboard.md`, `characters/`
4	Review outline (if requested)	User approval
5	Generate prompts	`prompts/*.md`
6	Review prompts (if requested)	User approval
7.1	Generate character sheet (if needed)	`characters/characters.png`
7.2	Generate pages (with character ref if available)	`*.png` files
8	Merge to PDF	`{slug}.pdf`
9	Completion report	Summary

Step 7: Image Generation

Pick a backend once per session using the ## Image Generation Tools rule at the top. If the backend is a repo skill (e.g., baoyu-imagine), read its SKILL.md and use its documented interface rather than its scripts.

7.1 Character sheet — generate it (to characters/characters.png, aspect 4:3) when the comic is multi-page with recurring characters. Skip for simple presets (e.g., four-panel minimalist) or single-page comics. Compress to JPEG before use-as---ref (sips -s format jpeg -s formatOptions 80 … on macOS, pngquant --quality=65-80 … elsewhere) to avoid payload failures. The prompt file at characters/characters.md must exist before invoking the backend.

7.2 Pages — each page's prompt MUST already be at prompts/NN-{cover|page}-[slug].md before invoking the backend; the file is the reproducibility record. Strategy depends on the character sheet:

Character sheet	Backend `--ref`	Strategy
Exists	Supported	Pass sheet as `--ref` on every page
Exists	Not supported	Prepend character descriptions to every prompt file
Skipped	—	All descriptions inline in prompt

Backup rule: existing prompts/…md and …png files → rename with -backup-YYYYMMDD-HHMMSS suffix before regenerating. Aspect ratio from storyboard (default 3:4; preset may override).

--ref failure recovery: compress sheet → retry → still fails → drop --ref and embed character descriptions in the prompt text.

Full step-by-step workflow (analysis, storyboard, review gates, regeneration variants): references/workflow.md.

EXTEND.md Paths ⛔ BLOCKING

If EXTEND.md is not found, first-time setup is blocking — complete it before any content analysis or style/tone questions.

Priority	Path	Scope
1	`.baoyu-skills/baoyu-comic/EXTEND.md`	Project
2	`$HOME/.baoyu-skills/baoyu-comic/EXTEND.md`	User home

Result	Action
Found	Read, parse, display summary → continue
Not found	⛔ Run first-time setup (references/config/first-time-setup.md) → save EXTEND.md → continue

EXTEND.md supports: watermark, preferred art/tone/layout, custom style definitions, character presets, language preference. Schema: references/config/preferences-schema.md.

References

Core Templates:

analysis-framework.md - Deep content analysis
character-template.md - Character definition format
storyboard-template.md - Storyboard structure
ohmsha-guide.md - Ohmsha manga specifics

Style Definitions:

references/art-styles/ - Art styles (ligne-claire, manga, realistic, ink-brush, chalk, minimalist)
references/tones/ - Tones (neutral, warm, dramatic, romantic, energetic, vintage, action)
references/presets/ - Presets with special rules (ohmsha, wuxia, shoujo, concept-story, four-panel)
references/layouts/ - Layouts (standard, cinematic, dense, splash, mixed, webtoon, four-panel)

Workflow:

workflow.md - Full workflow details
auto-selection.md - Content signal analysis
partial-workflows.md - Partial workflow options

Config:

config/preferences-schema.md - EXTEND.md schema
config/first-time-setup.md - First-time setup
config/watermark-guide.md - Watermark configuration

Page Modification

Action	Steps
Edit	Update prompt file FIRST → `--regenerate N` → Regenerate PDF
Add	Create prompt at position → Generate with character ref → Renumber subsequent → Update storyboard → Regenerate PDF
Delete	Remove files → Renumber subsequent → Update storyboard → Regenerate PDF

IMPORTANT: When updating pages, ALWAYS update the prompt file (prompts/NN-{cover|page}-[slug].md) FIRST before regenerating. This ensures changes are documented and reproducible.

Notes

Image generation: 10-30 seconds per page
Auto-retry once on generation failure
Use stylized alternatives for sensitive public figures
Maintain style consistency via session ID
Step 2 confirmation required - do not skip
Steps 4/6 conditional - only if user requested in Step 2
Step 7.1 character sheet - recommended for multi-page comics, optional for simple presets
Step 7.2 character reference - use --ref if sheet exists; compress/convert on failure; fall back to prompt-only
Watermark/language configured once in EXTEND.md

Changing Preferences

EXTEND.md lives at .baoyu-skills/baoyu-comic/EXTEND.md (project) or ~/.baoyu-skills/baoyu-comic/EXTEND.md (user). Three ways to change it:

Edit directly — open EXTEND.md and change fields. Full schema: references/config/preferences-schema.md.
Reconfigure interactively — delete EXTEND.md (or ask "reconfigure baoyu-comic preferences" / "重新配置"). The next run re-triggers first-time setup.
Common one-line edits:
- preferred_image_backend: auto — default; runtime-native tool wins, falls back to the only installed backend, asks only if multiple non-native are present.
- preferred_image_backend: codex-imagegen — pin to Codex's built-in.
- preferred_image_backend: baoyu-imagine — pin to the baoyu-imagine skill.
- preferred_image_backend: ask — confirm backend every run.
- watermark.enabled: true, preferred_art, preferred_tone, preferred_layout, language — shift the auto-selection defaults and cosmetic choices.