feat: initialize OmniClaw skills registry

2026-04-24 01:12:20 -07:00
commit 789bb38e69
16 changed files with 1713 additions and 0 deletions
--- a/skills/sub2api-gpt-image-2/SKILL.md
+++ b/skills/sub2api-gpt-image-2/SKILL.md
@@ -0,0 +1,85 @@
+---
+name: sub2api-gpt-image-2
+description: Use when Codex or another agent needs to call, document, troubleshoot, or generate code for GPT Image 2 through sub2api/OpenAI-compatible image APIs, including /v1/images/generations, /v1/images/edits, Responses image_generation tools, 4K image settings, streaming partial images, base64 decoding, and production error handling.
+---
+
+# Sub2API GPT Image 2
+
+## Workflow
+
+1. Treat `gpt-image-2` as an image generation/editing model, not a Codex or agent main model.
+2. Use `/v1/images/generations` for direct text-to-image and `/v1/images/edits` for reference-image or mask edits.
+3. Use `/v1/responses` with a text/agent model such as `gpt-5.5` plus an `image_generation` tool when an agent should reason before generating images.
+4. Never print, commit, or hard-code real API keys. Read credentials from `API_KEY` and `BASE_URL`.
+5. For detailed parameter rules, read `references/api.md`.
+6. For quick non-streaming generation tests, run `scripts/generate_image.py`.
+
+## Defaults
+
+Use these defaults unless the task says otherwise:
+
+```text
+BASE_URL=https://claude.omniclaw.store/v1
+model=gpt-image-2
+size=1024x1024
+quality=medium
+output_format=png
+n=1
+timeout=300 for 4K, 120 otherwise
+```
+
+Use `3840x2160` only when explicitly requested or when a final 4K asset is needed. For prompt iteration, generate smaller images first.
+
+## Direct Generation
+
+Run:
+
+```bash
+export API_KEY="sk-..."
+export BASE_URL="https://claude.omniclaw.store/v1"
+python3 ~/.codex/skills/sub2api-gpt-image-2/scripts/generate_image.py \
+  --prompt "A premium product poster for an AI service" \
+  --size 1536x1024 \
+  --quality medium \
+  --output image.png
+```
+
+Use `--dry-run` to inspect payloads without calling the API.
+
+## Editing
+
+Use multipart form requests with `image[]=@file.png`; add `mask=@mask.png` only for local masked edits. For `gpt-image-2`, omit `input_fidelity` because it always processes image inputs at high fidelity.
+
+## Responses Tool Pattern
+
+For agentic workflows:
+
+```json
+{
+  "model": "gpt-5.5",
+  "input": "Generate a clean product poster for an AI proxy service.",
+  "tools": [
+    {
+      "type": "image_generation",
+      "size": "1536x1024",
+      "quality": "medium",
+      "output_format": "png"
+    }
+  ]
+}
+```
+
+Do not set the Responses `model` to `gpt-image-2`.
+
+## Troubleshooting
+
+- `401`: wrong or disabled sub2api key.
+- `400`: invalid params such as unsupported transparent background or invalid size.
+- `429`: upstream account usage limit; prefer plus/team/pro accounts.
+- `502`: upstream did not return image output, network failed, or content was refused; inspect sub2api logs and simplify the prompt.
+- Long latency: 4K/high can take minutes; use `jpeg`, `quality=low|medium`, or smaller sizes for drafts.
+
+## Safety
+
+Reject or rewrite sexualized young-looking subjects, minors, non-consensual sexual content, explicit nudity, and other disallowed content before calling the API.
+
--- a/skills/sub2api-gpt-image-2/agents/openai.yaml
+++ b/skills/sub2api-gpt-image-2/agents/openai.yaml
@@ -0,0 +1,5 @@
+interface:
+  display_name: "Sub2API GPT Image 2"
+  short_description: "Generate and debug GPT Image 2 calls."
+  default_prompt: "Use sub2api-gpt-image-2 to generate, edit, stream, or troubleshoot GPT Image 2 API requests."
+
--- a/skills/sub2api-gpt-image-2/references/api.md
+++ b/skills/sub2api-gpt-image-2/references/api.md
@@ -0,0 +1,106 @@
+# GPT Image 2 API Reference For sub2api
+
+Source links:
+
+- https://developers.openai.com/api/docs/models/gpt-image-2
+- https://developers.openai.com/api/docs/guides/image-generation
+- https://developers.openai.com/api/reference/resources/images
+- Repository docs: `apis/sub2api/gpt-image-2.zh.md` and `apis/sub2api/gpt-image-2.en.md`
+
+## Endpoints
+
+```text
+POST /v1/images/generations
+POST /v1/images/edits
+POST /v1/responses  # image_generation tool with a text/agent model
+```
+
+`/v1/models` may only list Codex/text models in this deployment. That does not mean `/v1/images/generations` cannot accept `gpt-image-2`.
+
+## Generation Parameters
+
+| Parameter | Values | Notes |
+|---|---|---|
+| `model` | `gpt-image-2`, `gpt-image-2-2026-04-21` | Required for Images API. |
+| `prompt` | string | Required. Be concrete about subject, composition, style, lighting, and safety constraints. |
+| `n` | `1-10` | Prefer `1` for production retries and billing attribution. |
+| `size` | `auto` or valid `WxH` | Common: `1024x1024`, `1536x1024`, `1024x1536`, `3840x2160`. |
+| `quality` | `low`, `medium`, `high`, `auto` | Use `low` for drafts, `medium` for normal, `high` for final. |
+| `output_format` | `png`, `jpeg`, `webp` | `jpeg` is usually faster than `png`. |
+| `output_compression` | `0-100` | Applies to `jpeg` and `webp`. |
+| `background` | `auto`, `opaque` | `gpt-image-2` does not support `transparent`. |
+| `moderation` | `auto`, `low` | Changes filter strictness but does not bypass policy. |
+| `stream` | boolean | Enables SSE image events. |
+| `partial_images` | `0-3` | Streaming only; each partial image adds output token cost. |
+| `user` | string | End-user identifier for abuse monitoring. |
+
+## Size Constraints
+
+- Maximum edge length: `3840px`.
+- Both edges must be multiples of `16px`.
+- Long edge to short edge ratio must be at most `3:1`.
+- Total pixels must be between `655360` and `8294400`.
+- Outputs larger than `2560x1440` should be treated as high-latency experimental outputs.
+
+## Edit Notes
+
+- Use multipart form data.
+- Send reference images as `image[]`.
+- Optional `mask` must match input image format and dimensions, be under 50MB, and include an alpha channel.
+- Omit `input_fidelity` for `gpt-image-2`.
+
+## Response Shape
+
+```json
+{
+  "data": [
+    {
+      "b64_json": "...",
+      "revised_prompt": "..."
+    }
+  ],
+  "model": "gpt-image-2",
+  "size": "1024x1024",
+  "quality": "medium",
+  "output_format": "png",
+  "usage": {
+    "input_tokens": 43,
+    "output_tokens": 196,
+    "total_tokens": 239
+  }
+}
+```
+
+Always decode `data[0].b64_json` unless streaming.
+
+## Production Dispatch
+
+- Prefer plus/team/pro upstream OpenAI OAuth accounts for image workloads.
+- Set client timeout to `120s` for normal images and `300s` for 4K.
+- Retry only transient network/5xx failures with low retry counts.
+- Do not retry policy or validation errors without changing the prompt or params.
+- Log `usage`, latency, size, quality, output format, account ID, API key ID, and request ID.
+
+## Valid curl Template
+
+```bash
+curl -sS "$BASE_URL/images/generations" \
+  -H "Authorization: Bearer $API_KEY" \
+  -H "Content-Type: application/json" \
+  --max-time 300 \
+  -d '{
+    "model": "gpt-image-2",
+    "prompt": "A premium product poster for an AI service",
+    "size": "1536x1024",
+    "quality": "medium",
+    "output_format": "png",
+    "n": 1
+  }' > image.json
+```
+
+Decode:
+
+```bash
+jq -r '.data[0].b64_json' image.json | base64 --decode > image.png
+```
+
--- a/skills/sub2api-gpt-image-2/scripts/generate_image.py
+++ b/skills/sub2api-gpt-image-2/scripts/generate_image.py
@@ -0,0 +1,123 @@
+#!/usr/bin/env python3
+import argparse
+import base64
+import json
+import os
+import sys
+import time
+import urllib.error
+import urllib.request
+
+
+def parse_args() -> argparse.Namespace:
+    parser = argparse.ArgumentParser(
+        description="Generate an image with gpt-image-2 through an OpenAI-compatible /v1/images/generations endpoint."
+    )
+    parser.add_argument("--base-url", default=os.environ.get("BASE_URL", "https://claude.omniclaw.store/v1"))
+    parser.add_argument("--api-key-env", default="API_KEY")
+    parser.add_argument("--prompt", required=True)
+    parser.add_argument("--output", default="image.png")
+    parser.add_argument("--model", default="gpt-image-2")
+    parser.add_argument("--size", default="1024x1024")
+    parser.add_argument("--quality", default="medium", choices=["low", "medium", "high", "auto"])
+    parser.add_argument("--format", default="png", choices=["png", "jpeg", "webp"])
+    parser.add_argument("--compression", type=int, default=None)
+    parser.add_argument("--background", default=None, choices=["auto", "opaque"])
+    parser.add_argument("--moderation", default=None, choices=["auto", "low"])
+    parser.add_argument("--n", type=int, default=1)
+    parser.add_argument("--timeout", type=float, default=300.0)
+    parser.add_argument("--dry-run", action="store_true")
+    return parser.parse_args()
+
+
+def build_payload(args: argparse.Namespace) -> dict:
+    payload = {
+        "model": args.model,
+        "prompt": args.prompt,
+        "size": args.size,
+        "quality": args.quality,
+        "output_format": args.format,
+        "n": args.n,
+    }
+    if args.compression is not None:
+        payload["output_compression"] = args.compression
+    if args.background:
+        payload["background"] = args.background
+    if args.moderation:
+        payload["moderation"] = args.moderation
+    return payload
+
+
+def main() -> int:
+    args = parse_args()
+    payload = build_payload(args)
+    url = args.base_url.rstrip("/") + "/images/generations"
+
+    if args.dry_run:
+        print(json.dumps({"url": url, "payload": payload}, ensure_ascii=False, indent=2))
+        return 0
+
+    api_key = os.environ.get(args.api_key_env)
+    if not api_key:
+        print(f"Missing API key env: {args.api_key_env}", file=sys.stderr)
+        return 2
+
+    body = json.dumps(payload, ensure_ascii=False).encode("utf-8")
+    req = urllib.request.Request(
+        url,
+        data=body,
+        method="POST",
+        headers={
+            "Authorization": f"Bearer {api_key}",
+            "Content-Type": "application/json",
+        },
+    )
+
+    started = time.monotonic()
+    try:
+        with urllib.request.urlopen(req, timeout=args.timeout) as resp:
+            status = resp.status
+            raw = resp.read()
+    except urllib.error.HTTPError as exc:
+        raw = exc.read()
+        print(f"HTTP {exc.code}", file=sys.stderr)
+        print(raw.decode("utf-8", "replace")[:4000], file=sys.stderr)
+        return 1
+    except Exception as exc:
+        print(f"request failed: {type(exc).__name__}: {exc}", file=sys.stderr)
+        return 1
+
+    elapsed = time.monotonic() - started
+    data = json.loads(raw.decode("utf-8"))
+    if "error" in data:
+        print(json.dumps(data["error"], ensure_ascii=False), file=sys.stderr)
+        return 1
+
+    items = data.get("data") or []
+    if not items or "b64_json" not in items[0]:
+        print("No b64_json image returned", file=sys.stderr)
+        print(json.dumps(data, ensure_ascii=False)[:4000], file=sys.stderr)
+        return 1
+
+    image = base64.b64decode(items[0]["b64_json"])
+    with open(args.output, "wb") as f:
+        f.write(image)
+
+    summary = {
+        "status": status,
+        "elapsed_seconds": round(elapsed, 3),
+        "output": args.output,
+        "bytes": len(image),
+        "model": data.get("model"),
+        "size": data.get("size"),
+        "quality": data.get("quality"),
+        "output_format": data.get("output_format"),
+        "usage": data.get("usage"),
+    }
+    print(json.dumps(summary, ensure_ascii=False, indent=2))
+    return 0
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())
+
--- a/skills/sub2api-gpt-image-2/skill.registry.json
+++ b/skills/sub2api-gpt-image-2/skill.registry.json
@@ -0,0 +1,11 @@
+{
+  "name": "sub2api-gpt-image-2",
+  "version": "0.1.0",
+  "title": "Sub2API GPT Image 2",
+  "description": "Generate, edit, stream, and troubleshoot GPT Image 2 calls through sub2api/OpenAI-compatible image APIs.",
+  "tags": ["sub2api", "openai", "gpt-image-2", "images", "responses", "codex"],
+  "agents": ["codex", "openai-compatible-agent"],
+  "api_docs": ["sub2api/gpt-image-2.zh", "sub2api/gpt-image-2.en"],
+  "updated_at": "2026-04-24"
+}
+