Skip to content

Generation Engines · v0.2.0 · 2026-06-03

Pick your engines.
Wire them in minutes.

GenCreator OS isn't built on one model. It's a curated menu — premium hero stills, cinematic video, cheap batch, an alt-image lane — each routed through the same taste gate. This is the registry: what every engine is best for, what it costs, and the exact command to install it.

Why a registry, not a winner

The engine is replaceable. The taste isn't.

Every model on this page will be obsolete within a year. Betting the system on one of them is how you rebuild it every time the frontier moves. So we don't. We keep a menu — and the part that's actually ours is the curation: which engine for which job, the taste lanes every output is held against, and the gate that refuses the generic.

Pick the tiers you need. One premium-hero engine, one cinematic-video engine, one cheap batch lane covers most of a creator's year. The rest are there when a specific job calls for them.

The FrankX default stack

Three engines cover most of the work.

If you install nothing else, install these. Each owns one tier so they compose instead of competing.

Premium hero

Grok Imagine

Native image + video inside the Grok harness — already in your SuperGrok plan.

Premium hero

Nano Banana Pro (Antigravity)

Gemini 3 Pro Image built into Antigravity — free, in-editor, and Search-grounded.

Alt image

gpt-image-2 (Codex)

OpenAI image gen built into Codex CLI via the $imagegen skill — on your ChatGPT plan.

Premium hero

InfoGenius / Nano Banana

Gemini-3 premium hero stills with Guardian AI routing — the in-house engine.

Cinematic video

Higgsfield

One OAuth endpoint fronting 30+ frontier models — Sora, Veo, Kling, Soul.

Batch

fal.ai

Pay-per-use aggregator — the economical lane for batch video and bulk B-roll.

Every engine, by tier

What it's for. What it costs. How to install it.

Status reflects this repo's actual wiring — never an aspiration. The registry lives at lib/gen/backends.ts.

Premium hero

Quality over volume — top-of-page stills, covers, infographics.

Grok Imagine

both · library

Installed

Native image + video inside the Grok harness — already in your SuperGrok plan.

Grok Imagine (Aurora)Grok Imagine Video
Best for
  • Cinematic + photographic heroes generated from inside a Grok session
  • High volume on a flat monthly fee (no per-image bill)
  • Fast iteration when you are already working in Grok
Not when
Text-heavy/technical diagrams where labels must be legible — Grok Imagine fabricates text. Use Nano Banana Pro (Search-grounded) for those.
Auth
Logged-in Grok CLI/app. NB: the CLI may sit on a base-tier token showing "free credits %" — re-login with the main SuperGrok account for full quota.
Cost
✓ no extra API costIncluded in SuperGrok — no extra API cost
Included in
SuperGrok ($30/mo) — ~30–100 quality images/day on rolling windows

Rolling windows throttle bursts (50 then wait 2–6h). Video costs ~5–10× an image in quota.

When paid is worth it: For programmatic batches beyond the daily quota, the Grok Imagine API (~$0.02/image) supplements it.

Install

  1. 1.Already native in the Grok harness

    Generate from a Grok session directly. Capacity guide: ~/.grok/docs/visual-generation-capacity.md.

Nano Banana Pro (Antigravity)

image · library

Installed

Gemini 3 Pro Image built into Antigravity — free, in-editor, and Search-grounded.

Nano Banana Pro (Gemini 3 Pro Image)Nano Banana 2 (Gemini 3.1 Flash Image)
Best for
  • Text-heavy + technical heroes, infographics, diagrams — Search grounding keeps labels legible & accurate
  • Real-time-data visuals (current pricing, dates, charts) via Google Search
  • No-extra-cost generation from inside Antigravity / a Gemini session
Not when
You are working in a non-Gemini harness with no key — Claude reaches the same model via the infogenius script path instead.
Auth
Logged-in Antigravity / Gemini account (no separate API key inside Antigravity)
Cost
✓ no extra API costFree in the Antigravity preview / included in Google AI subscription
Included in
Antigravity free public preview (NB2 free to all); Google AI Pro/Ultra for higher limits

When paid is worth it: Programmatic batches outside Antigravity → the Gemini API (per-image pricing) via the infogenius script.

Install

  1. 1.Native in Antigravity

    Ask the Antigravity agent to generate — it invokes Nano Banana Pro in-conversation. NB2 is the free default.

Docs

InfoGenius / Nano Banana

image · mcp

Available

Gemini-3 premium hero stills with Guardian AI routing — the in-house engine.

Gemini 3 Pro ImageNano Banana 2Nano Banana Pro
Best for
  • Hero imagery at the top of a page
  • Book + album covers that live for years
  • Research-grounded infographics (the InfoGenius pipeline)
Not when
Bulk B-roll or 20-attempts-in-10-minutes ad variants — that volume burns time at hero quality.
Auth
Google Generative AI API key (GOOGLE_GENERATIVE_AI_API_KEY)
Cost
Gemini API pricing (~pennies per image)

The durable path (scripts/lib/nb-image.mjs) needs no server and finds the key automatically.

Install

  1. 1.Option A — durable library (no server, recommended)

    node scripts/nb-generate.mjs "<prompt>" --size 2K

    Already in-repo. Reads GOOGLE_GENERATIVE_AI_API_KEY from Arcanea/.env.local automatically. Most reliable across reinstalls.

  2. 2.Option B — the arcanea-infogenius MCP (when the binary is present)

    claude mcp add infogenius node "<path>/arcanea-infogenius/mcp-server/dist/index.js"

    arcanea-infogenius-mcp v2.0.0 (Gemini 3 Pro + Guardian routing). Note: the binary is not in the current Arcanea tree — rebuild it there before registering, and pass GOOGLE_GENERATIVE_AI_API_KEY via env expansion, never a key pasted into a committed .mcp.json. The Option A library needs none of this.

Cinematic video

Interactive frontier video + character consistency from the harness.

Higgsfield

video · mcp

Pending auth

One OAuth endpoint fronting 30+ frontier models — Sora, Veo, Kling, Soul.

Sora 2Veo 3.1Kling 3.0Soul 2.0Seedream 4.0MiniMax Hailuo 02
Best for
  • Interactive cinematic video from inside the harness
  • Character-consistent B-roll via Soul ID training
  • Reaching Sora / Veo / Kling without juggling three API keys
Not when
High-volume programmatic batch — the subscription-credit model gets expensive, and MCP calls burn credits even in "unlimited" mode.
Auth
OAuth through your Higgsfield account (no API key to manage)
Cost
$15 Starter · $39–49 Plus · $99–129 Ultra · $89/seat Business

MCP generations always consume credits, even when the web UI shows "unlimited".

Install

  1. 1.Register the MCP server

    claude mcp add --transport http higgsfield https://mcp.higgsfield.ai/mcp

    Already registered at project scope in this repo.

  2. 2.Authenticate

    /mcp

    Approve the project-scope server on next launch, then complete the OAuth browser login. Requires a paid Higgsfield plan — MCP calls fail on credits otherwise.

Docs

Batch

Cheap, high-volume — catalog runs and bulk B-roll.

fal.ai

both · api

Recommended

Pay-per-use aggregator — the economical lane for batch video and bulk B-roll.

FLUXKlingMiniMax HailuoWanVeo (via fal)Seedance
Best for
  • Lyric-video batches across the 12k catalog
  • Bulk B-roll where per-generation cost matters
  • Programmatic generation from scripts (one API, many models)
Not when
A single premium hero still — use InfoGenius. Interactive cinematic exploration — use Higgsfield.
Auth
fal.ai API key (FAL_KEY)
Cost
30–80% cheaper than subscription credits on video; ~$0.025–0.05/image

Install

  1. 1.Get a key

    Create FAL_KEY at fal.ai/dashboard/keys and add it to .env.local.

  2. 2.Call via the AI SDK or a thin script wrapper

    pnpm add @fal-ai/client

    Wrap in scripts/lib/fal-image.mjs mirroring nb-image.mjs so the gen router treats it as one more backend.

Docs

Alt image

A second aesthetic read when the default feels off-lane.

gpt-image-2 (Codex)

image · library

Installed

OpenAI image gen built into Codex CLI via the $imagegen skill — on your ChatGPT plan.

gpt-image-2
Best for
  • Quick visuals while already coding in Codex
  • Strong text-in-image rendering on a sub you already pay for
Not when
Large batches — image turns burn the Codex limit ~3–5× faster. Switch to a per-call API for volume.
Auth
Codex signed in with a ChatGPT plan. Setting OPENAI_API_KEY switches it to paid API pricing.
Cost
✓ no extra API costIncluded in the ChatGPT plan (no extra cost); ~3–5× faster limit burn per image turn
Included in
ChatGPT (Free/Go/Plus/Pro/Business) — counts toward Codex usage limits

When paid is worth it: Batches of >250 images, or to avoid eating Codex coding quota → set OPENAI_API_KEY for gpt-image-2 API pricing.

Install

  1. 1.Native in Codex CLI

    $imagegen

    Built in since Apr 2026. Uses the ChatGPT plan by default; API key only for paid/volume.

Docs

GPT Image 2

image · api

Available

OpenAI image lane — a second opinion via the key you already have.

gpt-image-2gpt-image-1
Best for
  • A different aesthetic read when InfoGenius output feels off-lane
  • Strong text-in-image rendering
  • Teams already standardized on OpenRouter
Not when
Your default hero work — InfoGenius is the in-house premium tier and stays the default.
Auth
OpenRouter key (OPENROUTER_API_KEY — already on this machine)
Cost
OpenRouter per-image pricing

Install

  1. 1.Route through OpenRouter

    OPENROUTER_API_KEY + OPENROUTER_BASE_URL are already set (machine-global LLM policy). Add a scripts/lib/gpt-image.mjs wrapper that posts to the images endpoint.

Frontier video (direct)

The raw models, direct — cheapest per call, most plumbing.

Sora · Veo · Kling (direct)

video · api

Available

The raw frontier video models, direct — cheapest per call, most plumbing.

Sora 2 (OpenAI)Veo 3.1 (Gemini API)Kling 3.0 (Kling API)
Best for
  • Squeezing per-call cost on one specific model at high volume
  • A production pipeline already standardized on one provider
Not when
You want one connector + Soul character consistency — Higgsfield fronts all three for far less setup.
Auth
Per-provider API keys (OpenAI / Google / Kling)
Cost
Lowest per-generation, but three separate integrations + no Soul consistency

Install

  1. 1.Only if you outgrow Higgsfield on one model

    Wire the single provider SDK you actually need. Most GenCreator users should reach these through Higgsfield, not directly.

Got your stack wired?

The engines make the pixels. The taste lanes and the gate make them yours — six premium aesthetics, each enforced before anything publishes.

v0.2.0 · registry at lib/gen/backends.ts · shipped 2026-06-03