AI Image Generation

Skill Verified Active

Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image

Purpose

Generate a wide variety of AI images for artistic, professional, or marketing purposes using a diverse set of advanced AI models through a single, convenient CLI interface.

Features

Generate images with 50+ AI models
Supports text-to-image, image-to-image, inpainting, LoRA, and editing
Offers upscaling and text rendering capabilities
Integrates with the inference.sh CLI (`belt`)

Use Cases

Creating AI art and illustrations
Generating product mockups and concept art
Producing social media graphics and marketing visuals
Experimenting with different AI image generation models

Non-Goals

Directly embedding or managing AI models
Providing a graphical user interface for image generation
Replacing the inference.sh CLI itself

Workflow

Login to the inference.sh CLI (`belt login`).
Select an AI model and specify generation parameters (e.g., prompt, aspect ratio).
Execute the image generation command using `belt app run`.
Review the generated image output.

Prerequisites

inference.sh CLI (`belt`) installed and logged in

Documentation

info:Configuration & parameter referenceWhile the documentation lists models and examples, specific CLI parameters beyond the basic input structure are not explicitly documented with defaults, requiring users to refer to external inference.sh documentation.

Execution

info:ValidationInput validation is handled by the `belt` CLI and the underlying inference.sh service, which is not explicitly detailed within the skill's markdown.

Code Execution

info:Error HandlingError handling is primarily managed by the `belt` CLI and the inference.sh service. The skill's markdown does not detail specific error reporting or recovery steps.

Errors

info:Actionable error messagesError messages would primarily come from the `belt` CLI or the inference.sh service. While the skill itself doesn't detail specific error paths, the underlying CLI is expected to provide them.

Practical Utility

info:Edge casesWhile the skill lists many models and capabilities, specific edge cases, failure modes, or limitations of the inference.sh CLI within this skill's context are not explicitly documented.

Installation

npx skills add inferen-sh/skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified

97 /100

Analyzed about 15 hours ago

Trust Signals

Last commitabout 20 hours ago

GitHub owner inferen-sh

Stars433

LicenseMIT

Status

View Source

Similar Extensions

Openclaw

100

Generate images and videos from text with multi-provider routing — supports GPT Image 2.0 (near-perfect text rendering), Nanobanana 2, Seedream 5.0, Midjourney V8.1 (unified photorealistic + anime), Flux 2 Klein (cheap drafts), Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 video, and local ComfyUI workflows. Includes 1,446 curated prompts and style-aware prompt enhancement. Use when users want to create images/videos, design assets, animate photos, enhance prompts, or manage AI art workflows. NOT for: generic chat, code generation, document writing, video editing of existing footage, audio/TTS, or any task unrelated to AI image/video creation.

Skill

jau123

Qwen Image 2 Pro

Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI. Professional text rendering, fine-grained realism, enhanced semantic adherence. Ideal for posters, banners, and text-heavy designs. Triggers: qwen image pro, qwen-image-pro, qwen 2 pro, alibaba image pro, dashscope pro, professional text rendering

Skill

inferen-sh

Image Upscaling

Upscale and enhance images with Real-ESRGAN, Thera, Topaz, FLUX Upscaler via inference.sh CLI. Models: Real-ESRGAN, Thera (any size), FLUX Dev Upscaler, Topaz Image Upscaler. Use for: enhance low-res images, upscale AI art, restore old photos, increase resolution. Triggers: upscale image, image upscaler, enhance image, increase resolution, real esrgan, ai upscale, super resolution, image enhancement, upscaling, enlarge image, higher resolution, 4k upscale, hd upscale

Skill

inferen-sh

Trader Regime

100

Detect current market regime using npx neural-trader — bull/bear/ranging/volatile classification with recommended strategy

Skill

ruvnet

Setup

100

Use first for install/update routing — sends setup, doctor, or MCP requests to the correct OMC setup flow

Skill

Yeachan-Heo

Project Session Manager

100

Worktree-first dev environment manager for issues, PRs, and features with optional tmux sessions

Skill

Yeachan-Heo