Skip to main content

AI Image Generation

Skill Verified Active

Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image

Purpose

Generate a wide variety of AI images for artistic, professional, or marketing purposes using a diverse set of advanced AI models through a single, convenient CLI interface.

Features

  • Generate images with 50+ AI models
  • Supports text-to-image, image-to-image, inpainting, LoRA, and editing
  • Offers upscaling and text rendering capabilities
  • Integrates with the inference.sh CLI (`belt`)

Use Cases

  • Creating AI art and illustrations
  • Generating product mockups and concept art
  • Producing social media graphics and marketing visuals
  • Experimenting with different AI image generation models

Non-Goals

  • Directly embedding or managing AI models
  • Providing a graphical user interface for image generation
  • Replacing the inference.sh CLI itself

Workflow

  1. Login to the inference.sh CLI (`belt login`).
  2. Select an AI model and specify generation parameters (e.g., prompt, aspect ratio).
  3. Execute the image generation command using `belt app run`.
  4. Review the generated image output.

Prerequisites

  • inference.sh CLI (`belt`) installed and logged in

Documentation

  • info:Configuration & parameter referenceWhile the documentation lists models and examples, specific CLI parameters beyond the basic input structure are not explicitly documented with defaults, requiring users to refer to external inference.sh documentation.

Execution

  • info:ValidationInput validation is handled by the `belt` CLI and the underlying inference.sh service, which is not explicitly detailed within the skill's markdown.

Code Execution

  • info:Error HandlingError handling is primarily managed by the `belt` CLI and the inference.sh service. The skill's markdown does not detail specific error reporting or recovery steps.

Errors

  • info:Actionable error messagesError messages would primarily come from the `belt` CLI or the inference.sh service. While the skill itself doesn't detail specific error paths, the underlying CLI is expected to provide them.

Practical Utility

  • info:Edge casesWhile the skill lists many models and capabilities, specific edge cases, failure modes, or limitations of the inference.sh CLI within this skill's context are not explicitly documented.

Installation

npx skills add inferen-sh/skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified
97 /100
Analyzed about 15 hours ago

Trust Signals

Last commitabout 20 hours ago
Stars433
LicenseMIT
Status
View Source

Similar Extensions

Openclaw

100

Generate images and videos from text with multi-provider routing — supports GPT Image 2.0 (near-perfect text rendering), Nanobanana 2, Seedream 5.0, Midjourney V8.1 (unified photorealistic + anime), Flux 2 Klein (cheap drafts), Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 video, and local ComfyUI workflows. Includes 1,446 curated prompts and style-aware prompt enhancement. Use when users want to create images/videos, design assets, animate photos, enhance prompts, or manage AI art workflows. NOT for: generic chat, code generation, document writing, video editing of existing footage, audio/TTS, or any task unrelated to AI image/video creation.

Skill
jau123

Qwen Image 2 Pro

99

Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI. Professional text rendering, fine-grained realism, enhanced semantic adherence. Ideal for posters, banners, and text-heavy designs. Triggers: qwen image pro, qwen-image-pro, qwen 2 pro, alibaba image pro, dashscope pro, professional text rendering

Skill
inferen-sh

Image Upscaling

95

Upscale and enhance images with Real-ESRGAN, Thera, Topaz, FLUX Upscaler via inference.sh CLI. Models: Real-ESRGAN, Thera (any size), FLUX Dev Upscaler, Topaz Image Upscaler. Use for: enhance low-res images, upscale AI art, restore old photos, increase resolution. Triggers: upscale image, image upscaler, enhance image, increase resolution, real esrgan, ai upscale, super resolution, image enhancement, upscaling, enlarge image, higher resolution, 4k upscale, hd upscale

Skill
inferen-sh

Trader Regime

100

Detect current market regime using npx neural-trader — bull/bear/ranging/volatile classification with recommended strategy

Skill
ruvnet

Setup

100

Use first for install/update routing — sends setup, doctor, or MCP requests to the correct OMC setup flow

Skill
Yeachan-Heo

Project Session Manager

100

Worktree-first dev environment manager for issues, PRs, and features with optional tmux sessions

Skill
Yeachan-Heo

© 2025 SkillRepo · Find the right skill, skip the noise.