chat/completions endpoint by selecting a base chat model and an image generation model using a simple suffix syntax.
How it works
- Base model: Your chat model (e.g.,
claude-sonnet-4-20250514) - Image model: Your image generator (e.g.,
flux-pro) - Suffix syntax:
baseModel@imageModel
claude-sonnet-4-20250514@flux-pro
When you use a suffixed model with chat/completions, the assistant can respond with content that includes generated image(s) inline with its text, keeping the flow immersive for roleplay and storytelling scenarios.
Example request
- The base chat model drives conversation quality and reasoning.
- The image model controls visual style and fidelity.
- The response may include image URLs or Markdown image formatting inside the assistant message content.
SillyTavern example
Here’s how this looks in SillyTavern when using a suffixed model:
Tips
- Be descriptive: Specify subject, style, lighting, composition, and mood.
- Stay consistent: Reuse key style terms across turns for continuity.
- Guide placement: Ask the assistant where to insert images within the story.
