Capability
Text Rendering in GPT Image 2: The Jump in Real Examples
Twelve prompts covering menus, signage, UI labels, comic captions, and dense paragraphs, rendered on 1.5 and on the 2.0 preview, side by side.
gptimage2api editorial3 min readRead
Every GPT Image 2 post, filterable by category, sortable by date, searchable by keyword.
Twelve prompts covering menus, signage, UI labels, comic captions, and dense paragraphs, rendered on 1.5 and on the 2.0 preview, side by side.
A working theory of why text is hard for diffusion models, what the leaked GPT Image 2 benchmarks actually mean, and how to prompt for text today on GPT Image 1.5 so your pipeline survives the upgrade.
In-image text rendering is the feature that sold GPT Image 1.5. It is also the feature that produces the most Slack screenshots captioned "why is it doing this." Even on the current flagship at $0.005 to $0.20 per image, expect a 5 to 10 percent failure rate on text-heavy prompts.