Logoglmimage.blog
HomeBlogGuidesPrompts
Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts
2026/01/02

Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts

Recreate the key “text-in-image” tests (CVTG-2K style) with prompts you can copy, run, and compare across models.

What you're trying to measure

You're not measuring "beauty." You're measuring:

  • exact word correctness
  • layout stability across multiple text regions
  • long-form text consistency

The GLM-Image repo reports CVTG-2K Word Accuracy 0.9116 and LongText-Bench metrics, positioning it strongly on text rendering. (GitHub)

The test suite (12 prompts)

Copy these prompts and run them across models.

A) Multi-region ad (3 regions)

Ad layout with three text areas. Top headline: "NEW SEASON ARRIVALS". Center badge: "UP TO 40% OFF". Bottom CTA: "SHOP NOW". Clean kerning, aligned baselines, no typos.

B) Price grid (menus)

Two-column menu with right-aligned prices: "Latte — $4.25", "Mocha — $4.75", "Tea — $3.00", "Croissant — $3.50". No extra items.

C) Long paragraph (hard mode)

A poster with a text block that must be readable: "This weekend only: free shipping on all orders over $50. Limited quantities available. Terms apply." Ensure every word is correct and not distorted.

D) Dialog bubbles

Comic panel with two speech bubbles. Bubble 1: "Where are we going?" Bubble 2: "Downtown, five minutes." Keep punctuation correct.

(…you can extend this set to 30–50 items and make a downloadable prompt pack on your site.)

How to publish results (SEO-friendly)

  • One page per benchmark category (Ads / Menus / LongText / Dialog)
  • Each page: Prompt, parameters, output, error analysis, comparison charts
All Posts

Author

avatar for GLMImage.blog
GLMImage.blog

Categories

  • Benchmarking & Testing
  • GLM-Image
What you're trying to measureThe test suite (12 prompts)A) Multi-region ad (3 regions)B) Price grid (menus)C) Long paragraph (hard mode)D) Dialog bubblesHow to publish results (SEO-friendly)

More Posts

GLM-Image vs SDXL: Why Text Rendering is the New Frontier

GLM-Image vs SDXL: Why Text Rendering is the New Frontier

A side-by-side comparison of text fidelity in complex layout generation. See why GLM-Image's AR stage outperforms traditional diffusion-only models.

avatar for GLMImage.blog
GLMImage.blog
2026/01/20
GLM-Image Layout Keywords Cheatsheet: Master Spatial Control in Prompts

GLM-Image Layout Keywords Cheatsheet: Master Spatial Control in Prompts

Complete guide to layout keywords for GLM-Image: left, center, right, grid, multi-region layouts. 10+ copy-paste templates for headers, heroes, bodies, CTAs, and footers.

avatar for GLMImage.blog
GLMImage.blog
2026/01/06
Mastering the AR Stage: 5 Tips for Complex Poster Layouts

Mastering the AR Stage: 5 Tips for Complex Poster Layouts

How to use spatial prompts to guide the GLM-Image autoregressive planner for professional grade posters.

avatar for GLMImage.blog
GLMImage.blog
2026/01/21
Logoglmimage.blog

The definitive guide and resource for GLM-Image. Master Zhipu AI's image generation with expert prompts, technical guides, and creative workflows.

Resources

  • Guides
  • Prompts
  • Blog
  • Feedback

Legal

  • Cookie Policy
  • Privacy Policy
  • Terms of Service

© 2026 • glmimage.blog All rights reserved.

GitHubGitHub