Logoglmimage.blog
HomeBlogGuidesPrompts
Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts
2026/01/17

Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts

Recreate the key “text-in-image” tests (CVTG-2K style) with prompts you can copy, run, and compare across models.

What you're trying to measure

You're not measuring "beauty." You're measuring:

  • exact word correctness
  • layout stability across multiple text regions
  • long-form text consistency

The GLM-Image repo reports CVTG-2K Word Accuracy 0.9116 and LongText-Bench metrics, positioning it strongly on text rendering. (GitHub)

The test suite (12 prompts)

Copy these prompts and run them across models.

A) Multi-region ad (3 regions)

Ad layout with three text areas. Top headline: "NEW SEASON ARRIVALS". Center badge: "UP TO 40% OFF". Bottom CTA: "SHOP NOW". Clean kerning, aligned baselines, no typos.

B) Price grid (menus)

Two-column menu with right-aligned prices: "Latte — $4.25", "Mocha — $4.75", "Tea — $3.00", "Croissant — $3.50". No extra items.

C) Long paragraph (hard mode)

A poster with a text block that must be readable: "This weekend only: free shipping on all orders over $50. Limited quantities available. Terms apply." Ensure every word is correct and not distorted.

D) Dialog bubbles

Comic panel with two speech bubbles. Bubble 1: "Where are we going?" Bubble 2: "Downtown, five minutes." Keep punctuation correct.

(…you can extend this set to 30–50 items and make a downloadable prompt pack on your site.)

How to publish results (SEO-friendly)

  • One page per benchmark category (Ads / Menus / LongText / Dialog)
  • Each page: Prompt, parameters, output, error analysis, comparison charts
All Posts

Author

avatar for GLMImage.blog
GLMImage.blog

Categories

  • Benchmarking & Testing
  • GLM-Image
What you're trying to measureThe test suite (12 prompts)A) Multi-region ad (3 regions)B) Price grid (menus)C) Long paragraph (hard mode)D) Dialog bubblesHow to publish results (SEO-friendly)

More Posts

fal.ai Hosted GLM-Image: Production Integration Checklist

fal.ai Hosted GLM-Image: Production Integration Checklist

Deploy GLM-Image without managing GPUs—fal.ai API examples, latency considerations, and a production checklist.

avatar for GLMImage.blog
GLMImage.blog
2026/01/16
ComfyUI Status Tracker: When Native Support Lands

ComfyUI Status Tracker: When Native Support Lands

Track GLM-Image support in ComfyUI—where to watch, what “native support” means, and stopgap workflows until it lands.

avatar for GLMImage.blog
GLMImage.blog
2026/01/16
The AR + Diffusion Hybrid Explained (With Diagrams)

The AR + Diffusion Hybrid Explained (With Diagrams)

GLM-Image uses autoregressive planning for layout + diffusion decoding for pixel fidelity. Here's the intuition, diagrams, and what it means for text rendering.

avatar for GLMImage.blog
GLMImage.blog
2026/01/15
Logoglmimage.blog

The unofficial community resource for GLM-Image.

Resources

  • Guides
  • Prompts
  • Blog
  • Feedback

Legal

  • Cookie Policy
  • Privacy Policy
  • Terms of Service

© 2026. The unofficial community resource for GLM-Image.

GitHubGitHub