Home Blog Guides Prompts

Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts

2026/01/02

Benchmark Replication: CVTG-2K-Style Cases + Downloadable Prompts

Recreate the key “text-in-image” tests (CVTG-2K style) with prompts you can copy, run, and compare across models.

What you're trying to measure

You're not measuring "beauty." You're measuring:

exact word correctness
layout stability across multiple text regions
long-form text consistency

The GLM-Image repo reports CVTG-2K Word Accuracy 0.9116 and LongText-Bench metrics, positioning it strongly on text rendering. (GitHub)

The test suite (12 prompts)

Copy these prompts and run them across models.

A) Multi-region ad (3 regions)

Ad layout with three text areas. Top headline: "NEW SEASON ARRIVALS". Center badge: "UP TO 40% OFF". Bottom CTA: "SHOP NOW". Clean kerning, aligned baselines, no typos.

B) Price grid (menus)

Two-column menu with right-aligned prices: "Latte — $4.25", "Mocha — $4.75", "Tea — $3.00", "Croissant — $3.50". No extra items.

C) Long paragraph (hard mode)

A poster with a text block that must be readable: "This weekend only: free shipping on all orders over $50. Limited quantities available. Terms apply." Ensure every word is correct and not distorted.

D) Dialog bubbles

Comic panel with two speech bubbles. Bubble 1: "Where are we going?" Bubble 2: "Downtown, five minutes." Keep punctuation correct.

(…you can extend this set to 30–50 items and make a downloadable prompt pack on your site.)

How to publish results (SEO-friendly)

One page per benchmark category (Ads / Menus / LongText / Dialog)
Each page: Prompt, parameters, output, error analysis, comparison charts

Author

GLMImage.blog

Categories

Benchmarking & Testing
GLM-Image

What you're trying to measure The test suite (12 prompts)A) Multi-region ad (3 regions)B) Price grid (menus)C) Long paragraph (hard mode)D) Dialog bubbles How to publish results (SEO-friendly)

More Posts

GLM-Image vs SDXL: Why Text Rendering is the New Frontier

GLM-Image vs SDXL: Why Text Rendering is the New Frontier

A side-by-side comparison of text fidelity in complex layout generation. See why GLM-Image's AR stage outperforms traditional diffusion-only models.

GLM-Image Layout Keywords Cheatsheet: Master Spatial Control in Prompts

GLM-Image Layout Keywords Cheatsheet: Master Spatial Control in Prompts

Complete guide to layout keywords for GLM-Image: left, center, right, grid, multi-region layouts. 10+ copy-paste templates for headers, heroes, bodies, CTAs, and footers.

Mastering the AR Stage: 5 Tips for Complex Poster Layouts

Mastering the AR Stage: 5 Tips for Complex Poster Layouts

How to use spatial prompts to guide the GLM-Image autoregressive planner for professional grade posters.

glmimage.blog

The definitive guide and resource for GLM-Image. Master Zhipu AI's image generation with expert prompts, technical guides, and creative workflows.

Resources

Guides
Prompts
Blog
Feedback

Legal

Cookie Policy
Privacy Policy
Terms of Service

© 2026 • glmimage.blog All rights reserved.