Z.AI (Zhipu) GLM Pricing 2026: API Costs, Plans & Review

Complete guide to Zhipu AI GLM pricing in 2026. Compare GLM-5, GLM-4.7 FlashX, coding subscription plans, CNY pricing, and see if it fits your needs.

Z.AI (Zhipu) GLM Pricing Overview

Zhipu AI offers two pricing models: API pay-as-you-go in USD for global users and CNY-denominated subscription plans for Chinese developers.

API Pricing (Pay-as-you-go)

ModelInput ($/1M tokens)Output ($/1M tokens)Cache Read ($/1M)Context
GLM-5.1$0.828$3.310$0.179200K
GLM-5$0.552$2.483$0.138200K
GLM-5 (CNY)¥3.00¥12.00128K
GLM-4.7 FlashX$0.50$3.00$0.10200K
GLM-4.7 FlashFreeFree128K

Coding Subscription Plans (CNY)

PlanPrice/moRate LimitModels
Coding Lite¥49 ($6.75)80 prompts/5hGLM-4.7/4.6
Coding Pro¥149 ($20.55)400 prompts/5hGLM-5/4.7/4.6
Coding Max¥469 ($64.70)1,600 prompts/5hAll models

What You Get

API access:

  • 200K context on most models
  • OpenAI-compatible API
  • Multimodal on GLM-5 (text, vision, audio)
  • Free GLM-4.7 Flash tier available

Coding subscriptions:

  • Access to GLM-5/4.7 through coding agents
  • Compatible with Cline, Claude Code, OpenCode
  • Counter-based (prompts/5h), not token-based

What Users Are Saying

"GLM-5 is a solid reasoning model — 92.7% on AIME 2026 is impressive. But the API reliability has been hit or miss; we get about 60% success rate during peak hours." — u/ai_researcher, Reddit r/LocalLLaMA

"The Coding Pro plan at ¥149/mo is decent value, but the per-5-hour rate limits are annoying. MiniMax's rolling 5h window is much more developer-friendly." — u/dev_cn, V2EX

"GLM-4.7 Flash being free is great for prototyping. We use it for dev/staging and switch to GLM-5 for production." — u/ml_engineer, Reddit r/LocalLLaMA

Pros & Cons

Pros:

  • Strong reasoning benchmarks (92.7% AIME 2026)
  • Free Flash tier for prototyping
  • CNY subscription plans for Chinese developers
  • MIT-licensed open-weight models

Cons:

  • Reliability issues during peak hours (reported ~60% success rate)
  • Coding plans use counter-based limits (prompts/5h), not flexible
  • Lower global brand recognition
  • Higher priced than MiniMax for comparable quality

Who Should Choose Z.AI (Zhipu)

  • You need strong reasoning — GLM-5 leads in math and science benchmarks
  • You're in China — CNY subscription plans are convenient
  • You want free prototyping — GLM-4.7 Flash is genuinely free
  • You need MIT-licensed open weights — self-host if API reliability is a concern

How Zhipu Compares to Alternatives

Compared toZhipu AdvantageAlternative Advantage
MiniMaxBetter reasoning (GLM-5)Cheaper, more reliable API, better coding
DeepSeek V4Stronger reasoning benchmarksCheaper, more reliable, open source
OpenAI GPT-5Cheaper, open weightsBetter quality, reliable API

Verdict

Z.AI (Zhipu) GLM-5 is a strong reasoning model that excels in math and science. Its open-weight license and free Flash tier are genuine advantages. However, reliability concerns and less developer-friendly rate limits make it a secondary choice compared to MiniMax or DeepSeek for most production workloads. Best suited for Chinese developers who need CNY billing and strong reasoning capabilities.

Share this guide

Related Guides