LogoTop AI Hubs

THUDM: GLM Z1 Rumination 32B

Other
Text
Paid

THUDM: GLM Z1 Rumination 32B is a 32B-parameter deep reasoning model from the GLM-4-Z1 series, optimized for complex, open-ended tasks requiring prolonged deliberation. It builds upon glm-4-32b-0414 with additional reinforcement learning phases and multi-stage alignment strategies, introducing “rumination” capabilities designed to emulate extended cognitive processing. This includes iterative reasoning, multi-hop analysis, and tool-augmented workflows such as search, retrieval, and citation-aware synthesis. The model excels in research-style writing, comparative analysis, and intricate question answering. It supports function calling for search and navigation primitives (`search`, `click`, `open`, `finish`), enabling use in agent-style pipelines. Rumination behavior is governed by multi-turn loops with rule-based reward shaping and delayed decision mechanisms, benchmarked against Deep Research frameworks such as OpenAI’s internal alignment stacks. This variant is suitable for scenarios requiring depth over speed.

Parameters

32B

Context Window

32,000

tokens

Input Price

$0.24

per 1M tokens

Output Price

$0.24

per 1M tokens

Capabilities

Model capabilities and supported modalities

Performance

Reasoning

Excellent reasoning capabilities with strong logical analysis

Math

-

Coding

-

Knowledge

-

Modalities

Input Modalities

text

Output Modalities

text

LLM Price Calculator

Calculate the cost of using this model

$0.000360
$0.000720
Input Cost:$0.000360
Output Cost:$0.000720
Total Cost:$0.001080
Estimated usage: 4,500 tokens

Monthly Cost Estimator

Based on different usage levels

Light Usage
$0.0048
~10 requests
Moderate Usage
$0.0480
~100 requests
Heavy Usage
$0.4800
~1000 requests
Enterprise
$4.8000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2025/05/06