How do you use GLM 4.7 Flash?

GLM 4.7 Flash can be installed locally with npx agentmag add model glm-4.7-flash, which wraps ollama pull glm-4.7-flash. This page also includes local Ollama run and API examples.

Is GLM 4.7 Flash free?

GLM 4.7 Flash is listed as a local Ollama model. There is no hosted token price in Agent Mag; local compute and storage requirements depend on the tag you pull.

GLM 4.7 Flash

All models

Z.aiGLMLocal runtimeFree

GLM 4.7 Flash

Q: What is GLM 4.7 Flash?

GLM 4.7 Flash is a locally installable Ollama model from the GLM series. Agent Mag tracks its local install command, available tags, modalities, and runtime details on this page.

198K context

Ollama install

4 tags

GLM 4.7 Flash is available through Ollama for local agent workflows, with support for text input. As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

What is GLM 4.7 Flash?

GLM 4.7 Flash is a local model entry from Ollama that Agent Mag tracks for install commands, available tags, modalities, and agent workflow fit. Builders can install it with the Agent Mag CLI and run it through Ollama on their own machine.

Agent Mag install

Ollama model ID

As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

Modalities

Input

text

Output

text

Ollama Tags

Tag	Size	Context	Input
glm-4.7-flash:latest	19GB	198K	text
glm-4.7-flash:q4_K_M	19GB	198K	text
glm-4.7-flash:q8_0	32GB	198K	text
glm-4.7-flash:bf16	60GB	198K	text

Related local models

GLM 5.1

LocalOllama

GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

GLM OCR

LocalOllama

GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

GLM 5

LocalOllama

A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

GLM 4.7

LocalOllama

Advancing the Coding Capability

Stay in the know

GLM 4.7 Flash

What is GLM 4.7 Flash?

Related local models

Related content