How do you use Ling-2.6-flash (free)?

Ling-2.6-flash (free) can be called through OpenRouter's OpenAI-compatible API using the model ID inclusionai/ling-2.6-flash:free. This page includes a ready-to-run curl example and the supported parameter list.

Is Ling-2.6-flash (free) free?

Ling-2.6-flash (free) is currently listed as free for both input and output tokens in the OpenRouter catalog.

Ling-2.6-flash (free)

All models

inclusionAILingFreeReleased 2026-04-21

Ling-2.6-flash (free)

Q: What is Ling-2.6-flash (free)?

Ling-2.6-flash (free) is an AI model from inclusionAI in the Ling series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

262K context

Free

104B total, 7.4B active

Ling-2.6-flash is an instant instruct model developed by inclusionAI, featuring 104 billion total parameters and 7.4 billion active parameters. It is optimized for real-world agents requiring fast responses, strong execution, and high token efficiency. The model delivers performance comparable to state-of-the-art models at a similar scale while significantly reducing token usage, making it suitable for coding, document processing, and lightweight agent workflows.

What is Ling-2.6-flash (free)?

Ling-2.6-flash (free) is an AI model from inclusionAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Ling-2.6-flash (free) against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Parameters

104B total, 7.4B active

Tokenizer

Other

Released

2026-04-21

Modalities

Input

text

Output

text

Supported Parameters

frequency_penaltymax_tokenspresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p

Strengths

Fast response times
Strong execution capabilities
High token efficiency
Performance comparable to state-of-the-art models
Optimized for lightweight agent workflows

Limitations

Limited information on training data
No mention of architecture specifics
Benchmarks indicate weaknesses in research-level physics reasoning (CritPt: 0.0%)
Lower performance in economically valuable tasks (GDPval-AA: 14.2%)
Moderate accuracy in omniscience-related tasks (AA-Omniscience Accuracy: 15.4%)

Recommended Use Cases

Coding tasks

Document processing

Lightweight agent workflows

Scientific computing with Python

Conversational AI in dual-control scenarios

More from inclusionAI

Ling-2.6-1T (free)

262K ctxFree

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...

Stay in the know

Ling-2.6-flash (free)

What is Ling-2.6-flash (free)?

More from inclusionAI

Related content