ZAYA1-8B

Efficient small language model for mathematical and coding problems.

About

ZAYA1-8B is an open-source small MoE model by Zyphra that pushes the frontier of intelligence density through its novel architecture and unique test-time compute inference. ZAYA1-8B excels at challenging mathematical and coding tasks that require deep reasoning expertise.

At under 1 billion active parameters, ZAYA1-8B performs strongly on reasoning, mathematics and coding benchmarks, matching or exceeding the performance of models many times its size such as Mistral-Small-4-119B, and remaining competitive with substantially larger first-generation frontier reasoning models such as DeepSeek-R1-0528, Gemini-2.5-Pro and Claude 4.5 Sonnet. With our novel Markovian-RSA test-time compute methodology, we achieve significant additional performance gains — exceeding GPT-5-High on HMMT'25 (89.6 vs 88.3) and closing in on frontier open-weight models such as DeepSeek-V3.2 on mathematics benchmarks.

Performance

ZAYA1-8B also performs competitively against recent SOTA OS models in the same weight class and against many substantially larger Western OS models across a wide range of evaluations such as mathematics (AIME and HMMT), coding (LCB), reasoning and knowledge retrieval (GPQA-Diamond) and instruction following (IFEval and IFBench). 

ZAYA1-8B vs leading open-weights models on a variety of evals.

API Usage
from openai import OpenAI

client = OpenAI(
    api_key="<YOUR_ZYPHRA_API_KEY>",
    base_url="https://api.zyphracloud.com/api/v1",
)

response = client.chat.completions.create(
    model="zyphra/ZAYA1-8B",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Provider

Provider

Provider

Release Date

Release Date

Release Date

Model License

Model License

Model License

Model Architecture

Model Architecture

Model Architecture

Mixture-of-experts

Mixture-of-experts

Mixture-of-experts

Total parameters

Total parameters

Total parameters

Activated parameters

Activated parameters

Activated parameters

Context length

Context length

Context length

Input modality

Input modality

Input modality

Output modality

Output modality

Output modality

Input price

Input price

Input price

Cached input price

Cached input price

Cached input price

Output price

Output price

Output price

Zyphra

Zyphra

Zyphra

Apache 2.0

Apache 2.0

Apache 2.0

Transformer

Transformer

Transformer

Yes

Yes

Yes

8B

8B

8B

760M

760M

760M

128K

128K

128K

Text

Text

Text

Text

Text

Text

$0.00

$0.00

$0.00

$0.00

$0.00

$0.00

$0.00

$0.00

$0.00

© 2026 Zyphra Technologies Inc. All rights reserved.

© 2026 Zyphra Technologies Inc. All rights reserved.