qwen3.6-35b-a3b-gguf-iq4xs The default LLM — reasoning, tools, and vision behind an OpenAI-compatible endpoint.
No GPU. No setup. Cancel anytime.
Qwen3.6 35B-A3B is the default model of Sogni Intelligence: a mixture-of-experts LLM with reasoning, native tool calling, and vision, served through an OpenAI-compatible endpoint — most SDKs work by swapping the base URL.
It can also drive every creative model in this catalog: enable sogni_tools and the model gains hosted image, video, and music generation as tool calls billed to the same account.
On the Sogni API you pay per request in Spark (1 Spark = $0.005). Prefer a flat rate? The Sogni app runs Qwen3.6 35B-A3B and 100+ other models credit-free under fair use.
| Configuration | Spark | USD |
|---|---|---|
| 1M input tokens | 60.0 Spark | $0.30 |
| 1M output tokens | 180 Spark | $0.90 |
| 10K in + 2K out (typical request) | 0.96 Spark | $0.0048 |
One Sogni API key reaches every model on the Supernet — call Qwen3.6 35B-A3B by id.
const res = await fetch('https://api.sogni.ai/v1/chat/completions', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
Authorization: `Bearer ${process.env.SOGNI_API_KEY}`,
},
body: JSON.stringify({
model: 'qwen3.6-35b-a3b-gguf-iq4xs',
messages: [{ role: 'user', content: 'Write a haiku about decentralized GPUs.' }],
}),
});
const { choices } = await res.json();
console.log(choices[0].message.content); curl https://api.sogni.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $SOGNI_API_KEY" \
-d '{
"model": "qwen3.6-35b-a3b-gguf-iq4xs",
"messages": [{ "role": "user", "content": "Write a haiku about decentralized GPUs." }]
}' OpenAI-compatible — point any OpenAI client at https://api.sogni.ai/v1. Full reference at docs.sogni.ai.
Most platforms ration credits or rent a cloud GPU by the hour. Sogni gives you a flat monthly price for credit-free fair-use generation, powered by a creator-owned GPU network — or per-render Spark pricing if you'd rather pay as you go via the API.
One flat price in the app. Generate under fair use without a per-image meter.
Image, video, music, and language models in one workspace and one API key.
Prefer pay-as-you-go? Call Qwen3.6 35B-A3B by id and pay in Spark.
Runs on a decentralized GPU network where workers share subscription revenue.
Qwen3.6 35B-A3B is the default model of Sogni Intelligence: a mixture-of-experts LLM with reasoning, native tool calling, and vision, served through an OpenAI-compatible endpoint — most SDKs work by swapping the base URL.
From 60.0 Spark ($0.30) per request on the Sogni API (1 Spark = $0.005) — or unlimited under fair use on a flat monthly Sogni plan.
Use it in Sogni Chat, or call the OpenAI-compatible API with model id qwen3.6-35b-a3b-gguf-iq4xs.
No. Qwen3.6 35B-A3B runs on the Sogni Supernet — a decentralized network of creator GPUs — with no local install or graphics card required.
Create in the app, or build with the API. Your call.