feat: add ollama/qwen3.5-9b-q4_k_m model params by zunami · Pull Request #37 · mnfst/modelparams.dev

zunami · 2026-05-27T06:34:36Z

Adds parameter definitions for qwen3.5:9b-q4_K_M running via Ollama.

Intentionally omits thinking.type — Ollama reports capabilities: ["thinking"]
via /api/show, but does not accept the Anthropic-style thinking API parameter.
Sending it causes "Failed to reach upstream provider" errors.

Ollama handles thinking internally via prompt prefix (/no_think), not via
API parameters.

vercel · 2026-05-27T06:34:41Z

@zunami is attempting to deploy a commit to the Manifest Team on Vercel.

A member of the Team first needs to authorize it.

guillaumegay13 · 2026-06-29T09:39:07Z

Thanks for the contribution, @zunami — and sorry for the slow look! 🙏 A few things to sort before this can go in:

1. Provider should be the maker, not the runtime. The catalog is keyed by who makes the model, not how it's served — so Ollama-served models live under their maker. Qwen is Alibaba, and we already track it at models/alibaba/ (qwen3.5.yaml, qwen3.5-flash.yaml, …). A new top-level ollama provider would be the first gateway/runtime in the tree, which we've been avoiding (same reason OpenRouter/OpenCode-Go aren't providers here).

2. Param surface. The file mixes OpenAI-compatible params (max_tokens, top_p) with Ollama-native ones (num_ctx, top_k). Worth picking one surface — and note Ollama's own max-output option is num_predict, not max_tokens. Also missing the schema header the other files use:

# yaml-language-server: $schema=https://modelparams.dev/api/v1/schema.json

3. CI is red + branch is stale. It needs a rebase on main, and the generated package files have to be regenerated and committed (npm run codegen --workspace=modelparams) or the "generated files in sync" build fails.

One open question worth your thoughts: since alibaba/qwen3.5 already exists, is a separate entry for the local quantized :9b-q4_K_M tag adding configurable params that the existing entry doesn't cover? If the knobs are the same, it might be redundant; if the quant exposes genuinely different Ollama options, that's the case to make. Happy to help reshape it whichever way you'd like to take it. 🙂

Create qwen3.5-9b-q4_k_m.yaml

c976c7a

github-actions Bot added the model Add a model that's missing label May 27, 2026

zunami mentioned this pull request May 27, 2026

Bug: Ollama models with thinking capability receive Anthropic-style thinking parameter — causes upstream failure mnfst/manifest#2035

Open

Merge branch 'main' into main

863fffe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add ollama/qwen3.5-9b-q4_k_m model params#37

feat: add ollama/qwen3.5-9b-q4_k_m model params#37
zunami wants to merge 2 commits into
mnfst:mainfrom
zunami:main

zunami commented May 27, 2026

Uh oh!

vercel Bot commented May 27, 2026

Uh oh!

guillaumegay13 commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

zunami commented May 27, 2026

Uh oh!

vercel Bot commented May 27, 2026

Uh oh!

guillaumegay13 commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants