Update assistants to Fable by hanna-paasivirta · Pull Request #525 · OpenFn/apollo

hanna-paasivirta · 2026-06-11T12:03:57Z

Short Description

Moves the main chat services (job_chat, workflow_chat, global_chat planner + doc_agent_chat prototype) from Sonnet to Claude Fable 5. Keeps RAG helpers, vocab_mapper, and the test judge on Sonnet.

This will increase our model costs, possibly by 5x. It is difficult to evaluate without running a lot of tests because thinking behaviour (i.e. output token volume) will vary between models and calls.

I haven't tested the effects of this update extensively. I skimmed over acceptance tests for the three chat services and tested locally with Lightning to see everything still works and doesn't seem slower. We'll be ready to rollback if there's complaints.

Fixes #524

Implementation details

models.py: add "claude-fable" alias and CLAUDE_FABLE constant for claude-fable-5
Point service configs at claude-fable: rag.yaml (model), gen_project_config.yaml, doc_agent_chat/config.yaml, global_chat/config.yaml (planner). Code fallback defaults updated to match.
Keep on Sonnet: job_chat RAG calls (llm_search_decision, llm_retrieval), vocab_mapper.
Triple max_tokens on Fable routes to absorb the new tokenizer (~30% more tokens): 16384 → 49152 (job_chat, workflow_chat, doc_agent_chat), 8192 → 24576 (planner).
Pass an explicit per-request timeout=httpx.Timeout(600.0, connect=5.0) on the four non-streaming messages.create calls (job_chat, workflow_chat, doc_agent_chat, planner). Required: the SDK rejects non-streaming requests with max_tokens > ~21k unless a timeout is given. Values match the SDK default, so no behaviour change.
Planner effort switched from high to medium.
Remove dead temperature config: unread keys in doc_agent_chat, workflow_chat, and planner configs, plus the planner's unused self.temperature. Live temperature=0 settings (RAG, vocab_mapper, router) are unchanged and stay on Sonnet/Haiku, which accept it.

AI Usage

Please disclose how you've used AI in this work (it's cool, we just want to know!):

You can read more details in our Responsible AI Policy

hanna-paasivirta · 2026-06-11T13:16:35Z

@josephjclark Does billing need to be adjusted for users?

josephjclark

Running the acceptance tests and it's certainly working!

Very hard to get a handle on better or worse. I think I'll release and smoke test staging.

We'll have to look back in a couple of weeks and assess whether the extra cost is worthwhile

hanna-paasivirta added 3 commits June 11, 2026 20:12

adjust tokens temp effort

1612d12

move judges to fable

16cdb65

fix timeout and default model name

3245978

hanna-paasivirta marked this pull request as ready for review June 11, 2026 13:16

hanna-paasivirta requested a review from josephjclark June 11, 2026 13:16

josephjclark approved these changes Jun 11, 2026

View reviewed changes

version: 1.3.1

5f7cbad

josephjclark merged commit e7f0dea into main Jun 11, 2026
2 checks passed

josephjclark deleted the model-update-fable branch June 11, 2026 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update assistants to Fable#525

Update assistants to Fable#525
josephjclark merged 4 commits into
mainfrom
model-update-fable

hanna-paasivirta commented Jun 11, 2026 •

edited

Loading

Uh oh!

hanna-paasivirta commented Jun 11, 2026 •

edited

Loading

Uh oh!

josephjclark left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hanna-paasivirta commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Short Description

Implementation details

AI Usage

Uh oh!

hanna-paasivirta commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josephjclark left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hanna-paasivirta commented Jun 11, 2026 •

edited

Loading

hanna-paasivirta commented Jun 11, 2026 •

edited

Loading