Skip to content

feat: add MiniMax as LLM provider with M3 default#1568

Open
octo-patch wants to merge 3 commits into
microsoft:mainfrom
octo-patch:feature/add-minimax-provider
Open

feat: add MiniMax as LLM provider with M3 default#1568
octo-patch wants to merge 3 commits into
microsoft:mainfrom
octo-patch:feature/add-minimax-provider

Conversation

@octo-patch

@octo-patch octo-patch commented Mar 15, 2026

Copy link
Copy Markdown

Summary

Add MiniMax as a new LLM provider for JARVIS/HuggingGPT, alongside the existing OpenAI and Azure OpenAI options. MiniMax provides an OpenAI-compatible chat completions API with models offering up to 512K context length on the M3 flagship.

Changes

  • Provider detection (awesome_chat.py): Added MiniMax to the provider priority chain (local > azure > minimax > openai) with proper API endpoint construction, key resolution (config file or MINIMAX_API_KEY env var), and temperature clamping
  • Token configuration (get_token_ids.py): Registered MiniMax-M3 (512K context, default), MiniMax-M2.7, and MiniMax-M2.7-highspeed (204K context) with cl100k_base encoding. Older MiniMax-M2.5* models have been removed.
  • Ready-to-use config (config.minimax.yaml): Pre-configured YAML with MiniMax-M3 as the default model
  • Documentation (README.md): Added MiniMax as a supported LLM provider with setup instructions, marking M3 as the default

Default Model

MiniMax-M3 — Latest flagship model with 512K context window and enhanced reasoning / coding / image-input capabilities.

Available MiniMax Models

Model ID Context Notes
MiniMax-M3 512K Default. Flagship model, supports image input, 128K max output.
MiniMax-M2.7 204K Previous flagship.
MiniMax-M2.7-highspeed 204K Faster variant of M2.7.

Testing

  • Verified Python syntax and model registration
  • Confirmed MiniMax-M3 is the default model in config.minimax.yaml
  • All remaining models preserved as alternatives

Add MiniMax (MiniMax-M2.5, MiniMax-M2.5-highspeed) as a new LLM provider
option alongside OpenAI and Azure OpenAI. MiniMax offers an
OpenAI-compatible API with up to 204K context length.

Changes:
- Add MiniMax provider detection and API endpoint construction in
  awesome_chat.py with priority: local > azure > minimax > openai
- Handle MiniMax temperature constraint (must be > 0) by adjusting
  zero values to 0.01
- Add MiniMax model encodings and context lengths in get_token_ids.py
- Create config.minimax.yaml template for MiniMax configuration
- Update README.md with MiniMax setup instructions
@octo-patch

Copy link
Copy Markdown
Author

@microsoft-github-policy-service agree

- Add MiniMax-M2.7 and MiniMax-M2.7-highspeed to model list
- Set MiniMax-M2.7 as default model in config and docs
- Keep all previous models (M2.5, M2.5-highspeed) as alternatives
- Update token encodings and max context length for new models
@octo-patch octo-patch changed the title feat: add MiniMax as LLM provider feat: add MiniMax as LLM provider with M2.7 default Mar 18, 2026
@mrjatuporn2528-ctrl

Copy link
Copy Markdown

#2028

- Add MiniMax-M3 as the new default model (512K context window)
- Keep MiniMax-M2.7 and MiniMax-M2.7-highspeed as alternatives
- Remove older MiniMax-M2.5 and MiniMax-M2.5-highspeed models
- Update README.md and config.minimax.yaml to reflect new default
@octo-patch octo-patch changed the title feat: add MiniMax as LLM provider with M2.7 default feat: add MiniMax as LLM provider with M3 default Jun 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants