GET /v1/models should list cached models from MACAFM_MLX_MODEL_CACHE

## Summary

`GET /v1/models` currently only returns the single model loaded via `-m`. It should also list models available in the `MACAFM_MLX_MODEL_CACHE` directory so clients can discover what's available without loading each model first.

## Motivation

When integrating AFM with benchmark tools (e.g. ToolCall-15), clients need to discover available models programmatically. Currently, the only way to know what models are cached locally is to scan the filesystem directly.

## Proposed behavior

- Scan `MACAFM_MLX_MODEL_CACHE` for `models--<org>--<name>/` subdirectories
- Return them in the `/v1/models` response alongside the currently loaded model
- Distinguish loaded vs available models (e.g. `loaded: true` vs `loaded: false`)
- The loaded model should still appear first in the list

## Context

Discovered while integrating AFM as a provider in ToolCall-15, a tool-calling benchmark dashboard. Model discovery would enable auto-populating the model selection UI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GET /v1/models should list cached models from MACAFM_MLX_MODEL_CACHE #64

Summary

Motivation

Proposed behavior

Context

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

GET /v1/models should list cached models from MACAFM_MLX_MODEL_CACHE #64

Description

Summary

Motivation

Proposed behavior

Context

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions