AI Video Editor Pipeline with Vision LLM Models
-
Updated
Apr 11, 2026 - Python
AI Video Editor Pipeline with Vision LLM Models
A lightweight Model Context Protocol (MCP) server for retrieving local and remote images for LLM vision models, featuring metadata extraction and configurable availability retries.
Claude Code skill — 给无视觉能力的 LLM(DeepSeek/o1/o3)外挂看图能力。Vision proxy for text-only LLMs, supports OpenAI/Anthropic/DashScope/Qwen3-VL.
Alt'Ollama: The app for generating alt-text for image/s using LLMs with image processing support.
Add a description, image, and links to the llm-vision topic page so that developers can more easily learn about it.
To associate your repository with the llm-vision topic, visit your repo's landing page and select "manage topics."