odysseus

Author	SHA1	Message	Date
Lucas Daniel	578f56ab92	fix(vision): recognize Gemma 4 and Phi-4 as vision-capable models (#1704 ) Gemma 4 and Phi-4 multimodal are natively vision-capable but their Ollama tags ("gemma4:12b", "phi-4", "phi4") did not match any keyword in _VISION_MODEL_KEYWORDS. The image was silently routed to the VL fallback path instead of being passed directly to the model — users saw the model respond to a placeholder like "[VL model unavailable - image not analyzed]" rather than the actual image. Adds "gemma-4"/"gemma4" and "phi-4"/"phi4" to the keyword list, following the existing err-toward-True policy (#124): a text-only variant being treated as vision is the safer failure than dropping a real image. Fixes #1274 (partial — covers the Gemma 4 + Phi-4 case; the OpenRouter/free vision fallback path is a separate issue).	2026-06-03 13:36:50 +09:00
lekt8	583df3dd6a	Recognize gemma3/llama4/mistral-small3.1+/multimodal as vision models (#1430 ) is_vision_model() classified several genuinely multimodal families as text-only because their names contain neither "vision" nor "vl": Gemma 3 (4b+), Llama 4, Mistral Small 3.1/3.2, and *-multimodal models (e.g. phi-4-multimodal). For those the attached image was stripped before the request, so the model never saw it — a "can't read the image" report (issue #1274), common with Ollama tags like gemma3:4b. Add those keywords (plus a generic "multimodal"). Per the file's err-toward-True policy (#124), a rare text-only tag treated as vision is the safer failure than dropping a real image. Guard tests confirm the text-only siblings (gemma2, plain gemma, mistral-small, phi-3) are not over-matched. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 04:17:40 +09:00
RosenTomov	a493fb49b0	Use LM Studio-reported vision capability for image passthrough (#1130 ) Read a model's capabilities.vision flag from LM Studio's native /api/v1/models so vision finetunes whose names lack a vision keyword still receive images, falling back to the name heuristic when the endpoint doesn't report it. The probe is short-TTL cached and restricted to local/LAN hosts, so remote/cloud endpoints are never contacted.	2026-06-02 23:01:04 +09:00
Robin Fröhlich	3c6ae3713e	Models: add Z.AI coding endpoint and GLM vision detection	2026-06-02 20:59:17 +09:00
Håkon Julius Størholt	91d3511580	Recognize local vision models so their images aren't dropped (#185 ) An image attachment only got through if the model name was on a short built-in list. Anything else was treated as text-only and the image was quietly dropped, so the model never saw it. That left out a lot of the smaller vision models you can run locally (moondream was the one I hit). Pulled the check into is_vision_model() in chat_helpers, broadened it to cover those, and added a test. Models that already worked are unaffected. Fixes #124.	2026-06-01 13:09:21 +09:00
pewdiepie-archdaemon	e5c99a5eee	Odysseus v1.0	2026-05-31 23:58:26 +09:00

6 Commits