Increase local vision model timeout and avoid caching transient VL failure placeholders.\n\nCloses #202.