ComfyUI nodes for Batch Image Captioning + various Vision-Language (VL) models, including InternVL3.5, Xiaomi MiMo-VL, LiquidAI LFM2-VL, Kwai Keye-VL, AIDC-AI Ovis2.5 and Ovis-U1. Models: Ovis2.5-2B, Ovis2.5-9B, Keye-VL-8B-Preview, MiMo-VL-7B-RL-GGUF, LFM2-VL-450M, LFM2-VL-1.6B, Ovis-U1-3B, Ovis2.5-2B, Ovis2.5-9B, InternVL3_5-1B/2B/4B/8B/14B/38B
-
Updated
Aug 29, 2025 - Python