vision-llm

Here are 6 public repositories matching this topic...

aidalinfo / extract-kit

Powerful PDF data extraction library powered by AI vision models. Transform PDFs into structured, validated data using TypeScript, Zod, and AI providers like Scaleway and Ollama.

pdf document-processing ai-sdk pdf-extraction vision-llm

Updated Sep 14, 2025
TypeScript

vdamov / D2R-AI-Item-Tracker

Star

AI-powered OCR for Diablo II: Resurrected - batch-extract item tooltips from screenshots using Vision LLMs (OpenAI, Groq, OpenRouter, LM Studio/Ollama). No Tesseract or EasyOCR needed.

Updated Sep 3, 2025
Python

NS027 / medical_chatbot_project_genAI

Star

Multimodal AI-powered medical assistant with LLMs, speech, and image understanding.

chatbot llama whisper peft multimodal huggingface healthcare-ai generative-ai qwen vision-llm

Updated Apr 18, 2025
Jupyter Notebook

HelloJahid / CarDVLM

Star

Car Damage Assessment using Vision LLM

llm vision-language-model vision-llm

Updated Sep 8, 2025
Python

vishvaRam / Fine-Tuning-Qwen2.5-Vision

Star

This repository focuses on customizing the Qwen2.5-Vision model for specific tasks. It provides step-by-step guidance, scripts, and best practices for fine-tuning the model on custom datasets. Ideal for developers and researchers, it ensures optimal performance and accuracy tailored to unique use cases.

transformer fine-tuning vision-transformer llm vision-language-model llm-training qwen qwen2-5 vision-llm

Updated Apr 22, 2025
Jupyter Notebook

memamara / D2R-AI-Item-Tracker

Star

🖼️ Extract Diablo II: Resurrected item tooltips from screenshots in batches, using AI for accurate categorization and searchable databases.

python ocr ai computer-vision gaming pytorch openai diablo2 item-tracking item-tracker loot-tracker diablo2resurrected openrouter ollama lmstudio loot-tracking vision-llm tooltip-ocr

Updated Sep 18, 2025
Python

Improve this page

Add a description, image, and links to the vision-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-llm

Here are 6 public repositories matching this topic...

aidalinfo / extract-kit

vdamov / D2R-AI-Item-Tracker

NS027 / medical_chatbot_project_genAI

HelloJahid / CarDVLM

vishvaRam / Fine-Tuning-Qwen2.5-Vision

memamara / D2R-AI-Item-Tracker

Improve this page

Add this topic to your repo