[NeurIPS 2025] Official Implementation of "ReinFlow: Fine-tuning Flow Policy with Online RL". (Flow x Reinforcement Learning)
-
Updated
Sep 26, 2025 - Python
[NeurIPS 2025] Official Implementation of "ReinFlow: Fine-tuning Flow Policy with Online RL". (Flow x Reinforcement Learning)
building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2 Vision Model. KV-Caching is supported and implemented from scratch as well
building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2 Vision Model. KV-Caching is supported and implemented from scratch as well
A Python script to analyze images generated using a LoRA (Low-Rank Adaptation) model applied at various strength levels. This tool helps determine an optimal strength for a given LoRA by evaluating image quality and similarity to control images.
Fine-tuned 3B parameters PaliGemma2 vision model on Valorant object detection improving IoU scores across all classes. Project is developed for research experimentation.
Fine-tuning DINO object detection model on a COCO-annotated pedestrian dataset from IIT Delhi. Includes data prep, training, evaluation, and visualization scripts.
This repository includes of a Multi-Tag (acronyms are Multi-Task and Multi-Output as well) Image Classification on Fashion Products Images dataset on Kaggle using EfficientNetB0 with high accuracies
Add a description, image, and links to the finetuning-vision-models topic page so that developers can more easily learn about it.
To associate your repository with the finetuning-vision-models topic, visit your repo's landing page and select "manage topics."