lmms-finetune

lmms-finetune

A unified code repository for fine-tuning large multimodal models

  • Provide a unified fine-tuning framework to simplify the integration and fine-tuning process
  • Support multiple fine-tuning strategies such as full fine-tuning, lora, q-lora, etc
  • Maintain the simplicity of the code repository for easy understanding and modification
  • Supports multiple types of LMMs, including single image models, multi image/interleaved image models, and video models
  • Provide detailed documentation and examples to help users quickly get started
  • Flexible code repository, supporting customization and quick experimentation

Product Details

LMMS finetune is a unified code repository designed to simplify the fine-tuning process of large multimodal models (LMMs). It provides a structured framework that allows users to easily integrate the latest LMMs and make fine-tuning, supporting full fine-tuning and strategies such as Lora. The code library design is simple and lightweight, easy to understand and modify, and supports multiple models including LLaVA-1.5, Phi-3-Vision, Qwen VL Chat, LLaVA NeXT Interleaf, and LLaVA NeXT Video.