Timeline
Research paper proposing VLM2Rec framework to fix modality collapse in multimodal recommendation systems was posted on arXiv.
Research paper published introducing VLM2Rec framework to fix modality collapse in VLMs for sequential recommendation
Technical guide published on Medium for efficient fine-tuning of VLMs using LoRA and quantization
Research paper 'VLM4Rec: Multimodal Semantic Representation for Recommendation with Large Vision-Language Models' posted to arXiv
Research reveals VLMs struggle with fine-grained visual classification despite excelling at complex reasoning
New research published on arXiv reveals VLMs' spatial reasoning collapses when visual elements lack text labels, exposing fundamental limitations.
Researchers develop novel fine-tuning technique that improves how medical VLMs understand negation in clinical reports