April 09, 2024
2024
New preprint out on arXiv Can Feedback Enhance Semantic Grounding in Large Vision-Language Models?!