CHIRP: A Fine-Grained Benchmark for Open-Ended Response Evaluation in Vision-Language Models
Created by
Haebom
作者
Alexis Roger, Prateek Humane, Daniel Z. Kaplan, Kshitij Gupta, Qi Sun, George Adamopoulos, Jonathan Siu Chi Lim, Quentin Anthony, Edwin Fennell, Irina Rish