A hierarchical taxonomy of benchmarks for multimodal model evaluation

⭐ If you find this useful, star us on GitHub!
0 Task Categories
0 Benchmarks
0 Understanding
0 Generation