Skip to content

Latest commit

 

History

History
20 lines (13 loc) · 985 Bytes

README.md

File metadata and controls

20 lines (13 loc) · 985 Bytes

Animal-Bench

Code release for "AnimalBench: Benchmarking Multimodal Video Models for Animal-centric Video Understanding"

image

Previous benchmarks (left) relied on limited agent and the scenarios of editing-based benchmarks are unrealistic. Our proposed Animal-Bench (right) includes diverse animal agents, various realistic scenarios, and encompasses 13 different tasks.

Task Demonstration

image

Effectiveness evaluation results: image

Robustness evaluation results: image

We will release our data and code SOON!