Evaluating Design Video Generation: Metrics for Compositional Fidelity

· ArXiv · AI/CL/LG ·

The paper proposes automated metrics for evaluating compositional fidelity in design-focused video generation models.

Categories: Research

Excerpt

Generative video models are increasingly used in design animation tasks, yet no standardized evaluation framework exists for this domain. Unlike natural video generation, design animation imposes structured constraints: specific components shall animate with prescribed motion types, directions, speed and timing, while non-animated regions must remain stable and layout structure must be preserved. This paper provides a fully automated evaluation framework organized across four dimensions: layout fidelity, motion correctness, temporal quality, and content fidelity. This eliminates the reliance on subjective human evaluation and establishes a common basis for benchmarking progress in the field.