Temporal-aware T2V generation eval Q&A Verifier
10+
Expert in detailed, accurate 3D image captioning
4+