VIDEO DOI: https://doi.org/10.48448/5yp8-6262

findings / work in progress

ACL 2022

May 23, 2022

Dublin, Ireland

Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals

Please log in to leave a comment

Downloads

SlidesPaperTranscript English (automatic)

Next from ACL 2022

VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
findings / work in progress

VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

ACL 2022

+3Iacer CalixtoLetitia Parcalabescu
Letitia Parcalabescu and 5 other authors

23 May 2022

Similar lecture

Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks
workshop paper

Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks

ACL 2022

+2Yueen MaShih-Fu ChangYue Wan
Yue Wan and 4 other authors

27 May 2022