UNDERLINE DOI: https://doi.org/10.48448/j08d-fv18
technical paper
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

