UNDERLINE DOI: https://doi.org/10.48448/hss6-1x42
workshop paper
Video Language Co-Attention with Multimodal Fast-Learning Feature Fusion for VideoQA
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
