UNDERLINE DOI: https://doi.org/10.48448/tkfa-wk43
poster
STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.