Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
audio video pytorch transformer temporal-action-proposals i3d video-features dense-video-captioning multimodal-fusion activitynet-captions bmvc bmt bmvc20 bi-modal-transformer proposal-generator bi-modal-encoder
-
Updated
Apr 8, 2023 - Jupyter Notebook