Ptt 大爆卦 | Hero - 前往 https://arxiv.org/abs/2005.00200

你即將離開本站

並前往https://arxiv.org/abs/2005.00200

HERO: Hierarchical Encoder for Video+Language Omni ...

HERO encodes multimodal inputs in a hierarchical structure, where local context of a video frame is captured by a Cross-modal Transformer via ...

確定！回上一頁

查詢「Hero」的人也找了：

日劇hero線上看