As there is no prior work on weakly supervised learning from adverbs, ... all baselines for video-to-adverb retrieval with a performance of 0.719 mAP.
確定! 回上一頁