3D Convolution : embed temporal dimension to CNN ... Objects2action: Classifying and localizing actions w/o any video example (arXiv).
確定! 回上一頁