We propose a method that can generate an unambiguous description (known as a referring expression) of a specific object or region in an image, and which can ...
確定! 回上一頁