As an encoder, CNN extracts image features and provides them to the decoder to generate captions. Figure 3 shows a generalized CNN architecture ...
確定! 回上一頁