Use extract_text method found in pdfminer.high_level to extract text from the PDF file. Tokenize the text file using NLTK.tokenize ...
確定! 回上一頁