Given that you also consider other libraries, I suggest using poppler-util's pdftohtml to convert the pdf to xml:
確定! 回上一頁