@juxtacognition I have used #RStats "#tabulizer" to extract tables from a #PDF (https://forum.knime.com/t/automate-pdf-reader-and-convert-data-to-excel-table-with-correct-column-mappings/26384/10?u=mlauber71) and "#pdftools" to extract text (https://forum.knime.com/t/unstructured-text-mining-from-pdf/48625/4?u=mlauber71). Maybe you can adapt this. Then there is a #KNIME node that uses "PDFBox" or another parser (https://kni.me/w/kjy6Q-3szxcH6716) - but I have not used it myself
#RStats #tabulizer #pdf #pdftools #KNIME
#tabulizer provides #rstats bindings to the Tabula java library, which can be used to computationaly extract tables from PDF documents.