Quellen
Hansen, Pomp et. al – Matthias Hansen, Andre Pomp et. al.: Data-Driven Recognition and Extraction of PDF Document Elements (https://www.mdpi.com/2227-7080/7/3/65)
Milosevic et. al – Nikola Milosevic et. al.: A framework for information extraction from tables in biomedical literature (https://link.springer.com/article/10.1007/s10032-019-00317-0)
tesseract-ocr – https://github.com/tesseract-ocr/docs/blob/master/das_tutorial2016/7Building%20a%20Multi-Lingual%20OCR%20Engine.pdf