kotaemon/knowledgehub/loaders/utils
Tuan Anh Nguyen Dang (Tadashi_Cin) 4704e2c11a Add new OCRReader with PDF+OCR text merging (#66)
This change speeds up OCR extraction by allowing bypassing OCR for texts that are irrelevant (not in table).

---------

Co-authored-by: Nguyen Trung Duc (john) <trungduc1992@gmail.com>
2023-11-13 17:43:02 +07:00
..
__init__.py [AUR-432] Add layout-aware table parsing PDF reader (#27) 2023-09-26 15:52:44 +07:00
box.py Add new OCRReader with PDF+OCR text merging (#66) 2023-11-13 17:43:02 +07:00
pdf_ocr.py Add new OCRReader with PDF+OCR text merging (#66) 2023-11-13 17:43:02 +07:00
table.py Add new OCRReader with PDF+OCR text merging (#66) 2023-11-13 17:43:02 +07:00