pillow
pypdf2
pdf2image
pytesseract
multiprocess
python-Levenshtein
tika
