nltk
numpy
PyPDF2
sklearn
