PyPDF2
python-docx
scikit-learn
pyspellchecker
stanza
nltk
rapidfuzz
