txt/
pdf/
md/
**/__pycache__/
**/*.egg-info/
dist/
clean_md.py
corpus.md