Metadata-Version: 2.2
Name: xh_pdf_parser
Version: 1.2.2.1
Summary: A practical tool for converting PDF to Markdown
Home-page: https://github.com/opendatalab/MinerU
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: boto3>=1.28.43
Requires-Dist: Brotli>=1.1.0
Requires-Dist: click>=8.1.7
Requires-Dist: fast-langdetect<0.3.0,>=0.2.3
Requires-Dist: loguru>=0.6.0
Requires-Dist: numpy<2.0.0,>=1.21.6
Requires-Dist: pydantic>=2.7.2
Requires-Dist: PyMuPDF<=1.24.14,>=1.24.9
Requires-Dist: scikit-learn>=1.0.2
Requires-Dist: transformers
Requires-Dist: pdfminer.six==20231228
Requires-Dist: omegaconf>=2.3.0
Requires-Dist: matplotlib>=3.8.4
Requires-Dist: iopath>=0.1.9
Requires-Dist: timm==0.9.16
Requires-Dist: opencv-python>=4.6.0
Requires-Dist: fairscale>=0.4.13
Requires-Dist: ftfy>=6.2.0
Requires-Dist: albumentations>=1.4.4
Requires-Dist: wand>=0.6.13
Requires-Dist: webdataset>=0.2.86
Requires-Dist: rapidfuzz>=3.8.1
Requires-Dist: termcolor>=2.4.0
Requires-Dist: pandas>=2.2.2
Requires-Dist: evaluate>=0.4.1
Requires-Dist: rich>=13.7.1
Requires-Dist: jupyterlab>=4.1.6
Requires-Dist: tabulate>=0.9.0
Requires-Dist: nltk>=3.8.1
Requires-Dist: streamlit>=1.33.0
Requires-Dist: pypdfium2>=4.29.0
Requires-Dist: pdf2image>=1.17.0
Requires-Dist: streamlit_drawable_canvas>=0.9.3
Requires-Dist: torch<=2.3.1,>=2.2.2
Requires-Dist: torchvision<=0.18.1,>=0.17.2
Requires-Dist: ultralytics>=8.3.48
Requires-Dist: paddleocr==2.7.3
Requires-Dist: struct-eqtable==0.3.2
Requires-Dist: einops
Requires-Dist: accelerate
Requires-Dist: doclayout_yolo==0.0.2b1
Requires-Dist: rapidocr-paddle<2.0.0,>=1.4.5
Requires-Dist: rapidocr_onnxruntime<2.0.0,>=1.4.4
Requires-Dist: rapid_table<2.0.0,>=1.0.3
Requires-Dist: PyYAML
Requires-Dist: openai
Requires-Dist: detectron2
Requires-Dist: paddlepaddle==3.0.0
Requires-Dist: paddlepaddle-gpu==2.6.0
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

整合Mineru和UniMERNet

处理包的冲突问题，方便安装使用
```python
pip install xh-pdf-parser --extra-index-url https://wheels.myhloli.com
```

download the model

https://github.com/opendatalab/MinerU/blob/master/docs/how_to_download_models_en.md

## Changelog
2025.3.28

    主要处理UniMERNet的transformers的包冲突问题

[MinerU](https://github.com/opendatalab/MinerU) 
