Metadata-Version: 2.2
Name: xh_pdf_parser
Version: 1.3.1.0
Summary: A practical tool for converting PDF to Markdown
Home-page: https://github.com/opendatalab/MinerU
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: boto3>=1.28.43
Requires-Dist: Brotli>=1.1.0
Requires-Dist: click>=8.1.7
Requires-Dist: fast-langdetect<0.3.0,>=0.2.3
Requires-Dist: loguru>=0.6.0
Requires-Dist: numpy>=1.21.6
Requires-Dist: pydantic<2.11,>=2.7.2
Requires-Dist: PyMuPDF<1.25.0,>=1.24.9
Requires-Dist: scikit-learn>=1.0.2
Requires-Dist: torch!=2.5.0,!=2.5.1,<=2.6.0,>=2.2.2
Requires-Dist: torchvision
Requires-Dist: transformers!=4.51.0,<5.0.0,>=4.49.0
Requires-Dist: pdfminer.six==20231228
Requires-Dist: tqdm>=4.67.1
Requires-Dist: matplotlib>=3.10
Requires-Dist: ultralytics>=8.3.48
Requires-Dist: doclayout_yolo==0.0.2b1
Requires-Dist: dill<1,>=0.3.9
Requires-Dist: rapid_table<2.0.0,>=1.0.5
Requires-Dist: PyYAML<7,>=6.0.2
Requires-Dist: ftfy<7,>=6.3.1
Requires-Dist: openai<2,>=1.70.0
Requires-Dist: shapely<3,>=2.0.7
Requires-Dist: pyclipper<2,>=1.3.0
Requires-Dist: omegaconf<3,>=2.3.0
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

整合Mineru和UniMERNet

处理包的冲突问题，方便安装使用
```python
pip install xh_pdf_parser --extra-index-url https://wheels.myhloli.com
```

download the model

https://github.com/opendatalab/MinerU/blob/master/docs/how_to_download_models_en.md

## Changelog
2025.3.28

    主要处理UniMERNet的transformers的包冲突问题

[MinerU](https://github.com/opendatalab/MinerU) 
