Metadata-Version: 2.4
Name: document-reader
Version: 0.0.6
Summary: Um leitor de documentos em Python para extrair campos, baseado em expressões regulares
Author-email: Jonatan Rodrigues da Silva <jonatanjrss@gmail.com>
License: MIT
Project-URL: Homepage, https://github.com/Jonatanjrss/document-reader
Project-URL: Issues, https://github.com/Jonatanjrss/document-reader/issues
Keywords: python,document,reader,regex
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: POSIX :: Linux
Requires-Python: >=3.12
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: loguru==0.7.3
Requires-Dist: pdftotext==3.0.0
Requires-Dist: loguru==0.7.3
Dynamic: license-file

# document-reader

Leitor de documentos em Python para extrair campos, baseado em expressões regulares.

## Instalação

```bash

pip install document-reader
```

## Uso

```python
from document_reader import Document, Field


doc = Document("pdf_file.pdf")
doc.register_fields(
    Field(name="contract", regex=r"\d+/.*?/\d+", page=0),
    Field(name="nup", regex=r"\d{5}\.\d{6}/\d{4}-\d{2}", page=1),
)
data = doc.open()
print(data)
```
