Metadata-Version: 2.4
Name: cvfile-cv-detector
Version: 0.1.0
Summary: Tiny reference sniffer for the .cv open file format. Detects .cv wrapping inside any application/pdf and unwraps the canonical Markdown payload.
Project-URL: Homepage, https://cvfile.org
Project-URL: Source, https://github.com/cvfile/cv/tree/main/tools/cv-detector/python
Project-URL: Issues, https://github.com/cvfile/cv/issues
Author-email: "cvfile.org maintainers" <contact@cvfile.org>
License: Apache-2.0
Keywords: crawler,cv,open-format,pdf,rag,resume,sniffer
Classifier: Development Status :: 4 - Beta
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Text Processing
Requires-Python: >=3.10
Requires-Dist: pypdf<6,>=4
Description-Content-Type: text/markdown

# cvfile-cv-detector (Python)

```bash
pip install cvfile-cv-detector
```

```python
from cvfile_cv_detector import detect, unwrap

with open("resume.pdf", "rb") as f:
    data = f.read()

det = detect(data)
if det.is_cv_file:
    payload = unwrap(data)
    if payload:
        markdown = payload.bytes_.decode("utf-8")
        print(f"got {payload.name} ({payload.mime_type}, {len(markdown)} chars)")
```

`detect()` is zero-dependency (pure regex over the PDF bytes). `unwrap()`
uses `pypdf` to read the PDF Associated Files (`/AF`) array.

See `../README.md` for the cross-language story and rationale.
