Olga
Document extraction · Rust + Python
15–40
×
faster
Four formats. One engine.
PDF · DOCX · XLSX · HTML
Spatial fidelity
·
Strictly-typed
·
No LLM in the loop
github.com/Hugues-DTANKOUO/olga