Olga
Document extraction · Rust + Python
15–40×faster
Four formats. One engine.PDF · DOCX · XLSX · HTML
Spatial fidelity·Strictly-typed·No LLM in the loop
github.com/Hugues-DTANKOUO/olga