A one-stop, open-source, high-quality data extraction tool that supports converting Office to Markdown and JSON. support pdf, image, word, ppt, excel.