Metadata-Version: 2.4
Name: arkeo
Version: 0.2.4
Summary: markdown archiver betasaurus
Home-page: https://github.com/arkeosaurus/arkeo
Author: arkeosaurus
Author-email: arkeosaurus@users.noreply.github.com
Keywords: document processing,archiving,indexing,markdown,corpus
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Topic :: Text Processing
Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
Requires-Python: >=3.8
Description-Content-Type: text/plain
Requires-Dist: beautifulsoup4
Requires-Dist: flashtext
Requires-Dist: genson
Requires-Dist: google-auth
Requires-Dist: google-api-python-client
Requires-Dist: html2text
Requires-Dist: jsonpath-ng
Requires-Dist: jsonschema
Requires-Dist: lxml_html_clean
Requires-Dist: markitdown
Requires-Dist: newspaper3k
Requires-Dist: nltk
Requires-Dist: pandas
Requires-Dist: PyYAML
Requires-Dist: requests
Requires-Dist: tldextract
Requires-Dist: tqdm
Requires-Dist: xmltodict
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

Ideally, this inhales a lot of media and regurgitates markdown for a curated, research repository.
