Metadata-Version: 2.4
Name: ie_datasets
Version: 0.0.3
Summary: Load fully-typed information extraction data in a single line.
Project-URL: Homepage, https://github.com/adanomad/ie-datasets
Project-URL: Issues, https://github.com/adanomad/ie-datasets/issues
Author-email: Justin Xu <xu.justin.j@gmail.com>
License-Expression: MIT
License-File: LICENSE
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Requires-Python: >=3.9
Requires-Dist: annotated-types>=0.7.0
Requires-Dist: pandas>=2.2.3
Requires-Dist: platformdirs>=4.3.6
Requires-Dist: pyarrow>=19.0.1
Requires-Dist: pydantic>=2.10.3
Description-Content-Type: text/markdown

# Information Extraction Datasets

This package takes care of all of the tedium when loading various information extraction datasets, providing the data in fully validated and typed Pydantic objects.

## Datasets

### [ChemProt](./src/ie_datasets/datasets/chemprot/README.md)

<details>
  <summary>Example</summary>

  ```py
  from ie_datasets import ChemProt
  ChemProt.load_units("train")
  ChemProt.load_units("validation")
  ChemProt.load_units("test")
  ```
</details>

### [SciERC](./src/ie_datasets/datasets/scierc/README.md)

<details>
  <summary>Example</summary>

  ```py
  from ie_datasets import SciERC
  SciERC.load_units("train")
  SciERC.load_units("dev")
  SciERC.load_units("test")
  ```
</details>

### [WikiEvents](./src/ie_datasets/datasets/wikievents/README.md)

<details>
  <summary>Example</summary>

  ```py
  from ie_datasets import WikiEvents
  WikiEvents.load_ontology()
  WikiEvents.load_units("train")
  WikiEvents.load_units("dev")
  WikiEvents.load_units("test")
  ```
</details>
