Metadata-Version: 2.4
Name: biocroissant
Version: 0.1.0
Summary: Metadata format and tooling for biomedical and health ML datasets (BioCroissant).
License-File: LICENSE.md
Requires-Python: >=3.10
Provides-Extra: omop
Requires-Dist: pandas; extra == 'omop'
Description-Content-Type: text/markdown

# BioCroissant

BioCroissant is a metadata format and tooling for biomedical and health ML datasets, extending [Croissant](https://github.com/mlcommons/croissant).

This repository is the **project skeleton** for community feedback. Implementation (parser, converters, validation) will be added in a later phase, including a possible port from the [biocroissant-to-omop](https://github.com/renato-umeton/biocroissant-to-omop) prototype.

## Contents

- **[docs/specification/](docs/specification/)** – BioCroissant specification (potentially)
- **[examples/](examples/)** – Example metadata and data (planned)
- **[schema/](schema/)** – JSON-LD context, JSON Schema, value domains (potentially) — alternative: schema profile(s)
- **[src/biocroissant/](src/biocroissant/)** – Python package (skeleton)
- **[CONTRIBUTING.md](CONTRIBUTING.md)** – How to contribute
