Metadata-Version: 2.4
Name: mex-artificial
Version: 1.4.1
Summary: Create artificial data for the MEx project.
Project-URL: Repository, https://github.com/robert-koch-institut/mex-artificial
Author-email: MEx Team <mex@rki.de>
License-Expression: MIT
License-File: AUTHORS
License-File: LICENSE
Requires-Python: <3.15,>=3.11
Requires-Dist: annotated-types>=0.7
Requires-Dist: faker<41,>=40
Requires-Dist: mex-common<2,>=1.16
Requires-Dist: pydantic>=2
Requires-Dist: rich>=14
Requires-Dist: typer>=0.23
Description-Content-Type: text/markdown

# MEx artificial

Create artificial data for the MEx project.

[![cookiecutter](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/cookiecutter.yml/badge.svg)](https://github.com/robert-koch-institut/mex-template)
[![cve-scan](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/cve-scan.yml/badge.svg)](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/cve-scan.yml)
[![documentation](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/documentation.yml/badge.svg)](https://robert-koch-institut.github.io/mex-artificial)
[![linting](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/linting.yml/badge.svg)](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/linting.yml)
[![opencode](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/opencode.yml/badge.svg)](https://gitlab.opencode.de/robert-koch-institut/mex/mex-artificial)
[![testing](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/testing.yml/badge.svg)](https://github.com/robert-koch-institut/mex-artificial/actions/workflows/testing.yml)

## Project

The Metadata Exchange (MEx) project is committed to improve the retrieval of RKI
research data and projects. How? By focusing on metadata: instead of providing the
actual research data directly, the MEx metadata catalog captures descriptive information
about research data and activities. On this basis, we want to make the data FAIR[^1] so
that it can be shared with others.

Via MEx, metadata will be made findable, accessible and shareable, as well as available
for further research. The goal is to get an overview of what research data is available,
understand its context, and know what needs to be considered for subsequent use.

RKI cooperated with D4L data4life gGmbH for a pilot phase where the vision of a
FAIR metadata catalog was explored and concepts and prototypes were developed.
The partnership has ended with the successful conclusion of the pilot phase.

After an internal launch, the metadata will also be made publicly available and thus be
available to external researchers as well as the interested (professional) public to
find research data from the RKI.

For further details, please consult our
[project page](https://www.rki.de/DE/Aktuelles/Publikationen/Forschungsdaten/MEx/metadata-exchange-plattform-mex-node.html).

[^1]: FAIR is referencing the so-called
[FAIR data principles](https://www.go-fair.org/fair-principles/) – guidelines to make
data Findable, Accessible, Interoperable and Reusable.

**Contact** \
For more information, please feel free to email us at [mex@rki.de](mailto:mex@rki.de).

### Publisher

**Robert Koch-Institut** \
Nordufer 20 \
13353 Berlin \
Germany

## Package

Create artificial extracted items, transform them into merged items and write the
results into a configured sink.

## License

This package is licensed under the [MIT license](/LICENSE). All other software
components of the MEx project are open-sourced under the same license as well.

## Development

### Installation

- install python on your system
- on unix, run `make install`
- on windows, run `.\mex.bat install`

### Linting and testing

- run all linters with `make lint` or `.\mex.bat lint`
- run unit and integration tests with `make test` or `.\mex.bat test`
- run just the unit tests with `make unit` or `.\mex.bat unit`

### Updating dependencies

- update boilerplate files with `cruft update`
- update global requirements in `requirements.txt` manually
- update git hooks with `pre-commit autoupdate`
- update package dependencies using `uv sync --upgrade`
- update github actions in `.github/workflows/*.yml` manually

### Creating release

- run `mex release RULE` to release a new version where RULE determines which part of
  the version to update and is one of `major`, `minor`, `patch`.

### Container workflow

- build image with `make image`
- run local version using docker `make run`

### Pre-built workflow

- you can run the latest artificial data generator without building it locally
- just pull it from the container registry and configure using cli arguments
- `docker run -it -u $(id -u):$(id -g) -v $(pwd):/out
   ghcr.io/robert-koch-institut/mex-artificial:latest --count=1000 --chattiness=10`
- using `-u $(id -u):$(id -g)` to run the process using your local user
- using `-v $(pwd):/out` to specify an output directory for the resulting `ndjson`
- `--count` controls the number of items to generate
- `--chattiness` controls the number of words in textual fields

## Commands

- run `uv run artificial --help` to print instructions
