Metadata-Version: 2.1
Name: mbdb-parsing
Version: 0.1.0
Summary: Extract metadata from raw datafiles instruments within the MBDB
Author-email: Emil Dandanell Agerschou <agerschou@ibt.cas.cz>
Project-URL: Homepage, https://github.com/Molecular-Biophysics-Database/mbdb-parsing
Project-URL: Issues, https://github.com/Molecular-Biophysics-Database/mbdb-parsing/issues
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Development Status :: 4 - Beta
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas
Requires-Dist: openpyxl
Requires-Dist: invenio-records-resources
Requires-Dist: oarepo-runtime
Requires-Dist: invenio-search
Requires-Dist: opensearch-dsl
Requires-Dist: invenio-drafts-resources
Provides-Extra: dev
Requires-Dist: black ; extra == 'dev'
Requires-Dist: pytest ; extra == 'dev'

# mbdb-parsing

## Description

Set of processors for InvenioRDM/OArepo style repository that allows for
extraction and conversion of metadata from raw measurements files. See below
for the list of supported file types.

### MST
 - .moc
 - .moc2
 - .xlsx

## Requirements

 * Python >=3.12

## Installation

```bash
pip install mbdb-parsing
```
Or add mbdb-parsing to the dependencies inside the app's pyproject.toml:

```toml
[project]
dependencies = [
    "mbdb-parsing",
]
```

## Configuration

The processors should be placed inside FileServiceConfig(s) objects:

```python
from invenio_records_resources.services import FileServiceConfig
from oarepo_runtime.services.config.service import PermissionsPresetsConfigMixin

from mbdb-parsing.mst import MocProcessor, XlxsProcessor


class MstFilesServiceConfig(PermissionsPresetsConfigMixin, FileServiceConfig):
    file_processors = [
        MocProcessor(),
        XlxsProcessor(),
    ]
    components = [
        *PermissionsPresetsConfigMixin.components,
        *FileServiceConfig.components,
    ]
```
