Metadata-Version: 2.4
Name: fairds
Version: 0.1.3
Summary: Python client for the FAIR Data Station API
Project-URL: Repository, https://gitlab.com/m-unlock/fairds-py
Author-email: Brett Metcalfe <brett.metcalfe@wur.nl>, Jasper Koehorst <jasper.koehorst@wur.nl>
License: Apache-2.0
License-File: LICENSE
Requires-Python: >=3.10
Requires-Dist: openpyxl>=3.1.5
Requires-Dist: pandas>=1.5
Requires-Dist: requests>=2.28
Description-Content-Type: text/markdown

# fairds
## FAIR Data Station
To improve the quality of reported data and to maximise its potential for reuse either by the original data generator or by others, data must have metadata, or 'data about data'. Furthermore, the set of metadata included or associated with any dataset must be sufficiently detailed as to allow for unambiguous interpretation of the associated data. In addition, good data management should allow for machine-actionabile metadata to be findable, accessible, interoperable, and directly reusable (FAIR guiding principles; [Wilkinson et al., 2016](https://doi.org/10.1038/sdata.2016.18)) with minimal human intervention. 

What constitutes 'good' metadata can be relatively subjective hence why researchers have developed various Minimum Information Models (MIMs), or checklists for a variety of datasets and data types. MIMs consist of two parts: firstly, for each assay and associated data type there is a community accepted checklist of reporting requirements. Secondly, an obligatory data format is used for reporting essential metadata to ensure machine-actionability. To ease the use of MIMs and various ontologies the FAIR Data Station, a metadata ingestion platform, has been developed ([Nijsse et al., 2023](https://doi.org/10.1093/gigascience/giad014)) that helps to guide researchers towards good data management using a multi-step process:

- Users select appropriate metadata standards via the FAIR-DS [web application](https://fairds.fairbydesign.nl/)
- Users generate a spreadsheet template via the FAIR-DS [web application](https://fairds.fairbydesign.nl/)
- Users record metadata using the spreadsheet template using Microsoft Excel
- Users then validate the metadata content according to template requirements generating an Resource Description Framework (RDF) file

FAIR Data Station uses a hierarchical metadata schema based upon an extended form of the ISA (Investigation, Study, Assay) Standard. ISA has been modified by replacing 'source material' with 'Observation Unit' from the Minimum Information About a Plant Phenotyping Experiment (MIAPPE) and implementing 'Sample' from the Just Enough Results Model (JERM). At present, Various packages are already implemented in FAIR-DS predominately based on the Minimum Information about any (x) Sequence (MIxS) standard from the Genomic Standards Consortium. Although it is possible to add new packages, e.g., the Minimum information model for fermentation experiments (MIFE; [Georgakilas et al., 2026](https://doi.org/10.1093/gigascience/giag038)) to meet the needs of different communities of practice.


## FAIR-by-design

The aim of FAIR Data Station is to ensure a FAIR-by-design approach to scientific research as opposed to the current model of FAIR for publication. What this entails is to facilitate researchers and data generators ability to make their data FAIR from the earliest moment as well as ensuring that they make use of appropriate metadata standards with minimal hassle. 


## fairds Python client

The python package `fairds` is a Python client for the [FAIR Data Station](https://fairds.fairbydesign.nl/) API. `fairds` is intended to allow users to ensure that the metadata and data validate metadata during the data analysis stage ensuring the quality of (a) dataset(s) prior to data analysis. 


## Install

The python package `fairds` can be installed from the Python Package Index (pypi) with the following command in the terminal: 

``` python

pip install fairds

```

It has the following requirements:


```
"pandas>=1.5",
"requests>=2.28",

```

## Example(s)

```
#--------------
# LOAD Package(s)
import os
from fairds import fairds


#-------------
# initialise class
fair_data = fairds.fairds()

#-------------
# SET spreadsheet file name
filename = os.path.normpath("examples\example_files\ValidationDemo_pass.xlsx")

#-------------
# GET results
fair_data.load(filename=filename, sheet_name = 'Sample - human gut')

#--------------
# To show the sheetnames uncomment the following
#fair_data.show_sheetnames()

#--------------
# To show the columns uncomment the following
#print(fair_data.df.columns)

#---------------
# Print out the sample identifiers
print(fair_data.df['sample identifier'])

```

A number of examples for loading, printing, fetching terms and packages, and validating can be found in the example folder.


## Contributing
To contribute to `fairds` or to raise an issue please go to: https://gitlab.com/m-unlock/fairds-py 


## Licence
`fairds` is licenced under an Apache License 2.0


## Authors and acknowledgment
The authors disclosed receipt of the following financial support for the research, authorship, and publication of this (research) software: European Union's Horizon 2020 research and innovation programme projects ‘RI services to promote deep digitalization of Industrial Biotechnology - towards smart biomanufacturing’, BIOINDUSTRY 4.0 (grant agreement n° [101094287](https://doi.org/10.3030/101094287)); and Wageningen University & Research ([WUR](https://www.wur.nl/)) Data Competency Center (WDCC) Open Science grant. FAIR Data Station was developed as part of the UNLOCK initiative (NWO: 184.035.007). The Dutch national funding agency NWO and Wageningen University and Research are thanked for their financial contribution to the [UNLOCK](https://m-unlock.com/) initiative. 


## References

Georgakilas, G.K., Metcalfe, B., Bize, A., et al. MIFE and MIFD: Minimum information for fermentation experiments and devices, GigaScience, 15, giag038 (2026). https://doi.org/10.1093/gigascience/giag038

Nijsse, B., Schaap, P.J., Koehorst, J.J. FAIR data station for lightweight metadata management and validation of omics studies, GigaScience, 12, giad014 (2023). https://doi.org/10.1093/gigascience/giad014

Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18



