Metadata-Version: 2.4
Name: factgenie
Version: 1.2.1
Summary: Lightweight self-hosted span annotation tool
Home-page: https://github.com/ufal/factgenie
Author: Zdenek Kasner, Ondrej Platek, Patricia Schmidtova, Dave Howcroft, Ondrej Dusek
Author-email: kasner@ufal.mff.cuni.cz
License: MIT
Classifier: Development Status :: 3 - Alpha
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Intended Audience :: Science/Research
Classifier: Operating System :: POSIX :: Linux
Classifier: License :: OSI Approved :: MIT License
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Typing :: Typed
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: coloredlogs>=15.0.1
Requires-Dist: datasets>=2.9.0
Requires-Dist: Flask-SSE>=1.0.0
Requires-Dist: Flask>=2.2.2
Requires-Dist: flask_apscheduler>=1.12.4
Requires-Dist: json2table>=1.1.5
Requires-Dist: lxml>=5.3.0
Requires-Dist: Markdown>=3.7.0
Requires-Dist: natsort>=8.4.0
Requires-Dist: python-slugify>=8.0.4
Requires-Dist: PyYAML>=6.0.2
Requires-Dist: requests>=2.32.3
Requires-Dist: scipy>=1.14.1
Requires-Dist: tinyhtml>=1.2.0
Requires-Dist: litellm>=1.70.0
Requires-Dist: pydantic>=2.10.6
Requires-Dist: google-api-python-client>=2.16.0
Requires-Dist: google-cloud-aiplatform>=1.38
Requires-Dist: pygamma-agreement==0.5.9
Requires-Dist: matplotlib>=3.10.0
Requires-Dist: tabulate>=0.9.0
Requires-Dist: tenacity>=9.0.0
Requires-Dist: llm-sandbox>=0.3.12
Requires-Dist: fast-edit-distance>=1.2.2
Provides-Extra: dev
Requires-Dist: wheel>=0.44.0; extra == "dev"
Requires-Dist: ipdb; extra == "dev"
Requires-Dist: black==24.10.0; extra == "dev"
Requires-Dist: isort==5.13.2; extra == "dev"
Provides-Extra: test
Requires-Dist: pytest>=8.3.3; extra == "test"
Provides-Extra: deploy
Requires-Dist: gunicorn>=23.0.0; extra == "deploy"
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license
Dynamic: license-file
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

<div align="center">
<img src="factgenie/static/img/favicon.png" width=80px" alt="logo" />

<h1> factgenie </h1>

<!-- ![Github downloads](https://img.shields.io/github/downloads/kasnerz/factgenie/total) -->

![Pytest](https://github.com/ufal/factgenie/actions/workflows/py311_tests.yml/badge.svg)
![PyPI](https://img.shields.io/pypi/v/factgenie)
[![slack](https://img.shields.io/badge/slack-factgenie-04AD90.svg?logo=slack)](https://join.slack.com/t/factgenie/shared_invite/zt-2u180yy81-3zCR7mt8EOy55cxA5zhKyQ)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
![Github stars](https://img.shields.io/github/stars/kasnerz/factgenie?style=social)
<!-- [![arXiv](https://img.shields.io/badge/arXiv-2407.17863-b31b1b.svg)](https://arxiv.org/abs/2407.17863) -->
<!-- ![PyPI downloads](https://img.shields.io/pypi/dm/factgenie) -->

Annotate LLM outputs with a lightweight, self-hosted web application 🌈

![factgenie](https://github.com/user-attachments/assets/1d074588-ada1-4974-a42a-0d2195c65283)

</div>

## 📢  News
- **2025-03-06** - Release 1.1.0 is out and comes with many significant improvements! See the 👉️ [changelog](https://github.com/ufal/factgenie/releases/tag/v1.1.0).
- **2024-11-13** - We released version 1.0.1: our first official release! 🎉

## 👉️ How can factgenie help you?
Outputs from large language models (LLMs) may contain errors: semantic, factual, and lexical. 

With factgenie, you can have the error spans annotated:
- From LLMs through an API.
- From humans through a crowdsourcing service.

Factgenie can provide you:
1. **A user-friendly website** for collecting annotations from human crowdworkers.
2. **API calls** for collecting equivalent annotations from LLM-based evaluators.
3. **A visualization interface** for visualizing the data and inspecting the annotated outputs.

---
*What does factgenie **not help with** is collecting the data (we assume that you already have these), starting the crowdsourcing campaign (for that, you need to use a service such as [Prolific.com](https://prolific.com)) or running the LLM evaluators (for that, you need a local framework such as [Ollama](https://ollama.com) or a proprietary API).*

## 🏃 Quickstart
Make sure you have Python >=3.9 installed.

If you want to quickly try out factgenie, you can install the package from PyPI:
```bash
pip install factgenie
```

However, the recommended approach for using factgenie is using an editable package:
```bash
git clone https://github.com/ufal/factgenie.git
cd factgenie
pip install -e .[dev,deploy]
```
This approach will allow you to manually modify configuration files and write your own data classes.

After installing factgenie, use the following command to run the server on your local computer:
```bash
factgenie run --host=127.0.0.1 --port 8890
```
More information on how to set up factgenie is on [Github wiki](../../wiki/Setup).

## 💡 Usage guide


See the following **wiki pages** that that will guide you through various use-cases of factgenie:

| Topic                                                               | Description                                        |
| ------------------------------------------------------------------- | -------------------------------------------------- |
| 🔧 [Setup](../../wiki/Setup)                                         | How to install factgenie.                          |
| 🗂️ [Data Management](../../wiki/Data-Management)                     | How to manage datasets and model outputs.          |
| 🤖 [LLM Annotations](../../wiki/LLM-Annotations)                     | How to annotate outputs using LLMs.                |
| 👥 [Crowdsourcing Annotations](../../wiki/Crowdsourcing-Annotations) | How to annotate outputs using human crowdworkers.  |
| ✍️  [Generating Outputs](../../wiki/Generating-Outputs)              | How to generate outputs using LLMs.                |
| 📊 [Analyzing Annotations](../../wiki/Analyzing-Annotations)         | How to obtain statistics on collected annotations. |
| 💻 [Command Line Interface](../../wiki/CLI)                          | How to use factgenie command line interface.       |
| 🌱 [Contributing](../../wiki/Contributing)                           | How to contribute to factgenie.                    |

## 🔥 Tutorials
We also provide step-by-step walkthroughs showing how to employ factgenie on [the dataset from the Shared Task in Evaluating Semantic Accuracy](https://github.com/ehudreiter/accuracySharedTask):

| Tutorial                                                                                                                    | Description                                                                                      |
| --------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------ |
| [🏀 #1: Importing a custom dataset](../../wiki/Tutorials#-tutorial-1-importing-a-custom-dataset)                             | Loading the basketball statistics and model-generated basketball reports into the web interface. |
| [💬 #2: Generating outputs](../../wiki/Tutorials#-tutorial-2-generating-outputs)                                             | Using Llama 3.1 with Ollama for generating basketball reports.                                   |
| [📊 #3: Customizing data visualization](../../wiki/Tutorials#-tutorial-3-customizing-data-visualization)                     | Manually creating a custom dataset class for better data visualization.                          |
| [🤖 #4: Annotating outputs with an LLM](../../wiki/Tutorials#-tutorial-4-annotating-outputs-with-an-llm)                     | Using GPT-4o for annotating errors in the basketball reports.                                    |
| [👨‍💼 #5: Annotating outputs with human annotators](../../wiki/Tutorials#-tutorial-5-annotating-outputs-with-human-annotators) | Using human annotators for annotating errors in the basketball reports.                          |

## 🔊 Join us on Slack

If you want to get a quick feedback or actively participate in development of factgenie, join our public **Slack workspace**:

<a href="https://join.slack.com/t/factgenie/shared_invite/zt-2u180yy81-3zCR7mt8EOy55cxA5zhKyQ"><img width="150px" alt="Join us on Slack" src="./img/slack.png"></a>

## 📸 Try a public preview
We used factgenie for [our related research project](https://d2t-llm.github.io/). We host the outputs from the project using a public instance of factgenie.

> [!IMPORTANT]
> Note that this preview is very limited: it enables only data viewing, not any data collection or management.

**👉️ You can access the preview [here](https://quest.ms.mff.cuni.cz/nlg/d2t-llm/)**.

## 💬 Cite us

[Our paper](https://aclanthology.org/2024.inlg-demos.5/) was published at INLG 2024 System Demonstrations!

You can also find the paper on [arXiv](https://arxiv.org/abs/2407.17863).

For citing us, please use the following BibTeX entry:
```bibtex
@inproceedings{kasner2024factgenie,
    title = "factgenie: A Framework for Span-based Evaluation of Generated Texts",
    author = "Kasner, Zden{\v{e}}k  and
      Platek, Ondrej  and
      Schmidtova, Patricia  and
      Balloccu, Simone  and
      Dusek, Ondrej",
    editor = "Mahamood, Saad  and
      Minh, Nguyen Le  and
      Ippolito, Daphne",
    booktitle = "Proceedings of the 17th International Natural Language Generation Conference: System Demonstrations",
    year = "2024",
    address = "Tokyo, Japan",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.inlg-demos.5",
    pages = "13--15",
}
```

## Acknowledgements
The authors acknowledge the support of the National Recovery Plan funded project MPO 60273/24/21300/21000 CEDMO 2.0 NPO.

This work was co-funded by the European Union (ERC, NG-NLG, 101039303).

<img src="img/LOGO_ERC-FLAG_FP.png" alt="erc-logo" height="150"/> 
