Metadata-Version: 2.1
Name: jaeger-bio
Version: 1.26.4
Summary: A quick and precise pipeline for detecting phages in sequence assemblies.
Author-Email: Yasas Wijesekara <yasas.wijesekara@uni-greifswald.de>
License: MIT
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Project-URL: Homepage, https://github.com/Yasas1994/Jaeger
Requires-Python: <3.14,>=3.11
Requires-Dist: h5py>=3.0
Requires-Dist: joblib>=1.0
Requires-Dist: keras>=3.12.0
Requires-Dist: kneed>=0.8.5
Requires-Dist: matplotlib>=3.7
Requires-Dist: numpy>=1.24
Requires-Dist: pandas>=2.2.3
Requires-Dist: polars>=1.2.6
Requires-Dist: parasail==1.3.4
Requires-Dist: progressbar2>=4.4.2
Requires-Dist: psutil>=7.0.0
Requires-Dist: click>=8.1
Requires-Dist: pycirclize
Requires-Dist: requests>=2.0
Requires-Dist: rich>=13.0
Requires-Dist: scipy>=1.10
Requires-Dist: pyfastx>=2
Requires-Dist: ruptures>=1.1.9
Requires-Dist: scikit-learn>=1.6.0
Requires-Dist: seaborn>=0.13.0
Requires-Dist: pyyaml>=6.0.2
Requires-Dist: jinja2>=3.1
Requires-Dist: pydustmasker>=1.0.3
Provides-Extra: darwin-arm
Requires-Dist: tensorflow<2.22,>=2.21; extra == "darwin-arm"
Requires-Dist: tensorflow-metal; extra == "darwin-arm"
Provides-Extra: gpu
Requires-Dist: tensorflow[and-cuda]<2.22,>=2.21; extra == "gpu"
Provides-Extra: cpu
Requires-Dist: tensorflow<2.22,>=2.21; extra == "cpu"
Provides-Extra: onnx
Requires-Dist: onnxruntime-gpu>=1.26; extra == "onnx"
Requires-Dist: tf2onnx>=1.17; extra == "onnx"
Requires-Dist: sympy>=1.13; extra == "onnx"
Requires-Dist: onnx>=1.16; extra == "onnx"
Provides-Extra: test
Requires-Dist: pytest>=8.0; extra == "test"
Requires-Dist: pytest-mock>=3.14; extra == "test"
Provides-Extra: taxonomy
Requires-Dist: taxopy; extra == "taxonomy"
Requires-Dist: faiss-cpu; extra == "taxonomy"
Description-Content-Type: text/markdown

```
                  .
               ,'/ \`.
              |\/___\/|
              \'\   /`/          ██╗ █████╗ ███████╗ ██████╗ ███████╗██████╗
               `.\ /,'           ██║██╔══██╗██╔════╝██╔════╝ ██╔════╝██╔══██╗
                  |              ██║███████║█████╗  ██║  ███╗█████╗  ██████╔╝
                  |         ██   ██║██╔══██║██╔══╝  ██║   ██║██╔══╝  ██╔══██╗
                 |=|        ╚█████╔╝██║  ██║███████╗╚██████╔╝███████╗██║  ██║
            /\  ,|=|.  /\    ╚════╝ ╚═╝  ╚═╝╚══════╝ ╚═════╝ ╚══════╝╚═╝  ╚═╝
        ,'`.  \/ |=| \/  ,'`.```

```



Jaeger: an accurate and fast deep-learning tool to detect bacteriophage sequences
===============
![GitHub](https://img.shields.io/github/license/Yasas1994/jaeger) ![GitHub last commit (branch)](https://img.shields.io/github/last-commit/Yasas1994/jaeger/main?color=8a35da) ![Conda](https://img.shields.io/conda/v/bioconda/jaeger-bio) ![Conda](https://img.shields.io/conda/dn/bioconda/jaeger-bio) [![PyPI version](https://img.shields.io/pypi/v/jaeger-bio.svg)](https://pypi.org/project/jaeger-bio/) [![Downloads](https://static.pepy.tech/badge/jaeger-bio)](https://pepy.tech/project/jaeger-bio)



Jaeger is a tool that utilizes homology-free machine learning to identify phage genome sequences that are hidden within metagenomes. It is capable of detecting both phages and prophages within metagenomic assemblies.

> 📚 **For detailed installation instructions, usage guides, and troubleshooting, please visit the [documentation](https://jaeger.readthedocs.io/en/latest/installation.html).**

---
#### Citing Jaeger
---

If you use Jaeger in your work, please consider citing its preprint:

* **[Jaeger: an accurate and fast deep-learning tool to detect bacteriophage sequences](https://www.biorxiv.org/content/early/2024/09/24/2024.09.24.612722)**
  Yasas Wijesekara, Ling-Yi Wu, Rick Beeloo, Piotr Rozwalak, Ernestina Hauptfeld, Swapnil P. Doijad, Bas E. Dutilh, Lars Kaderali
  *bioRxiv*, 2024.09.24.612722



To cite the code itself:

* **Jaeger: an accurate and fast deep-learning tool to detect bacteriophage sequences**
https://doi.org/10.5281/zenodo.20534106

---

- [Installing Jaeger](#installation)
  - [One-liner install script](#option-1--one-liner-install-script-recommended)
  - [Bioconda](#option-2--bioconda)
  - [PyPI](#option-3--installing-from-pypi)
  - [git (main branch)](#option-4--installing-from-git)
  - [containers](#option-5--apptainer-singularity)
- [Downloading models](#downloading-models)

---
#### Installing Jaeger
---

##### option 1 : One-liner install script (recommended)

The easiest way to install Jaeger. This script auto-detects your platform (GPU, CPU, or Apple Silicon) and sets up the environment for you.

```bash
curl -sSL https://raw.githubusercontent.com/MGXlab/Jaeger/main/install.sh | bash
```

##### option 2 : bioconda

The performance of the Jaeger workflow can be significantly increased by utilizing GPUs. To enable GPU support, the CUDA Toolkit and cuDNN library must be accessible to conda.

````bash
# create conda environment and install jaeger
mamba create -n jaeger -c bioconda jaeger-bio==1.26

# activate environment
conda activate jaeger
````
Test the installation with test data
```bash
jaeger health
```


##### option 3 : Installing from PyPI (recommended)

```bash
# create a conda environment and activate
mamba create -n jaeger -c nvidia -c conda-forge cuda-nvcc "python>=3.11,<3.14" pip
conda activate jaeger

# OR create a virtual environment using venv
python3 -m venv jaeger
source jaeger/bin/activate

# to install jaeger with GPU support
pip install jaeger-bio[gpu]

# to install without GPU support
pip install jaeger-bio[cpu]

# to install on a Mac(arm)
pip install jaeger-bio[darwin-arm]

# test the installation
jaeger health
```

##### option 4 : Installing from git (main branch)

```bash
# create a conda environment and activate
mamba create -n jaeger -c nvidia -c conda-forge cuda-nvcc "python>=3.11,<3.14" pip
conda activate jaeger

# OR create a virtual environment using venv
python3 -m venv jaeger
source jaeger/bin/activate

# install jaeger

# to install with GPU support
pip install --no-cache-dir "jaeger-bio[gpu] @ git+https://github.com/MGXlab/Jaeger@main"

# to install without GPU support
pip3 install --root-user-action=ignore --no-cache-dir "jaeger-bio[cpu] @ git+https://github.com/MGXlab/Jaeger@main"

# to install on a Mac(arm)
pip3 install --root-user-action=ignore --no-cache-dir "jaeger-bio[darwin-arm] @ git+https://github.com/MGXlab/Jaeger@main"

# test the installation
jaeger health

```

##### option 5 : Apptainer (Singularity)
If you're using Apptainer on a cluster, it's recommended to build the container on your local machine and then transfer it to the cluster.
```bash
# get the container def
wget -O jaeger_singularity.def https://raw.githubusercontent.com/Yasas1994/Jaeger/main/singularity/jaeger_singularity.def
# get the configuration file
wget -O config.json https://raw.githubusercontent.com/Yasas1994/Jaeger/main/src/jaeger/data/config.json

# to build the container
apptainer build jaeger.sif singularity/jaeger_singularity.def

# test container
apptainer run --nv jaeger.sif jaeger --help

# test the installation
apptainer run --nv jaeger.sif jaeger health

# list jaeger models available for download
apptainer run --nv jaeger.sif jaeger download --list
# download jaeger models
apptainer run --nv jaeger.sif jaeger download --model_name jaeger_38341_1.4M_fragment --path /path/to/save/model --config /path/to/config.json

# run jaeger
apptainer run --nv jaeger.sif jaeger predict --model jaeger_38341_1.4M_fragment --config /path/to/config.json -i /path/to/input.fasta -o /path/to/save/results

```


---
#### Downloading models
---
Starting from version 1.26.0, users will need to download the new models separately after installing Jaeger. The bundled `default` model is deprecated and uses the legacy prediction workflow; modern SavedModels (e.g., `jaeger_38341_1.4M_fragment`) are recommended.

Use the --list flag to print out all models available for download
```bash
jaeger download --list
```
Then to download the model and add it to the model path run
```bash
jaeger download --path /path/to/store/models --model_name jaeger_38341_1.4M
```
If you decide to change the model path later, or if you have a directory with newly trained/tuned models
register the path
```bash
jaeger register-models --path /new/model/path
```

---
#### Visualizing predictions
---

You can use [phage_contig_annotator](https://github.com/Yasas1994/phage_contig_annotator) to annotate and visualize Jaeger predictions.

---
#### Acknowledgements
---

This work was supported by the European Union's Horizon 2020 research and innovation program, under the Marie Skłodowska-Curie Actions Innovative Training Networks grant agreement no. 955974 ([VIROINF](https://viroinf.eu/)), the European Research Council (ERC) Consolidator grant 865694

<img src=https://github.com/Yasas1994/Jaeger/assets/34155351/0cad76c6-6e4d-4b89-8e73-257137cf73a8 width="210" height="84">  &nbsp;&nbsp;&nbsp;  <img src=https://github.com/Yasas1994/Jaeger/assets/34155351/fef3bc35-8a8c-44c9-85ca-35ab0c68130e width="100" height="100">  &nbsp;&nbsp;&nbsp;   <img src=https://github.com/Yasas1994/Jaeger/assets/34155351/f15ab9b6-cade-4315-941c-e897f753dad9 width="150" height="100">

The ascii art logo is from  <font size="3"> https://ascii.co.uk/ </font>
