Metadata-Version: 2.4
Name: imajin
Version: 1.3.3
Summary: Search tool for Japanese text in EPUB and Mokuro files
Project-URL: Repository, https://github.com/YonKuma/imajin.py
Author-email: YonKuma <kumarei@gmail.com>
License-Expression: CC-PDDC
License-File: LICENSE.txt
Keywords: ebook,example-sentences,japanese,manga,text-search
Classifier: Environment :: Console
Classifier: Intended Audience :: End Users/Desktop
Classifier: Natural Language :: Japanese
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Utilities
Requires-Python: >=3.9
Requires-Dist: beautifulsoup4
Requires-Dist: lxml
Requires-Dist: mecab-python3
Requires-Dist: unidic-lite
Description-Content-Type: text/markdown

# imajin.py

**imajin.py** is a search tool for `.epub` and `.mokuro` files, designed to help you find example sentences in your Japanese books and manga.

---

## Index

- [Features](#features)
- [Installation](#installation)  
  - [Install with pip (preferred method)](#install-with-pip-preferred-method)  
  - [Download and install script](#download-and-install-script)  
  - [Dependencies](#dependencies)
- [Usage](#usage)  
  - [Positional Arguments](#positional-arguments)  
  - [Options](#options)
- [Examples](#examples)  
  - [Installed](#installed)  
  - [As a script](#as-a-script)
- [Saving Results](#saving-results)
- [Notes](#notes)
- [License](#license)

---

## Features

- Search across unencrypted `.epub` (ebooks) and `.mokuro` (manga) files
- Supports fuzzy matching for Japanese conjugations (optional)
- Supports searching individual words or phrases
- Structured output: text, markdown, or JSON
- Recursively search directories of books and manga
- Clean highlighted snippets showing surrounding context

---

## Installation

### Install with pip (preferred method)

Recommended because it will be easier to update going forward, especially if you prefer to have the script installed.

1. Install Python 3.9+ if not allready installed
2. Install imajin using pip:

    ```bash
    pip install imajin
    ```

    You should be all set up to use `imajin` directly from the command line. 

### Download and install script

Not recommended, because it's harder to update.

1. Install Python 3.9+ if not already installed.
2. Download the source folder, either by downloading the Source code from the [Latest Release](https://github.com/YonKuma/imajin.py/releases/latest) or using `git clone https://github.com/YonKuma/imajin.py.git`
3. Install dependencies:

   ```bash
   pip install -r requirements.txt
   ```

   You can now use `imajin.py` as a python script. For example, if you're in the source directory, you can use `python imajin.py -r 蹴散らす ~/Documents/Manga`, but you'll need to use a qualified path to the files you want to search
4. (Optional) You can install imajin by copying the script to somewhere in your PATH and making it executable

    ```bash
    sudo cp imajin.py /usr/local/bin/imajin
    sudo chmod +x /usr/local/bin/imajin
    ```

### Dependencies

Dependencies:
- [beautifulsoup4](https://pypi.org/project/beautifulsoup4/)
- [lxml](https://pypi.org/project/lxml/)
- [mecab-python3](https://github.com/SamuraiT/mecab-python3) (optional; fuzzy matching will be disabled if MeCab is not available)
- unidic-lite (optional; used for fuzzy matching. Other dictionaries should work but are harder to install)

---

## Usage

Installed:

```bash
imajin [options] <search_word> <file_or_directory>
```

As a script:

```bash
python imajin.py [options] <search_word> <file_or_directory>
```

### Positional Arguments
| Argument | Description |
|:---------|:------------|
| `<search_word>` | The word or phrase you want to find. |
| `<file_or_directory>` | A single `.epub` or `.mokuro` file, or a directory containing them. |

### Options
| Option | Description |
|:-------|:------------|
| `--no-fuzzy` | Disable fuzzy matching (only exact matches). |
| `-r`, `--recursive` | Recursively search subdirectories if a directory is specified. |
| `--format {text,markdown,md,json}` | Choose output format (default: `text`). |
| `-h`, `--help` | Show help message and exit. |

---

## Examples

### Installed

Search for the word "慌ただしい" inside your book collection:

```bash
imajin 慌ただしい ./books/
```

Find exact matches only, searching all subdirectories:

```bash
imajin --no-fuzzy -r 慌ただしい ./novel-library/
```

Get markdown-formatted results:

```bash
imajin 慌ただしい ./books/ --format markdown
```

Save the results in a JSON file for further processing:

```bash
imajin 慌ただしい ./manga-collection/ --format json > results.json
```

### As a script

Search for the word "慌ただしい" inside your book collection:

```bash
python imajin.py 慌ただしい ./books/
```

Find exact matches only, searching all subdirectories:

```bash
python imajin.py --no-fuzzy -r 慌ただしい ./novel-library/
```

Get markdown-formatted results:

```bash
python imajin.py 慌ただしい ./books/ --format markdown
```

Save the results in a JSON file for further processing:

```bash
python imajin.py 慌ただしい ./manga-collection/ --format json > results.json
```

---

## Saving Results

To save your search results to a file, redirect the output:

```bash
imajin 慌ただしい ./books/ --format md > examples.md
```

This method works for all output formats (text, markdown/md, or JSON).

---

## Notes

- If MeCab is not installed, fuzzy matching will be automatically disabled.

---

## License

This project is released under the [CC0 1.0 Universal Public Domain Dedication](LICENSE.txt).
