Metadata-Version: 2.4
Name: opencypher-compliance
Version: 1.0.0
Summary: Measure graph database compliance against the openCypher 9 specification
Author-email: OpenCypher Compliance Contributors <jalakoo@gmail.com>
License-Expression: MIT
Project-URL: Homepage, https://github.com/jalakoo/opencypher-compliance
Project-URL: Repository, https://github.com/jalakoo/opencypher-compliance
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Database
Classifier: Topic :: Software Development :: Testing
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: typer>=0.9
Requires-Dist: pyyaml>=6.0
Requires-Dist: rich>=13.0
Provides-Extra: bolt
Requires-Dist: neo4j>=5.0; extra == "bolt"
Provides-Extra: falkordb
Requires-Dist: falkordb>=1.0; extra == "falkordb"
Provides-Extra: all
Requires-Dist: neo4j>=5.0; extra == "all"
Requires-Dist: falkordb>=1.0; extra == "all"
Provides-Extra: dev
Requires-Dist: pytest>=7.0; extra == "dev"
Requires-Dist: pytest-cov>=4.0; extra == "dev"
Dynamic: license-file

# opencypher-compliance

Measure graph database compliance against the **openCypher 9** specification.

![openCypher Compliance Report](https://github.com/jalakoo/opencypher-compliance/blob/main/assets/report_2026_03_17.png)


## Table of Contents

- [Installation](#installation)
- [Quickstart](#quickstart)
- [Configuration](#configuration)
- [Usage](#usage)
- [Adapters](#adapters)
- [HTML Report](#html-report)
- [Output Format](#output-format)
- [Development](#development)
- [License](#license)

## Installation

Install the core package with the adapter extra for your database:

```bash
pip install "opencypher-compliance[bolt]"         # Neo4j, Memgraph, ArcadeDB
pip install "opencypher-compliance[falkordb]"     # FalkorDB
pip install "opencypher-compliance[all]"          # All adapters
```

> **Note:** The quotes around the package name are required in zsh (the default shell on macOS). Bash does not require them, but they work in both.

**From source:**

```bash
git clone https://github.com/<org>/opencypher-compliance.git
cd opencypher-compliance
pip install -e ".[all]"
```

The adapter extras install the required third-party driver for each database (e.g. `bolt` installs the `neo4j` driver, `falkordb` installs the `falkordb` driver). The core package includes the adapter code but not the drivers — you'll get a `ModuleNotFoundError` at runtime if the driver isn't installed.

## Quickstart

```bash
# 1) Start from the sample config (edit values as needed)
cp sample.config.yaml config.yaml

# 2) Test a single database
opencypher-compliance run -a bolt --host localhost --port 7687 -u neo4j -p test

# 3) Test multiple databases from one config file
opencypher-compliance run -c config.yaml

# 4) Run and open an HTML comparison report in one step
opencypher-compliance run -c config.yaml -r
```

## Configuration

### Docker setup

This project will include ready‑to‑use Docker commands / `docker-compose` snippets for common databases (Neo4j, Memgraph, FalkorDB, ArcadeDB).

**Examples:**
- **ArcadeDB**: Container with Bolt plugin and default db setup on port `7001`
```
docker run --rm \
  -p 2480:2480 -p 2424:2424 -p 7001:7687 \
  -e "JAVA_OPTS=-Darcadedb.server.rootPassword=playwithdata -Darcadedb.server.defaultDatabases=myDB[root] -Darcadedb.server.plugins=Bolt:com.arcadedb.bolt.BoltProtocolPlugin" \
  arcadedata/arcadedb:latest
```
- **FalkorDB**: Redis‑compatible container with the FalkorDB module load on port `7002`, also runs default browser UI on port 7011
```
docker run -p 7001:6379 -p 7011:3000 -it --rm falkordb/falkordb:latest
```

- **Memgraph**: Memgraph platform image with Bolt exposed on port `7003`
```
docker run -d \
  --name memgraph-mage \
  --network memgraph-net \
  -p 7003:7687 \
  -p 7444:7444 \
  memgraph/memgraph-mage:latest
```
- **Neo4j**: Single instance container with Bolt exposed on `7004`
```
docker run \
  --name neo4j-docker \
  -p 7474:7474 -p 7473:7473 -p 7004:7687 \
  -e NEO4J_AUTH=neo4j/abcd1234 \
  -e NEO4J_ACCEPT_LICENSE_AGREEMENT=yes \
  neo4j:5 
```

These examples will live in a dedicated `docker/` directory and be referenced here once finalized. For now, use your own Docker setup to expose the correct host/port for each database and point `config.yaml` at those values.

### Config file (config.yaml)

Config files use a `databases` list — even for a single database:

```yaml
databases:
  - name: "ArcadeDB"
    adapter: "bolt"
    host: "localhost"
    port: 7001
    auth: { username: "root", password: "playwithdata" }

  - name: "FalkorDB"
    adapter: "falkordb"
    host: "localhost"
    port: 7002

  - name: "Memgraph"
    adapter: "bolt"
    host: "localhost"
    port: 7003

  - name: "Neo4j"
    adapter: "bolt"
    host: "localhost"
    port: 7004
    auth:
      username: "neo4j"
      password: "password"
    options:
      database_name: "neo4j"
    query_timeout: 10
    pre_run: "MATCH (n) DETACH DELETE n"
    post_run: "MATCH (n) DETACH DELETE n"

```

Each entry supports: `adapter` (required), `name`, `host`, `port`, `auth`, `options`, `query_timeout`, `pre_run`, `post_run`.

All databases are tested sequentially. Each run is independent — if one database is unreachable, the others still run.

### Environment variables

```bash
export CYPHER_COMPLIANCE_ADAPTER="bolt"
export CYPHER_COMPLIANCE_HOST="localhost"
export CYPHER_COMPLIANCE_PORT="7687"
export CYPHER_COMPLIANCE_USERNAME="neo4j"
export CYPHER_COMPLIANCE_PASSWORD="secret"
```

Precedence: CLI flags > environment variables > config.yaml

## Usage

### Run compliance tests

```bash
# Single database with config file
opencypher-compliance run -c config.yaml

# Single database with inline flags (no config file needed)
opencypher-compliance run -a bolt --host localhost --port 7687 -u neo4j -p password

# Multiple databases from one config
opencypher-compliance run -c config.yaml

# Run and open an HTML report in the browser
opencypher-compliance run -c config.yaml -r

# Filter by type
opencypher-compliance run -c config.yaml --types function --format json

# Verbose output
opencypher-compliance run -c config.yaml --verbose

# Pipe JSON to jq (single database only)
opencypher-compliance run -c config.yaml --quiet | jq '.results[] | select(.result=="fail")'

# Stop on first failure
opencypher-compliance run -c config.yaml --fail-fast
```

### List tests

```bash
opencypher-compliance list
opencypher-compliance list --types operator
```

### Validate catalog

```bash
opencypher-compliance validate
opencypher-compliance validate --catalog-dir ./my-custom-catalog/
```

### Generate HTML report

After running tests, generate a self-contained HTML report that compares results side by side:

```bash
# Run all databases, then generate report
opencypher-compliance run -c config.yaml
opencypher-compliance report

# Custom input/output paths, don't auto-open
opencypher-compliance report -i ./my-results -o ./reports --no-open

# Custom filename
opencypher-compliance report --filename neo4j-vs-falkordb.html
```

The report is a single HTML file with no external dependencies. It includes:
- **Dashboard cards** — per-database pass rate, version info, and bar charts
- **Type breakdown table** — pass counts by element type across databases, with expandable failure details
- **Comparison table** — every test element as a row, every database as a column, with pass/fail/error badges
- **Filters** — text search, type dropdown, and status dropdown
- **Sorting** — click column headers to sort
- **Error details** — hover over failed badges to see the error message

### Python API

```python
from opencypher_compliance import run_compliance

results = run_compliance(
    config={"database": {"adapter": "bolt", "host": "localhost", "port": 7687,
                         "auth": {"username": "neo4j", "password": "test"}}},
)
print(f"Pass rate: {results['metadata']['pass_rate']}")
```

## Adapters

| Adapter | Database(s) | Install extra | Protocol |
|---------|------------|---------------|----------|
| `bolt` | Neo4j, Memgraph, ArcadeDB | `[bolt]` | Bolt |
| `falkordb` | FalkorDB | `[falkordb]` | Redis |

Each adapter is a thin wrapper (~20 lines) around the database's official Python package.

## Output Format

### JSON

```json
{
  "metadata": {
    "database": "Neo4j Community 5.x",
    "adapter": "bolt",
    "total_tests": 133,
    "passed": 129,
    "failed": 4,
    "pass_rate": "96.99%"
  },
  "summary_by_type": {
    "clause": {"total": 18, "passed": 17, "failed": 1}
  },
  "results": [
    {"element": "MATCH", "type": "clause", "result": "pass", "duration_ms": 12}
  ]
}
```

### CSV

```csv
element,type,result,error,duration_ms
MATCH,clause,pass,,12
MANDATORY MATCH,clause,fail,"SyntaxError: MANDATORY is not supported",3
```

## Development

```bash
# Install in dev mode
pip install -e ".[dev]"

# Run tests
pytest -v

# Run specific test file
pytest tests/test_comparator.py -v
```

### Test catalog

The built-in catalog contains ~133 tests covering:
- 18 clauses (MATCH, CREATE, DELETE, MERGE, etc.)
- 25 operators (math, comparison, boolean, string, list)
- 76 functions (scalar, aggregate, list, math, string, temporal)
- 4 expressions (CASE, list comprehension, map projection)
- 10 data types (Boolean, Integer, Float, String, List, Map, etc.)

### Custom catalogs

Place YAML files in a directory and use `--catalog-dir`:

```bash
opencypher-compliance run -c config.yaml --catalog-dir ./my-catalog/
```

Each YAML file should follow the test definition schema:

```yaml
tests:
  - name: "My Test"
    type: "clause"
    query: "RETURN 1 AS x"
    expected_columns: ["x"]
    expected_rows: [[1]]
```

## License

This project is open source and available under the **MIT License**. See the `LICENSE` file at the repository root for full terms.
