Metadata-Version: 2.4
Name: aragora
Version: 2.7.4
Summary: Control plane for multi-agent vetted decisionmaking across org knowledge and channels
Author: Agora Contributors
License-Expression: MIT
Project-URL: Homepage, https://aragora.ai
Project-URL: Documentation, https://github.com/an0mium/aragora#readme
Project-URL: Repository, https://github.com/an0mium/aragora
Project-URL: Issues, https://github.com/an0mium/aragora/issues
Keywords: ai,multi-agent,control-plane,deliberation,llm,claude,gpt,consensus,enterprise,decision-assurance
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Typing :: Typed
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: aiohttp<4.0,>=3.13.3
Requires-Dist: websockets<15.1,>=13.0
Requires-Dist: pyyaml<7.0,>=6.0
Requires-Dist: pydantic<3.0,>=2.0
Requires-Dist: pydantic-settings<3.0,>=2.0
Requires-Dist: bcrypt<6.0,>=4.0
Requires-Dist: cryptography<48.0,>=46.0
Requires-Dist: markupsafe<4.0,>=2.1.0
Requires-Dist: defusedxml<1.0,>=0.7
Requires-Dist: pyotp<3.0,>=2.9
Requires-Dist: jinja2<4.0,>=3.1.6
Requires-Dist: urllib3<3.0,>=2.6.3
Requires-Dist: httpx<1.0,>=0.27
Requires-Dist: numpy<3.0,>=2.0
Requires-Dist: watchfiles<2.0,>=0.21
Requires-Dist: boto3<2.0,>=1.34
Requires-Dist: PyJWT<3.0,>=2.8
Requires-Dist: fastapi<1.0,>=0.109.0
Requires-Dist: uvicorn[standard]<1.0,>=0.27.0
Requires-Dist: aragora-debate<1.0,>=0.2.0
Provides-Extra: dev
Requires-Dist: pytest<10.0,>=7.0; extra == "dev"
Requires-Dist: pytest-asyncio<2.0,>=0.21; extra == "dev"
Requires-Dist: pytest-benchmark<6.0,>=4.0; extra == "dev"
Requires-Dist: pytest-cov<8.0,>=4.0; extra == "dev"
Requires-Dist: pytest-timeout<3.0,>=2.0; extra == "dev"
Requires-Dist: pytest-xdist<4.0,>=3.5; extra == "dev"
Requires-Dist: pytest-rerunfailures<15.0,>=14.0; extra == "dev"
Requires-Dist: pytest-randomly<5.0,>=3.15; extra == "dev"
Requires-Dist: black<27.0,>=23.0; extra == "dev"
Requires-Dist: ruff<1.0,>=0.1; extra == "dev"
Requires-Dist: bandit<2.0,>=1.7; extra == "dev"
Requires-Dist: mypy<2.0,>=1.8; extra == "dev"
Requires-Dist: mutmut<4.0,>=3.0; extra == "dev"
Requires-Dist: pre-commit<5.0,>=3.6; extra == "dev"
Requires-Dist: datamodel-code-generator<1.0,>=0.25; extra == "dev"
Requires-Dist: async-timeout<6.0,>=4.0; extra == "dev"
Requires-Dist: python3-saml<2.0,>=1.15; extra == "dev"
Requires-Dist: tiktoken<1.0,>=0.5; extra == "dev"
Provides-Extra: test
Requires-Dist: pytest<10.0,>=7.0; extra == "test"
Requires-Dist: pytest-asyncio<2.0,>=0.21; extra == "test"
Requires-Dist: pytest-benchmark<6.0,>=4.0; extra == "test"
Requires-Dist: pytest-cov<8.0,>=4.0; extra == "test"
Requires-Dist: pytest-timeout<3.0,>=2.0; extra == "test"
Requires-Dist: pytest-xdist<4.0,>=3.5; extra == "test"
Requires-Dist: pytest-rerunfailures<15.0,>=14.0; extra == "test"
Requires-Dist: pytest-randomly<5.0,>=3.15; extra == "test"
Requires-Dist: black<27.0,>=23.0; extra == "test"
Requires-Dist: ruff<1.0,>=0.1; extra == "test"
Requires-Dist: bandit<2.0,>=1.7; extra == "test"
Requires-Dist: mypy<2.0,>=1.8; extra == "test"
Requires-Dist: mutmut<4.0,>=3.0; extra == "test"
Requires-Dist: pre-commit<5.0,>=3.6; extra == "test"
Requires-Dist: datamodel-code-generator<1.0,>=0.25; extra == "test"
Requires-Dist: async-timeout<6.0,>=4.0; extra == "test"
Requires-Dist: aiosqlite<1.0,>=0.19; extra == "test"
Requires-Dist: supabase<3.0,>=2.0; extra == "test"
Requires-Dist: redis<8.0,>=5.0.0; extra == "test"
Requires-Dist: asyncpg<1.0,>=0.29.0; extra == "test"
Requires-Dist: yt-dlp<2027.0,>=2024.1; extra == "test"
Requires-Dist: tiktoken<1.0,>=0.5; extra == "test"
Requires-Dist: openai<3.0,>=2.0; extra == "test"
Requires-Dist: twilio<10.0,>=8.0; extra == "test"
Requires-Dist: mcp<2.0,>=1.0; extra == "test"
Requires-Dist: langchain<2.0,>=0.1; extra == "test"
Requires-Dist: weaviate-client<5.0,>=4.0; extra == "test"
Requires-Dist: z3-solver<5.0,>=4.12; extra == "test"
Requires-Dist: pytest-benchmark<5.0,>=4.0; extra == "test"
Requires-Dist: boto3<2.0,>=1.34; extra == "test"
Requires-Dist: weasyprint<70.0,>=68.0; extra == "test"
Requires-Dist: reportlab<5.0,>=3.6; extra == "test"
Requires-Dist: scikit-learn<2.0,>=1.5.0; extra == "test"
Requires-Dist: sentence-transformers<6.0,>=3.0.0; extra == "test"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "test"
Requires-Dist: duckduckgo-search<9.0,>=6.0; extra == "test"
Provides-Extra: monitoring
Requires-Dist: prometheus-client<1.0,>=0.19; extra == "monitoring"
Requires-Dist: sentry-sdk<3.0,>=2.0; extra == "monitoring"
Provides-Extra: blockchain
Requires-Dist: web3<8.0,>=7.0; extra == "blockchain"
Requires-Dist: eth-account<1.0,>=0.13; extra == "blockchain"
Provides-Extra: observability
Requires-Dist: opentelemetry-api<2.0,>=1.20.0; extra == "observability"
Requires-Dist: opentelemetry-sdk<2.0,>=1.20.0; extra == "observability"
Requires-Dist: opentelemetry-exporter-otlp<2.0,>=1.20.0; extra == "observability"
Requires-Dist: opentelemetry-instrumentation-logging<1.0,>=0.41b0; extra == "observability"
Requires-Dist: prometheus-client<1.0,>=0.19; extra == "observability"
Provides-Extra: redis
Requires-Dist: redis<8.0,>=5.0.0; extra == "redis"
Provides-Extra: persistence
Requires-Dist: supabase<3.0,>=2.0; extra == "persistence"
Requires-Dist: sqlalchemy<3.0,>=2.0.40; extra == "persistence"
Provides-Extra: postgres
Requires-Dist: asyncpg<1.0,>=0.29.0; extra == "postgres"
Requires-Dist: alembic<2.0,>=1.13.0; extra == "postgres"
Requires-Dist: nest_asyncio<2.0,>=1.5; extra == "postgres"
Provides-Extra: documents
Requires-Dist: pypdf<7.0,>=6.6; extra == "documents"
Requires-Dist: pdfplumber<1.0,>=0.10; extra == "documents"
Requires-Dist: python-docx<2.0,>=0.8; extra == "documents"
Requires-Dist: openpyxl<4.0,>=3.1; extra == "documents"
Requires-Dist: python-pptx<1.0,>=0.6; extra == "documents"
Requires-Dist: ebooklib<1.0,>=0.18; extra == "documents"
Requires-Dist: beautifulsoup4<5.0,>=4.12; extra == "documents"
Requires-Dist: mobi<1.0,>=0.3; extra == "documents"
Requires-Dist: weasyprint<70.0,>=68.0; extra == "documents"
Provides-Extra: broadcast
Requires-Dist: edge-tts<8.0,>=6.1.0; extra == "broadcast"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "broadcast"
Requires-Dist: pyttsx3<3.0,>=2.90; (platform_system == "Darwin" or platform_system == "Windows") and extra == "broadcast"
Provides-Extra: broadcast-elevenlabs
Requires-Dist: edge-tts<8.0,>=6.1.0; extra == "broadcast-elevenlabs"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "broadcast-elevenlabs"
Requires-Dist: pyttsx3<3.0,>=2.90; (platform_system == "Darwin" or platform_system == "Windows") and extra == "broadcast-elevenlabs"
Requires-Dist: elevenlabs<3.0,>=1.0; extra == "broadcast-elevenlabs"
Provides-Extra: broadcast-polly
Requires-Dist: edge-tts<8.0,>=6.1.0; extra == "broadcast-polly"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "broadcast-polly"
Requires-Dist: pyttsx3<3.0,>=2.90; (platform_system == "Darwin" or platform_system == "Windows") and extra == "broadcast-polly"
Requires-Dist: boto3<2.0,>=1.34; extra == "broadcast-polly"
Provides-Extra: broadcast-premium
Requires-Dist: edge-tts<8.0,>=6.1.0; extra == "broadcast-premium"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "broadcast-premium"
Requires-Dist: pyttsx3<3.0,>=2.90; (platform_system == "Darwin" or platform_system == "Windows") and extra == "broadcast-premium"
Requires-Dist: elevenlabs<3.0,>=1.0; extra == "broadcast-premium"
Requires-Dist: boto3<2.0,>=1.34; extra == "broadcast-premium"
Provides-Extra: research
Requires-Dist: duckduckgo-search<9.0,>=6.0; extra == "research"
Requires-Dist: beautifulsoup4<5.0,>=4.12; extra == "research"
Provides-Extra: gauntlet
Provides-Extra: ml
Requires-Dist: scipy<2.0,>=1.14.0; extra == "ml"
Requires-Dist: scikit-learn<2.0,>=1.5.0; extra == "ml"
Requires-Dist: sentence-transformers<6.0,>=3.0.0; extra == "ml"
Provides-Extra: rlm
Provides-Extra: control-plane
Requires-Dist: redis<8.0,>=5.0.0; extra == "control-plane"
Requires-Dist: networkx<4.0,>=3.0; extra == "control-plane"
Provides-Extra: code-intel
Requires-Dist: tree-sitter<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-python<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-javascript<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-typescript<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-go<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-rust<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: tree-sitter-java<1.0,>=0.23.0; extra == "code-intel"
Requires-Dist: networkx<4.0,>=3.0; extra == "code-intel"
Provides-Extra: browser
Requires-Dist: playwright<2.0,>=1.40; extra == "browser"
Provides-Extra: gateway
Provides-Extra: enterprise
Requires-Dist: python3-saml<2.0,>=1.15; extra == "enterprise"
Requires-Dist: prometheus-client<1.0,>=0.19; extra == "enterprise"
Requires-Dist: sentry-sdk<3.0,>=2.0; extra == "enterprise"
Requires-Dist: redis<8.0,>=5.0.0; extra == "enterprise"
Provides-Extra: connectors
Requires-Dist: aiokafka<1.0,>=0.9; extra == "connectors"
Requires-Dist: aio-pika<10.0,>=9.0; extra == "connectors"
Requires-Dist: twilio<10.0,>=8.0; extra == "connectors"
Requires-Dist: langchain<2.0,>=0.1; extra == "connectors"
Requires-Dist: playwright<2.0,>=1.40; extra == "connectors"
Provides-Extra: experimental
Requires-Dist: playwright<2.0,>=1.40; extra == "experimental"
Requires-Dist: networkx<4.0,>=3.0; extra == "experimental"
Provides-Extra: all
Requires-Dist: pytest<10.0,>=7.0; extra == "all"
Requires-Dist: pytest-asyncio<2.0,>=0.21; extra == "all"
Requires-Dist: pytest-benchmark<6.0,>=4.0; extra == "all"
Requires-Dist: pytest-cov<8.0,>=4.0; extra == "all"
Requires-Dist: pytest-timeout<3.0,>=2.0; extra == "all"
Requires-Dist: pytest-xdist<4.0,>=3.5; extra == "all"
Requires-Dist: pytest-rerunfailures<15.0,>=14.0; extra == "all"
Requires-Dist: pytest-randomly<5.0,>=3.15; extra == "all"
Requires-Dist: black<27.0,>=23.0; extra == "all"
Requires-Dist: ruff<1.0,>=0.1; extra == "all"
Requires-Dist: bandit<2.0,>=1.7; extra == "all"
Requires-Dist: mypy<2.0,>=1.8; extra == "all"
Requires-Dist: mutmut<4.0,>=3.0; extra == "all"
Requires-Dist: pre-commit<5.0,>=3.6; extra == "all"
Requires-Dist: datamodel-code-generator<1.0,>=0.25; extra == "all"
Requires-Dist: async-timeout<6.0,>=4.0; extra == "all"
Requires-Dist: aiosqlite<1.0,>=0.19; extra == "all"
Requires-Dist: supabase<3.0,>=2.0; extra == "all"
Requires-Dist: redis<8.0,>=5.0.0; extra == "all"
Requires-Dist: asyncpg<1.0,>=0.29.0; extra == "all"
Requires-Dist: yt-dlp<2027.0,>=2024.1; extra == "all"
Requires-Dist: tiktoken<1.0,>=0.5; extra == "all"
Requires-Dist: openai<3.0,>=2.0; extra == "all"
Requires-Dist: twilio<10.0,>=8.0; extra == "all"
Requires-Dist: mcp<2.0,>=1.0; extra == "all"
Requires-Dist: langchain<2.0,>=0.1; extra == "all"
Requires-Dist: weaviate-client<5.0,>=4.0; extra == "all"
Requires-Dist: z3-solver<5.0,>=4.12; extra == "all"
Requires-Dist: boto3<2.0,>=1.34; extra == "all"
Requires-Dist: weasyprint<70.0,>=68.0; extra == "all"
Requires-Dist: reportlab<5.0,>=3.6; extra == "all"
Requires-Dist: scikit-learn<2.0,>=1.5.0; extra == "all"
Requires-Dist: sentence-transformers<6.0,>=3.0.0; extra == "all"
Requires-Dist: pydub<1.0,>=0.25.0; extra == "all"
Requires-Dist: sqlalchemy<3.0,>=2.0.40; extra == "all"
Requires-Dist: pypdf<7.0,>=6.6; extra == "all"
Requires-Dist: pdfplumber<1.0,>=0.10; extra == "all"
Requires-Dist: python-docx<2.0,>=0.8; extra == "all"
Requires-Dist: openpyxl<4.0,>=3.1; extra == "all"
Requires-Dist: python-pptx<1.0,>=0.6; extra == "all"
Requires-Dist: ebooklib<1.0,>=0.18; extra == "all"
Requires-Dist: beautifulsoup4<5.0,>=4.12; extra == "all"
Requires-Dist: mobi<1.0,>=0.3; extra == "all"
Requires-Dist: edge-tts<8.0,>=6.1.0; extra == "all"
Requires-Dist: pyttsx3<3.0,>=2.90; (platform_system == "Darwin" or platform_system == "Windows") and extra == "all"
Requires-Dist: elevenlabs<3.0,>=1.0; extra == "all"
Requires-Dist: duckduckgo-search<9.0,>=6.0; extra == "all"
Requires-Dist: prometheus-client<1.0,>=0.19; extra == "all"
Requires-Dist: sentry-sdk<3.0,>=2.0; extra == "all"
Requires-Dist: scipy<2.0,>=1.14.0; extra == "all"
Requires-Dist: sentence-transformers<6.0,>=3.0.0; extra == "all"
Requires-Dist: networkx<4.0,>=3.0; extra == "all"
Requires-Dist: alembic<2.0,>=1.13.0; extra == "all"
Requires-Dist: nest_asyncio<2.0,>=1.5; extra == "all"
Requires-Dist: opentelemetry-api<2.0,>=1.20.0; extra == "all"
Requires-Dist: opentelemetry-sdk<2.0,>=1.20.0; extra == "all"
Requires-Dist: opentelemetry-exporter-otlp<2.0,>=1.20.0; extra == "all"
Requires-Dist: opentelemetry-instrumentation-logging<1.0,>=0.41b0; extra == "all"
Requires-Dist: tree-sitter<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-python<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-javascript<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-typescript<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-go<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-rust<1.0,>=0.23.0; extra == "all"
Requires-Dist: tree-sitter-java<1.0,>=0.23.0; extra == "all"
Requires-Dist: playwright<2.0,>=1.40; extra == "all"
Requires-Dist: aiokafka<1.0,>=0.9; extra == "all"
Requires-Dist: aio-pika<10.0,>=9.0; extra == "all"
Requires-Dist: python3-saml<2.0,>=1.15; extra == "all"
Dynamic: license-file

# Aragora

Aragora orchestrates 42 AI agents to adversarially vet decisions through structured debate, delivering audit-ready decision receipts. Built for enterprises where AI decisions carry real consequences.

### The Decision Integrity Platform

[![PyPI](https://img.shields.io/pypi/v/aragora)](https://pypi.org/project/aragora/)
[![Tests](https://github.com/an0mium/aragora/actions/workflows/test.yml/badge.svg)](https://github.com/an0mium/aragora/actions/workflows/test.yml)
[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

**New here?** Start with the [Getting Started Guide](docs/START_HERE.md) -- you'll have a working demo in 30 seconds.

**Individual LLMs are unreliable. Their personas shift with context, their confidence doesn't correlate with accuracy, and they say what you want to hear. For consequential decisions, you need infrastructure that treats this as a feature to be engineered around, not a problem to be ignored.**

Aragora orchestrates 42 agent types in structured adversarial debates -- forcing models to challenge each other's reasoning, surface blind spots, and produce decisions with complete audit trails showing where they agreed, where they disagreed, and why.

## Try It Now

```bash
pip install aragora

# Zero-config demo — runs a full adversarial debate, no API keys needed
aragora demo

# Or run the guided quickstart (opens receipt in your browser)
aragora quickstart --demo
```

**Or run with Docker (includes dashboard UI):**

```bash
docker compose -f docker-compose.quickstart.yml up
# Open http://localhost:3000
```

<details>
<summary>What you'll see (click to expand)</summary>

```
================================================================
  ARAGORA DEMO -- Adversarial Decision Stress-Test
================================================================

  Topic:  Should we adopt microservices?
  Agents: Analyst, Critic, Synthesizer, Devil's Advocate
  Rounds: 2

  --- Round 1 --------------------------------------------------

  [ANALYST] (supportive)
    This is a sound strategy. The evidence points toward
    significant gains in maintainability and team productivity.

  [CRITIC] (critical)
    The claimed benefits are overstated. Most organizations
    underestimated the operational burden by 3-5x. I recommend
    a modular monolith as the safer path.

  [SYNTHESIZER] (balanced)
    The tradeoffs here are real. On one hand, the current
    architecture limits independent scaling. On the other,
    the migration carries execution risk.

  --- Decision Receipt -----------------------------------------

  Verdict:    CONDITIONAL APPROVAL
  Confidence: 72%
  Consensus:  Partial (3 of 4 agents)
  Dissent:    Devil's Advocate flagged migration risk
```

</details>

```bash
# Review your current changes against main
git diff main | aragora review --demo

# Or review a GitHub PR
aragora review --pr https://github.com/org/repo/pull/123 --demo
```

```bash
# Stress-test a specification
aragora gauntlet spec.md --profile thorough --output receipt.html

# Run a multi-agent debate
aragora ask "Design a rate limiter for 1M req/sec" --agents anthropic-api,openai-api,gemini

# Start the API server
aragora serve
```

### Add to Your CI Pipeline (1 minute)

```yaml
# .github/workflows/aragora-review.yml
name: Aragora Review
on:
  pull_request:
    types: [opened, synchronize]
jobs:
  review:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: an0mium/aragora@main
        with:
          anthropic-api-key: ${{ secrets.ANTHROPIC_API_KEY }}
          openai-api-key: ${{ secrets.OPENAI_API_KEY }}
```

Or generate it automatically: `aragora init --ci github`

---

## Five Pillars

Aragora is built on five architectural commitments designed for a world where individual AI agents cannot be trusted with consequential decisions alone.

### 1. SMB-Ready, Enterprise-Grade

Aragora is useful to a 5-person startup on day one and scales to regulated enterprise without rearchitecting. Enterprise features -- OIDC/SAML SSO, MFA, AES-256-GCM encryption, multi-tenant isolation, RBAC with 7 roles and 360+ permissions, SOC 2 / GDPR / HIPAA compliance frameworks -- are built in, not bolted on. Security hardening (rate limiting, SSRF protection, path traversal guards, input validation, audit trails) is the default, not a premium tier.

### 2. Leading-Edge Memory and Context

Single agents lose context. Aragora's 4-tier Continuum Memory (fast / medium / slow / glacial) and Knowledge Mound with 33 registered adapters give every debate access to institutional history, cross-session learning, and evidence provenance. The RLM (Recursive Language Models) system compresses and structures context to reduce prompt bloat, enabling debates that sustain coherence across long multi-round sessions and large document sets where individual models would degrade.

### 3. Extensible and Modular

Connectors for Slack, Teams, Discord, Telegram, WhatsApp, email, voice, Kafka, RabbitMQ, GitHub, Jira, Salesforce, healthcare HL7/FHIR, and dozens more. SDKs in Python and TypeScript (140 namespaces in the TypeScript SDK). 2,000+ API operations across 1,800+ paths and 190+ WebSocket event types. OpenClaw integration for portable agent governance. A workflow engine with DAG execution and 50+ templates. A marketplace for agent personas, debate templates, and workflow patterns. Aragora adapts to your stack -- not the other way around.

### 4. Multi-Agent Robustness

Individual LLMs exhibit persona instability -- their outputs shift based on framing, context, and even prompt ordering. Aragora treats this as a feature: by running Claude, GPT, Gemini, Grok, Mistral, DeepSeek, Qwen, Kimi, and local models in structured Propose / Critique / Revise debates, the system surfaces disagreements that reveal genuine uncertainty. ELO rankings track agent performance. Calibration scoring (Brier scores) measures prediction accuracy. The Trickster detects hollow consensus where models agree without genuine reasoning. The result: when models with different training data independently converge on an answer, that convergence is meaningful -- and when they disagree, the dissent trail tells you exactly where human judgment is needed.

### 5. Self-Healing and Self-Extending

The Nomic Loop is Aragora's autonomous self-improvement system: agents debate improvements to the codebase, design solutions, implement code, run tests, and verify changes -- with human approval gates and automatic rollback on failure. This is how Aragora grew from a debate engine to 3,200+ modules. Red-team mode stress-tests the platform's own specs. The Gauntlet runs adversarial attacks against proposed changes. The system hardens itself.

---

## Why Aragora?

A single LLM will confidently give you a wrong answer and you won't know it. Research shows that LLM personas are context-dependent, fragile under adversarial pressure, and prone to sycophantic agreement with whoever is asking. [Stanford's taxonomy of LLM reasoning failures](https://arxiv.org/abs/2602.06176) documents systematic breakdowns in formal logic, unfaithful chain-of-thought, and robustness failures under minor prompt variations -- exactly the failure modes that structured adversarial debate is designed to surface. When the decision matters -- hiring, architecture, compliance, strategy -- one model's opinion is insufficient.

Aragora treats each model as an **unreliable witness** and uses structured debate protocols to extract signal from their disagreements:

| What you get | How it works |
|---|---|
| **Adversarial Validation** | Models with different training data and blind spots challenge each other's reasoning |
| **Decision Receipts** | Cryptographic audit trails with evidence chains, dissent tracking, and confidence calibration |
| **Gauntlet Mode** | Red-team stress-tests for specs, policies, and architectures using adversarial personas |
| **Calibrated Trust** | ELO rankings and Brier scores track which models are actually reliable on which domains |
| **Institutional Memory** | Decisions persist across sessions with 4-tier memory and Knowledge Mound (33 adapters) |
| **Channel Delivery** | Results route to Slack, Teams, Discord, Telegram, WhatsApp, email, or voice |

---

## Quick Start

### 1. Install and Try It (30 seconds)

```bash
pip install aragora

# Run a zero-config demo debate — opens receipt in your browser
aragora quickstart --demo

# Or review your uncommitted changes — no API keys needed in demo mode
git diff main | aragora review --demo
```

See [docs/QUICKSTART_DEVELOPER.md](docs/QUICKSTART_DEVELOPER.md) for the full developer quickstart.

### 2. Run Debates and Start the Server

```bash
# Set at least one API key
export ANTHROPIC_API_KEY=your-key  # or OPENAI_API_KEY, GEMINI_API_KEY, XAI_API_KEY

# Run a multi-agent debate
aragora ask "Should we adopt microservices?" --agents anthropic-api,openai-api --rounds 3

# Start the API server
aragora serve
```

See [docs/guides/GETTING_STARTED.md](docs/guides/GETTING_STARTED.md) for the complete 5-minute setup.

### 3. Deploy with Docker

```bash
# Clone and deploy
git clone https://github.com/an0mium/aragora && cd aragora

# Production deployment (secrets from AWS Secrets Manager)
cd deploy/liftmode && ./setup.sh

# Or run directly with Docker Compose
docker compose -f deploy/docker-compose.yml up -d
```

See [docs/DEPLOYMENT.md](docs/DEPLOYMENT.md) for full deployment options (Docker, Kubernetes, offline mode).

### 4. Develop with the SDK

| Package | Install | Purpose | PyPI |
|---|---|---|---|
| `aragora` | `pip install aragora` | Full platform (server, CLI, debate engine) | [v2.6.3](https://pypi.org/project/aragora/) |
| `aragora-debate` | `pip install aragora-debate` | Standalone debate engine (no server needed) | [v0.2.0](https://pypi.org/project/aragora-debate/) |
| `aragora-sdk` | `pip install aragora-sdk` | Python client SDK for connecting to aragora | [v2.6.3](https://pypi.org/project/aragora-sdk/) |
| `@aragora/sdk` | `npm install @aragora/sdk` | TypeScript/Node.js client SDK | — |

---

## Core Workflows

### 1. Gauntlet Mode -- Adversarial Stress Testing

Stress-test specs, architectures, and policies before they ship:

```bash
aragora gauntlet spec.md --input-type spec --profile quick
aragora gauntlet policy.yaml --input-type policy --persona gdpr
aragora gauntlet architecture.md --profile thorough --output report.html
```

| Attack Type | What It Tests |
|---|---|
| **Red Team** | Security holes, injection points, auth bypasses |
| **Devil's Advocate** | Logic flaws, hidden assumptions, edge cases |
| **Scaling Critic** | Performance bottlenecks, SPOF, thundering herd |
| **Compliance** | GDPR, HIPAA, SOC 2, AI Act violations |

Decision receipts provide cryptographic audit trails for every finding.

### 2. AI Code Review

Get **multi-model consensus** on your pull requests:

```bash
git diff main | aragora review
aragora review https://github.com/owner/repo/pull/123
aragora review --demo  # try without API keys
```

When 3+ independent models with different training data agree on an issue, that convergence is meaningful. Split opinions show where human judgment is needed -- the disagreement is the signal.

### 3. Structured Debates

The debate protocol follows thesis > antithesis > synthesis:

1. **Propose** -- Agents generate initial responses from different perspectives
2. **Critique** -- Agents challenge each other's proposals with severity scores
3. **Revise** -- Proposers incorporate valid critiques
4. **Synthesize** -- Judge combines best elements into a final answer

Configurable consensus: majority, unanimous, judge-based, or none.

---

## Architecture

```
aragora/
├── debate/         # Core debate engine (210+ modules)
│   ├── orchestrator.py   # Arena -- main debate loop
│   ├── consensus.py      # Consensus detection and proofs
│   ├── convergence.py    # Semantic similarity detection
│   └── phases/           # Propose, critique, revise, vote, judge
├── agents/         # 42 registered agent types (CLI, direct API, OpenRouter, local)
│   ├── api_agents/       # Anthropic, OpenAI, Gemini, Grok, Mistral, OpenRouter
│   ├── cli_agents.py     # Claude Code, Codex, Gemini CLI, Grok CLI
│   └── fallback.py       # OpenRouter fallback on quota errors
├── gauntlet/       # Adversarial stress testing
├── knowledge/      # Knowledge Mound with 33 registered adapters
├── memory/         # 4-tier memory (fast/medium/slow/glacial)
├── server/         # 2,000+ API operations, 190+ WebSocket event types
├── pipeline/       # Decision-to-PR generation
├── genesis/        # Fractal debates, agent evolution
├── sandbox/        # Docker-based safe execution
├── rbac/           # Role-based access control (7 roles, 360+ permissions)
├── compliance/     # SOC 2, GDPR, HIPAA frameworks
└── workflow/       # DAG-based automation engine
```

**Scale:** 3,200+ Python modules | 140,000+ tests

### Performance and Costs

| Metric | Typical Value |
|---|---|
| Debate latency (3 agents, 2 rounds) | 30-90 seconds |
| Token usage per debate | 8,000-25,000 tokens |
| Estimated cost per debate | $0.05-$0.30 (depends on models) |
| Concurrent debates supported | 50+ (configurable) |
| API response time (cached) | < 200ms |
| Memory tier lookup (fast tier) | < 10ms |

Costs vary by model mix. Claude Haiku + GPT-4o-mini debates cost ~$0.05; Claude Opus + GPT-4 debates cost ~$0.30. Use `aragora decide --dry-run` to preview costs before execution.

### How Aragora Compares

| Capability | Aragora | LangGraph | CrewAI | AutoGen |
|---|---|---|---|---|
| Adversarial debate protocol | Built-in (propose/critique/revise) | Manual | No | No |
| Decision receipts with audit trail | Cryptographic, SHA-256 | No | No | No |
| Agent calibration (ELO + Brier) | Built-in | No | No | No |
| Multi-model consensus | 42 agent types, 10+ providers | Single-provider | Single-provider | Multi-provider |
| Gauntlet stress testing | Built-in CLI | No | No | No |
| Enterprise security (SSO, RBAC, encryption) | Production-ready | No | No | No |
| Self-improvement (Nomic Loop) | Autonomous with safety gates | No | No | No |
| Knowledge persistence (33 adapters) | 4-tier memory + Knowledge Mound | Custom | Custom | Custom |
| Channel delivery (Slack, Teams, etc.) | 8 channels built-in | No | No | No |

---

## Programmatic Usage

```python
from aragora import Arena, Environment, DebateProtocol
from aragora.agents import create_agent

agents = [
    create_agent("anthropic-api", name="claude", role="proposer"),
    create_agent("openai-api", name="gpt", role="critic"),
    create_agent("gemini", name="gemini", role="synthesizer"),
]

env = Environment(task="Design a distributed cache with LRU eviction")
protocol = DebateProtocol(rounds=3, consensus="majority")
arena = Arena(env, agents, protocol)
result = await arena.run()

print(result.final_answer)
print(f"Consensus: {result.consensus_reached} ({result.confidence:.0%})")
```

### Python SDK

```python
from aragora.client import AragoraClient

client = AragoraClient(base_url="http://localhost:8080")
debate = client.debates.run(task="Should we adopt microservices?")
receipt = await client.gauntlet.run_and_wait(input_content="spec.md")
```

See [docs/SDK_GUIDE.md](docs/SDK_GUIDE.md) for the full API.

---

## Channels and Integrations

Aragora delivers debate results to wherever your team works:

| Channel | Status |
|---|---|
| Slack | Bot + OAuth |
| Microsoft Teams | Bot + OAuth |
| Discord | Interactions API |
| Telegram | Bot API |
| WhatsApp | Business API |
| Email | SMTP + Gmail + Outlook |
| Voice | TTS integration |
| Webhooks | Custom delivery |

Results automatically route to the originating channel via bidirectional chat routing.

See [docs/integrations/INTEGRATIONS.md](docs/integrations/INTEGRATIONS.md) for setup.

---

## Enterprise Features

| Category | Capabilities |
|---|---|
| **Authentication** | OIDC/SAML SSO, MFA (TOTP/HOTP), API key management, SCIM 2.0 |
| **Multi-Tenancy** | Tenant isolation, resource quotas, usage metering |
| **Security** | AES-256-GCM encryption, rate limiting, SSRF protection, key rotation |
| **Compliance** | SOC 2 controls, GDPR support, HIPAA, audit trails |
| **Observability** | Prometheus metrics, Grafana dashboards, OpenTelemetry tracing |
| **RBAC** | 7 roles, 360+ permissions, decorator-based enforcement |
| **Backup** | Incremental backups, retention policies, disaster recovery |
| **Control Plane** | Agent registry, task scheduler, health monitoring, policy governance |

See [docs/enterprise/ENTERPRISE_FEATURES.md](docs/enterprise/ENTERPRISE_FEATURES.md) for details.

---

## Self-Improvement (Nomic Loop)

Aragora includes an autonomous self-improvement system where agents debate and implement improvements to the codebase itself. **Experimental** -- always run in a sandbox with human review.

```bash
python scripts/run_nomic_with_stream.py run --cycles 3
python scripts/self_develop.py --goal "Improve test coverage" --require-approval
```

Safety: automatic backups, protected file checksums, rollback on failure, human approval gates.

---

## Deployment

| Goal | Command | Requirements |
|------|---------|-------------|
| **Try it** | `docker compose -f docker-compose.quickstart.yml up` | Docker only |
| **Self-hosted** | `cd deploy/self-hosted && docker compose up -d` | Docker + API key |
| **Local dev** | `aragora serve --api-port 8080 --ws-port 8765` | Python + API key |

See [deploy/README.md](deploy/README.md) for the full deployment guide.

**API:** REST at `/api/v2/*` | WebSocket at `/ws` | OpenAPI at `/api/openapi`

---

## Security

- Ed25519 signature verification for webhooks (Discord, Slack)
- Rate limiting (IP, token, and endpoint-based)
- Input validation and content-length enforcement
- CORS allowlists, security headers, error message sanitization
- Path traversal protection, upload validation with magic byte checking
- WebSocket message limits (64KB), debate timeouts, backpressure control

See [docs/enterprise/SECURITY.md](docs/enterprise/SECURITY.md) and [docs/enterprise/COMPLIANCE.md](docs/enterprise/COMPLIANCE.md).

---

## Documentation

| Need | Where |
|---|---|
| Developer quickstart | [QUICKSTART_DEVELOPER.md](docs/QUICKSTART_DEVELOPER.md) |
| First-time setup | [GETTING_STARTED.md](docs/guides/GETTING_STARTED.md) |
| API reference | [API_REFERENCE.md](docs/api/API_REFERENCE.md) |
| SDK guide | [SDK_GUIDE.md](docs/SDK_GUIDE.md) |
| Enterprise features | [ENTERPRISE_FEATURES.md](docs/enterprise/ENTERPRISE_FEATURES.md) |
| Gauntlet guide | [GAUNTLET.md](docs/debate/GAUNTLET.md) |
| Agent catalog | [AGENTS.md](docs/debate/AGENTS.md) |
| Feature discovery | [FEATURE_DISCOVERY.md](docs/FEATURE_DISCOVERY.md) |
| Extended README | [EXTENDED_README.md](docs/EXTENDED_README.md) |
| Full index | [INDEX.md](docs/reference/INDEX.md) |

---

## Inspiration and Citations

Aragora synthesizes ideas from these open-source projects:

- **[Stanford Generative Agents](https://github.com/joonspk-research/generative_agents)** -- Memory + reflection architecture
- **[ChatArena](https://github.com/chatarena/chatarena)** -- Multi-agent interaction environments
- **[LLM Multi-Agent Debate](https://github.com/composable-models/llm_multiagent_debate)** -- ICML 2024 consensus mechanisms
- **[ai-counsel](https://github.com/AI-Counsel/ai-counsel)** -- Semantic convergence detection (MIT)
- **[DebateLLM](https://github.com/Tsinghua-MARS-Lab/DebateLLM)** -- Agreement intensity modulation (Apache 2.0)
- **[claude-flow](https://github.com/ruvnet/claude-flow)** -- Adaptive topology switching (MIT)
- **[LLM Reasoning Failures](https://arxiv.org/abs/2602.06176)** -- Stanford taxonomy of systematic reasoning breakdowns (Song et al. 2026)

See the full attribution table in [docs/reference/CREDITS.md](docs/reference/CREDITS.md).

---

## Contributing

Contributions welcome. Areas of interest:

- Additional agent backends
- Debate visualization
- Benchmark datasets for agent evaluation
- Lean 4 theorem proving integration

## License

MIT

---

*The name "aragora" evokes the Greek agora -- the public assembly where citizens debated and reached collective decisions through reasoned discourse.*
