Metadata-Version: 2.4
Name: faultline
Version: 0.1.0
Summary: Map features in any codebase from git history. No Jira required.
Project-URL: Homepage, https://github.com/pkuzina/faultline
Project-URL: Repository, https://github.com/pkuzina/faultline
Project-URL: Issues, https://github.com/pkuzina/faultline/issues
Author: FeatureMap Team
License-Expression: MIT
License-File: LICENSE
Keywords: bug-fix-ratio,code-analysis,engineering-metrics,features,git,health-score,technical-debt
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Software Development :: Quality Assurance
Classifier: Typing :: Typed
Requires-Python: >=3.11
Requires-Dist: anthropic>=0.25.0
Requires-Dist: gitpython>=3.1.40
Requires-Dist: pydantic>=2.5.0
Requires-Dist: rich>=13.7.0
Requires-Dist: typer>=0.12.0
Provides-Extra: ollama
Requires-Dist: ollama>=0.4.0; extra == 'ollama'
Description-Content-Type: text/markdown

# faultline

> Turn your git history into a feature risk map — no Jira required.

**faultline** analyzes your git commit history to automatically detect features and modules in your codebase, then shows which ones are accumulating the most bug fixes — your technical debt hotspots.

No integrations. No configuration. Just point it at any git repo.

---

## Why faultline?

Engineering managers need to know *where* the technical debt is before they can act on it. Most tools require you to tag tickets in Jira or Linear. faultline reads the truth directly from your git history.

```
✗ payments    — health: 23   38 bug fixes / 112 commits (33.9%)
!  auth        — health: 54   12 bug fixes / 48 commits  (25.0%)
✓  dashboard   — health: 91    2 bug fixes / 67 commits   (3.0%)
```

---

## Installation

### From source

```bash
git clone https://github.com/pkuzina/faultline
cd faultline
pip install -e .
```

### With Ollama support

```bash
pip install -e '.[ollama]'
```

---

## Quick Start

```bash
# Analyze the current directory
faultline analyze .

# Analyze a specific repo
faultline analyze ./path/to/repo

# Focus on a source subdirectory (recommended for frontend/monorepo projects)
faultline analyze . --src src/

# Last 90 days, show top 5 risk zones
faultline analyze . --days 90 --top 5

# Just print, don't save
faultline analyze . --no-save

# AI-powered semantic feature detection
faultline analyze . --llm --src src/

# AI feature detection with flow breakdown
faultline analyze . --llm --flows --src src/
```

---

## AI-Powered Feature Detection

By default faultline groups files by directory structure (fast, no API needed). With `--llm` enabled, Claude or a local Ollama model reads the full file tree and returns a **semantic feature map** — grouping files by business domain, not folder names.

```
Without --llm:  "components", "views", "hooks"        ← technical layers
With --llm:     "user-auth", "payments", "dashboard"  ← business features
```

### Anthropic Claude (cloud)

```bash
# Pass your API key directly
faultline analyze . --llm --api-key sk-ant-...

# Or use an environment variable
export ANTHROPIC_API_KEY=sk-ant-...
faultline analyze . --llm

# Recommended for large projects — focus on source files only
faultline analyze . --llm --src src/
```

Get your API key at [console.anthropic.com](https://console.anthropic.com) → API Keys.
Uses **Claude Haiku** for cost-efficient analysis (~$0.001 per repo).

### Ollama (local, free, private)

Run analysis entirely on your machine — no API key, no data leaves your computer.
Recommended for private repositories.

```bash
# 1. Install Ollama
brew install ollama        # macOS
# or: curl -fsSL https://ollama.com/install.sh | sh

# 2. Pull the recommended model (one time, ~4.7 GB)
ollama pull llama3.1:8b

# 3. Start the server
ollama serve

# 4. Install the ollama package
pip install -e '.[ollama]'

# 5. Run
faultline analyze . --llm --provider ollama --src src/
```

### Recommended models

| Model | Size | Semantic quality | Notes |
|-------|------|-----------------|-------|
| `llama3.1:8b` | 4.7 GB | ★★★★★ | **Default — best balance of quality and size** |
| `mistral-nemo:12b` | 7.1 GB | ★★★★★ | Best overall quality, higher RAM requirement |
| `qwen2.5:7b` | 4.7 GB | ★★★★ | Good alternative |
| `llama3.2:3b` | 2.0 GB | ★★★ | Use when RAM is limited |

```bash
# Best quality (requires ~8 GB RAM)
ollama pull mistral-nemo:12b
faultline analyze . --llm --provider ollama --model mistral-nemo:12b

# Lightweight option
ollama pull llama3.2:3b
faultline analyze . --llm --provider ollama --model llama3.2:3b
```

The API key is validated **before** the analysis starts — no waiting to discover a bad key.
Falls back silently to heuristic mode if the LLM is unavailable.

---

## Flow Detection

With `--flows`, faultline goes one level deeper — breaking each feature into **user-facing flows**: named end-to-end sequences of actions (e.g. `checkout-flow`, `refund-flow`).

Requires `--llm`. Works with both Anthropic and Ollama.

```bash
faultline analyze . --llm --flows --src src/
faultline analyze . --llm --provider ollama --flows --src src/
```

Output shows two tables: features overview, then features with nested flows:

```
                         Features by Risk
╭──────────────┬────────┬─────────┬───────────┬────────╮
│ Feature      │ Health │ Commits │ Bug Fixes │  Bug % │
├──────────────┼────────┼─────────┼───────────┼────────┤
│ payments     │   23   │     112 │        38 │ 33.9%  │
│ auth         │   54   │      48 │        12 │ 25.0%  │
╰──────────────┴────────┴─────────┴───────────┴────────╯

                    Feature & Flow Map by Risk
╭──────┬────────────────────────┬────────┬─────────┬───────────╮
│      │ Feature / Flow         │ Health │ Commits │ Bug Fixes │
├──────┼────────────────────────┼────────┼─────────┼───────────┤
│  ✗   │ payments               │   23   │     112 │        38 │
│      │   ├─ checkout-flow     │   18   │      67 │        28 │
│      │   └─ refund-flow       │   41   │      45 │        10 │
│  !   │ auth                   │   54   │      48 │        12 │
│      │   └─ login-flow        │   54   │      48 │        12 │
╰──────┴────────────────────────┴────────┴─────────┴───────────╯
```

---

## Focusing on a Subdirectory

Use `--src` to restrict analysis to a specific folder. Everything outside that path is ignored.

```bash
# Sources in src/
faultline analyze . --src src/

# Sources in app/
faultline analyze . --src app/

# Monorepo — analyze one package
faultline analyze . --src packages/api/src/
```

Automatically excluded regardless of location:
- Dependencies: `node_modules/`, `vendor/`, `.venv/`
- Build output: `dist/`, `build/`, `.next/`, `coverage/`
- Tooling: `.github/`, `.husky/`, `.storybook/`, `.circleci/`
- Binary and generated files: `.map`, `.lock`, `.woff`, `.ttf`

---

## Output

### Terminal

```
╭──────────────────────────────────────────────╮
│             FeatureMap Analysis               │
│                                               │
│ Repository:           /path/to/repo           │
│ Analyzed:             last 365 days           │
│ Total commits:        847                     │
│ Features found:       12                      │
│ Bug fix commits:      143                     │
│ Average health score: 61.3/100                │
╰──────────────────────────────────────────────╯
```

### JSON

Results are saved to `.faultline/feature-map.json` by default:

```json
{
  "repo_path": "/path/to/repo",
  "analyzed_at": "2026-02-22T10:00:00Z",
  "total_commits": 847,
  "date_range_days": 365,
  "features": [
    {
      "name": "payments",
      "description": "Handles Stripe payment processing and subscription billing.",
      "health_score": 23.0,
      "bug_fix_ratio": 0.339,
      "bug_fixes": 38,
      "total_commits": 112,
      "authors": ["alice", "bob"],
      "paths": ["src/payments/stripe.py", "src/payments/webhooks.py"],
      "flows": [
        {
          "name": "checkout-flow",
          "health_score": 18.0,
          "bug_fixes": 28,
          "total_commits": 67
        }
      ]
    }
  ]
}
```

---

## CLI Reference

### `faultline analyze [REPO_PATH]`

| Flag | Default | Description |
|------|---------|-------------|
| `--src` | — | Subdirectory to focus on, e.g. `src/` or `app/` |
| `--days` | `365` | Days of git history to analyze |
| `--max-commits` | `5000` | Maximum commits to read |
| `--top` | `3` | Number of top risk zones to highlight |
| `--output` | `.faultline/feature-map.json` | Output file path |
| `--no-save` | — | Skip saving JSON output |
| `--llm` | `false` | Use AI for semantic feature detection |
| `--flows` | `false` | Detect user-facing flows within features (requires `--llm`) |
| `--provider` | `anthropic` | LLM provider: `anthropic` or `ollama` |
| `--model` | — | Model override (default: `claude-haiku-4-5` / `qwen2.5-coder:7b`) |
| `--api-key` | env | Anthropic API key (`ANTHROPIC_API_KEY` env var) |
| `--ollama-url` | `http://localhost:11434` | Custom Ollama server URL |

---

## How It Works

1. **Reads git history** — up to `--max-commits` commits within the requested date range
2. **Collects tracked files** — respects `--src` filter, skips build output and tooling
3. **Maps files to features** — two modes:
   - **Heuristic** (default): groups by directory structure (`src/payments/` → `payments`)
   - **LLM** (`--llm`): sends the file tree to Claude or Ollama, gets back a semantic `{feature: [files]}` mapping by business domain
4. **Scans commit history** — for each feature, counts total commits and bug fix commits
5. **Calculates health scores** — `100 - (bug_fix_ratio × 200)`, weighted by commit activity
6. **Detects flows** (`--flows`): analyzes files within each feature and maps them to named user-facing journeys

---

## Health Score

| Score | Status | Meaning |
|-------|--------|---------|
| 75–100 | ✓ Green | Healthy — low bug fix ratio |
| 50–74 | ! Yellow | Watch — moderate technical debt |
| 0–49 | ✗ Red | Critical — high bug fix ratio |

---

## License

[Apache 2.0](LICENSE)
