Metadata-Version: 2.4
Name: efterlev
Version: 0.1.22
Summary: Repo-native, agent-first compliance scanner for FedRAMP and DoD Impact Levels
Project-URL: Homepage, https://efterlev.com
Project-URL: Documentation, https://efterlev.com
Project-URL: Repository, https://github.com/efterlev/efterlev
Project-URL: Issues, https://github.com/efterlev/efterlev/issues
Author: Efterlev contributors
License-Expression: Apache-2.0
License-File: LICENSE
Keywords: cmmc,compliance,dod,fedramp,iac,oscal,security,terraform
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Security
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.12
Requires-Dist: anthropic<1,>=0.40
Requires-Dist: compliance-trestle<5,>=4.0.2
Requires-Dist: jinja2<4,>=3.1
Requires-Dist: jsonschema<5,>=4.20
Requires-Dist: mcp<2,>=1.0
Requires-Dist: pathspec<2,>=0.12
Requires-Dist: platformdirs<5,>=4.0
Requires-Dist: pydantic<3,>=2.6
Requires-Dist: python-hcl2<6,>=4.3
Requires-Dist: python-multipart<1,>=0.0.27
Requires-Dist: pyyaml<7,>=6.0.3
Requires-Dist: typer<1,>=0.12
Provides-Extra: bedrock
Requires-Dist: boto3<2,>=1.35; extra == 'bedrock'
Provides-Extra: dev
Requires-Dist: boto3<2,>=1.35; extra == 'dev'
Requires-Dist: mypy<2,>=1.11; extra == 'dev'
Requires-Dist: pytest<10,>=9.0.3; extra == 'dev'
Requires-Dist: ruff<1,>=0.15.12; extra == 'dev'
Requires-Dist: types-pyyaml<7,>=6.0; extra == 'dev'
Provides-Extra: docs
Requires-Dist: mkdocs-material<10,>=9.7.6; extra == 'docs'
Requires-Dist: mkdocs<2,>=1.6.1; extra == 'docs'
Requires-Dist: pymdown-extensions<11,>=10.7; extra == 'docs'
Description-Content-Type: text/markdown

# Efterlev

**Compliance scanning for SaaS teams pursuing FedRAMP 20x — that lives in your repo, not a SaaS dashboard.**

Efterlev reads your Terraform, classifies it against the 60 thematic Key Security Indicators, drafts FRMR-compatible attestations grounded in cited evidence, and proposes code-level remediations. Locally. No procurement cycle. No vendor account. Apache 2.0.

```bash
pipx install efterlev
efterlev quickstart                        # one-command demo against a bundled fixture
                                           # (~3 min on Sonnet, ~$0.30; runs init+scan
                                           #  only without ANTHROPIC_API_KEY set)
```

Or against your own repo:

```bash
pipx install efterlev
cd path/to/your-repo                       # repo root, NOT a Terraform subdir
efterlev init --target . --force
export ANTHROPIC_API_KEY=sk-ant-...
efterlev report run                        # init → scan → gap → document → poam
```

Pronounced "EF-ter-lev." From Swedish *efterlevnad* (compliance).

### Or have an AI assistant do it for you

Open [Claude Code](https://claude.com/claude-code), Cursor, Codex, Kiro, or any AI assistant with shell access, and paste the canonical prompt at [`docs/ai-quickstart-prompt.md`](./docs/ai-quickstart-prompt.md). The assistant will confirm the repo root, ask which backend you want, install Efterlev, run the full pipeline, and brief you on the top 3 KSIs to focus on. Cost: ~$0.30–1 on Sonnet 4.6 (the recommended default), ~5–10 minutes wall.

---

## Why this exists

A 100-person SaaS company just got told by its biggest prospect: *"we'll buy, but only if you're FedRAMP Moderate."*

The team googles it. Consulting engagements start at $250K. SaaS compliance platforms cover SOC 2 beautifully and treat FedRAMP as a footnote. Enterprise GRC tooling is priced for the wrong scale. A NIST document family runs to thousands of pages.

What they actually need is something that reads their Terraform and tells them, in their own language, what's wrong and how to fix it. Something a single engineer can install on a Tuesday and show results at Wednesday's standup. Output concrete enough that their 3PAO can use it; honest enough that the 3PAO won't throw it out.

Efterlev is that tool.

It targets **FedRAMP 20x** — the new authorization track that replaces narrative-heavy System Security Plans with measurable outcomes called **Key Security Indicators**. KSIs are concrete things ("encrypt network traffic," "enforce phishing-resistant MFA") that can be assessed against actual evidence rather than long descriptions of intent. Most new SaaS authorizations starting in 2026 will target this track. Efterlev's primary internal abstraction is the KSI; **FRMR** (the machine-readable format FedRAMP 20x is standardizing on) is the primary output.

---

## What it does

- **Scans** your Terraform — both raw `.tf` files and `terraform show -json` plan output — for evidence of 60 thematic KSIs, backed by underlying NIST 800-53 Rev 5 controls
- **Classifies** each KSI as implemented, partial, not_implemented, not_applicable, or `evidence_layer_inapplicable` (the honest answer for procedural KSIs no scanner can see)
- **Drafts** FRMR-compatible attestation JSON grounded in cited evidence — every assertion cites its source file (and HCL line numbers when scanning `.tf` directly; plan-JSON mode resolves modules at the cost of file-level-only citations)
- **Proposes** code-level remediation diffs you can review, edit, or apply
- **Generates** a reviewer-ready POA&M markdown for every open KSI, with out-of-boundary scope filtering
- **Traces** every claim back to the file (and HCL line range, in `.tf` mode) that produced it via `efterlev provenance show <id>` — accepts truncated SHA prefixes
- **Watches**: `efterlev report run --watch` re-runs the full pipeline on every save (debounced 2s)
- **Captures token telemetry** so you can audit per-run LLM cost without consulting CloudWatch

Everything runs locally. The only outbound network call is to your configured LLM endpoint — direct Anthropic API by default, or **AWS Bedrock** (`[bedrock]` extra) for FedRAMP-authorized GovCloud deployments. Scanner output is fully deterministic and offline.

## What it doesn't do

- It does not produce an Authorization to Operate. Humans and 3PAOs do that.
- It does not certify compliance. It produces drafts that accelerate the human review cycle.
- It does not guarantee LLM-generated narratives are correct. Every claim carries `requires_review: Literal[True]` at the type level — not a flag, not a string.
- It does not cover SOC 2, ISO 27001, HIPAA, or GDPR. Other tools serve those well.
- It does not scan live cloud infrastructure (yet — v1.5+).
- It does not replace AWS Config / Security Hub for runtime evaluation. Efterlev is the pre-deploy IaC layer; AWS-native is the runtime evidence layer. See [docs/aws-coexistence.md](./docs/aws-coexistence.md).

For the honest full accounting, see [LIMITATIONS.md](./LIMITATIONS.md).

---

## How to run it

```bash
efterlev init --target . --force               # creates .efterlev/ workspace
efterlev boundary set \                        # declare authorization scope
  --include 'infra/terraform/**' \
  --include '.github/workflows/**'
efterlev doctor                                # pre-flight check (Python, FRMR cache,
                                               #  API key shape, Bedrock creds, LLM ping)
efterlev scan                                  # raw .tf files
# OR for module-composed codebases (the dominant pattern):
terraform init && terraform plan -out plan.bin && terraform show -json plan.bin > plan.json
efterlev scan --plan plan.json                 # ~60% more evidence on real codebases

efterlev agent gap                             # KSI-by-KSI classification (Opus 4.7)
efterlev agent document                        # FRMR JSON + HTML attestations (Sonnet 4.6)
efterlev agent remediate --ksi KSI-SVC-SNT     # Terraform diff that closes the gap (Opus 4.7)
efterlev poam                                  # POA&M markdown for every open KSI
efterlev provenance show <prefix>              # walk any claim back to source (8-char prefix OK)
efterlev provenance verify                     # tamper-evidence sweep
```

Or just:

```bash
efterlev report run                            # full pipeline: init → scan → gap → document → poam
efterlev report run --watch                    # re-run on every file change (2s debounce)
```

Wire it into CI: drop-in GitHub Action at `.github/workflows/pr-compliance-scan.yml` posts a sticky markdown PR comment with findings + detector coverage. See [docs/ci-integration.md](./docs/ci-integration.md).

---

## How it's built

Three layers, each with a clear job:

- **Detectors** — small, deterministic Python folders. One detector = one folder = one compliance pattern. No AI. The detector library is the community-contributable surface.
- **Primitives** — typed functions wrapping the things agents need ("scan this directory," "validate this output," "load that catalog"). MCP-exposed.
- **Agents** — focused reasoning loops backed by Claude. Each has its system prompt in a plain `.md` file you can read and audit. AI is used for the parts where reasoning matters; never for the parts where determinism does.

This split — **deterministic for evidence, AI for reasoning, different model weights for different cognitive loads** — is the most important design decision in the project. It's what lets us tell auditors and 3PAOs the truth: scanner findings are verifiable facts about your code; AI claims are drafts you can audit but should not blindly trust.

**Hallucination defenses are structural, not advisory.** Every AI-generated claim links to evidence records via content-addressed IDs. Prompts wrap evidence in `<evidence_NONCE>` XML fences with a per-run nonce; a post-generation validator rejects any output citing IDs the model didn't actually see. The provenance store rejects any claim whose `derived_from` cites IDs that don't resolve. The DRAFT marker is `Literal[True]` at the type level — there's no flag to clear it.

**Secrets never leave the machine unredacted.** Every LLM prompt is unconditionally scrubbed for 7 secret families (AWS keys, GCP keys, GitHub tokens, Slack tokens, Stripe keys, PEM private keys, JWTs). The scrubber has no opt-out path. Each redaction writes an audit line to `.efterlev/redactions/<scan_id>.jsonl` (mode `0o600`); review with `efterlev redaction review`.

**LLM calls degrade predictably.** Transient errors retry with exponential backoff + full jitter (3 attempts). On primary-model exhaustion, falls back once from Opus to Sonnet before surfacing a failure. Non-retryable errors (auth, invalid request) fail immediately. Each call's token usage is captured on the resulting Claim record and written to `.efterlev/receipts.log` for offline cost auditing.

For deeper architectural detail, see [docs/architecture.md](./docs/architecture.md). For the design history including reversals and tradeoffs, see [DECISIONS.md](./DECISIONS.md).

---

## Coverage at v0.1.22

- **45 detectors** — 38 KSI-mapped + 7 supplementary 800-53-only (where FRMR 0.9.43-beta doesn't yet map the underlying control)
- **31 of 60 thematic KSIs** covered, across **8 of 11 themes** (CNA, CMT, IAM, MLA, PIY, RPL, SCR, SVC). The remaining three themes (AFR, CED, INR) are entirely procedural — covered by customer-authored Evidence Manifests rather than detector evidence.
- **Detector sources:** 41 Terraform + 4 GitHub workflows
- **Three agents:** Gap (Opus 4.7), Documentation (Sonnet 4.6), Remediation (Opus 4.7)
- **Two LLM backends:** Anthropic API (default) + AWS Bedrock (`[bedrock]` extra, GovCloud-deployable)
- **1131 tests passing;** mypy strict + ruff check + ruff format clean across 178 source files; full E2E pipeline smoke (real Anthropic API call against a synthetic fixture) runs as a required check on every PR

**Coverage relative to FedRAMP 20x Phase 2's 70% automated-validation threshold:** the threshold applies to the customer's whole authorization package, not to any single tool. Efterlev covers 31 KSIs at the IaC layer pre-deploy; AWS-native services (Config, Security Hub, CloudTrail, Inspector, GuardDuty) cover roughly 14 KSIs at the runtime layer. Honest union: ~33 of 63 KSIs (~52%) — distinct layers, not double-counted. Reaching 70% takes both. See [docs/aws-coexistence.md](./docs/aws-coexistence.md) for the strategic mapping and [docs/csx-mapping.md](./docs/csx-mapping.md) for how the outputs map to CSX-SUM / MAS / ORD.

---

## Where Efterlev fits

Sits **alongside AWS Config / Security Hub / CloudTrail**, not in place of them:

| | Efterlev | AWS-native |
|---|---|---|
| **When** | Pre-deploy, on every commit or save | Post-deploy, on a 3-day cadence |
| **Reads** | Terraform `.tf` + `.github/workflows/*.yml` + `.efterlev/manifests/*.yml` | Live AWS API state, runtime events |
| **Output** | Per-KSI attestation JSON + POA&M markdown + remediation diffs | Config evaluations, Security Hub findings, CloudTrail logs |
| **Cost** | Free (Apache 2.0); ~$0.30–2 per run on the LLM endpoint you configure | AWS spend |

A FedRAMP 20x customer pursuing the 70% automated threshold typically wires both, plus procedural Evidence Manifests under `.efterlev/manifests/*.yml` for the AFR / CED / INR themes detectors can't see.

---

## Run it from another AI session (MCP)

```bash
efterlev mcp serve
```

Exposes every CLI verb as an MCP tool over stdio. Point Claude Code (or any MCP client) at it and drive scans, agent calls, and provenance walks from another AI session. Our own agents use the same MCP interface — that's how we know it works. If you want to build a compliance workflow Efterlev doesn't ship, write your own agent against the MCP surface; you don't need to fork the codebase.

---

## Documentation

Full docs site: **[efterlev.com](https://efterlev.com)** — quickstart, concepts, tutorials (CI integration, GovCloud deployment, writing detectors, customizing agent prompts), CLI reference, and comparisons against Paramify, Comp AI, Vanta/Drata, and traditional consulting.

In this repo:

- [`docs/ai-quickstart-prompt.md`](./docs/ai-quickstart-prompt.md) — canonical prompt for AI-assistant-driven runs
- [`docs/quickstart.md`](./docs/quickstart.md) — five-minute path from zero to your first scan
- [`docs/architecture.md`](./docs/architecture.md) — three-layer architecture in depth
- [`docs/aws-coexistence.md`](./docs/aws-coexistence.md) — how Efterlev fits next to AWS-native services
- [`docs/csx-mapping.md`](./docs/csx-mapping.md) — outputs mapped to CSX-SUM / MAS / ORD
- [`docs/ci-integration.md`](./docs/ci-integration.md) — drop-in GitHub Action for PR compliance scans
- [`docs/deploy-govcloud-ec2.md`](./docs/deploy-govcloud-ec2.md) — running inside an AWS GovCloud boundary
- [`docs/icp.md`](./docs/icp.md) — Ideal Customer Profile; the lens behind every product decision
- [`docs/followups.md`](./docs/followups.md) — tracked work, deferred items, what's shipped vs pending
- [`docs/v0.2-eval-harness-plan.md`](./docs/v0.2-eval-harness-plan.md) — design sketch for the v0.2 agent-quality harness
- [`CHANGELOG.md`](./CHANGELOG.md) — release-by-release record (24 patch versions, v0.1.22 current)
- [`DECISIONS.md`](./DECISIONS.md) — append-only design-decision log

---

## Contributing

We want contributors. The detector library is designed to make the common contribution — "here's a new KSI indicator I can evidence from Terraform" — a self-contained folder that doesn't touch the rest of the codebase.

[`CONTRIBUTING.md`](./CONTRIBUTING.md) has the five-minute path from `git clone` to running tests, the hour path from idea to open PR, and the per-fix regression-test discipline every patch ships under. Community conduct: [Contributor Covenant 2.1](./CODE_OF_CONDUCT.md). Good first issues are labeled `good first issue` on GitHub. The most valuable contributions right now are new detectors covering KSIs on the roadmap.

---

## Status, governance, license

**Status:** v0.1.22 is current. Twenty-four patch releases since v0.1.0 (2026-04-29). v0.1.5–v0.1.10 closed real-CI bugs surfaced by deep-dive shakedowns; v0.1.11 closed five rendering / display / persistence findings from the first external 3PAO blinded review; v0.1.12–v0.1.15 closed eight findings from rolling deterministic post-release triages; v0.1.16 codified the triage methodology as automated infrastructure (`scripts/triage.sh <tag>` posts a verifiable artifact-quality report as the GitHub Release notes body); v0.1.17 closed two cosign-tooling findings from v0.1.16's first auto-triage; v0.1.18–v0.1.21 shipped the Tier 1 UX sequence (quickstart → cost summary → --dry-run → Evidence Manifest starter pack); **v0.1.22 closes a silent-failure gap in `release-smoke`** that had left the cross-platform install matrix red across v0.1.20 + v0.1.21 (wrong SQLite table name + an assertion that required a `reports/` dir the workflow never created), and adds T7 to `scripts/triage.sh` so matrix conclusion lands on the GitHub Release page on every future release. Full E2E pipeline smoke runs on every PR; per-fix regression-test discipline is enforced for bug-fix patches. See [`CHANGELOG.md`](./CHANGELOG.md) for per-release notes. Verify a published artifact with `bash scripts/verify-release.sh v0.1.22` (PEP 740 PyPI attestations + cosign keyless OIDC + SLSA provenance on `ghcr.io/efterlev/efterlev`).

**Governance:** Benevolent-dictator model today (`@lhassa8`), transitioning to a technical steering committee at 10 sustained-activity contributors. Full model in [GOVERNANCE.md](./GOVERNANCE.md). Architectural decisions: [DECISIONS.md](./DECISIONS.md). The project may eventually be donated to a neutral foundation (OpenSSF / Linux Foundation / CNCF) if contributor diversity warrants — that decision is not made and not time-boxed.

**License:** Apache 2.0. See [LICENSE](./LICENSE).

**Security:** Coordinated disclosure process in [SECURITY.md](./SECURITY.md). Threat model for Efterlev itself: [THREAT_MODEL.md](./THREAT_MODEL.md). The pre-launch security review (signed by the maintainer) is at [docs/security-review-2026-04.md](./docs/security-review-2026-04.md).

---

## Credits

Efterlev was bootstrapped in a 4-day hackathon using [Claude Code](https://claude.com/claude-code). The architecture commits to keeping Claude Code (and other MCP-capable agents) as first-class integration partners — that's what "agent-first" means here, structurally, not as marketing.

Built on [compliance-trestle](https://github.com/IBM/compliance-trestle) for OSCAL catalog loading, on the [FedRAMP Machine-Readable (FRMR) catalog](https://github.com/FedRAMP/docs), and on the [NIST SP 800-53 Rev 5 catalog](https://github.com/usnistgov/oscal-content). Those projects make this one possible.
