Metadata-Version: 2.4
Name: gnosis-indus
Version: 0.1.0
Summary: Staged anchor application and evidence repo for the Gnosis Indus Atlas lane.
License-Expression: Apache-2.0
Project-URL: Homepage, https://zer0pa.ai/gnosis/Gnosis-Indus-Valley/
Project-URL: Source, https://github.com/Zer0pa/Indus-Valley
Project-URL: Issue Tracker, https://github.com/Zer0pa/Indus-Valley/issues
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.11
Description-Content-Type: text/markdown
License-File: LICENSE
Provides-Extra: numerics
Requires-Dist: numpy>=1.24; extra == "numerics"
Provides-Extra: test
Requires-Dist: pytest>=8; extra == "test"
Dynamic: license-file

# Gnosis Indus Atlas

> Anchor application and evidence repo for the Indus lane. Useful now, improving without overclaim.

## What This Is

Indus-Valley is a non-decipherment Gnosis lane: conditional k=70 catalogue, search-without-decode runtime, and rights gates explicit.

Gnosis applied-research repo for the Indus-script lane. It packages
the carried-forward evidence chain around the Indus morphological
catalogue (admitted at k=70 with a stability caveat), the Phase 5
falsification work (linguistic structure confirmed, substrate
identification not), the Paper 1 / Paper 2 verdict stack, and a
search-without-decode application surface implemented in this repo as
the first runtime slice. It is not a decipherment repo, not a generic
search product, and not the portfolio's lead thesis.

**Strongest current evidence surface:** Phase 4 admits a conditional
412-sign / 70-cluster catalogue over 179 inscriptions with NMI 0.5793
against ICIT Sets and sigma 5.65. Track C then demonstrates
search-without-decode over cluster sequences with 5.89x catalogue
compression and 0.0451 ms max query latency. Proof paths:
`authority/review_pack/phase4_governing_verdict.md`,
`authority/review_pack/indus_catalogue_summary.md`, and
`authority/review_pack/search_demo_summary.md`.

**Headline metric:** `pytest -q → 14 passed` (≈0.03 s local, ≈0.3 s
on RunPod from fresh clone). The clean-room search-without-decode
runtime (`src/gnosis_indus/search_surface/`) reproduces 6
authority-doc query records on the demo fixture
(`artifacts/phase4/indus_catalogue_demo_fixture.json`), with
`sequence_search` median latency well under the authority-doc 100 ms
gate (`authority/review_pack/search_demo_summary.md`). Per-phase
verifications: `.gpd/phases/0{1,2,3}-*/VERIFICATION.md` — all PASS,
10/10 confidence.

**Honest blocker:** Image-bearing sign rights remain `BLOCKED_RIGHTS`
per `DATA_POLICY.md`; the full k=70 catalogue (412 signs / 70 clusters
/ 179 inscriptions) stays `FETCH_EXTERNAL`. The bundled fixture is
small and authority-anchored to
`authority/review_pack/search_demo_summary.md`.

## Method Mechanics

| Field | Value |
| --- | --- |
| Architecture | GNOSIS_INDUS_SEARCH_WITHOUT_DECODE |
| Method | Conditional k=70 catalogue plus clean-room cluster-sequence search |
| Runtime | `src/gnosis_indus/search_surface/` |
| Data Boundary | sign images `BLOCKED_RIGHTS`; full k=70 catalogue `FETCH_EXTERNAL` |
| Cultural Boundary | traditional-knowledge acknowledgement retained below |
| Non-Claim | no decipherment and no proven substrate identification |

## Key Metrics

| Metric | Value | Baseline |
| --- | --- | --- |
| Conditional catalogue | 412 signs / 70 clusters / 179 inscriptions | NMI 0.5793 against ICIT Sets; sigma 5.65 |
| Search compression | 5.89x catalogue compression | Track C search-without-decode |
| Query latency | 0.0451 ms max | 100 ms authority-doc gate |
| Pytest surface | 14 passed | clean Python 3.11 replay |

> Source: `authority/review_pack/phase4_governing_verdict.md`, `authority/review_pack/indus_catalogue_summary.md`, `authority/review_pack/search_demo_summary.md`, and `.gpd/phases/03-truth-preserving-packaging/VERIFICATION.md`.

## Repo Identity

| Field | Value |
| --- | --- |
| Identifier | Indus-Valley |
| Repository | https://github.com/Zer0pa/Indus-Valley |
| Portfolio | Gnosis |
| Visibility | PUBLIC |
| Default Branch | main |
| Authority Source | `authority/review_pack/` + `.gpd/phases/` |
| License | Apache-2.0 code; CC-BY-4.0 docs; data per `DATA_POLICY.md` |

## Readiness

| Field | Value |
| --- | --- |
| Evidence posture | staged runtime scaffold; not a portfolio verdict |
| Posture | `rights_gated_data_classes_image_blocked_text_fetch_external` |
| Checks | 14 pytest checks and Phase 03 verification pass |
| Authority | `DATA_POLICY.md`; `authority/review_pack/search_demo_summary.md` |

### Honest Blocker

Image rights and full-catalogue redistribution gates remain open. Sign images stay `BLOCKED_RIGHTS`, and the full k=70 catalogue stays `FETCH_EXTERNAL`.

## What We Prove

- The lane is migrated out of the original monorepo as a standalone,
  truthful scaffold without losing the Phase 4 stability caveat or the
  Phase 5 non-decipherment posture.
- A clean-room search-without-decode runtime
  (`src/gnosis_indus/search_surface/`) anchored to
  `authority/review_pack/search_demo_summary.md` reproduces six
  authority-doc query records on a small authority-anchored demo
  fixture, with `sequence_search` median latency well under the
  authority-doc 100 ms gate.
- A clean-machine replay path exists: any Python 3.11 host can
  reproduce the test surface via `pip install -e ".[test]" && pytest`.
- Off-repo custody (private HF dataset + model repos) is provisioned
  for future heavy-artifact promotion under `DATA_POLICY.md`
  classification, with the register documented in
  `HF_CUSTODY_REGISTER.md`.

## What We Don't Claim

- We do not claim decipherment of the Indus script.
- We do not claim proven substrate identification.
- We do not claim all rights gates are cleared. Image rights for
  sign-bearing releases remain open; this is an open lab, not a
  finished product.
- We do not claim unrestricted public redistribution rights for any
  image-bearing or rights-gated corpus referenced in the original
  monorepo work; sign images stay `BLOCKED_RIGHTS` in
  `DATA_POLICY.md`.
- We do not claim the bundled fixture is the real full catalogue. The
  full k=70 catalogue (412 signs, 70 clusters, 179 inscriptions) stays
  `FETCH_EXTERNAL` per `DATA_POLICY.md`.

## Verification Status

| Code | Check | Verdict |
| --- | --- | --- |
| V_01 | `pytest -q` on Python 3.11: 14 passed | PASS |
| V_02 | `python -m compileall src` | PASS |
| V_03 | Per-phase verification reports all PASS | PASS |
| V_04 | Operational endpoint leak scan: 0 matches | PASS |
| V_05 | Image rights and full catalogue redistribution | BLOCKED |

## Proof Anchors

| Path | State |
| --- | --- |
| `authority/review_pack/phase4_governing_verdict.md` | VERIFIED |
| `authority/review_pack/indus_catalogue_summary.md` | VERIFIED |
| `authority/review_pack/phase5_governing_verdict.md` | VERIFIED |
| `authority/review_pack/search_demo_summary.md` | VERIFIED |
| `artifacts/phase4/indus_catalogue_demo_fixture.json` | VERIFIED |
| `.gpd/phases/03-truth-preserving-packaging/VERIFICATION.md` | VERIFIED |

## Repo Shape

| Field | Value |
| --- | --- |
| Proof Anchors | 6 display anchors |
| Portfolio | Gnosis |
| Runtime | `src/gnosis_indus/search_surface/` |
| Authority | `authority/review_pack/`; `.gpd/phases/` |
| Data Boundary | `DATA_POLICY.md` |
| Support Sections | Licensing; Traditional-Knowledge Acknowledgment; Quick Start; Current Gaps; Upcoming Workstreams |

```
.
├── NOTICE                              # Apache-2.0 code and CC-BY-4.0 docs posture
├── README.md                           # this file
├── CHANGELOG                           # release history
├── CITATION.cff                        # citation metadata
├── CODE_OF_CONDUCT.md                  # community conduct standards
├── PRD_GNOSIS_INDUS_ANCHOR_APPLICATION.md # sovereign brief
├── AGENTS.md                           # autonomous-agent rules
├── AUTHORITY_SNAPSHOT.md               # carried-forward truth
├── AUDITOR_PLAYBOOK.md                 # outsider-audit fast path (pre-Phase-02 vintage; refresh pending)
├── PUBLIC_AUDIT_LIMITS.md              # what audit can and cannot conclude
├── HF_CUSTODY_REGISTER.md             # off-repo storage truth
├── SOURCE_BOUNDARY.md                  # source families included / deferred / excluded
├── DATA_POLICY.md                      # data classes and rights posture
├── ROADMAP.md                          # milestone tracker
├── _internal/                          # scaffolding and internal orchestration docs
│   ├── MIGRATION_PLAN.md               # extraction roadmap and waves (internal)
│   ├── STATUS_REPORT_2026-04-24.md     # orchestrator execution narrative (internal)
│   ├── AUTONOMOUS_EXECUTION_POLICY.md
│   ├── GPD_BOOTSTRAP_GUIDE.md
│   ├── STARTUP_PROMPT.md
│   ├── UNIVERSAL_STARTUP_PROMPT.md
│   ├── WORKSTREAM_GPD_INIT_CHECKLIST.md
│   └── TEMPLATE_USAGE.md
├── docs/
│   ├── ARCHITECTURE.md
│   ├── LEGAL_BOUNDARIES.md
│   ├── FAQ.md
│   ├── SUPPORT.md
│   └── family/INDUS_EXPORT_CONTRACT.md
├── authority/                          # exact-source-only verdict copies
│   ├── review_pack/                    # Phase 4/5 governing verdicts and PRDs
│   └── papers/                         # Paper 1/2 governing verdicts
├── src/gnosis_indus/
│   └── search_surface/                 # Phase 02 first runtime slice (clean-room)
├── artifacts/phase4/
│   └── indus_catalogue_demo_fixture.json  # authority-anchored demo fixture
├── tests/
│   └── test_search_surface.py          # 14 tests reproducing authority queries
├── .gpd/                               # GPD control plane (PROJECT, STATE, ROADMAP, REQUIREMENTS, DECISIONS, CONVENTIONS, phase plans + verifications)
└── .github/
    ├── ISSUE_TEMPLATE/
    ├── PULL_REQUEST_TEMPLATE.md
    └── workflows/ci.yml                   # boring CI: install + pytest
```

## Licensing

This repository is part of the Zer0pa Gnosis Portfolio.

**Code** in this repository is licensed under the Apache License 2.0. See
`LICENSE` for the full text. SPDX identifier: `Apache-2.0`.

**Documentation, reports, and written materials** are licensed under Creative
Commons Attribution 4.0 International. SPDX identifier: `CC-BY-4.0`. Canonical
terms: <https://creativecommons.org/licenses/by/4.0/>.

**Data and fixtures** are handled per dataset and artifact family. See
`DATA_POLICY.md` for this repository's data boundary. The code license does not
license raw corpora, image-bearing cultural-heritage assets, private HF
artifacts, model weights, endpoint logs, or operational transcripts.

**Trademarks** - "Gnosis", "Zer0pa Gnosis", and distinctive sub-marks are
trademarks of Zer0pa. Apache-2.0 and CC-BY-4.0 do not grant trademark rights.
See `TRADEMARKS.md`.

Public visibility is a separate repository-setting action. The license files in
this repo define the intended open-source/open-documentation terms for released
Gnosis code and written materials; they do not publish rights-gated data.

## Traditional-Knowledge Acknowledgment

This repository works with material originating from the Indus Valley
Civilization, including the undeciphered Indus script. Zer0pa claims no
proprietary right over the Indus script, its imagery, or the archaeological
record.

Zer0pa's work is the computational analysis, lattice methods, and falsification
discipline applied to that material. This repository does not claim a definitive
decipherment of any Indus sign or sequence.

Good-faith inquiries from identified communities, governmental antiquity
authorities, or institutional bodies may be sent to architects@zer0pa.ai.

## Quick Start

Reproduce the Phase 02 stronger smoke path on any clean Python 3.11
host:

```bash
git clone https://github.com/Zer0pa/Indus-Valley.git gnosis-indus
cd gnosis-indus
python3.11 -m venv .venv && source .venv/bin/activate
pip install -e ".[test,numerics]"
pytest -q
```

Expected: `14 passed`. The pytest suite reproduces the authority-doc
query records from `authority/review_pack/search_demo_summary.md`
against the bundled `artifacts/phase4/indus_catalogue_demo_fixture.json`.
The fixture is small and authority-anchored; the real full catalogue
stays `FETCH_EXTERNAL` per `DATA_POLICY.md`. The Phase 4 stability
caveat (k=70 conditional) remains visible in the package and fixture
surfaces.

## Current Gaps

- **Image-rights and provenance** for any public image-bearing release
  remain unresolved; sign images stay `BLOCKED_RIGHTS` in
  `DATA_POLICY.md`.
- **License text has landed** as Apache-2.0 for code and CC-BY-4.0 for docs.
  Public visibility remains separate and is blocked on rights, provenance, and
  release wording review.
- **Phase 02 landed only the search-without-decode slice.** The Phase
  4 catalogue and Phase 5 falsification slices are sequenced as later
  extraction waves (`MIGRATION_PLAN.md`, `SOURCE_BOUNDARY.md`).
- **Full k=70 catalogue** (412 signs, 70 clusters, 179 inscriptions)
  is not vendored; the bundled demo fixture reproduces only what the
  authority doc enumerates.
- **`AUDITOR_PLAYBOOK.md` is pre-Phase-02 vintage.** Functionally
  superseded by the README Quick Start and the Phase 03 verification
  reports (`.gpd/phases/03-truth-preserving-packaging/VERIFICATION.md`),
  but the playbook itself awaits a maintenance refresh.

## Upcoming Workstreams

### Active Engineering

- **AUDITOR_PLAYBOOK.md maintenance refresh** — currently self-flagged
  "pre-Phase-02 vintage"; should be updated to reference the Phase 02 / 03
  verification reports and the Quick Start replay path. (`AUDITOR_PLAYBOOK.md`)
- **Phase 4 catalogue extraction wave** — extract first clean runtime slice
  into `src/gnosis_indus/catalogue/` per `_internal/MIGRATION_PLAN.md` wave 2.
- **Phase 5 falsification extraction wave** — extract first clean runtime slice
  into `src/gnosis_indus/falsification/` per `_internal/MIGRATION_PLAN.md` wave 3.

### Operations / External Dependency

- **Image-rights and provenance review** for any image-bearing public release;
  sign images remain `BLOCKED_RIGHTS` in `DATA_POLICY.md` until a rights review
  is completed by the relevant authority.
- **Full k=70 catalogue redistribution rights review** — the 412-sign / 70-cluster
  / 179-inscription catalogue stays `FETCH_EXTERNAL` until cleared.

### Research-Deferred — Investigation Underway

_(none currently; Phase 5 posture — linguistic structure confirmed, substrate
identification not — is stable and not under active re-investigation.)_

### Zero-Base Scientific Thinking — GPD Research and Planning Pending

_(none currently.)_
