Creative Commons Attribution 4.0 International License
======================================================

The CASES CORPUS, GROUND TRUTH, TAGS, EVALUATION RESULTS, REPORTS, and
PROTOCOL LOCKS in this repository are licensed under the Creative
Commons Attribution 4.0 International License (CC-BY-4.0).

Scope (what CC-BY-4.0 covers in this repo):
  cases/v1_3/**           — v1.3 case corpus (raw.log, case.json,
                            ground_truth.json, tags.json,
                            privacy_audit.json)
  cases/v2/**             — v2 case corpus (same per-case shape)
  results/**              — manifest .jsonl, per-case JSON,
                            eval_diagnosis_*.json, baseline outputs
  reports/**              — methodology + findings reports
                            (e10_v2_generalization_partial.md and
                            the e1..e9 historical reports)
  protocols/**            — protocol locks (.lock.json)
  docs/**                 — corpus matrix, intake queue, model cards

Scope (what is NOT covered by CC-BY-4.0):
  All SOURCE CODE in this repository — tools/, examples/, schemas/,
  configs/, prompts/, and supporting scripts — is licensed under
  Apache-2.0 (see LICENSE).

License text
------------

By exercising the Licensed Rights (defined below), You accept and
agree to be bound by the terms and conditions of this Creative
Commons Attribution 4.0 International Public License ("Public
License").

Section 1 — Definitions.

  a. Adapted Material means material subject to Copyright and Similar
     Rights that is derived from or based upon the Licensed Material
     and in which the Licensed Material is translated, altered,
     arranged, transformed, or otherwise modified in a manner
     requiring permission under the Copyright and Similar Rights held
     by the Licensor.
  b. Licensed Material means the artistic or literary work, database,
     or other material to which the Licensor applied this Public
     License.
  c. Licensed Rights means the rights granted to You subject to the
     terms and conditions of this Public License, which are limited
     to all Copyright and Similar Rights that apply to Your use of
     the Licensed Material and that the Licensor has authority to
     license.
  d. Licensor means the individual(s) or entity(ies) granting rights
     under this Public License.
  e. You means the individual or entity exercising the Licensed
     Rights under this Public License.

Section 2 — Scope.

  a. License grant. Subject to the terms and conditions of this
     Public License, the Licensor hereby grants You a worldwide,
     royalty-free, non-sublicensable, non-exclusive, irrevocable
     license to exercise the Licensed Rights in the Licensed Material
     to:
       1. reproduce and Share the Licensed Material, in whole or in
          part; and
       2. produce, reproduce, and Share Adapted Material.

Section 3 — License Conditions.

  Your exercise of the Licensed Rights is expressly made subject to
  the following conditions.

  a. Attribution. If You Share the Licensed Material (including in
     modified form), You must:
       1. retain the following if it is supplied by the Licensor:
          identification of the creator(s) of the Licensed Material
          and any others designated to receive attribution, in any
          reasonable manner requested;
          - Creator: Bowen Qin (National University of Singapore)
          - Project: LogDx
          - Year: 2026
       2. indicate if You modified the Licensed Material and retain
          an indication of any previous modifications; and
       3. indicate the Licensed Material is licensed under this
          Public License, and include the text of, or the URI or
          hyperlink to, this Public License.

     Citing the project via CITATION.cff (or the BibTeX entry below)
     satisfies the attribution requirement.

  BibTeX (also in CITATION.cff):

      @misc{qin2026logdx,
        title  = {{LogDx-CI}: Benchmarking CI Log Reduction Tools
                 for LLM Root-Cause Diagnosis},
        author = {Qin, Bowen},
        year   = {2026},
        howpublished = {\url{https://github.com/eyuansu62/LogDx}},
        note   = {v1.0 release; cases corpus at
                 \url{https://huggingface.co/datasets/eyuansu71/logdx-ci}},
      }

Section 4 — Disclaimer of Warranties and Limitation of Liability.

  a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
     EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
     AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
     ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
     IMPLIED, STATUTORY, OR OTHER.

  b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
     TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
     NEGLIGENCE) FOR ANY DIRECT, SPECIAL, INDIRECT, INCIDENTAL,
     CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES, COSTS,
     EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR USE
     OF THE LICENSED MATERIAL.

Full legal text
---------------

For the complete legal text of the Creative Commons Attribution 4.0
International Public License, see:

    https://creativecommons.org/licenses/by/4.0/legalcode

A human-readable summary is at:

    https://creativecommons.org/licenses/by/4.0/

Privacy note
------------

The raw CI logs (`cases/*/raw.log`) are drawn from publicly visible
GitHub Actions runs. Each log was passed through
`tools/audit_context_privacy.py` (200k-line cap, fail-closed on
truncation or long-line splits) before commit, and runtime in-place
redactions are documented in the per-case `privacy_audit.json` /
`tags.json#repo_visibility`. No `allow_secret_values_in_results`
flag is set anywhere in the diagnoser configs; see §5 caveats in
`reports/e10_v2_generalization_partial.md` for the residual risks
and the audit pipeline.
