CIE-Unified

git clone 

CIE-Unified / tasks
codex@macbookpro  ·  2026-03-31

2026-03-31_task04_branch_a_validation_reporting.md

  1# Task 04: Branch A Validation and Reporting
  2
  3## Title
  4
  5Task 04: unified validation + reporting against locked spec
  6
  7## Direct Prompt
  8
  9Continue Branch A from commit `164c06af63812c69cab32d8f8a6c770b96f38ef6`, keep the implementation independent from other branches, add a small unified validation/reporting entrypoint against the locked docs, generate machine-readable and human-readable reports, and record execution results in-repo.
 10
 11## Suggested Branch Name
 12
 13`branch-a/task04-validation-reporting`
 14
 15## Goal
 16
 17Implement the smallest explicit Branch A validation/reporting layer that checks the locked runtime surface, summarizes smoke/dynamics/sedimentation readiness in one schema, and documents Branch A as ready for review/comparison.
 18
 19## Background
 20
 21This round continues to follow the locked conceptual and engineering constraints in:
 22
 23- `/Users/george/code/CIE-Unified/README.md`
 24- `/Users/george/code/CIE-Unified/LOCKED_IMPLEMENTATION_SPEC.md`
 25
 26Those documents remain locked and define the Branch A requirements for graph-native runtime state, the unified interface, output-to-input feedback, homing, decay, degraded output, sedimentation observability, and comparable reporting.
 27
 28## Involved Repo
 29
 30- `/Users/george/code/CIE-Unified`
 31
 32## Scope
 33
 34- update the Branch A plan for Task 04 completion
 35- create this Task 04 prompt document in-repo
 36- add a stdlib-only `cie.validation` entrypoint for compact controlled scenarios
 37- validate smoke, dynamics, sedimentation, and locked snapshot requirements in a consistent schema
 38- generate a machine-readable JSON report and a human-readable Markdown report
 39- add tests for the validation entrypoint and report schema
 40- append the Task 04 continuation note to the Task 03 record
 41- leave Branch A documented as ready for review/comparison rather than adding new runtime subsystems
 42
 43## Allowed Modifications
 44
 45- `/Users/george/code/CIE-Unified/plans/2026-03-31_branch_a_plan.md`
 46- `/Users/george/code/CIE-Unified/tasks/2026-03-31_task03_branch_a_sedimentation.md`
 47- `/Users/george/code/CIE-Unified/tasks/2026-03-31_task04_branch_a_validation_reporting.md`
 48- `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.md`
 49- `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.json`
 50- `/Users/george/code/CIE-Unified/cie/__init__.py`
 51- `/Users/george/code/CIE-Unified/cie/runtime.py`
 52- `/Users/george/code/CIE-Unified/cie/validation.py`
 53- `/Users/george/code/CIE-Unified/tests/__init__.py`
 54- `/Users/george/code/CIE-Unified/tests/test_smoke.py`
 55- `/Users/george/code/CIE-Unified/tests/test_dynamics.py`
 56- `/Users/george/code/CIE-Unified/tests/test_sedimentation.py`
 57- `/Users/george/code/CIE-Unified/tests/test_validation.py`
 58
 59## Avoid Modifying
 60
 61- `/Users/george/code/CIE-Unified/README.md`
 62- `/Users/george/code/CIE-Unified/LOCKED_IMPLEMENTATION_SPEC.md`
 63
 64## Must Complete
 65
 66- mark Task 03 complete and Task 04 complete in the Branch A plan
 67- create this Task 04 prompt document in-repo
 68- implement a unified validation/reporting entrypoint for Branch A
 69- validate smoke + dynamics + sedimentation expectations in one consistent report schema
 70- check locked `snapshot_state()` fields and required runtime behaviors
 71- generate `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.json`
 72- generate `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.md`
 73- run the recommended validation commands
 74- record execution details and remaining review-stage limitations here
 75
 76## Acceptance Criteria
 77
 781. there is a unified validation/reporting entrypoint for Branch A
 792. validation covers smoke + dynamics + sedimentation expectations in a consistent schema
 803. validation checks locked `snapshot_state` fields and required runtime behaviors
 814. a machine-readable JSON report is generated
 825. a human-readable Markdown report is generated
 836. tests pass
 847. Branch A is documented as ready for review/comparison
 85
 86## Evaluation Requirements
 87
 88- use only the Python standard library in new runtime/validation code
 89- keep Branch A graph-native with `(phi, mu, J)` as the canonical state
 90- keep state minimal, parameters few, and observability explicit
 91- avoid `exact_text_map`
 92- avoid MoE-style substitution
 93- avoid latent-vector ontology as the real runtime state
 94- prefer an explicit readable harness over a larger framework
 95
 96## Recommended Validation Commands
 97
 98- `python3 -m unittest discover -s tests -v`
 99- `python3 -m cie.validation`
100
101## Delivery Requirements
102
103- commit on `branch-a/task04-validation-reporting`
104- push the branch to `origin`
105- keep the implementation independent from `branch-b` and other later branches
106- include execution record details for branch/base/backup/files/validation/results/limitations
107- leave Branch A ready for review/comparison rather than feature-expanded
108
109## Execution Record
110
111- actual branch name: `branch-a/task04-validation-reporting`
112- base commit: `164c06af63812c69cab32d8f8a6c770b96f38ef6`
113- backup path used for dirty-worktree handling: `none`
114- files changed:
115  - `/Users/george/code/CIE-Unified/plans/2026-03-31_branch_a_plan.md`
116  - `/Users/george/code/CIE-Unified/tasks/2026-03-31_task03_branch_a_sedimentation.md`
117  - `/Users/george/code/CIE-Unified/tasks/2026-03-31_task04_branch_a_validation_reporting.md`
118  - `/Users/george/code/CIE-Unified/cie/validation.py`
119  - `/Users/george/code/CIE-Unified/tests/test_validation.py`
120  - `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.json`
121  - `/Users/george/code/CIE-Unified/reports/2026-03-31_task04_branch_a_validation.md`
122- validation commands:
123  - `python3 -m unittest discover -s tests -v`
124  - `python3 -m cie.validation`
125- concise test summary: `Ran 18 tests; all passed.`
126- concise report summary: `Validation report status PASS; interface, smoke, dynamics, sedimentation, and snapshot sections all passed; Branch A is ready for review/comparison.`
127- remaining known limitations for review stage:
128  - the validation harness is compact and scenario-based rather than a benchmark or long-run stability suite
129  - checks target the locked observable runtime surface, not richer semantic task-performance evaluation
130  - homing and sedimentation remain explicit but heuristic, which is acceptable for this comparison-ready stage