Test Strategy¶

The tests for bijux-canon-reason are the executable proof of its package contract.

This page should help readers see the broad proof shape of the package rather than treating the test tree like a bag of unrelated checks. A good strategy page explains why these tests exist, not just where they live.

Treat the quality pages for bijux-canon-reason as the proof frame around the package. They should show how trust is earned and where skepticism still belongs.

Visual Summary¶

flowchart TB
    page["Test Strategy<br/>clarifies: see proof | see limitations | judge done-ness"]
    classDef page fill:#dbeafe,stroke:#1d4ed8,color:#1e3a8a,stroke-width:2px;
    classDef positive fill:#dcfce7,stroke:#16a34a,color:#14532d;
    classDef caution fill:#fee2e2,stroke:#dc2626,color:#7f1d1d;
    classDef anchor fill:#ede9fe,stroke:#7c3aed,color:#4c1d95;
    classDef action fill:#fef3c7,stroke:#d97706,color:#7c2d12;
    proof1["tests/unit for planning, reasoning, execution, verification, and interfaces"]
    proof1 --> page
    proof2["tests/e2e for API, CLI, replay gates, retrieval reasoning, and smoke coverage"]
    proof2 --> page
    proof3["tests/perf for retrieval benchmark coverage"]
    proof3 --> page
    risk1["CHANGELOG.md"]
    risk1 -.keeps trust honest.-> page
    risk2["pyproject.toml"]
    risk2 -.keeps trust honest.-> page
    risk3["README.md"]
    risk3 -.keeps trust honest.-> page
    bar1["proof before confidence"]
    page --> bar1
    bar2["done means defended behavior"]
    page --> bar2
    bar3["package trust after change"]
    page --> bar3
    class page page;
    class proof1,proof2,proof3 positive;
    class risk1,risk2,risk3 caution;
    class bar1,bar2,bar3 action;

Test Areas¶

tests/unit for planning, reasoning, execution, verification, and interfaces
tests/e2e for API, CLI, replay gates, retrieval reasoning, and smoke coverage
tests/perf for retrieval benchmark coverage
tests/docs for documentation-linked safeguards

Concrete Anchors¶

tests/unit for planning, reasoning, execution, verification, and interfaces
tests/e2e for API, CLI, replay gates, retrieval reasoning, and smoke coverage
README.md

Use This Page When¶

you are reviewing tests, invariants, limitations, or ongoing risks
you need evidence that the documented contract is actually defended
you are deciding whether a change is truly done rather than merely implemented

Decision Rule¶

Use Test Strategy to decide whether bijux-canon-reason has actually earned trust after a change. If one narrow green check hides a wider contract, risk, or validation gap, the work is not done yet.

What This Page Answers¶

what currently proves the bijux-canon-reason contract instead of merely describing it
which risks, limits, and assumptions still need explicit skepticism
what a reviewer should be able to say before accepting a change as done

Reviewer Lens¶

compare the documented proof story with the actual test layout and release posture
look for limitations or risks that should have moved with recent behavior changes
verify that the claimed done-ness standard still reflects real validation practice

Honesty Boundary¶

This page explains how bijux-canon-reason is supposed to earn trust, but it does not claim that prose alone is enough. If the listed tests, checks, and review practice stop backing the story, the story has to change.

Next Checks¶

move to foundation when the risk appears to be boundary confusion rather than missing tests
move to architecture when the proof gap points to structural drift
move to interfaces or operations when the proof question is really about a contract or workflow

Purpose¶

This page explains the broad testing shape of the package.

Stability¶

Keep it aligned with the real test directories and the behaviors they protect.