Failure Recovery¶
When a rebuild fails, recover by boundary rather than rerunning the whole repository blindly.
Broad reruns often hide the real problem. A useful recovery path narrows the failing boundary first, then expands only when the cause is understood.
First Questions¶
- did command parsing fail before any tracked state changed?
- did collection or normalization fail inside one source family?
- did publication fail while writing
docs/report/outputs? - did a truth or contract check block an overclaim?
Recovery Route¶
- inspect the narrowest command you ran
- inspect the governed root it should have touched:
data/ordocs/report/ - rerun the narrowest relevant tests
- only rerun
make app-stateonce the broken boundary is understood
That sequence matters. If you skip straight to a full rebuild, you can make the diff larger while learning less.
High-Risk Cases¶
- partial aDNA refreshes that appear to improve counts without improving sample-owned evidence
- docs rewrites that narrow
01,02, or03while keeping pages green - report output changes that make atlas posture sound stronger than the support reviews justify
These are high-risk not because they always fail loudly, but because they can leave the repository looking healthier than it really is.