Source Baseline Guide¶
Guide Maps¶
graph LR
family["Reproducible Research"]
program["Deep Dive DVC"]
guide["Capstone docs"]
section["SOURCE_BASELINE_GUIDE"]
page["Source Baseline Guide"]
proof["Release route"]
family --> program --> guide --> section --> page
page -.checks against.-> proof
flowchart LR
question["Need a clean learner or review bundle"] --> clean["Remove local-only state"]
clean --> check["Prove the source baseline is clean"]
check --> bundle["Build the tracked-source archive"]
bundle --> review["Inspect the archive instead of trusting the working tree"]
Use this guide when you need a source artifact that reflects the tracked capstone rather than whatever local state happens to be lying around in the working tree.
What this guide is protecting¶
This capstone deliberately creates local state while you work:
data/derived/,metrics/,models/,publish/, andstate/are generated outputs.dvc/cacheand.dvc-remote/are local storage surfaces, not learner-facing source.pytest_cache/,__pycache__/, and editable-install metadata are runtime residue
Those surfaces are useful while developing and verifying the capstone. They are not part of the clean source baseline another learner should download and inspect first.
Source baseline workflow¶
Run these commands from the capstone directory:
The intent of each step is different:
make source-baseline-cleanremoves local-only state that should never shipmake source-baseline-checkproves the tree is free of the known contamination pathsmake source-bundlewrites a tracked-source archive built fromgit ls-files, so the output depends on tracked repository state instead of local junk
What the source bundle includes¶
The source bundle includes tracked capstone files such as:
- capstone docs and review guides
dvc.yaml,dvc.lock,params.yaml, and tracked DVC metadata- source code under
src/ - tests and helper scripts
- committed source data and other tracked repository inputs
What the source bundle excludes¶
The source bundle excludes:
- generated pipeline outputs
- local caches and remotes
- bytecode, pytest state, and editable-install metadata
- any other untracked or ignored working-tree files
Best companion pages¶
README.mdRELEASE_REVIEW_GUIDE.mdRECOVERY_GUIDE.mdcourse-book/capstone/index.md