Repository Layer Guide¶

Reference Position¶

flowchart TD
  family["Reproducible Research"] --> program["Deep Dive Snakemake"]
  program --> reference["Repository Layer Guide"]
  reference --> review["Design or review decision"]
  review --> capstone["Capstone proof surface"]

flowchart TD
  trigger["Hit a naming, boundary, or trade-off question"] --> lookup["Use this page as a glossary, map, rubric, or atlas"]
  lookup --> compare["Compare the current code or workflow against the boundary"]
  compare --> decision["Turn the comparison into a keep, change, or reject call"]

Read the first diagram as a lookup map: this page is part of the review shelf, not a first-read narrative. Read the second diagram as the reference rhythm: arrive with a concrete ambiguity, compare the current work against the boundary on the page, then turn that comparison into a decision.

Deep Dive Snakemake uses several repository layers on purpose. This page explains what each layer owns so the capstone stays legible as the learner moves past the top-level workflow entrypoint.

Use it when workflow/, src/, profiles/, and config/ feel like parallel folders instead of an intentional architecture.

Reading Order¶

Read the repository layers in this order:

capstone/Snakefile
capstone/workflow/rules/
capstone/workflow/modules/
capstone/workflow/scripts/
capstone/src/capstone/
capstone/profiles/
capstone/config/

That order moves from orchestration entrypoint, to rule families, to modular grouping, to workflow-adjacent helpers, to reusable implementation code, to operating policy, and finally to declared configuration.

Back to top

Layer Responsibilities¶

Path	Responsibility
`Snakefile`	repository entrypoint and visible workflow assembly
`workflow/rules/`	rule families and declared file contracts
`workflow/modules/`	reusable workflow bundles that keep repository growth legible
`workflow/scripts/`	workflow-adjacent helpers that belong with orchestration rather than the Python package
`src/capstone/`	reusable implementation code for processing steps
`profiles/`	execution policy for local, CI, and scheduler-backed runs
`config/`	declared config inputs and schema validation boundaries

Back to top

What Each Layer Must Not Do¶

Path	Boundary to protect
`Snakefile`	should not become the only place where workflow truth can be located
`workflow/rules/`	should not hide implementation code that belongs in scripts or packages
`workflow/modules/`	should not bury the visible rule graph under indirection
`workflow/scripts/`	should not become a second undocumented application package
`src/capstone/`	should not silently mutate workflow meaning outside declared rule or config surfaces
`profiles/`	should not change the workflow's analytical meaning
`config/`	should not become an unvalidated dumping ground for hidden behavior

Back to top

Best Companion Pages¶

Use these pages with this guide:

Back to top