Skip to content

Glossary

Page Maps

graph LR
  family["Reproducible Research"]
  program["Deep Dive DVC"]
  section["Reproducibility Failures Real Teams"]
  page["Glossary"]
  capstone["Capstone evidence"]

  family --> program --> section --> page
  page -.applies in.-> capstone
flowchart LR
  orient["Orient on the page map"] --> read["Read the main claim and examples"]
  read --> inspect["Inspect the related code, proof, or capstone surface"]
  inspect --> verify["Run or review the verification path"]
  verify --> apply["Apply the idea back to the module and capstone"]

This glossary keeps the language of Module 01 stable.

The goal is practical clarity: when you use the same words for the same failure patterns, the rest of Deep Dive DVC becomes easier to reason about.

Terms

Term Meaning in this module
repeatability The ability to rerun a workflow in roughly the same local setup and get the same or similar result.
reproducibility The ability for another person to recover the same result later from the recorded artifacts, rules, and declared inputs rather than from private memory.
hidden state Any influential condition, input, or side effect that shapes the result without being made explicit enough for others to inspect and recover.
undeclared input A file, parameter, environment assumption, or manual step that affects a result but is not represented clearly in the workflow story.
local success A result that seems fine for one person on one machine right now, without proving that the team can recover it later.
source-control boundary The set of things Git is naturally strong at preserving, especially textual source history and deliberate edits.
result story The full explanation of how a result came to exist, including data, parameters, environment, workflow steps, and trusted outputs.
social memory Important workflow knowledge that lives mainly in people's recollection rather than in durable artifacts.
mechanical reproducibility The part of reproducibility concerned with explicit state, recorded relationships, and recoverable artifacts.
tool boundary The line between what DVC is meant to own and what still belongs to environment discipline, governance, scientific judgment, or release review.
trusted output A file, report, or metric the team actually treats as authoritative in decisions, reviews, or releases.
workflow inventory A structured description of a workflow's real inputs, assumptions, trusted outputs, and weak points before redesign begins.

How to use these terms

If a discussion in Module 01 starts getting vague, ask which term has become unclear:

  • are we describing repeatability or reproducibility?
  • is this input explicit, or is it hidden state?
  • is Git preserving this well, or is it outside the source-control boundary?
  • is this something DVC should own, or does it belong to a different tool boundary?

Those questions usually turn a fuzzy complaint into a useful diagnosis.