Module Glossary¶
Page Maps¶
graph LR
family["Reproducible Research"]
program["Deep Dive DVC"]
section["Metrics Parameters Comparable Meaning"]
page["Module Glossary"]
capstone["Capstone evidence"]
family --> program --> section --> page
page -.applies in.-> capstone
flowchart LR
orient["Orient on the page map"] --> read["Read the main claim and examples"]
read --> inspect["Inspect the related code, proof, or capstone surface"]
inspect --> verify["Run or review the verification path"]
verify --> apply["Apply the idea back to the module and capstone"]
This glossary belongs to Module 05: Metrics, Parameters, and Comparable Meaning in Deep Dive DVC.
Use it to keep the module language stable while you move between the core lessons, the worked example, the exercises, and capstone review.
How to use this glossary¶
Read the directory index first. Return here when a metric review, parameter diff, plot, or release discussion starts to feel vague.
The goal is not extra theory. The goal is shared language for deciding whether two results are meaningfully comparable.
Terms in this directory¶
| Term | Meaning in this directory |
|---|---|
| metric | A recorded value that claims something about workflow behavior on a defined population under defined controls. |
| metric claim | The human interpretation attached to a metric value, including population, definition, and decision context. |
| comparison surface | The set of metrics, parameters, data identity, and review evidence needed to compare two runs fairly. |
| parameter control | A value that influences result behavior or interpretation and should be visible in review. |
| control surface | The parameter values that shape training, evaluation, publishing, or release decisions. |
| metric schema | The structure, names, units, nesting, and missing-value rules of a metric file. |
| schema drift | A change in metric file structure or meaning that can invalidate direct comparison. |
| additive metric change | A new metric or field added while existing metric meanings remain stable. |
| meaning-changing metric change | A metric change that alters definition, population, unit, threshold policy, or review interpretation. |
| population | The examples, records, incidents, or slice being measured by a metric or plot. |
| unit of analysis | The level at which a metric is computed, such as per incident, per customer, per day, or per alert. |
| fixed threshold | A threshold chosen before evaluation and kept stable for comparison. |
| threshold search | A procedure that chooses a threshold during or after evaluation, changing what fixed-threshold metrics can mean. |
dvc metrics diff |
A DVC command that reports changed metric values without judging semantic validity. |
dvc params diff |
A DVC command that reports changed parameter values that may alter metric interpretation. |
| baseline | The prior state chosen as the comparison reference for a review. |
| release metric | A metric promoted into a release boundary as evidence for downstream readers. |
| plot evidence | A visual artifact used to support a bounded review claim. |
| rendering noise | Plot change caused by unstable rendering, ordering, timestamps, or presentation behavior rather than meaningful data movement. |
| aggregation rule | The rule that combines records into a displayed or reported value, such as binning or averaging. |
| release note | A written review statement that names what changed, what stayed comparable, and what conclusion is justified. |
| semantic validity | The condition where a metric comparison actually supports the human claim being made from it. |
Stable review questions¶
Use these questions when the module feels abstract:
- What does this metric claim?
- Which population and unit of analysis does it describe?
- Which parameter values control the comparison?
- Did the metric schema stay stable?
- Does the diff answer the review question being asked?
- Did a plot change because of evidence or rendering noise?
- Is this number ready for release evidence, or only exploratory review?
- What conclusion can be written without overstating the metric movement?