Observability And Diagnostics¶
Observability for DAG focuses on making run outcomes explainable from command output and persisted evidence.
Visual Summary¶
flowchart LR
command[command output] --> run_state[run state model]
run_state --> artifacts[artifact evidence]
artifacts --> explain[replay and diff explain views]
explain --> remediation[targeted remediation]
Primary Diagnostic Surfaces¶
statusandinspectcommand outputs- run manifests, outputs index, and node traces
- replay proof and replay explain artifacts
- diff explain summaries grouped by mismatch class
Diagnostic Workflow¶
- confirm run state and completion flags
- inspect missing or degraded artifact groups
- run replay explain for fidelity and environment gaps
- run semantic diff explain for drift clusters
- map findings to runtime/input/environment boundaries
Code Anchors¶
crates/bijux-dag-app/src/routes/status_routes.rscrates/bijux-dag-app/src/routes/inspect_routes.rscrates/bijux-dag-runtime/src/replay/crates/bijux-dag-artifacts/src/integrity/
Anti-Patterns¶
- treating all failures as transient retries
- mutating artifacts before capturing diagnostics
- promoting runs without explain evidence when drift exists