Refactor: Repositories, Codecs, and Schema Evolution¶

Concept Position¶

flowchart TD
  family["Python Programming"] --> program["Python Object-Oriented Programming"]
  program --> module["Module 06: Persistence, Serialization, and Schema Evolution"]
  module --> concept["Refactor: Repositories, Codecs, and Schema Evolution"]
  concept --> capstone["Capstone pressure point"]

flowchart TD
  problem["Start with the design or failure question"] --> example["Study the worked example and trade-offs"]
  example --> boundary["Name the boundary this page is trying to protect"]
  boundary --> proof["Carry that question into code review or the capstone"]

Read the first diagram as a placement map: this page is one concept inside its parent module, not a detached essay, and the capstone is the pressure test for whether the idea holds. Read the second diagram as the working rhythm for the page: name the problem, study the example, identify the boundary, then carry one review question forward.

Goal¶

Extend the monitoring-system capstone from an in-memory-only persistence story to a storage-aware design without letting persistence details invade the aggregate.

Refactor Outline¶

Define a repository contract in aggregate language.
Introduce explicit record mappers and a boundary codec for serialized policy state.
Add a version field to the serialized representation and an upcaster for one older shape.
Add optimistic concurrency tokens to persisted aggregates.
Keep publish-after-save behavior coordinated through the unit of work instead of the aggregate.

What to Watch For¶

Domain objects should still reject invalid state after rehydration.
Storage record types should stay separate from core domain types.
Conflict detection should surface as an explicit application error, not silent overwrite.
Compatibility rules should live in codec and migration code, not in random domain methods.

Suggested Verification¶

round-trip an aggregate through the codec and repository
load an older payload through the upcaster path
demonstrate a stale-write conflict with two loads of the same aggregate
verify that queued publications stay aligned with commit boundaries

Review Questions¶

Which part of the design now owns storage-shape knowledge?
Did any domain method start accepting raw persisted payloads or row-like data?
Can an unsupported schema version fail fast with a readable error?
Is there any last-write-wins behavior that remains accidental?