Flagship Benchmark Assets¶
The flagship benchmark packages are no longer treated like disposable test fixtures. They live under one product-owned root:
packages/bijux-proteomics-core/benchmark-assets/flagship-public-packages/
The punishing companion surface now lives beside them under:
packages/bijux-proteomics-core/benchmark-assets/flagship-challenge-corpora/
This handbook explains what is copied, what is generated, what public sources justify the copied files, how to rebuild the package metadata, and where the repository keeps the current freshness and obsolescence pressure visible.
The direct audit and lineage follow-up pages are:
- Benchmark Asset Audit
- Benchmark Freshness Review
- Benchmark Licensing and Redistribution
- Benchmark Incompleteness Ledger
- Benchmark Flagship Status
- DDA Benchmark Lineage
- DIA Benchmark Lineage
- LFQ Benchmark Lineage
- Multiplex Benchmark Lineage
- PTM Benchmark Lineage
- Targeted Benchmark Lineage
What Exists Per Package¶
Each flagship package root now carries:
- copied evidence snapshots or follow-up packets that the benchmark actually reviews
source_locator_manifest.jsoncitation_manifest.jsongenerated_boundary.jsonrebuild_instructions.mdpackage_manifest.jsonartifact_inventory.jsonquality_sheet.jsonlifecycle.json
The DDA package also carries:
scientific_invariants.jsonwarning_demonstrations.json
Shared Asset Governance Files¶
The root-level support files are:
packages/bijux-proteomics-core/benchmark-assets/flagship-public-packages/asset_root_contract.jsonpackages/bijux-proteomics-core/benchmark-assets/flagship-public-packages/freshness_report.jsonpackages/bijux-proteomics-core/benchmark-assets/flagship-public-packages/obsolescence_audit.json
Use them for different questions:
asset_root_contract.json: what each package root must contain and why it is allowed to existfreshness_report.json: whether the copied snapshots are still present and whether the public source pages are still reachableobsolescence_audit.json: whether the package is still scientifically too weak to count as a stable end state
How To Rebuild¶
When copied evidence, package metadata, or root-level reports need to be refreshed, run:
That command rewrites:
- the shared asset-root contract
- the freshness report
- the obsolescence audit
- each package root's source locator manifest
- each package root's citation manifest
- each package root's generated-boundary manifest
- each package root's rebuild instructions
- each package root's package manifest, artifact inventory, quality sheet, and lifecycle record
Current Scientific Limits¶
These package roots are stronger than the old fixture-only posture, but they do not yet close the scientific gap across all workflow families.
dda: outsider-auditable, but still built around imported-result snapshots instead of live in-repo search rerunsdia: publicly inspectable, but still library-conditioned and import-backedlfq: runtime-backed, but still blocked on stronger comparator and generalization proofmultiplex: runtime-backed, but still thin on lab consequence and broader authorityptm: runtime-backed, but still blocked on stronger comparator and PTM-family breadthtargeted: public and consequence-bearing, but still import-only and not yet calibration-strong enough for decision-grade trust
Those limits are now pressure-tested explicitly through the blinded holdout and perturbation roots in the Flagship Challenge Corpus Catalog.
First Proof Check¶
packages/bijux-proteomics-core/src/bijux_proteomics/benchmarks/flagship_asset_roots.pypackages/bijux-proteomics-core/src/bijux_proteomics/benchmarks/flagship_asset_maintenance.pypackages/bijux-proteomics-core/tests/benchmarks/test_flagship_asset_root_surface.pypackages/bijux-proteomics-core/benchmark-assets/flagship-public-packages
Challenge Pressure¶
Open Flagship Challenge Corpus Catalog when you need the frozen holdouts and adversarial perturbations that challenge these public package claims directly.