Workflow Recommendation Confidence¶
This route answers the question after grounding: how strong may the current recommendation sound once withheld evidence, comparator loss, and regret review are taken seriously.
bijux-proteomics-intelligence owns this because recommendation confidence is
not a summary score. It is the pressure surface that stops attractive workflow
stories from sounding more certain than the shipped challenge artifacts earn.
What Ships¶
The current confidence bundle is built around four public artifact families:
artifacts/intelligence/benchmark-decisions/counterfactual_recommendations.jsonartifacts/intelligence/benchmark-decisions/workflow_overconfidence_audit.jsonartifacts/intelligence/benchmark-decisions/workflow_underconfidence_audit.jsonartifacts/intelligence/benchmark-decisions/recommendation_regret_ledger.json
Together they show how fragile the current public recommendation sentence still is under evidence loss and hindsight review.
What The Current Bundle Says¶
- all five flagship public families currently collapse toward
do_not_recommendwhen comparator evidence is removed - all five flagship public families currently collapse toward
do_not_recommendwhen literature support is removed - all five flagship public families currently collapse toward
do_not_recommendwhen lab burden is doubled from the current shipped posture - targeted currently carries the strongest overconfidence score at
0.67 - none of the current flagship families yet shows a hindsight-backed underconfidence event strong enough to justify broader public wording
How To Read Recommendation Confidence¶
- read the counterfactual report first when the question is whether the current call survives one missing evidence axis
- read the overconfidence audit when the sentence sounds cleaner than the challenge surfaces justify
- read the regret ledger when the question is which kind of mistake maintainers are still most likely to make if hidden evidence is revealed later
What This Means For Public Language¶
The current recommendation layer is materially more serious than it was at
v0.3.7, but it still does not earn decision-grade authority.
Why:
- the public sentence still breaks too easily under evidence removal
- some families still carry visible overconfidence pressure
- downstream assay burden still narrows otherwise attractive recommendation stories
- LFQ, PTM, and targeted still stop at bounded recommendation posture
That is why the stronger current product still needs bounded recommendation language.
Family-Level Reading¶
| family | confidence reading today | why the sentence still stays narrow |
|---|---|---|
dda |
stronger than before, but still fragile under evidence removal | comparator and lab-burden loss still collapse the recommendation |
dia |
bounded outsider-facing recommendation support | evidence and consequence loss still narrow the call |
lfq |
real review-grade posture | missingness and transfer pressure still make the stronger sentence too expensive |
ptm |
stronger analytical posture than before | consequence confidence still lags localization strength |
targeted |
strongest current overconfidence pressure | calibration and interference risk still make broader certainty unsafe |
Best Next Routes¶
- Open Workflow Claim Grounding when the question is whether the sentence is supported at all.
- Open Lab Consequence when the question is whether downstream burden still narrows the apparently reasonable recommendation.
- Open Workflow Consequence Maps when the question is which downstream boundary currently caps the strongest honest public sentence.
- Open What Changed The Recommendation when you need the exact evidence-removal, burden, or observed-outcome driver.
- Open Decision Support when the full combined route matters more than the recommendation layer by itself.
Boundary¶
This page owns recommendation-pressure and confidence limits. It should not pretend to replace evidence grounding or lab consequence just because those later surfaces consume the recommendation output.