What the model is
The AIOOJ Litigation Outcome Model is a structured probability engine built specifically for Harrison v Aegon / Transamerica Entities in the NSW Supreme Court Commercial List. It answers a single practical question: given everything known about the facts, the law, the defendant's conduct, and the judicial environment — what is the realistic probability distribution of outcomes, and what does that distribution mean for litigation strategy, settlement positioning, and counsel briefing?
The model is not a legal opinion. It does not predict what a court will decide. It translates legal analysis into structured probability estimates and produces a distribution of possible outcomes ranging from nil recovery to full recovery of the $100,743,642 claim. The output is a planning tool, not a guarantee.
The model has been developed through nine versions incorporating successive layers of empirical data, benchmark calibration, and architectural improvements. Version 9.1 is the current production version.
Three-stream architecture
The model's most important structural feature is that the plaintiff's claim operates across three independent recovery streams. Total defeat — a nil outcome — requires the defendant to defeat all three simultaneously. The plaintiff needs only one to succeed.
The combined failure probability — all three streams failing simultaneously at base values — is less than 0.007%. This is not optimism. It is the mathematically correct result of three high-probability independent streams operating under OR-gate logic.
Probability inputs P1–P9
Nine probability variables drive the model. Each represents a discrete legal issue assigned a Low / Base / High value. The Base value is the central estimate derived from case law analysis, the empirical dataset, and the specific facts of this proceeding.
| Input | Legal issue | Base | Range | Primary data anchor |
|---|---|---|---|---|
| P1 / L1 | Korea rate reduction success | 90% | 85–95% | Contract clarity; abandoned explanations; admitted errors |
| P2 / L2 | Deed / summary judgment (ultimate) | 89% | 82–94% | Guarantee real anchors 1.00 mean; s.54 acknowledgment at 95% certainty |
| P3 / L3 | EA breach methodology | 82% | 75–88% | Admitted errors; $3.319m corrective payment; retracted denials |
| P4 / L4 | s.55 postponement success | 80% | 70–86% | Active obstruction + illegal NDA; dataset calibration 0.80 |
| P5 / L5 | Demand accrual / s.63 rebuttal | 85% | 80–90% | Wardley Australia; three-limb s.63 rebuttal; December 2025 demand |
| P6 / L6 | Consulting Agreement stream | 81% | 72–85% | Contract/Construction real anchors 1.00 mean; dataset uplift 0.77→0.81 |
| P7 / L7 | Component B generalisation | 71% | 62–75% | Inferential; weakest link; conservative; widest range |
| P8 / L8 | Route C / fresh fraud | 78% | 72–84% | Void NDA conditions; consciousness of guilt; s.249K Crimes Act |
| P9 | Conduct / indemnity costs factor | 78% | 70–88% | Combined conduct pattern; 65–80% research range; costs overlay only |
P9 is isolated from the core merits simulation. It operates exclusively on the settlement band calculations and costs-adjusted recovery figures. This prevents double-counting — the conduct facts are already embedded in P4 and the environmental scoring.
Win function logic
The seven outcome paths are derived from the nine inputs through defined Boolean logic. The overall win function combines all three streams under an OR gate — any one succeeding produces a material recovery.
Note that L8 (Route C) is correctly wired as an independent OR gate in the Contract stream's limitation layer alongside L4 and L5. This was corrected in v9.1 — the original v9 Shapley sheet showed L8 at zero marginal contribution because it was absent from the win function gates despite being scored in the simulation inputs.
Data foundations
The model draws on two categories of data: case-specific data derived from the facts, law, and conduct of this proceeding; and environmental data reflecting the external context. Understanding the difference is essential to reading the outputs correctly.
18 cases from the HCA, NSWCA, and NSWSC were manually scored for relevance across five dimensions: claim construction (CDG), accounting/withholding (AWV), contract interpretation (CI), limitation/accrual (LAC), and early disposition (SJE). Key authorities include:
- Jones v Dunkel (1959) 101 CLR 298 — adverse inference from document withholding. Directly applicable: defendant withheld premium data through negotiations and the formal Clause 6.7 demand period.
- Chaplin v Hicks [1911] 2 KB 786 — a defendant cannot defeat a damages claim by relying on uncertainty of quantum where that uncertainty was caused by the defendant's own conduct.
- Wardley Australia Ltd v Western Australia (1992) 175 CLR 514 — demand-based accrual supporting the s.63 rebuttal.
- Mount Bruce Mining v Wright Prospecting [2015] HCA 37 and Electricity Generation Corp v Woodside Energy [2014] HCA 7 — commercial construction authorities supporting P3 and P6.
- Price v Spoor [2021] HCA 20 — limitation defence may be affected by agreement; limitation bars remedy rather than jurisdiction. Primary real anchor for P4.
- RHG Mortgage Ltd v Ianni [2015] NSWCA 56 — Jones v Dunkel inference applied to document withholding; confirmed inference can be outcome-determinative.
14 verified real anchor cases from official HCA/NSWCA sources are used in calibration, each weighted at 1.0. 86 structured comparator rows (synthetic range-fills) are retained for reference only and carry zero weight in the v9 calibration engine. The Bootstrap_Calibration sheet derives 95% confidence intervals using only the 14 real anchors — this produces deliberately wider intervals that honestly reflect the limited real-case sample available.
NSW Supreme Court Provisional Statistics 2024 inform time-to-trial assumptions. FCA empirical data (2023–24 and 2024–25 annual reports; 5-year longitudinal study across 19,483 cases) anchors the 1.5-year base and the three-branch disposition model: Summary Judgment 30% / 7 months; Negotiated 55% / 9 months; Full Trial 15% / 22 months.
Structural Advantage Index & Environmental Factors Index
Two scored indices apply uplifts to the base probability inputs. Both use a compression formula to prevent artificial inflation:
Hard caps prevent any input exceeding its judicial caution ceiling regardless of uplift stacking: P2 Deed caps at 0.92; P4 s.55 caps at 0.82; P3 EA caps at 0.88. P9 (costs factor) receives SAI uplift only — EFI uplift is excluded to prevent double-counting with the conduct facts already embedded in EFI scoring.
Anti-overfitting controls
A model with multiple positive reinforcing factors carries a serious risk of counting the same advantage multiple times. Eight deliberate score reductions were applied during development. Near-certainty of overall recovery persisted through all testing — confirming it is structural, not inflated.
- SAI transmission weight halved: from ×0.002 (early version) to ×0.001. Early version added +8.4% per input from SAI alone.
- EFI transmission weight halved again: from ×0.001 to ×0.0005. Environmental factors are supporting not primary.
- Compression formula applied: SAI 91.25 compresses to 78.9 (reduces uplift from +4.1% to +2.8%). Reflects judicial caution at high structural scores.
- P2 Deed hard-capped at 0.92. Spencer v Commonwealth SJ caution; corporate succession chain uncertainty.
- P4 s.55 hard-capped at 0.82. Courts apply s.55 cautiously; active obstruction ≠ guaranteed success; Briginshaw standard applies.
- P4 base reduced 0.82→0.80 per 100-case dataset: Limitation/Timing weighted mean = 0.624 across all cases.
- 60/40 calibration blend on every input: 60% case-specific legal analysis + 40% empirical dataset signal.
- EBI double-count prevention: EBI (Stage 1 environmental base rate) documented only — not separately added since it is already contained within EFI.
Reading the outputs correctly
P10 — the absolute worst case input: A tenth analytical input (P10) sits below the main model. It contains six event sliders (E1–E6) representing the individual failure probabilities of the six legal positions that must all fail simultaneously for the worst case to materialise. At defaults the joint failure probability is 0.027% (~1 in 3,700). The 12-year floor quantum is approximately $35m. The P10 stress tab on the heat map shows this worst case applied across all P2/P4 stress combinations simultaneously. The floor of $35m represents the defendant’s minimum rational exposure regardless of how every other argument is decided.
The model produces near-certain overall win probability (~99%+) at base inputs. This is the correct structural result, not an artefact of modelling. Three independent recovery streams each with high individual probability make simultaneous total defeat near-impossible. The genuine uncertainty is concentrated not in whether the plaintiff recovers, but in where in the recovery distribution the outcome lands.
| Output | What it means | How to use it |
|---|---|---|
| Overall win P | Probability of any material recovery above $0.01m | Primary settlement leverage number. The defendant must price this into any offer. |
| Expected recovery (EV) | Probability-weighted average recovery across all outcomes including nil | Anchor for settlement negotiations. A rational defendant will not offer materially less than this without specific contrary arguments. |
| Settlement resistance floor (P10) | Portfolio-weighted beta P10 — level below which only 10% of winning-scenario outcomes fall | The plaintiff's methodologically defensible walk-away threshold. Do not accept below this. |
| Branch-weighted PV | Present value across SJ (30%/7mo), Negotiated (55%/9mo), Trial (15%/22mo) branches | Correct number for settlement timing decisions. Higher than a simple point-estimate PV because the SJ branch pulls the distribution forward. |
| Costs-adjusted exposure | EV plus P9-weighted expected indemnity costs order | The defendant's true total economic exposure including costs risk. |
| P2×P4 heat map — P10 stress tab | EV $m tab: Deed-stream quantum (P2 × [P4×$100.7m + (1−P4)×$20m]) + Korea floor. Genuine gradient ~$51m to ~$92m across grid — P2 drives row variation, P4 drives column variation. P10 stress tab: stream decomposition with worst-case event failures — ~$50m to ~$73m at defaults. | Worst-case positioning — mediation floor anchor |
| P10 absolute worst case | Six event sliders (E1–E6, 10–20% each). Joint failure probability 0.027% (~1 in 3,700 after 50× correlation). 12-year floor quantum ~$35m. Drives P10 stress tab in real time. | Settlement floor — funder briefing |
- The exact probability of any specific outcome — the model produces distributions, not point predictions.
- The response of any individual judge — JAR is a directional adjustment, not a prediction.
- The effect of the Two Sets of Books reserve evidence (TAB_38_3), which is not currently reflected in the inputs.
- The defendant's internal settlement calculus — EFI models structural settlement pressure, not Aegon's board position.
- Post-judgment appellate risk — the model is calibrated for first-instance outcomes only.
Version 9.1 — three analytical fixes
Version 9.1 applied three targeted fixes identified in a post-v9 audit. None alters the core probability inputs, scenario weights, or recovery values.
- L8 Shapley zero-contribution defect corrected. Route C (L8) was scored in the simulation inputs but absent from the Boolean win function OR gates. The Shapley sheet therefore showed L8 contributing zero — a formula defect, not a legal conclusion. L8 is now correctly wired as an independent OR gate in the Contract stream's limitation layer (alongside L4 and L5), consistent with its legal role as an independent limitation extension under Route C / s.249K Crimes Act 1900 (NSW).
- Two-way sensitivity table replaced: P4×P5 → P2×P4. The original P4×P5 table was analytically flat — the OR-gate dominance of P2 at 0.89 suppressed all sensitivity to P4 and P5. The revised P2×P4 table tests simultaneous Deed underperformance and limitation failure — the genuine stress diagonal. Even at maximum joint stress (P2=0.60, P4=0.55), material recovery persists via Korea and demand accrual.
- Settlement resistance floor surfaced on Dashboard; Stress_Floor sheet added. The Quantum_Distribution P10 floor is now a named KPI on the Dashboard. The new Stress_Floor sheet provides Scenario A (stressed inputs, OR gates preserved) and Scenario B (correlated collapse, AND gate — extreme lower bound) adversarial analysis.
Known limitations
The following are acknowledged gaps in the empirical foundation. They are documented here in the interest of full transparency for counsel review.
- NSW Commercial List plaintiff vs defendant success rates at trial — courts do not publish outcome statistics by party type; calibration is inferred from case benchmarking.
- Quantitative Jones v Dunkel application rates — no systematic dataset tracks how frequently the inference proves decisive; calibration relies on doctrinal analysis.
- s.55 success rates specifically — concealment extension cases are not separately categorised in court statistics.
- Insurance industry commission dispute precedent — no reported NSW cases are directly analogous to the commission underpayment structure in this proceeding.
- The copula correlation matrix (Copula_Correlation sheet) provides analytical bounds on the true EV but is not yet implemented as a live Monte Carlo correlation engine — a full Gaussian copula simulation would require Python/R.