CONDITIONALTargetedNOVEL — Zero co-occurrences for 'dynamic Hill estimator FRTB' and 'regime-aware expected shortfall' in independent QG-level web search (Literature Scout 9/9 DISJOINT queries confirmed at QG). Danielsson-Shin 2002 endogenous-risk critique is known in principle; the specific operationalization as a dynamic Hill overlay tied to FRTB IMA calibration is genuinely new.Session 2026-04-22...Discovered by Federico Bottino

Basel III FRTB Standardized Approach Calibrated on Normal-Regime Windows Behaves Functionally as xi ≈ 0 Until Forced Recalibration: A Regime-Aware ES Correction Using Dynamic Hill Estimation Recovers Capital Underestimation

Bank risk models may underestimate crisis losses by 35%+ because they're blind to how extreme tail risk shifts during market turmoil.

Extreme Value Theory
Private-Wealth Advisory under Regime Uncertainty

Under Danielsson-Shin 2002 endogenous-risk framework, FRTB Internal Models Approach (IMA) calibrated on a 250-business-day (one-year) stressed window behaves functionally as xi ≈ 0 during regime transitions, systematically underestimating Expected Shortfall by ≥ 35% for ~400 business days post-shift; a dynamic Hill estimator on 60-day rolling windows recovers xi_hat and corrects the capital gap.

StrategyStructural IsomorphismIdentical math, different physical substrates
Session Funnel5 generated
Field Distance
1.00
minimal overlap
Session DateApr 22, 2026
7 bridge concepts
Fisher-Tippett-Gnedenko universalityPickands-Balkema-de Haan theoremHill estimatorexpected shortfallxi (tail shape) parameterblock maximaxi-stability
Composite
8.8/ 10
Confidence
5
Groundedness
5
How this score is calculated ›

6-Dimension Weighted Scoring

Each hypothesis is scored across 6 dimensions by the Ranker agent, then verified by a 10-point Quality Gate rubric. A +0.5 bonus applies for hypotheses crossing 2+ disciplinary boundaries.

Novelty20%

Is the connection unexplored in existing literature?

Mechanistic Specificity20%

How concrete and detailed is the proposed mechanism?

Cross-field Distance10%

How far apart are the connected disciplines?

Testability20%

Can this be verified with existing methods and data?

Impact10%

If true, how much would this change our understanding?

Groundedness20%

Are claims supported by retrievable published evidence?

Composite = weighted average of all 6 dimensions. Confidence and Groundedness are assessed independently by the Quality Gate agent (35 reasoning turns of Opus-level analysis).

R

Quality Gate Rubric

5/10 PASS · 5 CONDITIONAL
Test ProtocolNoveltyMechanismRegulatory AccuracyConfidenceTranslational UtilityFalsifiableGroundedness Per ClaimMathematical CorrectnessCounter Evidence Considered
CriterionResult
Test Protocol9
Novelty9
Mechanism9
Regulatory Accuracy6
Confidence8
Translational Utility9
Falsifiable9
Groundedness Per Claim8
Mathematical Correctness8
Counter Evidence Considered8
V

Claim Verification

7 verified2 parametric
Strength: Danielsson-Shin 2002 correctly invoked as formal backbone; operationalization via dynamic Hill overlay is novel and backtestable on public Italian-market data; EUR 215M underestimation figure (EUR 500M × 0.43 × 2 years / 2) is C-suite legible.
Risk: The card originally stated '500-business-day stressed-window calibration' — factual error corrected to '250-business-day (one-year) stressed window' per FRTB IMA specification. Non-fatal: the core argument (6-12 tail observations insufficient for Hill estimation) survives; 250 × 0.025 = 6.25 matches the '~6 observations' figure.
E

Empirical Evidence

Evidence Score (EES)
0.0/ 10
Convergence
None found
Clinical trials, grants, patents
Dataset Evidence
0/ 0 claims confirmed
HPA, GWAS, ChEMBL, UniProt, PDB
How EES is calculated ›

The Empirical Evidence Score measures independent real-world signals that converge with a hypothesis — not cited by the pipeline, but discovered through separate search.

Convergence (45% weight): Clinical trials, grants, and patents found by independent search that align with the hypothesis mechanism. Strong = direct mechanism match.

Dataset Evidence (55% weight): Molecular claims verified against public databases (Human Protein Atlas, GWAS Catalog, ChEMBL, UniProt, PDB). Confirmed = data matches the claim.

S
View Session Deep DiveFull pipeline journey, narratives, all hypotheses from this run
Share:XLinkedIn

Banks are required by international rules (called Basel III) to hold enough capital to survive market disasters. To figure out how much capital is 'enough,' they run statistical models that look at how bad things could get in the worst 2.5% of scenarios — a measure called Expected Shortfall. These models are trained on roughly one year of historical market data, which sounds reasonable until you realize that only about 6 or 7 data points actually fall in that extreme tail. That's like trying to predict a 100-year flood using only 6 or 7 historical storms. Here's where it gets interesting. In statistics, there's a whole field — Extreme Value Theory — dedicated to understanding rare, catastrophic events. One key insight is that market returns during crises have 'fat tails': disasters happen more often and more severely than normal statistics suggest. This fatness is captured by a number called xi (pronounced 'ksee'). The hypothesis argues that when markets shift from calm to crisis mode, the true xi jumps dramatically (from near zero to around 0.3-0.4), but bank models are effectively blind to this shift for over a year, because they need time to accumulate enough crisis data to recalibrate. During that blind spot, the models could be underestimating required capital by 35% or more. The proposed fix is a kind of early-warning system: use a shorter, 60-day rolling window to quickly detect when tail risk has shifted, then adjust capital calculations on the fly. The idea draws on a fundamental critique in financial economics — that risk models trained during calm periods create a false sense of security precisely when markets are most dangerous. It's a bit like calibrating your earthquake detector using only data from quiet afternoons, then wondering why it misses the big one.

This is an AI-generated summary. Read the full mechanism below for technical detail.

Why This Matters

If confirmed, this hypothesis could expose a systematic flaw in how the world's major banks calculate their required capital buffers — potentially meaning that, during a financial crisis, banks are holding significantly less cushion than regulators believe. Regulators like the Basel Committee could be compelled to mandate shorter recalibration windows or regime-aware tail-risk adjustments, fundamentally changing how capital adequacy is assessed globally. For investors and wealth managers, the correction method proposed — using rapid rolling estimates of tail behavior — could also improve private portfolio stress-testing during volatile periods like the 2020 COVID crash or the 2022 sovereign debt turmoil. The hypothesis is worth testing because the stakes are high: systematic capital underestimation during crises is precisely the mechanism that turns market stress into systemic banking failures.

Evidence Density5 tagged claims
5grounded

Grounded claims cite published evidence. Parametric claims draw on general model knowledge. Speculative claims are explicitly flagged hypothetical leaps.

M

Mechanism

FRTB (Basel III market risk, fully phased in 2025) replaces 99% VaR with 97.5% Expected Shortfall over a 10-day liquidity-adjusted horizon. Under the Internal Models Approach, historical simulation over a one-year (~250 trading day) STRESSED calibration window yields approximately 6-7 observations at the 97.5% tail (250 × 0.025 = 6.25) — insufficient for reliable xi estimation, since the Hill estimator requires k ≥ 25-50 tail observations (McNeil-Frey-Embrechts 2015, §5.2.4 GROUNDED). In practice, the implicit xi is whatever is captured in those 6-7 points, and there is NO explicit tail-shape parameter updated across regime transitions. Following Danielsson-Shin (2002) GROUNDED endogenous-risk critique ("Financial risk forecast models based on an assumption of exogeneity of risk are likely to fail"), models calibrated in normal-regime windows behave FUNCTIONALLY as if xi ≈ 0 across regimes, until forced recalibration absorbs crisis observations. During regime transitions documented by Longin 1996 GROUNDED and Ang-Bekaert 2002 GROUNDED, the true xi spikes to 0.3-0.4. By the EVT-based ES formula ES_q = [VaR_q + beta - xi*u]/(1-xi) (Acerbi-Tasche 2002 GROUNDED), the ratio ES/VaR at xi = 0.30 becomes 1/(1-0.30) = 1.4286 versus 1 at xi = 0 — a 43% capital underestimation persisting for the time lag required for the stressed window to repopulate. The proposed correction: upon regime-trigger detection (VIX > 40 + sovereign-spread widening + geopolitical event), switch from standard FRTB-ES to ES_q^{regime-aware}(t) using a 60-business-day rolling Hill estimate xi_hat(t), tested on Italian-market data (FTSE MIB, BTP-Bund spread, iTraxx Europe).

+

Supporting Evidence

Danielsson-Shin 2002 endogenous-risk critique; Longin 1996 empirical xi>0 for equity crises (xi ∈ [0.2, 0.4]); Ang-Bekaert 2002 regime-switching heavier tails; Tan-Chen-Chen 2022 regime-switching Frechet confirming discontinuous xi; McNeil-Frey-Embrechts 2015 ES/GPD formula; Acerbi-Tasche 2002 coherent ES. Computational Validator CV Check 4 confirms defensible form; CV Check 2 confirms Hill minimum k ≥ 25-50.

?

How to Test

Historical market data 2005-2024 covering 5 regime shifts (2008 GFC, 2011 sovereign crisis, 2015 China devaluation, 2020 COVID, 2022 Ukraine). Identify regime shifts via Hamilton 1989 Markov-switching model fit to returns, cross-validated with VIX peaks and geopolitical calendar. Compute FRTB-ES via 250-business-day historical simulation on rolling windows. Compute EVT-ES via 60-business-day Hill estimator (Reiss-Thomas k-selection via stability plateau), GPD fit above 90th percentile. Statistical test: Diebold-Mariano comparison of ES accuracy across post-regime-shift 100-day windows, using realized tail losses as ground truth. Primary acceptance: ratio ES^{EVT}/ES^{FRTB} ≥ 1.35 for 100 days post-shift across all 5 events. Falsification: ratio < 1.20 rejects. Secondary: Hill-plot variance peak ≥ 2× baseline at 30 days post-shift; gap closure within 400 business days.

What Would Disprove This

See the counter-evidence and test protocol sections above for conditions that would falsify this hypothesis. Every surviving hypothesis must pass a falsifiability check in the Quality Gate — ideas that cannot be proven wrong are automatically rejected.

Other hypotheses in this cluster

Private-Bank Client Defections During Regime Shifts Form a POT Process; Retention Exceedances Converge to GPD_{xi,beta} — Advisor Churn-Resistance is a Measurable xi-Attenuation Coefficient

CONDITIONAL
Extreme Value Theory
Private-Wealth Advisory under Regime Uncertainty
Pickands-Balkema-de Haan theorem maps client-defection AUM exceedances above an advisor-book threshold u_a to a GPD with advisor-specific tail index xi_a; advisor value is formalized as Delta xi_a = xi^{baseline} - xi^{post-intervention}, quantifying measurable conversion of a heavy-tailed (Frechet) defection regime toward Gumbel.
TargetedStructural Isomorphism

A math tool for predicting financial disasters could reveal which wealth advisors actually stop rich clients from leaving.

Evidence · 5 tagged claims
Score7.8
Confidence5
Grounded5

Advisor Successions Are xi-Stable iff Post-Transition xi_c ≤ max(xi_{pre}, xi_{successor-baseline}) + ε: A Formal Criterion for Protocol-Quality in Private-Bank Advisor Turnover

CONDITIONAL
Extreme Value Theory
Private-Wealth Advisory under Regime Uncertainty
Advisor successions ranked formally by xi-stability: a transition protocol is xi-stable iff the client's post-transition tail index does not exceed max(xi_{pre}, xi_{successor-baseline}) + ε, grounded in the dominant-tail result from regular variation theory. Narrative-continuous handoffs preserve xi_c; crisis-window or cold transfers induce xi-instability (structural tail-heaviness shock).
TargetedStructural Isomorphism

A math formula could tell private banks whether an advisor handoff will cause clients to suffer outsized financial losses.

Evidence · 2 tagged claims
Score7.5
Confidence5
Grounded5

The Advisor xi-Ledger: Expected ES-Reduction Per Client-Year Achieved via xi-Attenuation — Integrating H1-H4 Into Private-Bank P&L Under FTG-Universality Accounting

CONDITIONAL
Extreme Value Theory
Private-Wealth Advisory under Regime Uncertainty
Integrative framework: Delta_ES_{a,c}(t) = [ES_q(xi_baseline) - ES_q(xi_observed)] × AUM_c aggregates retention-xi (H1), trust-xi (H2), transition-xi (H3), and regulatory-xi (H4) channels into a single management-accounting ledger entry (EUR of expected tail-loss avoidance per client-year), reframing private-bank P&L from mean-based to tail-shape-based under FTG universality.
TargetedStructural Isomorphism

A new accounting framework would measure wealth advisors' value by how much they reduce clients' worst-case financial losses.

Evidence · 3 tagged claims
Score7.3
Confidence5
Grounded5

Client Trust in Advisor = 1/xi_c: Trust as a Tail-Sensitivity Asset Priceable via EVT Expected Shortfall, Elicited via Percentile-Scale Subjective-Loss Questionnaires

CONDITIONAL
Extreme Value Theory
Private-Wealth Advisory under Regime Uncertainty
Client trust in advisor identified operationally as 1/xi_c, where xi_c is the Hill-estimated tail index of the client's subjective-loss distribution (elicited via percentile-scale questionnaires, triangulated with behavioral proxies to mitigate overprecision bias). Trust-production = Delta(1/xi) priceable via ES_q = [VaR+beta-xi·u]/(1-xi); advisor function is to reduce xi_c of managed clients.
TargetedStructural Isomorphism

A math formula from insurance risk modeling could turn client trust into a measurable, priceable financial asset.

Evidence · 4 tagged claims
Score7.2
Confidence5
Grounded5

Can you test this?

This hypothesis needs real scientists to validate or invalidate it. Both outcomes advance science.