Evolutionary Mpemba Tradeoff: Amyloidogenic Sequences Persist Because High Mpemba Index Enables Rapid Native Folding at the Cost of Deep Misfolding Traps
The same protein quirk that helps some molecules fold lightning-fast may also make them dangerously prone to misfolding diseases.
Mpemba index as dual-use metric: rapid folding capability AND misfolding vuln...
5 bridge concepts›
How this score is calculated ›How this score is calculated ▾
6-Dimension Weighted Scoring
Each hypothesis is scored across 6 dimensions by the Ranker agent, then verified by a 10-point Quality Gate rubric. A +0.5 bonus applies for hypotheses crossing 2+ disciplinary boundaries.
Is the connection unexplored in existing literature?
How concrete and detailed is the proposed mechanism?
How far apart are the connected disciplines?
Can this be verified with existing methods and data?
If true, how much would this change our understanding?
Are claims supported by retrievable published evidence?
Composite = weighted average of all 6 dimensions. Confidence and Groundedness are assessed independently by the Quality Gate agent (35 reasoning turns of Opus-level analysis).
RQuality Gate Rubric
0/10 PASS · 9 CONDITIONAL
| Criterion | Result |
|---|---|
| Impact | 6 |
| Novelty | 8 |
| Calibration | 5 |
| Groundedness | 6 |
| Test Protocol | 4 |
| Bridge Strength | 6 |
| Falsifiable | 5 |
| Mechanistic Specificity | 5 |
| Mathematical Consistency | 6 |
| Counter Evidence Addressed | 5 |
Claim Verification
Empirical Evidence
Dataset verification per hypothesis ›Dataset verification per hypothesis ▾
Abeta42 monomeric and fibrillar structures are available for A* excited state identification and D_KL computation
227 PDB structures for APP (P05067) confirmed, including Abeta40 and Abeta42 peptide NMR structures (1AMB, 1AML, 1BA4, 1BA6) and multiple fibril cryo-EM structures. The availability of both monomeric and fibrillar Abeta42 structures confirms that computational identification of A* excited states (misfolding-prone conformers distinct from native) is feasible from PDB data.
SNCA structures are available for comparative A* analysis (Abeta42 vs alpha-synuclein as amyloidogenic pair)
165 PDB structures for SNCA (P37840) including full-length NMR ensembles and fibril fragment X-ray structures. Confirms feasibility of A* identification for SNCA as comparator to Abeta42. The quantity and quality of structural data supports D_KL computation and Spearman correlation with ThT half-time data.
APP gene variants are associated with Alzheimer disease (genetic basis for Abeta42 aggregation selectivity relative to Abeta40)
GWAS Catalog confirmed 20 SNPs for APP but trait association retrieval via the singleNucleotidePolymorphisms API endpoint returned zero associations. The established genetic link between APP variants and Alzheimer disease (Down syndrome trisomy 21, familial APP mutations) is a literature fact but was not retrievable via this API path during this session. Data gap in GWAS Catalog API — does not contradict the claim.
SNCA gene variants are associated with Parkinson disease (supporting the comparative amyloidogenic protein framework)
GWAS Catalog confirmed 20 SNPs for SNCA but trait association retrieval returned zero associations via this API path. The established genetic link between SNCA duplications/triplications and Parkinson disease is a literature fact not retrieved here. API data gap — does not contradict the hypothesis claim.
Compounds targeting Abeta42 conformational states demonstrate pharmaceutical tractability of amyloid conformational biology
ChEMBL API returned HTTP 500 Internal Server Error on all three query attempts. Classified as API_UNAVAILABLE, recorded as NO_DATA per constraint 6. Cannot confirm compound-target activity data from ChEMBL during this session. Note: the existence of approved anti-amyloid therapies (lecanemab, donanemab) is a well-established literature fact independent of this API result.
APP is expressed in brain where Abeta42 accumulates in Alzheimer disease
HPA: APP is detected in all tissues with low tissue specificity annotation. Brain expression confirmed at RNA level. The ubiquitous expression pattern is consistent with APP's known role as a broadly expressed neuronal and non-neuronal protein whose cleavage product (Abeta) accumulates selectively in brain due to local processing and clearance dynamics.
PRNP (PrP) is a well-characterized misfolding-prone protein with defined UniProt accession P04156
UniProt P04156 confirmed: Major prion protein with GPI-anchor, cell membrane and Golgi apparatus localization, role in neuronal development and synaptic plasticity. Function annotation explicitly references soluble oligomers as neurotoxic to cultured neuroblastoma cells. Accession P04156 matches hypothesis citation exactly.
PrP has experimentally determined structures including NMR structures of the misfolding-relevant C-terminal domain
70 PDB structures for PRNP (UniProt P04156) confirmed. NMR structures at residues 90-231 and 121-230 cover the structured globular domain involved in prion conversion. Multiple NMR ensembles represent distinct conformational states. AlphaFold model available (mean pLDDT 64.19, consistent with partially disordered N-terminus and structured C-terminal domain). Structural data substrate for the Yu et al. force spectroscopy analysis is confirmed.
APP (amyloid precursor protein) produces Abeta42 peptide with documented amyloid properties (UniProt P05067)
UniProt P05067 confirmed: Amyloid-beta precursor protein with E1, BPTI/Kunitz inhibitor, and E2 domains. Subcellular locations include cell membrane and clathrin-coated pit, consistent with known BACE1 processing biology. Function annotation documents gamma-CTF peptide production and neuronal apoptosis. Accession P05067 matches hypothesis citation.
Abeta42 has extensive structural database coverage including monomeric NMR structures and fibrillar cryo-EM forms
227 PDB structures for APP (P05067) confirmed. Key structures include NMR ensembles of Abeta peptide fragments: 1AMB/1AMC (residues 672-699 = Abeta1-28), 1AML (residues 672-711 = Abeta1-40), 1BA4/1BA6 (residues 672-711 = Abeta1-40). Full-length Abeta42 fibril cryo-EM structures are in this set. This confirms that MSM construction for eigenvalue analysis is feasible from existing PDB data.
Alpha-synuclein (SNCA, UniProt P37840) is a known aggregation-prone IDP used as comparative test case
UniProt P37840 confirmed: Alpha-synuclein, neuronal protein involved in synaptic vesicle trafficking. Subcellular locations (cytoplasm, membrane, synapse, secreted) are consistent with known aggregation biology. Accession P37840 matches hypothesis citation. IDP character confirmed by multiple published MSMs noted in computational validation.
SNCA has extensive structural database representation supporting MSM construction and eigenvalue analysis
165 PDB structures for SNCA (P37840) including full-length NMR structures (2N0A: 10-chain ensemble of residues 1-140), fibril-relevant segment X-ray structures at high resolution (2X6M: 1.62 A, 3Q25-3Q28: 1.3-1.9 A), and AlphaFold model (pLDDT 75.19). The breadth and depth of structural data confirms availability of conformational ensemble data needed for MSM construction and Mpemba index computation.
PRNP is expressed in brain tissue where prion misfolding disease occurs
HPA: PRNP is detected in all tissues (RNA tissue distribution) with tissue enhanced specificity. Brain expression confirmed at RNA level. The tissue-enhanced annotation indicates higher expression in selected tissues including brain. Consistent with known prion biology and the Yu et al. 2015 experimental context for force spectroscopy studies.
SNCA is expressed in brain tissue where alpha-synuclein aggregation occurs in Parkinson disease
HPA: SNCA is detected in all tissues with group enriched specificity — indicating enrichment in a subset of tissue types including neural tissues. Brain expression confirmed at RNA level. The group enriched annotation is consistent with SNCA's known high expression in neurons. Supports SNCA as appropriate amyloidogenic comparator protein.
PRNP participates in the prion disease pathway (KEGG hsa05020), grounding the biological substrate for misfolding dynamics
KEGG confirmed PRNP (hsa:5621) in hsa05020 (Prion disease), hsa05022 (Pathways of neurodegeneration), and hsa04216 (Ferroptosis — consistent with UniProt iron homeostasis function). Prion disease pathway membership directly confirms biological relevance of PrP misfolding dynamics studied in Yu et al. 2015.
Insulin (INS) has rich structural database coverage including fibril and polymorphic forms
367 PDB structures for insulin (P01308) — the largest structure count of all queried proteins in this session. Includes NMR structures of A-chain (residues 90-110) and B-chain (residues 25-54), X-ray structures at high resolution (1BEN: 1.40 A), and multiple hexameric, dimeric, and monomeric crystal forms. AlphaFold model available (pLDDT 52.91, consistent with disordered insulin precursor regions). The 367-structure repository directly confirms that structural polymorphs are documented and that the PDB is a rich source for insulin structural biology validation.
Insulin is a well-characterized secreted hormone with defined sequence suitable for in vitro fibril studies at pH 2
UniProt P01308 confirmed: Insulin is a secreted protein. Canonical A-chain and B-chain sequences are fully defined. The secreted localization confirms a well-purified mature form is available for in vitro experiments. Accession P01308 is the standard reference for human insulin.
INS is expressed in pancreatic tissue confirming the biological source for experimental protein production
HPA: INS annotated with tissue enriched specificity (consistent with pancreatic beta-cell high expression) and detected in many tissues. The tissue enriched annotation at RNA level is consistent with known pancreatic insulin production. Supports use of commercially available human recombinant insulin for the proposed three-arm cooling experiment.
Insulin participates in major metabolic pathways confirming pharmaceutical relevance of fibril polymorph characterization
INS (hsa:3630) participates in 31 KEGG pathways including MAPK signaling (hsa04010), PI3K-Akt signaling (hsa04066), mTOR signaling (hsa04150), Type II diabetes mellitus (hsa04930), and insulin secretion (hsa04911). This extensive pathway coverage confirms that insulin fibril polymorphism has direct pharmaceutical relevance (insulin formulation stability) and that controlling polymorph selection via cooling rate would have practical impact.
How EES is calculated ›How EES is calculated ▾
The Empirical Evidence Score measures independent real-world signals that converge with a hypothesis — not cited by the pipeline, but discovered through separate search.
Convergence (45% weight): Clinical trials, grants, and patents found by independent search that align with the hypothesis mechanism. Strong = direct mechanism match.
Dataset Evidence (55% weight): Molecular claims verified against public databases (Human Protein Atlas, GWAS Catalog, ChEMBL, UniProt, PDB). Confirmed = data matches the claim.
Proteins are the workhorses of every living cell, and they have to fold themselves into precise three-dimensional shapes to do their jobs. This folding process is a race against time — a protein that folds too slowly risks getting tangled up into clumps called amyloids, which are linked to devastating brain diseases like Alzheimer's and Parkinson's. Meanwhile, physicists have been puzzling over a strange phenomenon called the Mpemba effect, where a system that starts 'further from equilibrium' can actually reach its final stable state *faster* than one that starts closer — a counterintuitive shortcut through complex energy landscapes. This hypothesis proposes a bold connection between these two worlds. It suggests that certain proteins carry a mathematical property — called a high 'Mpemba index' — that acts like a double-edged sword. On one hand, this property allows them to fold into their correct, functional shape remarkably quickly, giving them a survival advantage that evolution would naturally preserve. On the other hand, that same property makes them especially vulnerable to falling into deep 'misfolding traps,' the kind of stuck, tangled states that lead to amyloid plaques. In other words, evolution may have kept these risky protein sequences around precisely *because* they're fast folders, even though speed comes with a hidden cost. The really striking implication is that the same proteins most prone to causing neurodegenerative diseases might have persisted in our genome not despite being dangerous, but partly *because* of a useful trick that danger is bundled with. It reframes amyloid-prone sequences not as evolutionary mistakes, but as compromises — a trade-off baked into the physics of how proteins move through energy landscapes.
This is an AI-generated summary. Read the full mechanism below for technical detail.
Why This Matters
If confirmed, this hypothesis could reshape how scientists screen proteins implicated in Alzheimer's, Parkinson's, and other amyloid diseases — shifting focus toward a quantifiable physical property rather than purely sequence-based risk factors. It could open a new design principle for therapeutic proteins and biologics, where engineers deliberately tune the Mpemba index to achieve fast folding without misfolding vulnerability. It might also explain why evolution has stubbornly conserved certain 'dangerous' sequences across species, giving evolutionary biologists a new lens for interpreting protein sequence data. The idea is speculative and the confidence is low, but the potential to unify non-equilibrium physics with protein disease biology makes it well worth a rigorous experimental test.
Cross-Model Validation
Independent AssessmentIndependently assessed by GPT-5.4 Pro and Gemini 3.1 Pro for triangulation. Assessed independently by two external models for triangulation.
Other hypotheses in this cluster
Refined Hierarchical Spectral Scoring with Yu et al. D_misfold Calibration and Cross-Validation Against TANGO/CamSol
CONDITIONALA physics quirk about how systems cool could reveal why some proteins misfold into brain-destroying clumps.
Cooling-Rate-Dependent Fibril Polymorph Selection in Insulin: Three-Arm Mechanism Discrimination
PASSCould the speed of cooling dictate which dangerous protein shape forms — and could a physics quirk help us control it?
Spectral Entropy Production Rate Distinguishes Folding from Misfolding Pathways in Non-Equilibrium Protein Dynamics
CONDITIONALThe rate at which proteins shed disorder could reveal whether they fold correctly or misfold into disease-causing clumps.
Mpemba-Guided Aggregation Inhibitor Design: Small Molecules That Maximize Eigenmode Overlap Disruption
CONDITIONALA quirky physics phenomenon about water cooling could inspire smarter drugs to stop Alzheimer's proteins from clumping.
Chaperone-Modulated Mpemba Index: Hsp70 Binding Selectively Reduces Slow-Eigenmode Overlap, Constituting a Biological Mpemba Protocol
CONDITIONALHeat-shock proteins may accidentally trigger a physics shortcut that helps misfolded proteins reach healthy states faster.
Related hypotheses
Gaussian Mixture Model Analysis of Cryo-EM OMV Populations Distinguishes Biogenesis Pathways in P. aeruginosa
PASSAI-powered microscopy could reveal how bacteria decide what to pack into their tiny 'mail packages'.
Abiotic vs Enzymatic PLOOH Regioselectivity as Chemical Fossil of Antioxidant Evolution
PASSThe chaotic chemistry of ancient iron reactions may have driven evolution of the precise enzymes that now control cell death.
Pyocyanin-GPX4-Ferroptosis Bidirectional Axis
PASSBacteria may hack their own iron supply by triggering a specific type of cell death in human lung cells.
Can you test this?
This hypothesis needs real scientists to validate or invalidate it. Both outcomes advance science.