Search. Organize.
Understand. Synthesize.

An automated AI4S platform for frontier scientific research.

I

Work ledger

Research In Progress

Active proof and disproof work, kept close to the evidence instead of buried inside announcements.

OEIS / Lean 4 / counterexample search / formalization

Proofa237271

OEIS A237271 · Conjecture 2

The number of parts in the symmetric representation of σ(n) equals the count of 2-dense divisor sublists. Fully formalized in Lean 4 — zero sorry, no extra axioms, compiles clean.

✓ Solved
Disproofa248802

OEIS A248802 · Conjecture 5

Disproved A248802(138n+6) = 1669 by counterexample: at n=51, index 7044, the value 1399 divides 2^(2^7044+2)+3, so A248802(7044) ≤ 1399 < 1669.

&check; Solved
Disproofa241898

OEIS A241898 · Conjecture 0

Disproved the claimed lower bound a(n) > 7 for all n > 599: the value n = 736 = 4² + 12² + 24² yields a(736) = 4, a clean counterexample.

&check; Solved
Proofa281009

OEIS A281009 · Conjecture 1

Human-readable proof complete; the Lean formalization still carries one sorry. The machine-checked version is being finished now.

Runner solving
II

Problem atlas

Puzzle List

A compact inventory of scientific problems where AI systems can create leverage through search, simulation, proof, or experimental design.

45 puzzles / 5 domains / cost bands / method hypotheses

Mathematics
9
M1

OEIS Remaining ~450 Open Conjectures

AlphaProof Nexus proved 44/492; ~450 integer-sequence conjectures remain

$LLM + Lean formal verification
M2

Erdős Open Problems (~670)

1179 problems documented, ~59% still open — combinatorics, number theory, graph theory

$AlphaProof Nexus-like systems
M3

Frankl Union-Closed Sets Conjecture

In a union-closed family, at least one element appears in ≥ half the sets

$ML + information-theoretic bounds
M4

Ramsey Number Exact Values

R(5,5) still unknown (known: 43–46); improving small Ramsey number bounds

$$Deep RL + local search
M5

Lonely Runner Conjecture

k+1 runners at distinct speeds on a unit circle each have a moment ≥ 1/(k+1) from all others

$Computer-assisted proof + continuous optimization
M6

Erdős–Straus Conjecture

4/n = 1/x + 1/y + 1/z has positive integer solutions for all n ≥ 2

$Parameter-space search + polynomial family discovery
M7

Graceful Tree Conjecture

Every tree admits a graceful labeling

$GNN + graph search
M8

Peaceable Queens

Maximum equal-count non-attacking black and white queens on an n×n board

$RL + combinatorial optimization
M9

Reconstruction Conjecture

A graph is uniquely determined by its vertex-deleted subgraphs

$GNN learning deck→graph mapping
Physics
9
P1

Navier-Stokes Singularity Search

Do 3D Navier-Stokes equations develop finite-time singularities? A Clay Millennium Problem

$$Neural network singularity search
P2

Turbulence Closure Problem

Unresolved turbulence closure models; DNS cost scales as ~Re³

$$PINNs + interpretable AI (SHAP)
P3

Glass Transition

Mechanism of dynamical slowdown in liquid→glass transition — 'the deepest unsolved problem'

$$ML-accelerated molecular dynamics
P4

High-Temperature Superconductivity Mechanism

No consensus on cuprate microscopic mechanism; room-temperature SC unrealized

$$ML Tc prediction + candidate screening
P5

Quantum Many-Body Problem

Many-particle wavefunctions live in exponentially large Hilbert spaces

$$Neural quantum states (NQS)
P6

Quantum Spin Liquids

Disordered magnetic states at absolute zero; experimental confirmation remains difficult

$$RBM / autoregressive quantum state models
P7

Dark Matter Distribution Mapping

~27% of the universe's mass-energy, unobservable directly

$$UNet reconstruction + gravitational lensing
P8

Quantum Error Correction Decoding

Real-time decoding of surface-code syndromes faster and more accurately than algorithmic decoders

$Recurrent/transformer neural decoder (AlphaQubit)
P9

Neutrino Mass Ordering

Whether the neutrino mass spectrum is normal or inverted remains experimentally undetermined

$Simulation-based inference / normalizing-flow likelihoods
Chemistry & Materials
9
C1

DFT Exchange-Correlation Functional

The exact XC functional has never been found; all practical DFT uses approximations

$$Neural network universal XC functional
C2

Crystal Structure Prediction

Predicting crystal structure from composition — energy surfaces have astronomical minima

$$Diffusion models + generative AI
C3

Universal Machine Learning Force Fields

MLFFs must balance accuracy, speed, transferability, and long-range interactions

$$MACE-OFF, ANI, NequIP architectures
C4

Inverse Materials Design

Given target properties → generate materials; highly nonlinear mapping

$$Deep generative models + active learning
C5

Fully Automated Retrosynthesis

Given a target molecule → find optimal synthesis route; combinatorial explosion

$Template-free Transformer retrosynthesis
C6

Catalytic Reaction Mechanism Prediction

Predicting complete reaction mechanisms — bond breaking/forming order, transition states

$$ML catalyst→product mapping
C7

Solid-State Battery Electrolyte Discovery

Finding solid electrolytes with ionic conductivity ~10 mS/cm

$$ML screening + generative structure design
C8

Protein-Ligand Binding Affinity Prediction

Accurate, generalizable prediction of small-molecule binding affinity to proteins is still unsolved

$AlphaFold3-derived models + ML/physics hybrid scoring
C9

MOF Discovery for Gas Capture

Designing porous metal-organic frameworks with target porosity and capture performance on demand

$Generative LLMs + diffusion (MOFGPT, MOFDiff) + RL
Biology & Life Sciences
9
B1

Connectome → Function Mapping

Full fruit fly brain connectome complete, but structure→function gap remains

$GNN on connectome graphs
B2

Non-Coding DNA Function

98% of genome function unknown — contains enhancers, silencers, and more

$Enformer / Transformer models
B4

Intrinsically Disordered Proteins

~30% of residues lack stable structure, beyond AlphaFold's single-conformation reach

$$VAE / diffusion models for conformational ensembles
B5

Protein Dynamics & Conformational Changes

Multi-state protein conformations determine activity under different conditions

$$Flow matching + Boltzmann generators
B6

RNA 3D Structure Prediction

Known RNA structures are an order of magnitude fewer than proteins

$Language models + few-shot methods
B8

Cell Fate Determination

Is Waddington's landscape static or dynamic? Noise-driven or signal-driven differentiation?

$Optimal transport + attractor landscape analysis
B9

Genotype–Phenotype Mapping

GWAS variants explain only a fraction of heritability

$G2PT Transformer
B11

Splicing Code

Predicting which splice isoform is expressed in a given tissue

$SpliceAI / TrASPr+BOS
B15

Aging Prediction & Mechanisms

Causal mechanisms of aging clocks, not just correlative markers

$Deep aging clocks + multi-omics
Earth & Climate
9
E1

AMOC Tipping Point Prediction

Will the Atlantic Meridional Overturning Circulation collapse — and when?

$Deep learning + early warning signal detection
E2

Aerosol–Cloud Interactions

Largest source of uncertainty in climate models

$$ML aerosol activation simulators
E3

Earthquake Prediction

Accurately predicting time, location, and magnitude remains unsolved

$$Transformer + GNN (SeismoQuakeGNN)
E4

Species Extinction Risk Prediction

6th mass extinction — over half of data-deficient species are threatened

$RL integrating multi-source data
E5

ENSO Prediction Beyond the Spring Barrier

El Niño/La Niña forecasts collapse across the boreal-spring predictability barrier, limiting lead times

$CNN-Transformer hybrids (CTEFNet) + dynamical-DL fusion
E6

Antarctic Ice-Sheet & Sea-Level Projection

Marine ice-sheet instability makes the upper bound of 2100 sea-level rise deeply uncertain

$Neural-network ice-sheet emulators with uncertainty quantification
E7

Space-Weather / Geomagnetic Storm Prediction

Forecasting solar flares, CMEs and geomagnetic storms that threaten satellites and power grids remains unreliable

$Vision Transformers + CNNs on solar magnetograms
E8

Methane Super-Emitter Source Attribution

Pinpointing which facilities drive the disproportionate methane plumes seen from space is largely unsolved

$CNN plume detectors + transfer learning on Sentinel imagery
E9

Equilibrium Climate Sensitivity Constraint

How much warming a CO₂ doubling causes is still only bounded to a wide ~2.2–4.7 K range after 45 years

$ML emergent constraints + network-based model weighting