THEOREM

Theory for more efficient generative models

Preview

Towards a better understanding of generative models in order to guide their adjustment, specialization, and fairness, thanks to a unified mathematical framework based on stochastic optimal control.

Alain Oliviero Durmus, Professor, Ecole polytechnique

Generative models have transformed the generation of images, texts, and scientific structures, but their design often remains empirical and difficult to control.
The THEOREM project proposes to “start from scratch” by developing a unified mathematical framework to explain what makes these models perform well and what causes them to fail (instability, bias, memorization, lack of robustness).
The goal is to translate practical recipes (schedules, time steps, choice of objectives) into design principles, with guarantees of stability and accuracy.
The project also aims to better understand how to adapt a model to a task or domain (specialization), while monitoring reliability and fairness.
Finally, the goal is to move from “artisanal” generative AI to more auditable and accountable generative AI.

Keywords: generative models, diffusion models, normalizing flows, energy-based models, stochastic optimal control, stability, convergence, fine-tuning, specialization, robustness, fairness, uncertainty, bias, AI for science.

Missions

Our researches

Unify generative models in a common language

Propose a theoretical framework that links diffusion and flows via a stochastic optimal control formulation, in order to compare, explain, and combine approaches in a consistent manner.

Transforming heuristics into a method

Derive design rules for key choices (training objectives, schedules, discretization, solvers) and establish practical criteria to guide fine-tuning.

Ensuring and diagnosing reliability

Develop analytical tools and safeguards (stability, errors, sensitivity) to understand when a model goes off track, why, and how to correct it in a robust manner.

Making specialization safer and more effective

Study specialization strategies (adaptation to a domain, a type of data, a physical constraint) while controlling generalization, uncertainty, and the risks of overfitting/memorization.

Putting equity at the heart of generative engineering

Analyze how biases arise and propagate in training and sampling, and propose regularization/diagnostic mechanisms to improve fairness without sacrificing quality.

Consortium

Université Clermont Auvergne, Université Paris Dauphine, Ecole Polytechnique de Paris

Scientific findings

Societal impacts

Skills development

Publication

Autres projets

Géné-Pi

Mathematics of generative models

MacLeOD

Machine learning on geometries and distributions

MadLearning

Deep Learning Mathematics: From Theory to Applications

MAGICALL

Mathematics of generative models: an interdisciplinary analysis of loss function landscapes

PERSNET

PERsistent Structures in Neural NETworks

PRODIGE-AI

PRObability, ranDom matrIx theory, Geometry and gEneralization for generative-AI

TENSOR4ML

TENSOR methods FOR mastering modern Machine Learning

Call for chairs Attractivités

The PEPR IA Research Program is opening its Call for Chairs Attractivité, aimed at junior and senior researchers, with the main criterion being an excellent track record in research in the PEPR IA themes.

NNawaQ

NNawaQ, Neural Network Adequate Hardware Architecture for Quantization (HOLIGRAIL project)

Package Python Keops

Package Python Keops for (very) high-dimensional tensor calculations (PDE-AI project)

MPTorch

MPTorch, a PyTorch-based framework for simulating and emulating custom precision DNN training (HOLIGRAIL project)

CaBRNeT

CaBRNeT, a library for developing and evaluating Case-Based Reasoning Models (SAIF project)

FloPoCo

FloPoCo (Floating-Point Cores), a generator of arithmetic cores and its applications to IA accelerators (HOLIGRAIL project)

SNN Software

SNN Software, Open Source Tools for SNN Design (EMERGENCES project)

SDOT

SDOT, A C++ and Python library for Semi-Discrete Optimal Transport (PDE-AI project)

Lazylinop

Lazylinop (Lazy Linear Operator), a high-level linear operator based on an arbitrary underlying implementation, (SHARP project)

CAISAR

CAISAR, a platform for characterizing artificial intelligence safety and robustness

P16

P16 or to develop, distribute and maintain a set of sovereign libraries for AI

AIDGE

AIDGE, the DEEPGREEN project's open embedded development platform

Jean-Zay

Jean Zay or the national infrastructure for the AI research community

ADAPTING

An approach that goes further than current hardware architectures, with the aim of reaching the next generation of AI applications.

Call of chairs Choose France – CNRS AI Rising Talents (closed call)

Call of chairs Choose France - CNRS AI Rising Talents (closed call)

CEA AI Rising Talents Grant

The CEA AI Rising Talents program offers you a tremendous opportunity to bring your ideas to life and lead your own research project for the benefit of industry and society.