MAGICALL

Mathematics of generative models: an interdisciplinary analysis of loss function landscapes

Preview

MAGICALL is a theoretical research project aimed at gaining an in-depth understanding of modern generative models and variational inference, combining mathematics, statistical physics, and optimization.

Giulio Biroli, professor, Ecole normale supérieure (ENS-PSL) – Laboratoire de Physique de l’ENS (LPENS)

The MAGICALL project focuses on the theoretical foundations of modern generative models, such as diffusion models and variational inference. It aims to analyze loss function landscapes, learning dynamics, and generalization properties in high-dimensional contexts. Using tools from statistical physics, statistics, and optimization, the project aims to provide theoretical frameworks on the efficiency, stability, and reliability of these methods.

Key words: Generative models, Loss landscapes, Diffusion

Missions

Our researches

Understanding the mechanisms of generalization of diffusion models

The project will theoretically analyze the transition between memorization and generalization using controlled probabilistic models (Gaussian mixtures, hierarchical structures), studying the impact of data size, dimension, and learning dynamics.

Analyzing learning dynamics and the geometry of loss landscapes

Tools from statistical physics and optimization will be used to characterize the loss function landscapes of generative models and relate their structure to the convergence, stability, and performance properties of training algorithms.

Characterize the formation of data structures during learning

The project will investigate how latent data structures (modes, hierarchies, relevant subspaces) gradually emerge during the training of generative models, linking these phenomena to optimization time scales and data complexity.

Analyze and develop methods to limit the phenomenon of “mode collapse.”

Strategies based on guided distribution paths, annealing, and over-parameterization will be studied theoretically and numerically in order to identify conditions that guarantee robust exploration of multimodal distributions.

Structure and lead an interdisciplinary community focused on the mathematics of generative AI

The project will organize seminars, PEPR IA working groups, international collaborations, and a summer school to promote exchanges between mathematicians, physicists, and researchers in machine learning.

Consortium

École normale supérieure (ENS-PSL), CNRS

Scientific attempts

Societal impacts

Skills development

Publication

Autres projets

Géné-Pi

Mathematics of generative models

MacLeOD

Machine learning on geometries and distributions

MadLearning

Deep Learning Mathematics: From Theory to Applications

PERSNET

PERsistent Structures in Neural NETworks

PRODIGE-AI

PRObability, ranDom matrIx theory, Geometry and gEneralization for generative-AI

TENSOR4ML

TENSOR methods FOR mastering modern Machine Learning

THEOREM

Theory for more efficient generative models

Call for chairs Attractivités

The PEPR IA Research Program is opening its Call for Chairs Attractivité, aimed at junior and senior researchers, with the main criterion being an excellent track record in research in the PEPR IA themes.

NNawaQ

NNawaQ, Neural Network Adequate Hardware Architecture for Quantization (HOLIGRAIL project)

Package Python Keops

Package Python Keops for (very) high-dimensional tensor calculations (PDE-AI project)

MPTorch

MPTorch, a PyTorch-based framework for simulating and emulating custom precision DNN training (HOLIGRAIL project)

CaBRNeT

CaBRNeT, a library for developing and evaluating Case-Based Reasoning Models (SAIF project)

FloPoCo

FloPoCo (Floating-Point Cores), a generator of arithmetic cores and its applications to IA accelerators (HOLIGRAIL project)

SNN Software

SNN Software, Open Source Tools for SNN Design (EMERGENCES project)

SDOT

SDOT, A C++ and Python library for Semi-Discrete Optimal Transport (PDE-AI project)

Lazylinop

Lazylinop (Lazy Linear Operator), a high-level linear operator based on an arbitrary underlying implementation, (SHARP project)

CAISAR

CAISAR, a platform for characterizing artificial intelligence safety and robustness

P16

P16 or to develop, distribute and maintain a set of sovereign libraries for AI

AIDGE

AIDGE, the DEEPGREEN project's open embedded development platform

Jean-Zay

Jean Zay or the national infrastructure for the AI research community

ADAPTING

An approach that goes further than current hardware architectures, with the aim of reaching the next generation of AI applications.

Call of chairs Choose France – CNRS AI Rising Talents (closed call)

Call of chairs Choose France - CNRS AI Rising Talents (closed call)

CEA AI Rising Talents Grant

The CEA AI Rising Talents program offers you a tremendous opportunity to bring your ideas to life and lead your own research project for the benefit of industry and society.