CosmoFlow: Scale-Aware Representation Learning for Cosmology with Flow Matching

πŸ“… 2025-07-15
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the high dimensionality and multiscale nature of cold dark matter (CDM) simulation field data by proposing the first unsupervised flow-matching generative model tailored for cosmological field data, enabling scale-aware latent representation learning. Methodologically, it employs the flow-matching framework to directly model the field distribution without supervision, learning a compact (32Γ— compression), semantically interpretable low-dimensional latent spaceβ€”where distinct latent channels explicitly encode features at different cosmological scales. Contributions include: (i) the first application of flow matching to cosmological field modeling; (ii) end-to-end disentanglement of multiscale features with physically grounded interpretability; and (iii) support for high-fidelity field reconstruction, physically consistent synthetic data generation, and cosmological parameter inference with sub-percent accuracy.

Technology Category

Application Category

πŸ“ Abstract
Generative machine learning models have been demonstrated to be able to learn low dimensional representations of data that preserve information required for downstream tasks. In this work, we demonstrate that flow matching based generative models can learn compact, semantically rich latent representations of field level cold dark matter (CDM) simulation data without supervision. Our model, CosmoFlow, learns representations 32x smaller than the raw field data, usable for field level reconstruction, synthetic data generation, and parameter inference. Our model also learns interpretable representations, in which different latent channels correspond to features at different cosmological scales.
Problem

Research questions and friction points this paper is trying to address.

Learn compact latent representations of CDM simulation data
Enable field reconstruction and synthetic data generation
Identify interpretable features across cosmological scales
Innovation

Methods, ideas, or system contributions that make the work stand out.

Flow matching for unsupervised latent learning
Compact 32x smaller CDM representations
Interpretable multi-scale cosmological features
πŸ”Ž Similar Papers
No similar papers found.
S
Sidharth Kannan
University of California, Santa Barbara
T
Tian Qiu
University of California, Santa Barbara
Carolina Cuesta-Lazaro
Carolina Cuesta-Lazaro
MIT
CosmologyMachine Learning
Haewon Jeong
Haewon Jeong
UCSB
Information TheoryMachine LearningFault-tolerant Computing