Multimodal Universe: Enabling Large-Scale Machine Learning with 100TBs of Astronomical Scientific Data

“The Multimodal Universe dataset is a large scale collection of multimodal astronomical data, including images, spectra, and light curves, which aims to enable research into foundation models for astrophysics and beyond.”