A Machine Learning Suite to Halo-Galaxy Connection
Abstract
As far as we know, galaxies form inside dark matter halos and elucidating this connection is a key element in theories of galaxy formation and evolution. In this work we propose a suite of machine learning tools to predict baryonic from halo properties in the IllustrisTNG300 magnetohydrodynamical simulation. We apply four methods: extremely randomized trees, K-nearest neighbors, light gradient boosting machine, and neural networks. Moreover, we combine the results of them in a stacked model. In addition, we apply all these methods in an augmented dataset using the synthetic minority over-sampling technique for regression with Gaussian noise, to deal with the problem of imbalanced data sets. Altogether, the ML algorithms are consistent at predicting central galaxy properties from a set of input halo properties such as halo mass, concentration, spin, and halo overdensity. For stellar mass, the Pearson correlation coefficient is 0.98, while for specific star formation rate, color, and size it is between 0.7–0.8. Lastly, the presented analysis adds evidence to previous works indicating that certain galaxy properties cannot be reproduced using halo features alone.
Más información
Título según SCOPUS: | ID SCOPUS_ID:85175996004 Not found in local SCOPUS DB |
Título de la Revista: | UNIVERSE OF DIGITAL SKY SURVEYS: A MEETING TO HONOUR THE 70TH BIRTHDAY OF MASSIMO CAPACCIOLI |
Volumen: | 60 |
Fecha de publicación: | 2023 |
Página de inicio: | 31 |
Página final: | 34 |
DOI: |
10.1007/978-3-031-34167-0_7 |
Notas: | SCOPUS |