TY - JOUR
T1 - Merged or monolithic? Using machine-learning to reconstruct the dynamical history of simulated star clusters
AU - Pasquato, Mario
AU - Chung, Chul
N1 - Publisher Copyright:
© ESO, 2016.
PY - 2016/5/1
Y1 - 2016/5/1
N2 - Context. Machine-learning (ML) solves problems by learning patterns from data with limited or no human guidance. In astronomy, ML is mainly applied to large observational datasets, e.g. for morphological galaxy classification. Aims. We apply ML to gravitational N-body simulations of star clusters that are either formed by merging two progenitors or evolved in isolation, planning to later identify globular clusters (GCs) that may have a history of merging from observational data. Methods. We create mock-observations from simulated GCs, from which we measure a set of parameters (also called features in the machine-learning field). After carrying out dimensionality reduction on the feature space, the resulting datapoints are fed in to various classification algorithms. Using repeated random subsampling validation, we check whether the groups identified by the algorithms correspond to the underlying physical distinction between mergers and monolithically evolved simulations. Results. The three algorithms we considered (C5.0 trees, k-nearest neighbour, and support-vector machines) all achieve a test misclassification rate of about 10% without parameter tuning, with support-vector machines slightly outperforming the others. The first principal component of feature space correlates with cluster concentration. If we exclude it from the regression, the performance of the algorithms is only slightly reduced.
AB - Context. Machine-learning (ML) solves problems by learning patterns from data with limited or no human guidance. In astronomy, ML is mainly applied to large observational datasets, e.g. for morphological galaxy classification. Aims. We apply ML to gravitational N-body simulations of star clusters that are either formed by merging two progenitors or evolved in isolation, planning to later identify globular clusters (GCs) that may have a history of merging from observational data. Methods. We create mock-observations from simulated GCs, from which we measure a set of parameters (also called features in the machine-learning field). After carrying out dimensionality reduction on the feature space, the resulting datapoints are fed in to various classification algorithms. Using repeated random subsampling validation, we check whether the groups identified by the algorithms correspond to the underlying physical distinction between mergers and monolithically evolved simulations. Results. The three algorithms we considered (C5.0 trees, k-nearest neighbour, and support-vector machines) all achieve a test misclassification rate of about 10% without parameter tuning, with support-vector machines slightly outperforming the others. The first principal component of feature space correlates with cluster concentration. If we exclude it from the regression, the performance of the algorithms is only slightly reduced.
KW - Galaxy: evolution
KW - globular clusters: general
KW - methods: numerical
KW - methods: statistical
UR - http://www.scopus.com/inward/record.url?scp=84965163521&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84965163521&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/201425181
DO - 10.1051/0004-6361/201425181
M3 - Article
AN - SCOPUS:84965163521
SN - 0004-6361
VL - 589
JO - Astronomy and Astrophysics
JF - Astronomy and Astrophysics
M1 - A95
ER -