Upcoming radio telescope sky surveys are set to observe millions of early Universe galaxies. However, to handle this massive influx of data, automatic tools are essential. An algorithm developed by a team from the Institute of Astrophysics and Space Sciences (IA) at the Faculty of Sciences, University of Lisbon in Portugal, is designed to process this data and identify galaxies harboring massive black holes at their centers.
As far as the eye can see, galaxies fill the images of the deep Universe. What processes determined their shapes, colors and populations of stars? Astronomers think that primordial black holes were the engines of galaxies’ growth and transformation and can explain the cosmic landscape we see now.
A Breakthrough in Identifying Superluminous Galaxies
In an article recently published in the journal Astronomy & Astrophysics, an international team led by Rodrigo Carvajal, of the Institute of Astrophysics and Space Sciences (IA) and the Faculty of Sciences of the University of Lisbon (Ciências ULisboa), presents a machine learning technique that recognises superluminous galaxies in the early Universe.
These are galaxies thought to be dominated by the activity of a voracious black hole at their core. According to the authors, this should be the first algorithm that predicts when this activity also radiates an intense signal in the radio frequencies. Radio emissions are often distinct from the other light of the galaxy, and sometimes it is difficult to link them. This technique of artificial intelligence will enable astronomers to be more effective in the search for the so-called radio galaxies.
The algorithm, developed with the collaboration of the Closer company, acting in the sector of technological solutions for data science, was trained with images of galaxies obtained in several wavelengths of the electromagnetic spectrum. When tested with other images, it was able to predict four times more radio galaxies than the conventional methods that use explicit instructions. As machine learning develops its own algorithms, trying to understand its success may help clarify the physical phenomena that were happening in these galaxies, 1.5 billion of years after the Big Bang, that is, when the Universe had a tenth of its current age.
The Importance of Further Research and Analysis
“We have to find more active galaxies in the sky, because there are predictions that there should exist much more in the early history of the Universe. With the current observations we don’t have that number,” says Rodrigo Carvajal. According to this researcher, more observations are needed to verify if the current understanding about how active galaxies evolve is correct, or has to be modified.
“It’s also important to analyze the machine learning models themselves and to understand what’s happening inside them,” Carvajal adds. “Which features are the most relevant to the decision? For example, we want to know if the most important feature for the module to have stated that it is an active galaxy is the light the galaxy emits in the infrared, possibly an indication of rapid formation of new stars. With this, we are able to produce a new law to separate between what is a normal galaxy and an active galaxy.”
Investigating the Role of Radio Emissions and Star Formation
The relative weight of the galaxy features on the decision taken by the computer may point to what is at the origin of its intense activity, in particular in the radio band. In a study in preparation, Carvajal is exploring the implications of this apparent dependency between the radio emission and the formation of stars. Israel Matute, of IA and Ciências ULisboa, the second author of the paper, clarifies: “These models are mathematical tools that help us to look into the right direction when the complexity of the data increases. This work might provide insights into the processes that curbed the formation of new stars in the second half of the history of the Universe.”
The galaxies that seem to be lacking in the primordial Universe may be in the large mass of data that modern radio telescopes will produce in the coming years. Future surveys of extensive regions of the sky will reveal billions of galaxies. One example is the Evolutionary Map of the Universe (EMU), that will map the whole southern celestial hemisphere with the ASKAP radio telescope, in Australia. The team led by IA is already working with data from a pilot project of this survey. Once perfectioned, these tools will be crucial for the processing of the astronomical amount of data the future Square Kilometre Array Observatory (SKAO) will produce. Portugal is a member of the consortium of this observatory, which is already under construction.
“In a new age when astronomy will have access to vast amounts of data, it is increasingly more important the development of advanced techniques for their processing and analysis,” says José Afonso, of IA and Ciências ULisboa and co-author of this paper. “At IA we are developing and implementing these techniques, to be able to decipher the origin of galaxies and the supermassive black holes that most of them host.”
The idea for the collaboration between the Closer company and IA was put forward by one of the co-authors, Helena Cruz, who holds a PhD in Physics and is a data scientist at Closer. Her involvement was key to analyze and process the impact of uncertainties and inconsistencies between different data sources – coming from several telescopes and observation programmes – used to train the machine learning algorithm.
“I became aware that Astronomy is a field with great opportunities for the exploration and development of models of machine learning, and it made sense to me to apply my professional skills to this field,” says Helena Cruz. “I shared my interest with Closer and both parties showed immediately their willingness to collaborate, which I see as an extension of my work at the company.”
“Closer thrives from the knowledge of its collaborators, this is its capital,” adds João Pires da Cruz, Closer co-founder, professor and researcher. “The more challenging and sophisticated from a scientific point of view are the projects in which our team members get involved, the greater will be the company’s capital. We will have collaborators able to solve the problems of our clients that are similar to the problem of the signals from distant galaxies.”
Reference: “Selection of powerful radio galaxies with machine learning” by R. Carvajal, I. Matute, J. Afonso, R. P. Norris, K. J. Luken, P. Sánchez-Sáez, P. A. C. Cunha, A. Humphrey, H. Messias, S. Amarantidis, D. Barbosa, H. A. Cruz, H. Miranda, A. Paulino-Afonso and C. Pappalardo, 6 December 2023, Astronomy & Astrophysics.