News

Scientists use machine learning to gain unprecedented view of small molecules

A new tool to identify small molecules offers benefits for diagnostics, drug discovery and fundamental research.
Illustration of small molecules and their connections.
Metabolites are extremely small – the diameter of a human hair is 100,000 nanometers, while that of a glucose molecule is approximately one nanometer. Illustration: Matti Ahlgren/Aalto University.

A new machine learning model will help scientists identify small molecules, with applications in medicine, drug discovery and environmental chemistry. Developed by researchers at Aalto University and the University of Luxembourg, the model was trained with data from dozens of laboratories to become one of the most accurate tools for identifying small molecules.

Thousands of different small molecules, known as metabolites, transport energy and transmit cellular information throughout the human body. Because they are so small, metabolites are difficult to distinguish from each other in a blood sample analysis – but identifying these molecules is important to understand how exercise, nutrition, alcohol use and metabolic disorders affect wellbeing.

Metabolites are normally identified by analysing their mass and retention time with a separation technique called liquid chromatography followed by mass spectrometry. This technique first separates metabolites by running the sample through a column, which results in different flow rates – or retention times – through the measurement device. Mass spectrometry is then used to fine-tune the identification process by sorting metabolites according to their mass. Researchers can also break metabolites into smaller pieces to analyse their composition using a technique called tandem mass spectrometry.

‘Even the best methods can’t identify more than 40% of the molecules in samples without making some additional assumptions about the candidate molecules,’ says Professor Juho Rousu of Aalto University.

Now, Rousu’s group has developed a novel machine learning model to identify small molecules. It was recently published in Nature Machine Intelligence.

‘This new open-source model offers the whole research community an enriched view of small molecules. It will help research into methods to identify metabolic disorders, such as diabetes, or even cancer,’ says Rousu.

The new approach elegantly sidesteps one of the challenges facing conventional methods. Because the retention times of molecules vary from lab to lab, data cannot be compared between labs. Eric Bach, a doctoral student at Aalto, came up with an alternative during his PhD research that solved the problem.

‘Our research shows that while absolute retention times may vary, the retention order is stable across measurements by different labs,’ Bach explains. ‘This allowed us to merge all publicly available data on metabolites for the first time ever and feed it into our machine learning model.’

With the incorporation of data from dozens of laboratories around the globe, the machine learning model is accurate enough to distinguish between mirror image molecules, known as stereochemical variants. So far, identification tools have not been able to tell stereochemical variants apart, and the new capability is expected to open up new avenues in drug design and other fields.

‘The fact that using stereochemistry improved the identification performance is a revelation for all developers of metabolite identification methods’ says Emma Schymanski, associate professor at the Luxembourg Centre for Systems Biomedicine (LCSB) of the University of Luxembourg. ‘This method could also be used to help identify and trace micropollutants in the environment or characterise new metabolites in plant cells.’

Eric Bach

Doctoral Candidate
 Emma Schymanski

Emma Schymanski

Associate Professor
  • Updated:
  • Published:
Share
URL copied!

Read more news

Three people having a discussion at a table with laptops. Text: Visiting Professorships at TU Graz, October 1, 2026 - January 31, 2027.
Cooperation, Research & Art, Studies, University Published:

Apply Now: Unite! Visiting Professorships at TU Graz

TU Graz, Austria, invites experienced postdoctoral researchers to apply for two fully funded visiting professorships. The deadline for expressions of interest is 20 February 2026, and the positions will begin on 1 October 2026.

A modern lobby with a large brown sectional sofa, colourful artwork, and a staircase. A '50' logo is on the back wall.
Press releases Published:

Hanaholmen’s 50th anniversary exhibition lives on online – making the history of Finnish–Swedish cooperation accessible worldwide

MeMo Institute at Aalto University has produced a virtual 3D version of the anniversary exhibition of Hanaholmen.
Aerial view of a tram on a curved track surrounded by trees and buildings in a cityscape on a sunny day.
Awards and Recognition, Cooperation, Research & Art Published:

Environmental Structure of the Year 2025 Award goes to Kalasatama-Pasila tramway

The award is given in recognition of meritorious design and implementation of the built environment. Experts from Aalto University developed sustainability solutions for the project.
A blue figure holds two red, abstract creatures against a yellow background.
Aalto Magazine Published:

Five things everyone should know about creativity

Creativity is not the preserve of artists or a rare innate talent but a human capacity we all share – and one that can be measured, developed, and led for. The two-year Creative Leap project explored how creativity shows up in everyday life and work and how it connects to companies’ financial results. Here are five key takeaways.