Structure-led search tool set to refine how scientists explore metabolomics data

Prefer Chemistry World
in Google search

No comments

A new tool hopes to make public metabolomics data more accessible through structure-based searching. Named StructureMASST , the platform returns every spectrum in its database where a molecule, structure or substructure has been seen before.

Public metabolomics data is growing ever more abundant, with increasing diversity. Methods of searching raw mass spectra have adapted to meet this rise: the introduction of indexing technologies to the Mass spectrometry search tool (MASST) in 2020 condensed a 20–40 minute search of metabolomics data to just seconds across over a billion mass spectra. Yet current methods still struggle with structure- and substructure-based searches where names of molecules or Smiles are used. A single molecule can have multiple different names, for example, making it an arduous task to find all its relating mass spectra.

Diagram illustrating StructureMASST: multiple mass spectra for a molecule are searched simultaneously across public metabolomics datasets to retrieve matches along with associated metadata such as organisms, tissues and experimental conditions

Source: © Yasin El Abiead et al 2026

StructureMASST aggregates all available MS/MS spectra associated with a molecule (or substructure) and searches them simultaneously

A team led by scientists at the University of California, San Diego and the University of California, Riverside, developed StructureMASST in answer to this issue and a second, more important problem. Cross-repository searches can be complicated by datasets that come from a range of instruments and acquisition conditions. Using StructureMASST, scientists can search across several major public metabolomics repositories, scanning multiple MS/MS spectra at once to find the organisms, organs or health conditions associated with a molecule.

The platform builds on pre-existing and well-developed metabolomics data repositories, informatics tools and workflows, including MASST and Pan-ReDU – a community resource that standardises metadata across metabolomics datasets to enable large‑scale comparative analyses. StructureMASST expands on Pan-ReDU’s success, by including data from the NORMAN/DSFP suspect screening repository and meta-visualising results with Sankey plots. The ability to filter over 1.5 million spectra by chemical name or structure and then search across all corresponding MS/MS for those molecules adds to its merit.

The result is a comprehensive dataset for users, with a simple method of obtaining them. The team believes StructureMASST will empower hypothesis generation, improve discovery and reveal new insights into metabolism, exposure and microbial interactions.

References

Yasin El Abiead et al, Nat. Biotechnol., 2026, DOI: 10.1038/s41587-026-03082-8

Frances BriggsFrances is a science writer intern at Chemistry World.View full profile

More Frances Briggs

Topics

Prefer Chemistry World
in Google search

No comments

Structure-led search tool set to refine how scientists explore metabolomics data

References

More Frances Briggs

An update on the Strait of Hormuz & potential explosives at an Antarctic base

Chemistry delivers £60.5 billion to the UK economy and makes local economies more resilient, says RSC report

Explainer: Potential explosives found at Antarctic explorer laboratories

Topics

Related articles

Molecular labels streamline high throughput mass spec analysis by rapidly ranking reactions

Elusive ‘dark matter’ diazo compounds captured from a human lung pathogen

What happens to our bodies after we die?

Mapping metabolite disturbances by drugs

Algorithm predicts bitterness from mass spectra data alone

Ancient microbial natural products reconstructed from Neanderthal dental plaques

No comments yet

Only registered users can comment on this article.

More News

‘Losing an entire generation of scientific talent’: top US universities accepted 15% fewer PhDs

STFC’s £160 million cost-cutting plan prompts concern over future access to key UK research facilities

Thermo Fisher antibody data manipulation is a breach of trust, say researchers

New York state sues chemical companies over PFAS pollution

UK’s defence research site unveils new NMR lab to protect against evolving chemical and biological threats

Chemical treatment slashes emissions of making hydrogen from plastic waste