Exploratory analysis of metabolic changes using mass spectrometry data and graph embeddings

dc.contributor.affiliationPontificia Universidad Católica del Perú. Departamento de Ingeniería
dc.contributor.affiliationPontificia Universidad Católica del Perú. Instituto de Ciencias Ómicas y Biotecnología Aplicada (ICOBA)
dc.contributor.affiliationPontificia Universidad Católica del Perú. Departamento de Ciencias
dc.contributor.authorAlvarez-Mamani, E.
dc.contributor.authorBuettner, F.
dc.contributor.authorBeltrán Castañón, C.A.
dc.contributor.authorIbáñez, A.J.
dc.date.accessioned2026-03-13T17:00:50Z
dc.date.issued2024
dc.description.abstractMass spectrometry (MS)-based metabolomics analysis is a powerful tool, but it comes with its own set of challenges. The MS workflow involves multiple steps before its interpretation in what is denominate data mining. Data mining consists of a two-step process. First, the MS data is ordered, arranged, and presented for filtering before being analyzed. Second, the filtered and reduced data are analyzed using statistics to remove further variability. This holds true particularly for MS-based untargeted metabolomics studies, which focused on understanding fold changes in metabolic networks. Since the task of filtering and identifying changes from a large dataset is challenging, automated techniques for mining untargeted MS-based metabolomic data are needed. The traditional statistics-based approach tends to overfilter raw data, which may result in the removal of relevant data and lead to the identification of fewer metabolomic changes. This limitation of the traditional approach underscores the need for a new method. In this work, we present a novel deep learning approach using node embeddings (powered by GNNs).
dc.description.sponsorshipFunding: E.A.M. doctoral studies are funded by Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica (CONCYTEC), and Fondo Nacional de Desarrollo Científico, Tecnológico y de Innovación Tecnológica (FONDECYT), under contract No. 174-2020-FONDECYT "Doctoral Programs in Peruvian Universities". A.J.I. thank to The Max Planck Partner Group (Max Planck Institute for Chemical Ecology-Jena), and the Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica (CONCYTEC-Prociencia funding call E041-2024-01; PE501086715-2024-PROCIENCIA) for their financial support. We also thank Prof. Dr. Marcia González-Teuber (Pontificia Universidad Católica de Chile) for the Aristolochia chilensis' leaf dataset, and PD Dr. Reinhard Dechant (Calico) & Dr. Madina Mansurova (PUCP) for their valuable recommendations and review of this work.
dc.identifier.doihttps://doi.org/10.1038/s41598-024-80955-5
dc.identifier.urihttp://hdl.handle.net/20.500.14657/206766
dc.language.isoeng
dc.publisherNature Research
dc.relation.ispartofurn:issn:2045-2322
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.sourceScientific Reports; Vol. 14, Núm. 1 (2024)
dc.subjectMetabolomics
dc.subjectComputer science
dc.subjectSilhouette
dc.subjectWorkflow
dc.subjectData mining
dc.subjectGraph
dc.subjectExploratory data analysis
dc.subjectArtificial intelligence
dc.subjectBioinformatics
dc.subjectTheoretical computer science
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#1.02.01
dc.titleExploratory analysis of metabolic changes using mass spectrometry data and graph embeddings
dc.typehttp://purl.org/coar/resource_type/c_6501
dc.type.otherArtículo
dc.type.versionhttps://vocabularies.coar-repositories.org/version_types/c_970fb48d4fbd8a85/

Files

Collections