Supplementary tables:MetaFetcheR: An R package for complete mapping of small compound data

SND-ID: 2024-341. Version: 1. DOI: https://doi.org/10.57804/7sf1-fw75

Citation

Creator/Principal investigator(s)

Sara A. Yones - Uppsala University, Department of Cell and Molecular Biology

Rajmund Csombordi - Uppsala University, Department of Cell and Molecular Biology

Jan Komorowski - Uppsala University / Polish Academy of Sciences / Washington National Primate Research Center, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics, Swedish Collegium for Advanced Study (SCAS) / Institute of Computer Science, Warsaw orcid

Klev Diamanti - Uppsala University, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics, Department of Immunology, Genetics and Pathology orcid

Research principal

Uppsala University rorId

Description

The dataset includes a PDF file containing the results and an Excel file with the following tables:

Table S1 Results of comparing the performance of MetaFetcheR to MetaboAnalystR using Diamanti et al.
Table S2 Results of comparing the performance of MetaFetcheR to MetaboAnalystR for Priolo et al.
Table S3 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool using Diamanti et al.
Table S4 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool for Priolo et al.
Table S5 Data quality test results for running 100 iterations on HMDB database.
Table S6 Data quality test results for running 100 iterations on KEGG database.
Table S7 Data quality test results for running 100 iterations on ChEBI database.
Table S8 Data quality test results for running 100 iterations on PubChem database.

... Show more..
The dataset includes a PDF file containing the results and an Excel file with the following tables:

Table S1 Results of comparing the performance of MetaFetcheR to MetaboAnalystR using Diamanti et al.
Table S2 Results of comparing the performance of MetaFetcheR to MetaboAnalystR for Priolo et al.
Table S3 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool using Diamanti et al.
Table S4 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool for Priolo et al.
Table S5 Data quality test results for running 100 iterations on HMDB database.
Table S6 Data quality test results for running 100 iterations on KEGG database.
Table S7 Data quality test results for running 100 iterations on ChEBI database.
Table S8 Data quality test results for running 100 iterations on PubChem database.
Table S9 Data quality test results for running 100 iterations on LIPID MAPS database.
Table S10 The list of metabolites that were not mapped by MetaboAnalystR for Diamanti et al.
Table S11 An example of an input matrix for MetaFetcheR.
Table S12 Results of comparing the performance of MetaFetcheR to MS_targeted using Diamanti et al.
Table S13 Data set from Diamanti et al.
Table S14 Data set from Priolo et al.
Table S15 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Diamanti et al.
Table S16 Results of comparing the performance of MetaFetcheR to CTS using LIPID MAPS identifiers available in Diamanti et al.
Table S17 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Priolo et al.
Table S18 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Priolo et al.
(See the "index" tab in the Excel file for more information)

Small-compound databases contain a large amount of information for metabolites and metabolic pathways. However, the plethora of such databases and the redundancy of their information lead to major issues with analysis and standardization. Lack of preventive establishment of means of data access at the infant stages of a project might lead to mislabelled compounds, reduced statistical power and large delays in delivery of results.

We developed MetaFetcheR, an open-source R package that links metabolite data from several small-compound databases, resolves inconsistencies and covers a variety of use-cases of data fetching. We showed that the performance of MetaFetcheR was superior to existing approaches and databases by benchmarking the performance of the algorithm in three independent case studies based on two published datasets.

The dataset was originally published in DiVA and moved to SND in 2024. Show less..

Data contains personal data

No

Language

Method and outcome
Data collection
Geographic coverage
Administrative information

Identifiers

Topic and keywords

Research area

Computer science (Standard för svensk indelning av forskningsämnen 2011)

Bioinformatics (computational biology) (Standard för svensk indelning av forskningsämnen 2011)

Publications
Published: 2021-10-07
Last updated: 2024-06-24