MatchSpectra function #52

tiwa1125 · 2022-01-05T09:12:04Z

Hi,
I tried to use the MetaboAnnotation to annotate my metabolites, and I used the HMDB to get the match, while when I used the code below, I even did not get any matches, that is weird, as I am sure there are many metabolites could be identified. Do you know what is the problem?

data("hmdb", package = "SpectraTutorials")
mtches <- matchSpectra(pest_ms2, hmdb,
param = CompareSpectraParam(requirePrecursor = TRUE, ppm = 30))
mtches
Object of class MatchedSpectra
Total number of matches: 0
Number of query objects: 5220 (0 matched)
Number of target objects: 458963 (0 matched)

Best
Tingting

jorainer · 2022-01-05T14:14:08Z

I assume the pest_ms2 contains MS2 spectra from the pesticide test data? If that's the case you'll not find matches in HMDB because HMDB provides human metabolite data and does not contain (AFAIK) plant pesticides You should however get some matches if you use MassBank as that resource contains also MS2 spectra from not naturally occuring compounds.

tiwa1125 · 2022-01-05T14:19:56Z

Hi Johannes, Thanks for your reply, I did not use the test data, but my own data, it is a sample from human blood, I run it on bruker qtof with DDA mode, so I assume I should get some matches, since I already know many metabolites are there by matching with the authentic standards. Did I make something wrong?

pest_ms2

MSn data (Spectra) with 5220 spectra in a MsBackendMzR backend: msLevel rtime scanIndex <integer> <numeric> <integer> 1 2 15.388 17 2 2 15.488 18 3 2 15.588 19 4 2 15.688 20 5 2 15.788 21 ... ... ... ... 5216 2 600.354 5753 5217 2 600.454 5754 5218 2 600.554 5755 5219 2 600.654 5756 5220 2 600.764 5757 ... 34 more variables/columns. file(s): TX41855.mzML Processing: Filter: select MS level(s) 2 [Fri Dec 31 17:42:26 2021] Best Tingting From: Johannes Rainer ***@***.***> Sent: Wednesday, January 5, 2022 3:14 PM To: rformassspectrometry/MetaboAnnotation ***@***.***> Cc: Tingting Wang ***@***.***>; Author ***@***.***> Subject: Re: [rformassspectrometry/MetaboAnnotation] MatchSpectra function (Issue #52) I assume the pest_ms2 contains MS2 spectra from the pesticide test data? If that's the case you'll not find matches in HMDB because HMDB provides human metabolite data and does not contain (AFAIK) plant pesticides You should however get some matches if you use MassBank as that resource contains also MS2 spectra from not naturally occuring compounds. — Reply to this email directly, view it on GitHub<#52 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AVISBX4PCUUUX6WTW2FMGH3UURG3VANCNFSM5LJMYIPQ>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you authored the thread.Message ID: ***@***.******@***.***>>

jorainer · 2022-01-06T07:59:13Z

I would maybe suggest to remove some of the low intensity peaks in your MS2 spectra (similar to what I did in the tutorial). Then, importantly, HMDB does not provide precursor m/z values, so you should use requirePrecursor = FALSE. Unfortunately, this will have a huge impact on performance because then you'll compare all your experimental spectra with all database spectra. Maybe it would also be helpful to subset the HMDB data excluding predicted spectra (using hmdb <- hmdb[!hmdb$predicted])?

If you still miss some of the obvious hits, it might be that you have to further increase the ppm or tolerance. Some of the spectra in HMDB seem to be mis-calibrated (I had to use a large tolerance value to find some of our pure standards in HMDB data).

tiwa1125 · 2022-01-19T13:42:03Z

Hi, I finally made it work, while I got some weird results, I think the target compound ID does not fit with the fragments it found, e.g., the mass of compounds that HMDB ID represent are smaller than the precursor ion, how can I know what are the metabolites name of these features directly in the table? Or did I do something wrong? Best Tingting From: Johannes Rainer ***@***.***> Sent: Thursday, January 6, 2022 8:59 AM To: rformassspectrometry/MetaboAnnotation ***@***.***> Cc: Tingting Wang ***@***.***>; Author ***@***.***> Subject: Re: [rformassspectrometry/MetaboAnnotation] MatchSpectra function (Issue #52) I would maybe suggest to remove some of the low intensity peaks in your MS2 spectra (similar to what I did in the tutorial<https://jorainer.github.io/SpectraTutorials/articles/Spectra-matching-with-MetaboAnnotation.html>). Then, importantly, HMDB does not provide precursor m/z values, so you should use requirePrecursor = FALSE. Unfortunately, this will have a huge impact on performance because then you'll compare all your experimental spectra with all database spectra. Maybe it would also be helpful to subset the HMDB data excluding predicted spectra (using hmdb <- hmdb[!hmdb$predicted])? If you still miss some of the obvious hits, it might be that you have to further increase the ppm or tolerance. Some of the spectra in HMDB seem to be mis-calibrated (I had to use a large tolerance value to find some of our pure standards in HMDB data). — Reply to this email directly, view it on GitHub<#52 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AVISBX7JL5FC4K3TLN3I6WTUUVDVZANCNFSM5LJMYIPQ>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you authored the thread.Message ID: ***@***.******@***.***>>

jorainer · 2022-01-31T14:12:04Z

Note that the masses of the compounds are exact masses, not m/z values, thus, they might indeed be smaller than the precursor ion, depending on what ion/adduct was fragmented. Hm, and you're right, the compound name is not available in that data.

I would thus suggest to do something different:

Install the CompoundDb package with BiocManager::install("RforMassSpectrometry/CompoundDb").
Download the CompDb database for HMDB (5.0 from 2021-11-02) from here (i.e. download the .sqlite file to your computer).
Use that database instead of the hmdb variable you use above.

Basically, in your script, replacing

data("hmdb", package = "SpectraTutorials")

with

library(CompoundDb)
cdb <- CompDb(<sqlite file name>)
hmdb <- Spectra(cdb)

(<sqlite file name> would then be the path and filename of the downloaded sqlite file) should allow you to use this resource instead. You could also have a look at this new tutorial for the MetaboAnnotation package for more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MatchSpectra function #52

MatchSpectra function #52

tiwa1125 commented Jan 5, 2022 •

edited

Loading

jorainer commented Jan 5, 2022

tiwa1125 commented Jan 5, 2022 via email

jorainer commented Jan 6, 2022

tiwa1125 commented Jan 19, 2022 via email •

edited

Loading

jorainer commented Jan 31, 2022

MatchSpectra function #52

MatchSpectra function #52

Comments

tiwa1125 commented Jan 5, 2022 • edited Loading

jorainer commented Jan 5, 2022

tiwa1125 commented Jan 5, 2022 via email

jorainer commented Jan 6, 2022

tiwa1125 commented Jan 19, 2022 via email • edited Loading

jorainer commented Jan 31, 2022

tiwa1125 commented Jan 5, 2022 •

edited

Loading

tiwa1125 commented Jan 19, 2022 via email •

edited

Loading