A Bayesian approach to the alignment of mass spectra

Xiaoxiao Kong; Cavan Reilly

doi:10.1093/bioinformatics/btp582

A Bayesian approach to the alignment of mass spectra

Xiaoxiao Kong, Cavan Reilly

Biostatistics

Research output: Contribution to journal › Article › peer-review

10 Scopus citations

Abstract

Motivation: The need to align spectra to correct for mass-to-charge experimental variation is a problem that arises in mass spectrometry (MS). Most of the MS-based proteomic data analysis methods involve a two-step approach, identify peaks first and then do the alignment and statistical inference on these identified peaks only. However, the peak identification step relies on prior information on the proteins of interest or a peak detection model, which are subject to error. Also numerous additional features such as peak shape and peak width are lost in simple peak detection, and these are informative for correcting mass variation in the alignment step. Results: Here, we present a novel Bayesian approach to align the complete spectra. The approach is based on a parametric model which assumes that the spectrum and alignment function are Gaussian processes, but the alignment function is monotone. We show how to use the expectation-maximization algorithm to find the posterior mode of the set of alignment functions and the mean spectrum for a patient population. After alignment, we conduct tests while controlling for error attributable to multiple comparisons on the level of the peaks identified from the absolute mean spectra difference of two patient populations.

Original language	English (US)
Article number	btp582
Pages (from-to)	3213-3220
Number of pages	8
Journal	Bioinformatics
Volume	25
Issue number	24
DOIs	https://doi.org/10.1093/bioinformatics/btp582
State	Published - Oct 9 2009

Bibliographical note

Funding Information:
Funding: National Institutes of Health (grant P01-AI074340); University of Minnesota graduate dissertation fellowship.

Access

10.1093/bioinformatics/btp582

OpenUrl availability

Full text

Cite this

@article{c03d3fb2193d4c87a2e918ebcdd3be58,

title = "A Bayesian approach to the alignment of mass spectra",

abstract = "Motivation: The need to align spectra to correct for mass-to-charge experimental variation is a problem that arises in mass spectrometry (MS). Most of the MS-based proteomic data analysis methods involve a two-step approach, identify peaks first and then do the alignment and statistical inference on these identified peaks only. However, the peak identification step relies on prior information on the proteins of interest or a peak detection model, which are subject to error. Also numerous additional features such as peak shape and peak width are lost in simple peak detection, and these are informative for correcting mass variation in the alignment step. Results: Here, we present a novel Bayesian approach to align the complete spectra. The approach is based on a parametric model which assumes that the spectrum and alignment function are Gaussian processes, but the alignment function is monotone. We show how to use the expectation-maximization algorithm to find the posterior mode of the set of alignment functions and the mean spectrum for a patient population. After alignment, we conduct tests while controlling for error attributable to multiple comparisons on the level of the peaks identified from the absolute mean spectra difference of two patient populations.",

author = "Xiaoxiao Kong and Cavan Reilly",

note = "Funding Information: Funding: National Institutes of Health (grant P01-AI074340); University of Minnesota graduate dissertation fellowship.",

year = "2009",

month = oct,

day = "9",

doi = "10.1093/bioinformatics/btp582",

language = "English (US)",

volume = "25",

pages = "3213--3220",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "24",

}

TY - JOUR

T1 - A Bayesian approach to the alignment of mass spectra

AU - Kong, Xiaoxiao

AU - Reilly, Cavan

N1 - Funding Information: Funding: National Institutes of Health (grant P01-AI074340); University of Minnesota graduate dissertation fellowship.

PY - 2009/10/9

Y1 - 2009/10/9

N2 - Motivation: The need to align spectra to correct for mass-to-charge experimental variation is a problem that arises in mass spectrometry (MS). Most of the MS-based proteomic data analysis methods involve a two-step approach, identify peaks first and then do the alignment and statistical inference on these identified peaks only. However, the peak identification step relies on prior information on the proteins of interest or a peak detection model, which are subject to error. Also numerous additional features such as peak shape and peak width are lost in simple peak detection, and these are informative for correcting mass variation in the alignment step. Results: Here, we present a novel Bayesian approach to align the complete spectra. The approach is based on a parametric model which assumes that the spectrum and alignment function are Gaussian processes, but the alignment function is monotone. We show how to use the expectation-maximization algorithm to find the posterior mode of the set of alignment functions and the mean spectrum for a patient population. After alignment, we conduct tests while controlling for error attributable to multiple comparisons on the level of the peaks identified from the absolute mean spectra difference of two patient populations.

AB - Motivation: The need to align spectra to correct for mass-to-charge experimental variation is a problem that arises in mass spectrometry (MS). Most of the MS-based proteomic data analysis methods involve a two-step approach, identify peaks first and then do the alignment and statistical inference on these identified peaks only. However, the peak identification step relies on prior information on the proteins of interest or a peak detection model, which are subject to error. Also numerous additional features such as peak shape and peak width are lost in simple peak detection, and these are informative for correcting mass variation in the alignment step. Results: Here, we present a novel Bayesian approach to align the complete spectra. The approach is based on a parametric model which assumes that the spectrum and alignment function are Gaussian processes, but the alignment function is monotone. We show how to use the expectation-maximization algorithm to find the posterior mode of the set of alignment functions and the mean spectrum for a patient population. After alignment, we conduct tests while controlling for error attributable to multiple comparisons on the level of the peaks identified from the absolute mean spectra difference of two patient populations.

UR - http://www.scopus.com/inward/record.url?scp=75849128985&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=75849128985&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btp582

DO - 10.1093/bioinformatics/btp582

M3 - Article

C2 - 19819887

AN - SCOPUS:75849128985

SN - 1367-4803

VL - 25

SP - 3213

EP - 3220

JO - Bioinformatics

JF - Bioinformatics

IS - 24

M1 - btp582

ER -

A Bayesian approach to the alignment of mass spectra

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this