Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 19 of 19

Full-Text Articles in Life Sciences

Statistical Methods For Proteomic Biomarker Discovery Based On Feature Extraction Or Functional Modeling Approaches, Jeffrey S. Morris Jan 2012

Statistical Methods For Proteomic Biomarker Discovery Based On Feature Extraction Or Functional Modeling Approaches, Jeffrey S. Morris

Jeffrey S. Morris

In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational ...


Integrative Bayesian Analysis Of High-Dimensional Multi-Platform Genomics Data, Wenting Wang, Veerabhadran Baladandayuthapani, Jeffrey S. Morris, Bradley M. Broom, Ganiraju C. Manyam, Kim-Anh Do Jan 2012

Integrative Bayesian Analysis Of High-Dimensional Multi-Platform Genomics Data, Wenting Wang, Veerabhadran Baladandayuthapani, Jeffrey S. Morris, Bradley M. Broom, Ganiraju C. Manyam, Kim-Anh Do

Jeffrey S. Morris

Motivation: Analyzing data from multi-platform genomics experiments combined with patients’ clinical outcomes helps us understand the complex biological processes that characterize a disease, as well as how these processes relate to the development of the disease. Current integration approaches that treat the data are limited in that they do not consider the fundamental biological relationships that exist among the data from platforms.

Statistical Model: We propose an integrative Bayesian analysis of genomics data (iBAG) framework for identifying important genes/biomarkers that are associated with clinical outcome. This framework uses a hierarchical modeling technique to combine the data obtained from multiple ...


Boesenbergia Rotunda: From Ethnomedicine To Drug Discovery, Norzulaani Khalid Jan 2012

Boesenbergia Rotunda: From Ethnomedicine To Drug Discovery, Norzulaani Khalid

Norzulaani Khalid

Boesenbergia rotunda is a herb from the Boesenbergia genera under the Zingiberaceae family. B. rotunda is widely found in Asian countries where it is commonly used as a food ingredient and in ethnomedicinal preparations. The popularity of its ethnomedicinal usage has drawn the attention of scientists worldwide to further investigate its medicinal properties. Advancement in drug design and discovery research has led to the development of synthetic drugs from B. rotunda metabolites via bioinformatics and medicinal chemistry studies. Furthermore, with the advent of genomics, transcriptomics, proteomics, and metabolomics, new insights on the biosynthetic pathways of B. rotunda metabolites can be ...


Statistical Contributions To Proteomic Research, Jeffrey S. Morris, Keith A. Baggerly, Howard B. Gutstein, Kevin R. Coombes Jan 2010

Statistical Contributions To Proteomic Research, Jeffrey S. Morris, Keith A. Baggerly, Howard B. Gutstein, Kevin R. Coombes

Jeffrey S. Morris

Proteomic profiling has the potential to impact the diagnosis, prognosis, and treatment of various diseases. A number of different proteomic technologies are available that allow us to look at many proteins at once, and all of them yield complex data that raise significant quantitative challenges. Inadequate attention to these quantitative issues can prevent these studies from achieving their desired goals, and can even lead to invalid results. In this chapter, we describe various ways the involvement of statisticians or other quantitative scientists in the study team can contribute to the success of proteomic research, and we outline some of the ...


Informatics And Statistics For Analyzing 2-D Gel Electrophoresis Images, Andrew W. Dowsey, Jeffrey S. Morris, Howard G. Gutstein, Guang Z. Yang Jan 2010

Informatics And Statistics For Analyzing 2-D Gel Electrophoresis Images, Andrew W. Dowsey, Jeffrey S. Morris, Howard G. Gutstein, Guang Z. Yang

Jeffrey S. Morris

Whilst recent progress in ‘shotgun’ peptide separation by integrated liquid chromatography and mass spectrometry (LC/MS) has enabled its use as a sensitive analytical technique, proteome coverage and reproducibility is still limited and obtaining enough replicate runs for biomarker discovery is a challenge. For these reasons, recent research demonstrates the continuing need for protein separation by two-dimensional gel electrophoresis (2-DE). However, with traditional 2-DE informatics, the digitized images are reduced to symbolic data though spot detection and quantification before proteins are compared for differential expression by spot matching. Recently, a more robust and automated paradigm has emerged where gels are ...


Microproteomics: Analysis Of Protein Diversity In Small Samples, Howard B. Gutstein, Jeffrey S. Morris, Suresh P. Annangudi, Jonathan V. Sweedler Feb 2008

Microproteomics: Analysis Of Protein Diversity In Small Samples, Howard B. Gutstein, Jeffrey S. Morris, Suresh P. Annangudi, Jonathan V. Sweedler

Jeffrey S. Morris

Proteomics, the large-scale study of protein expression in organisms, offers the potential to evaluate global changes in protein expression and their post-translational modifications that take place in response to normal or pathological stimuli. One challenge has been the requirement for substantial amounts of tissue in order to perform comprehensive proteomic characterization. In heterogeneous tissues, such as brain, this has limited the application of proteomic methodologies. Efforts to adapt standard methods of tissue sampling, protein extraction, arraying, and identification are reviewed, with an emphasis on those appropriate to smaller samples ranging in size from several microliters down to single cells. The ...


Statistical Issues In Proteomic Research, Jeffrey S. Morris Dec 2007

Statistical Issues In Proteomic Research, Jeffrey S. Morris

Jeffrey S. Morris

No abstract provided.


Pre-Processing Mass Spectrometry Data, Kevin R. Coombes, Keith A. Baggerly, Jeffrey S. Morris Jan 2007

Pre-Processing Mass Spectrometry Data, Kevin R. Coombes, Keith A. Baggerly, Jeffrey S. Morris

Jeffrey S. Morris

No abstract provided.


Laser Capture Sampling And Analytical Issues In Proteomics, Howard Gutstein, Jeffrey S. Morris Jan 2007

Laser Capture Sampling And Analytical Issues In Proteomics, Howard Gutstein, Jeffrey S. Morris

Jeffrey S. Morris

Proteomics holds the promise of evaluating global changes in protein expression and post-translational modificaiton in response to environmental stimuli. However, difficulties in achieving cellular anatomic resolution and extracting specific types of proteins from cells have limited the efficacy of these techniques. Laser capture microdissection has provided a solution to the problem of anatomical resolution in tissues. New extraction methodologies have expanded the range of proteins identified in subsequent analyses. This review will examine the application of laser capture microdissection to proteomic tissue sampling, and subsequent extraction of these samples for differential expression analysis. Statistical and other quantitative issues important for ...


Prepms: Tof Ms Data Graphical Preprocessing Tool, Yuliya V. Karpievitch, Elizabeth G. Hill, Adam J. Smolka, Jeffrey S. Morris, Kevin R. Coombes, Keith A. Baggerly, Jonas S. Almeida Nov 2006

Prepms: Tof Ms Data Graphical Preprocessing Tool, Yuliya V. Karpievitch, Elizabeth G. Hill, Adam J. Smolka, Jeffrey S. Morris, Kevin R. Coombes, Keith A. Baggerly, Jonas S. Almeida

Jeffrey S. Morris

We introduce a simple-to-use graphical tool that enables researchers to easily prepare time-of-flight mass spectrometry data for analysis. For ease of use, the graphical executable provides default parameter settings experimentally determined to work well in most situations. These values can be changed by the user if desired. PrepMS is a stand-alone application made freely available (open source), and is under the General Public License (GPL). Its graphical user interface, default parameter settings, and display plots allow PrepMS to be used effectively for data preprocessing, peak detection, and visual data quality assessment.


An Introduction To High-Throughput Bioinformatics Data, Keith A. Baggerly, Kevin R. Coombes, Jeffrey S. Morris Mar 2006

An Introduction To High-Throughput Bioinformatics Data, Keith A. Baggerly, Kevin R. Coombes, Jeffrey S. Morris

Jeffrey S. Morris

High throughput biological assays supply thousands of measurements per sample, and the sheer amount of related data increases the need for better models to enhance inference. Such models, however, are more effective if they take into account the idiosyncracies associated with the specific methods of measurement: where the numbers come from. We illustrate this point by describing three different measurement platforms: microarrays, serial analysis of gene expression (SAGE), and proteomic mass spectrometry.


Bayesian Mixture Models For Gene Expression And Protein Profiles, Michele Guindani, Kim-Anh Do, Peter Mueller, Jeffrey S. Morris Mar 2006

Bayesian Mixture Models For Gene Expression And Protein Profiles, Michele Guindani, Kim-Anh Do, Peter Mueller, Jeffrey S. Morris

Jeffrey S. Morris

We review the use of semi-parametric mixture models for Bayesian inference in high throughput genomic data. We discuss three specific approaches for microarray data, for protein mass spectrometry experiments, and for SAGE data. For the microarray data and the protein mass spectrometry we assume group comparison experiments, i.e., experiments that seek to identify genes and proteins that are differentially expressed across two biologic conditions of interest. For the SAGE data example we consider inference for a single biologic sample.


Analysis Of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Philip J. Brown, Keith A. Baggerly, Kevin R. Coombes Mar 2006

Analysis Of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Philip J. Brown, Keith A. Baggerly, Kevin R. Coombes

Jeffrey S. Morris

In this chapter, we demonstrate how to analyze MALDI-TOF/SELDITOF mass spectrometry data using the wavelet-based functional mixed model introduced by Morris and Carroll (2006), which generalizes the linear mixed models to the case of functional data. This approach models each spectrum as a function, and is very general, accommodating a broad class of experimental designs and allowing one to model nonparametric functional effects for various factors, which can be conditions of interest (e.g. cancer/normal) or experimental factors (blocking factors). Inference on these functional effects allows us to identify protein peaks related to various outcomes of interest, including ...


Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer Dec 2005

Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer

Jeffrey S. Morris

Background: Mass spectrometry, especially surface enhanced laser desorption and ionization (SELDI) is increasingly being used to find disease-related proteomic patterns in complex mixtures of proteins derived from tissue samples or from easily obtained biological fluids such as serum, urine, or nipple aspirate fluid. Questions have been raised about the reproducibility and reliability of peak quantifications using this technology. For example, Yasui and colleagues opted to replace continuous measures of the size of a peak by a simple binary indicator of its presence or absence in their analysis of a set of spectra from prostate cancer patients.

Methods: We collected nipple ...


Serum Proteomics Profiling: A Young Technology Begins To Mature, Kevin R. Coombes, Jeffrey S. Morris, Jianhua Hu, Sarah R. Edmondson, Keith A. Baggerly Mar 2005

Serum Proteomics Profiling: A Young Technology Begins To Mature, Kevin R. Coombes, Jeffrey S. Morris, Jianhua Hu, Sarah R. Edmondson, Keith A. Baggerly

Jeffrey S. Morris

No abstract provided.


Signal In Noise: Evaluating Reported Reproducibility Of Serum Proteomic Tests For Ovarian Cancer, Keith A. Baggerly, Jeffrey S. Morris, Sarah R. Edmonson, Kevin R. Coombes Feb 2005

Signal In Noise: Evaluating Reported Reproducibility Of Serum Proteomic Tests For Ovarian Cancer, Keith A. Baggerly, Jeffrey S. Morris, Sarah R. Edmonson, Kevin R. Coombes

Jeffrey S. Morris

Proteomic profi ling of serum initially appeared to be dramatically effective for diagnosis of early-stage ovarian cancer, but these results have proven diffi cult to reproduce. A recent publication reported good classifi cation in one dataset using results from training on a much earlier dataset, but the authors have since reported that they did not perform the analysis as described. We examined the reproducibility of the proteomic patterns across datasets in more detail. Our analysis reveals that the pattern that enabled successful classifi cation is biologically implausible and that the method, properly applied, does not classify the data accurately. We ...


High-Resolution Serum Proteomic Patterns For Ovarian Cancer Detection, Keith A. Baggerly, Sarah R. Edmonson, Jeffrey S. Morris, Kevin R. Coombes Nov 2004

High-Resolution Serum Proteomic Patterns For Ovarian Cancer Detection, Keith A. Baggerly, Sarah R. Edmonson, Jeffrey S. Morris, Kevin R. Coombes

Jeffrey S. Morris

No abstract provided.


Quality Control And Peak Finding For Proteomics Data Collected From Nipple Aspirate Fluid Using Surface Enhanced Laser Desorption And Ionization., Jeffrey S. Morris, Kevin R. Coombes, Herbert A. Fritsche, Charlotte Clarke, Jeng-Neng Chen, Keith A. Baggerly, Lian-Chun Xiao, Mien-Chie Hung, Henry M. Kuerer Oct 2003

Quality Control And Peak Finding For Proteomics Data Collected From Nipple Aspirate Fluid Using Surface Enhanced Laser Desorption And Ionization., Jeffrey S. Morris, Kevin R. Coombes, Herbert A. Fritsche, Charlotte Clarke, Jeng-Neng Chen, Keith A. Baggerly, Lian-Chun Xiao, Mien-Chie Hung, Henry M. Kuerer

Jeffrey S. Morris

Background: Recently, researchers have been using mass spectroscopy to study cancer. For use of proteomics spectra in a clinical setting, stringent quality-control procedures will be needed.

Methods: We pooled samples of nipple aspirate fluid from healthy breasts and breasts with cancer to prepare a control sample. Aliquots of the control sample were used on two spots on each of three IMAC ProteinChip® arrays (Ciphergen Biosystems, Inc.) on 4 successive days to generate 24 SELDI spectra. In 36 subsequent experiments, the control sample was applied to two spots of each ProteinChip array, and the resulting spectra were analyzed to determine how ...


A Comprehensive Approach To The Analysis Of Maldi-Tof Proteomics Spectra From Serum Samples., Keith A. Baggerly, Jeffrey S. Morris, Jing Wang, David Gold, Lian-Chun Xiao, Kevin R. Coombes Jun 2003

A Comprehensive Approach To The Analysis Of Maldi-Tof Proteomics Spectra From Serum Samples., Keith A. Baggerly, Jeffrey S. Morris, Jing Wang, David Gold, Lian-Chun Xiao, Kevin R. Coombes

Jeffrey S. Morris

For our analysis of the data from the First Annual Proteomics Data Mining Conference, we attempted to discriminate between 24 disease spectra (group A) and 17 normal spectra (group B). First, we processed the raw spectra by (i) correcting for additive sinusoidal noise (periodic on the time scale) affecting most spectra, (ii) correcting for the overall baseline level, (iii) normalizing, (iv) recombining fractions, and (v) using variable- width windows for data reduction. Also, we identified a set of polymeric peaks (at multiples of 180.6 Da) that is present in several normal spectra (B1–B8). After data processing, we found ...