Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 31 - 39 of 39

Full-Text Articles in Life Sciences

Development Of Computations In Bioscience And Bioinformatics And Its Application: Review Of The Symposium Of Computations In Bioinformatics And Bioscience (Scbb06), Youping Deng, Jun Ni, Chaoyang Zhang Dec 2006

Development Of Computations In Bioscience And Bioinformatics And Its Application: Review Of The Symposium Of Computations In Bioinformatics And Bioscience (Scbb06), Youping Deng, Jun Ni, Chaoyang Zhang

Faculty Publications

The first symposium of computations in bioinformatics and bioscience (SCBB06) was held in Hangzhou, China on June 21-22, 2006. Twenty-six peer-reviewed papers were selected for publication in this special issue of BMC Bioinformatics. These papers cover a broad range of topics including bioinformatics theories, algorithms, applications and tool development. The main technical topics contain gene expression analysis, sequence analysis, genome analysis, phylogenetic analysis, gene function prediction, molecular interaction and system biology, genetics and population study, immune strategy, protein structure prediction and proteomics.


Svm Classifier: A Comprehensive Java Interface For Support Vector Machine Classification Of Microarray Data, Mehdi Pirooznia, Youping Deng Dec 2006

Svm Classifier: A Comprehensive Java Interface For Support Vector Machine Classification Of Microarray Data, Mehdi Pirooznia, Youping Deng

Faculty Publications

Motivation

Graphical user interface (GUI) software promotes novelty by allowing users to extend the functionality. SVM Classifier is a cross-platform graphical application that handles very large datasets well. The purpose of this study is to create a GUI application that allows SVM users to perform SVM training, classification and prediction.

Results

The GUI provides user-friendly access to state-of-the-art SVM methods embodied in the LIBSVM implementation of Support Vector Machine. We implemented the java interface using standard swing libraries.

We used a sample data from a breast cancer study for testing classification accuracy. We achieved 100% accuracy in classification among the ...


A Fourier Transformation Based Method To Mine Peptide Space For Antimicrobial Activity, Vijayaraj Nagarajan, Navodit Kaushik, Beddhu Murali, Chaoyang Zhang, Sanyogita Lakhera, Mohamed O. Elasri, Youping Deng Sep 2006

A Fourier Transformation Based Method To Mine Peptide Space For Antimicrobial Activity, Vijayaraj Nagarajan, Navodit Kaushik, Beddhu Murali, Chaoyang Zhang, Sanyogita Lakhera, Mohamed O. Elasri, Youping Deng

Faculty Publications

Background

Naturally occurring antimicrobial peptides are currently being explored as potential candidate peptide drugs. Since antimicrobial peptides are part of the innate immune system of every living organism, it is possible to discover new candidate peptides using the available genomic and proteomic data. High throughput computational techniques could also be used to virtually scan the entire peptide space for discovering out new candidate antimicrobial peptides.

Result

We have identified a unique indexing method based on biologically distinct characteristic features of known antimicrobial peptides. Analysis of the entries in the antimicrobial peptide databases, based on our indexing method, using Fourier transformation ...


Emd: An Ensemble Algorithm For Discovering Regulatory Motifs In Dna Sequences, Jianjun Hu, Y. D. Yang, D. Kihara Jan 2006

Emd: An Ensemble Algorithm For Discovering Regulatory Motifs In Dna Sequences, Jianjun Hu, Y. D. Yang, D. Kihara

Faculty Publications

Background

Understanding gene regulatory networks has become one of the central research problems in bioinformatics. More than thirty algorithms have been proposed to identify DNA regulatory sites during the past thirty years. However, the prediction accuracy of these algorithms is still quite low. Ensemble algorithms have emerged as an effective strategy in bioinformatics for improving the prediction accuracy by exploiting the synergetic prediction capability of multiple algorithms.

Results

We proposed a novel clustering-based ensemble algorithm named EMD for de novo motif discovery by combining multiple predictions from multiple runs of one or more base component algorithms. The ensemble approach is ...


Integrative Missing Value Estimation For Microarray Data, Jianjun Hu, H. Li, M. S. Waterman, X. J. Zhou Jan 2006

Integrative Missing Value Estimation For Microarray Data, Jianjun Hu, H. Li, M. S. Waterman, X. J. Zhou

Faculty Publications

Background

Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples.

Results

We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets ...


Identification Of New Members Of Hydrophobin Family Using Primary Structure Analysis, Kuan Yang, Youping Deng, Chaoyang Zhang, Mohamed O. Elasri Jan 2006

Identification Of New Members Of Hydrophobin Family Using Primary Structure Analysis, Kuan Yang, Youping Deng, Chaoyang Zhang, Mohamed O. Elasri

Faculty Publications

Background

Hydrophobins are fungal proteins that can turn into amphipathic membranes at hydrophilic/hydrophobic interfaces by self-assembly. The assemblages by Class I hydrophobins are extremely stable and possess the remarkable ability to change the polarity of the surface. One of its most important industrial applications is its usage as paint. Without detailed knowledge of the 3D structure and self-assembly principles of hydrophobins, it is difficult to make significant progress in furthering its research.

Results

In order to provide useful information to hydrophobin researchers, we analyzed primary structure of hydrophobins to gain more insight about these proteins. In this paper, we ...


Parallelization Of Multicategory Support Vector Machines (Pmc- Svm) For Classifying Microarray Data, Chaoyang Zhang, Peng Li, Arun Rajendran, Youping Deng, Dequan Chen Jan 2006

Parallelization Of Multicategory Support Vector Machines (Pmc- Svm) For Classifying Microarray Data, Chaoyang Zhang, Peng Li, Arun Rajendran, Youping Deng, Dequan Chen

Faculty Publications

Background: Multicategory Support Vector Machines (MC-SVM) are powerful classification systems with excellent performance in a variety of data classification problems. Since the process of generating models in traditional multicategory support vector machines for large datasets is very computationally intensive, there is a need to improve the performance using high performance computing techniques.

Results: In this paper, Parallel Multicategory Support Vector Machines (PMC-SVM) have been developed based on the sequential minimum optimization-type decomposition method for support vector machines (SMO-SVM). It was implemented in parallel using MPI and C++ libraries and executed on both shared memory supercomputer and Linux cluster for multicategory ...


Limitations And Potentials Of Current Motif Discovery Algorithms, Jianjun Hu, Bin Li, D. Kihara Jan 2005

Limitations And Potentials Of Current Motif Discovery Algorithms, Jianjun Hu, Bin Li, D. Kihara

Faculty Publications

Computational methods for de novo identification of gene regulation elements, such as transcription factor binding sites, have proved to be useful for deciphering genetic regulatory networks. However, despite the availability of a large number of algorithms, their strengths and weaknesses are not sufficiently understood. Here, we designed a comprehensive set of performance measures and benchmarked five modern sequence-based motif discovery algorithms using large datasets generated from Escherichia coli RegulonDB. Factors that affect the prediction accuracy, scalability and reliability are characterized. It is revealed that the nucleotide and the binding site level accuracy are very low, while the motif level accuracy ...


Incremental Genetic K-Means Algorithm And Its Application In Gene Expression Data Analysis, Yi Lu, Shiyong Lu, Farhad Fotouhi, Youping Deng, Susan J. Brown Oct 2004

Incremental Genetic K-Means Algorithm And Its Application In Gene Expression Data Analysis, Yi Lu, Shiyong Lu, Farhad Fotouhi, Youping Deng, Susan J. Brown

Faculty Publications

Background

In recent years, clustering algorithms have been effectively applied in molecular biology for gene expression data analysis. With the help of clustering algorithms such as K-means, hierarchical clustering, SOM, etc, genes are partitioned into groups based on the similarity between their expression profiles. In this way, functionally related genes are identified. As the amount of laboratory data in molecular biology grows exponentially each year due to advanced technologies such as Microarray, new efficient and effective methods for clustering must be developed to process this growing amount of biological data.

Results

In this paper, we propose a new clustering algorithm ...