Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

2019

Bioinformatics

Discipline
Institution
Publication
Publication Type

Articles 1 - 25 of 25

Full-Text Articles in Life Sciences

Molecular Phylogeny Implemented In An Introductory Plant Classification Course, Chao Cai, Jo Ann Banks Oct 2019

Molecular Phylogeny Implemented In An Introductory Plant Classification Course, Chao Cai, Jo Ann Banks

Libraries Faculty and Staff Presentations

Plant classification is one of the core components in undergraduate programs related to plant sciences. Traditionally plant classification courses primarily introduce morphology-based taxonomy because of practical needs in the field. However, the publication of new plant classification systems by Angiosperm Phylogeny Group (APG) using molecular phylogeny methods leads to the trends of using molecular evidence (DNA barcode) for plant identification. In our introductory plant classification course, we included a two-week module (lectures and labs) to introduce key concepts and fundamental skills in molecular phylogeny. Week 1 included concepts of evolutionary tree thinking, data mining in NCBI using BLAST search, and ...


9th Annual Postdoctoral Science Symposium, University Of Texas Md Anderson Cancer Center Postdoctoral Association Sep 2019

9th Annual Postdoctoral Science Symposium, University Of Texas Md Anderson Cancer Center Postdoctoral Association

MD Anderson Cancer Center Postdoctoral Association Annual Postdoctoral Science Symposium Abstracts

The mission of the Annual Postdoctoral Science Symposium (APSS) is to provide a platform for talented postdoctoral fellows throughout the Texas Medical Center to present their work to a wider audience. The MD Anderson Postdoctoral Association convened its inaugural Annual Postdoctoral Science Symposium (APSS) on August 4, 2011.

The APSS provides a professional venue for postdoctoral scientists to develop, clarify, and refine their research as a result of formal reviews and critiques of faculty and other postdoctoral scientists. Additionally, attendees discuss current research on a broad range of subjects while promoting academic interactions and enrichment and developing new collaborations.


Formulation Of Hybrid Knowledge-Based/Molecular Mechanics Potentials For Protein Structure Refinement And A Novel Graph Theoretical Protein Structure Comparison And Analysis Technique, Aaron Maus Aug 2019

Formulation Of Hybrid Knowledge-Based/Molecular Mechanics Potentials For Protein Structure Refinement And A Novel Graph Theoretical Protein Structure Comparison And Analysis Technique, Aaron Maus

University of New Orleans Theses and Dissertations

Proteins are the fundamental machinery that enables the functions of life. It is critical to understand them not just for basic biology, but also to enable medical advances. The field of protein structure prediction is concerned with developing computational techniques to predict protein structure and function from a protein’s amino acid sequence, encoded for directly in DNA, alone. Despite much progress since the first computational models in the late 1960’s, techniques for the prediction of protein structure still cannot reliably produce structures of high enough accuracy to enable desired applications such as rational drug design. Protein structure refinement ...


Identification And Characterization Of Preferred Dna-Binding Sites For The Thermus Thermophilus Hb8 Transcriptional Regulator Ttha0973, James Shell Cox, Kristi Moncja, Mykala Mckinnes, Michael W. Van Dyke Jul 2019

Identification And Characterization Of Preferred Dna-Binding Sites For The Thermus Thermophilus Hb8 Transcriptional Regulator Ttha0973, James Shell Cox, Kristi Moncja, Mykala Mckinnes, Michael W. Van Dyke

Faculty Publications

Advances in genomic sequencing have allowed the identification of a multitude of genes encoding putative transcriptional regulatory proteins. Lacking, often, is a fuller understanding of the biological roles played by these proteins, the genes they regulate or regulon. Conventionally this is achieved through a genetic approach involving putative transcription factor gene manipulation and observations of changes in an organism’s transcriptome. However, such an approach is not always feasible or can yield misleading findings. Here, we describe a biochemistry-centric approach, involving identification of preferred DNA-binding sequences for the Thermus thermophilus HB8 transcriptional repressor TTHA0973 using the selection method Restriction Endonuclease ...


Wavelet Analysis Of Short Globular Homologous Proteins In Mesophile And Thermophile Prokaryotes, John B. Linehan Jun 2019

Wavelet Analysis Of Short Globular Homologous Proteins In Mesophile And Thermophile Prokaryotes, John B. Linehan

College of Science and Health Theses and Dissertations

This study looked to identify features related to thermal stability and function in the amino acid chains of short globular proteins from mesophile and thermophile species, within the constraint that the protein fold to perform a speci_c function. To do so 540 homologous pairs of proteins were studied. The amino acid chains were con-verted to hydrophobicity signals by assigning a hydropathy score to each residue in the polypeptide. The hydrophobicity signals were passed through a wavelet packet transform and the resulting spectra analyzed. Bootstrapping was used to gener-ate a control data set to determine if the true ordering of amino ...


Designing Computational Biology Workflows With Perl - Part 1, Esma Yildirim May 2019

Designing Computational Biology Workflows With Perl - Part 1, Esma Yildirim

Open Educational Resources

This material introduces Linux File System structures and demonstrates how to use commands to communicate with the operating system through a Terminal program. Basic program structures and system() function of Perl are discussed. A brief introduction to gene-sequencing terminology and file formats are given.


Designing Computational Biology Workflows With Perl - Part 2, Esma Yildirim May 2019

Designing Computational Biology Workflows With Perl - Part 2, Esma Yildirim

Open Educational Resources

This material introduces the AWS console interface, describes how to create an instance on AWS with the VMI provided and connect to that machine instance using the SSH protocol. Once connected, it requires the students to write a script to automate the tasks to create VCF files from two different sample genomes belonging to E.coli microorganisms by using the FASTA and FASTQ files in the input folder of the virtual machine. The same exercise can be applied if the VMI is installed on a local machine using virtualization software (e.g. Oracle VirtualBox). In this case, the Terminal program ...


Designing Computational Biology Workflows With Perl - Part 1 & 2, Esma Yildirim May 2019

Designing Computational Biology Workflows With Perl - Part 1 & 2, Esma Yildirim

Open Educational Resources

This manual guides the instructor to combine the partial files of the virtual machine image and construct sequencer.ova file. It is accompanied by the partial files of the virtual machine image.


Computational Genomic Models For Spatio-Temporal Investigation Of Early Lung Cancer Pathology, Smruthy Sivakumar May 2019

Computational Genomic Models For Spatio-Temporal Investigation Of Early Lung Cancer Pathology, Smruthy Sivakumar

UT GSBS Dissertations and Theses (Open Access)

Lung cancer, of which non-small cell lung cancer (NSCLC) is the most common form, is the second most prevalent cancer and the leading cause of cancer-related deaths. NSCLCs primarily comprise adenocarcinomas (LUAD) and squamous cell carcinomas (LUSC). Advances in early detection and prevention have been limited by the lack of early-stage biomarkers and targets. A comprehensive molecular characterization of premalignant lesions and tumor-adjacent normal tissue can aid in better understanding NSCLC pathogenesis. However, these investigations are further challenged by limited tissue availability and low cellular fractions of detectable somatic mutations.

Therefore, there is a dearth of knowledge about the pathogenesis ...


Designing Computational Biology Workflows With Perl - Part 2, Esma Yildirim May 2019

Designing Computational Biology Workflows With Perl - Part 2, Esma Yildirim

Open Educational Resources

This material briefly reintroduces the DNA double Helix structure, explains SNP and INDEL mutations in genes and describes FASTA, FASTQ, BAM and VCF file formats. It also explains the index creation, alignment, sorting, marking duplicates and variant calling steps of a simple preprocessing workflow and how to write a Perl script to automate the execution of these steps on a Virtual Machine Image.


Designing Computational Biology Workflows With Perl - Part 1, Esma Yildirim May 2019

Designing Computational Biology Workflows With Perl - Part 1, Esma Yildirim

Open Educational Resources

This material introduces the AWS console interface, describes how to create an instance on AWS with the VMI provided, connect to that machine instance using the SSH protocol. Once connected, it requires the students to write a script to enter the data folder, which includes gene-sequencing input files and print the first five line of each file remotely. The same exercise can be applied if the VMI is installed on a local machine using virtualization software (e.g. Oracle VirtualBox). In this case, the Terminal program of the VMI can be used to do the exercise.


Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott Feb 2019

Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We studied the biological activity of the Mrub_3018 gene, which we hypothesize is orthologous to E. coli gene B2759. We predicted that Mrub_3018(DNA coordinates 3057916… 3058524) encodes the protein CasB. CasB is a protein in the CRISPR CASCADE that will function as a structural protein. When the rest of the proteins form an “S” formation CasB will connect the front and back of the “S” creating a back bone for the structure. It will help bind ...


Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott Feb 2019

Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This research is part of the Meiothermus ruber genome annotation project which aims to predict gene function with various bioinformatics tools. We investigated the function of Mrub_3019, which encodes the CasA protein involved in the multi-subunit effector complex for the CRISPR-Cas immunity system and predicted it to be an ortholog of E. coli K12 MG1655 b2760 (casA). We predicted that Mrub_3019 encodes the protein CasA, which is involved in PAM recognition of CRISPR interference pathway. Foreign DNA will bind to CasA, which signals Cas3 for helicase-mediated DNA degradation. Our hypothesis is supported by low E-values for pairwise alignment in NCBI ...


Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott Feb 2019

Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3015, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3015 (DNA coordinates 3055550...3056245) encodes the the CRISPR-associated protein cas5, which is integral in maintaining the crRNA-DNA structure, keeping the complex from base pairing with the target phage DNA. Our hypothesis is supported by identical hits for Mrub_3015 and b2527 to the KEGG, Pfam, TIGRfam, CDD and PDB databases as well as ...


Genome-Wide Characterization, Molecular Evolution And Mexpression Profiling Of The Metacaspases In Potato (Solanum Tuberosum L.), Nehal Dubey, Maitri Trivedi, Suresh Varsani, Vishal Vyas, Manisha Farsodia, Sunil Kumar Singh Jan 2019

Genome-Wide Characterization, Molecular Evolution And Mexpression Profiling Of The Metacaspases In Potato (Solanum Tuberosum L.), Nehal Dubey, Maitri Trivedi, Suresh Varsani, Vishal Vyas, Manisha Farsodia, Sunil Kumar Singh

Faculty Publications: Department of Entomology

Metacaspases are distant relatives of animal caspases found in plants, protozoa and fungi. Some recent studies have demonstrated that metacaspases are involved in regulating the developmental and environmentally induced programmed cell death in plants. In this study, we identified metacaspase gene family in potato (Solanum tuberosum L.) and analyzed their expression pattern in various developmental tissues and stress responses of plants. There were eight metacaspase genes identified in the Peptidase (Cysteine protease) C14 family and based upon sequence alignment and phylogenetic analysis, a systematic nomenclature of potato metacaspases (SotubMCs) has been proposed. Three of the eight candidate genes showing homology ...


Effects Of Temperature On Crispr/Cas System, Eddie Beckom, Dr. Lori Scott Jan 2019

Effects Of Temperature On Crispr/Cas System, Eddie Beckom, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the effect of temperature on the complexity of CRISPR/Cas systems in bacterial organisms across temperature classifications. We predict that temperature extremes would result in CRISPR/Cas systems with multiple operons, repeating cas genes, and complex systems. CRISPR/Cas systems can be classified into three types with a number of subtypes based on the CRISPR-associated genes, cas genes, present in a given organism. Our hypothesis is supported by the presence of multiple operons in thermophilic ...


Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott Jan 2019

Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3014, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3014 (DNA coordinates 3054943..3055575) encodes CRISPR-associated protein Cse3/case which function as an endonuclease. Our hypothesis is supported by identical hits for Mrub_3014 and b2756 to the KEGG, Pfam, TIGRfam, CDD and PDB databases, as well as a low E-value for a pairwise NCBI BLAST comparison. Both protein products are predicted to ...


M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott Jan 2019

M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of gene Mrub_3013, which we hypothesize is orthologous to b2755 in E. coli K12 MG1655 (a.k.a. Cas1). We investigated the biological function of a gene with the M. ruber locus tag of Mrub_3013, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system in M. ruber. We predict that Mrub_3013 (DNA coordinates 3,053,978-3,054,940) encodes the protein Cas1 which as part of the CRISPR-Cas system ...


An Investigation Into The Relationship Between Mrub_3013, Mrub_1477, And Mrub_0224: Are They Paralogs?, Melette Devore, Dr. Lori Scott Jan 2019

An Investigation Into The Relationship Between Mrub_3013, Mrub_1477, And Mrub_0224: Are They Paralogs?, Melette Devore, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of mrub_3013 and the nature of its relationship with mrub_1477 and mrub_0224. We hypothesized that mrub_3013 is orthologous to b2755 in E. coli K12 MG1655 (a.k.a. cas1). We predict that mrub_3013 encodes the enzyme Cas1, which is involved in spacer acquisition in the CRISPR-Cas prokaryotic defense system. Our hypothesis is supported by identical hits for b2755, mrub_3013, mrub_1477, and mrub_0224 from the CDD and Pfam databases and highly similar hits ...


Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott Jan 2019

Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological functions of two genes: mrub_3020 and mrub_1489. We make two hypotheses in this investigation: a) mrub_3020 is orthologous to the gene b2761 in E. coli K12 MG1655 (a.k.a. casC); b) mrub_1489 is a paralog of mrub_3020. We also predict that the two genes encode unique proteins: mrub_3020 with DNA coordinates 3060491…3063190 encodes a CRISPR – associated helicase (Cas3) that supports the Cascade complex of the CRISPR – Cas adaptive immune system by ...


Evaluating Predixcan’S Ability To Predict Differential Expression Between Alcoholics And Non-Alcoholics, John E. Drake Jr Jan 2019

Evaluating Predixcan’S Ability To Predict Differential Expression Between Alcoholics And Non-Alcoholics, John E. Drake Jr

Theses and Dissertations

PrediXcan is a recent software for the imputation of gene expression from genotype data alone. Using an overlapping set of transcriptome datasets from postmortem brain tissues of donors with alcohol use disorder and neurotypical controls, which were generated by two different platforms (e.g., Arraystar and Affymetrix), and an additional unrelated transcriptome dataset from lung tissue, we sought to evaluate PrediXcan’s ability to impute gene expression and identify differentially expressed genes. From the Arraystar platform, 1.3% of matched genes between the measured and imputed expression had a Pearson correlation ≥ 0.5. Our attempt to replicate this finding using ...


The N-Glycan Structures Of The Antigenic Variants Of Chlorovirus Pbcv-1 Major Capsid Protein Help To Identify The Virus-Encoded Glycosyltransferases, Immacolata Speciale, Garry A. Duncan, Luca Unione, Irina Agarkova, Domenico Garozzo, Jesus Jimenez-Barbero, Sicheng Lin, Todd L. Lowary, Antonio Molinaro, Eric Noel, Maria Elena Laugieri, Michela Tonetti, James L. Van Etten, Cristina De Castro Jan 2019

The N-Glycan Structures Of The Antigenic Variants Of Chlorovirus Pbcv-1 Major Capsid Protein Help To Identify The Virus-Encoded Glycosyltransferases, Immacolata Speciale, Garry A. Duncan, Luca Unione, Irina Agarkova, Domenico Garozzo, Jesus Jimenez-Barbero, Sicheng Lin, Todd L. Lowary, Antonio Molinaro, Eric Noel, Maria Elena Laugieri, Michela Tonetti, James L. Van Etten, Cristina De Castro

James Van Etten Publications

The chlorovirus Paramecium bursaria chlorella virus 1 (PBCV-1) is a large dsDNA virus that infects the microalga Chlorella variabilis NC64A. Unlike most other viruses, PBCV-1 encodes most, if not all, of the machinery required to glycosylate its major capsid protein (MCP). The structures of the four N-linked glycans from the PBCV-1 MCP consist of nonasaccharides, and similar glycans are not found elsewhere in the three domains of life. Here, we identified the roles of three virus-encoded glycosyltransferases (GTs) that have four distinct GT activities in glycan synthesis. Two of the three GTs were previously annotated as GTs but the third ...


Exploring Strategies To Integrate Disparate Bioinformatics Datasets, Charbel Bader Fakhry Jan 2019

Exploring Strategies To Integrate Disparate Bioinformatics Datasets, Charbel Bader Fakhry

Walden Dissertations and Doctoral Studies

Distinct bioinformatics datasets make it challenging for bioinformatics specialists to locate the required datasets and unify their format for result extraction. The purpose of this single case study was to explore strategies to integrate distinct bioinformatics datasets. The technology acceptance model was used as the conceptual framework to understand the perceived usefulness and ease of use of integrating bioinformatics datasets. The population of this study included bioinformatics specialists of a research institution in Lebanon that has strategies to integrate distinct bioinformatics datasets. The data collection process included interviews with 6 bioinformatics specialists and reviewing 27 organizational documents relating to integrating ...


Algorithms For Synteny-Based Phylostratigraphy And Gene Origin Classification, Zebulun Arendsee Jan 2019

Algorithms For Synteny-Based Phylostratigraphy And Gene Origin Classification, Zebulun Arendsee

Graduate Theses and Dissertations

With every newly sequenced species we discover hundreds of novel protein coding genes. Many of these "orphan" genes have been experimentally proven to have dramatic functions in development, sexual dimorphism, pathogen resistance, and social traits like symbiosis. Whereas in the past, researchers viewed genes as the product of continuous variation acting on ancient material, we now know that novel genes may arise de novo from non-genic sequence. Thus evolutionary experimentation is not limited to tweaking existing genes or their regulatory patterns. Any orphan genes that arose in the distant past, should appear today as lineage-specific genes (or gene families). The ...


Saccharomyces Genome Database & Uniprot Bioinformatics Analysis, Ray A. Enke Dec 2018

Saccharomyces Genome Database & Uniprot Bioinformatics Analysis, Ray A. Enke

Ray Enke Ph.D.

This in class activity introduces basic bioinformatics analysis using the Saccharomyces Genome Database (SGD) and the UniProt Database. The yeast URA3 gene is studied in this activity, however, any other yeast gene can be substituted. This activity is designed for novice instructors and students for implementation into core biology lecture or lab courses.