Computational Methods for the Analysis of Genomic Data and Biological Processes

Computational Methods for the Analysis of Genomic Data and Biological Processes
Author: Francisco A. Gómez Vela
Publisher: MDPI
Total Pages: 222
Release: 2021-02-05
Genre: Medical
ISBN: 3039437712

In recent decades, new technologies have made remarkable progress in helping to understand biological systems. Rapid advances in genomic profiling techniques such as microarrays or high-performance sequencing have brought new opportunities and challenges in the fields of computational biology and bioinformatics. Such genetic sequencing techniques allow large amounts of data to be produced, whose analysis and cross-integration could provide a complete view of organisms. As a result, it is necessary to develop new techniques and algorithms that carry out an analysis of these data with reliability and efficiency. This Special Issue collected the latest advances in the field of computational methods for the analysis of gene expression data, and, in particular, the modeling of biological processes. Here we present eleven works selected to be published in this Special Issue due to their interest, quality, and originality.

Computational Methods for the Analysis of Genomic Data and Biological Processes

Computational Methods for the Analysis of Genomic Data and Biological Processes
Author: Francisco A. Gómez Vela
Publisher:
Total Pages: 222
Release: 2021
Genre:
ISBN: 9783039437726

In recent decades, new technologies have made remarkable progress in helping to understand biological systems. Rapid advances in genomic profiling techniques such as microarrays or high-performance sequencing have brought new opportunities and challenges in the fields of computational biology and bioinformatics. Such genetic sequencing techniques allow large amounts of data to be produced, whose analysis and cross-integration could provide a complete view of organisms. As a result, it is necessary to develop new techniques and algorithms that carry out an analysis of these data with reliability and efficiency. This Special Issue collected the latest advances in the field of computational methods for the analysis of gene expression data, and, in particular, the modeling of biological processes. Here we present eleven works selected to be published in this Special Issue due to their interest, quality, and originality.

Computational Genomics with R

Computational Genomics with R
Author: Altuna Akalin
Publisher: CRC Press
Total Pages: 462
Release: 2020-12-16
Genre: Mathematics
ISBN: 1498781861

Computational Genomics with R provides a starting point for beginners in genomic data analysis and also guides more advanced practitioners to sophisticated data analysis techniques in genomics. The book covers topics from R programming, to machine learning and statistics, to the latest genomic data analysis techniques. The text provides accessible information and explanations, always with the genomics context in the background. This also contains practical and well-documented examples in R so readers can analyze their data by simply reusing the code presented. As the field of computational genomics is interdisciplinary, it requires different starting points for people with different backgrounds. For example, a biologist might skip sections on basic genome biology and start with R programming, whereas a computer scientist might want to start with genome biology. After reading: You will have the basics of R and be able to dive right into specialized uses of R for computational genomics such as using Bioconductor packages. You will be familiar with statistics, supervised and unsupervised learning techniques that are important in data modeling, and exploratory analysis of high-dimensional data. You will understand genomic intervals and operations on them that are used for tasks such as aligned read counting and genomic feature annotation. You will know the basics of processing and quality checking high-throughput sequencing data. You will be able to do sequence analysis, such as calculating GC content for parts of a genome or finding transcription factor binding sites. You will know about visualization techniques used in genomics, such as heatmaps, meta-gene plots, and genomic track visualization. You will be familiar with analysis of different high-throughput sequencing data sets, such as RNA-seq, ChIP-seq, and BS-seq. You will know basic techniques for integrating and interpreting multi-omics datasets. Altuna Akalin is a group leader and head of the Bioinformatics and Omics Data Science Platform at the Berlin Institute of Medical Systems Biology, Max Delbrück Center, Berlin. He has been developing computational methods for analyzing and integrating large-scale genomics data sets since 2002. He has published an extensive body of work in this area. The framework for this book grew out of the yearly computational genomics courses he has been organizing and teaching since 2015.

Computational Genome Analysis

Computational Genome Analysis
Author: Richard C. Deonier
Publisher: Springer Science & Business Media
Total Pages: 543
Release: 2005-12-27
Genre: Computers
ISBN: 0387288074

This book presents the foundations of key problems in computational molecular biology and bioinformatics. It focuses on computational and statistical principles applied to genomes, and introduces the mathematics and statistics that are crucial for understanding these applications. The book features a free download of the R software statistics package and the text provides great crossover material that is interesting and accessible to students in biology, mathematics, statistics and computer science. More than 100 illustrations and diagrams reinforce concepts and present key results from the primary literature. Exercises are given at the end of chapters.

Computational Methods for Analysis of Large-Scale Epigenomics Data

Computational Methods for Analysis of Large-Scale Epigenomics Data
Author: Petko Plamenov Fiziev
Publisher:
Total Pages: 248
Release: 2018
Genre:
ISBN:

Reverse-engineering and understanding the regulatory dynamics of genes is key to gaining insights into many biological processes on molecular level. Advances in genomics technologies and decreasing costs of DNA sequencing enabled interrogating relevant properties of the genome, collectively referred to as epigenetics, on very large scale. This work presents results from two collaborative projects with experimental biologists and two new general computational methods for analysis of high-throughput epigenomic data. The first collaborative project is joint work with Dr. Kathrin Plath and members of her lab at UCLA on studying the epigenetics of somatic cell reprogramming in mouse. By generating and analyzing a large compendium of genomics datasets at four distinct stages during reprogramming, we discovered key properties of the regulatory dynamics during this process and proposed new ways to improve its efficiency. The first computational method in this work, ChromTime, presents a novel framework for modeling spatio-temporal dynamics of chromatin marks. ChromTime detects expanding, contracting and steady domains of chromatin marks from time course epigenomics data. Applications of the method to a diverse set of biological systems show that predicted dynamic domains likely mark important regulatory regions as they associate with changes in gene expression and transcription factor binding. Furthermore, ChromTime enables analyses of the directionality of spatio-temporal dynamics of epigenetic domains, which is a previously understudied aspect of chromatin dynamics. Our results uncover associations between the direction of expanding and contracting domains of several chromatin marks and the direction of transcription of nearby genes. The second collaborative project is joint work with cancer researchers, Dr. Lynda Chin and Dr. Kunal Rai and members of their labs at MD Anderson Cancer Center in Houston, TX. Within this project we studied the epigenetics of melanoma cancer progression. Our collaborators generated genome-wide maps for a large number of histone modifications, DNA methylation and gene expression in tumorigenic and non-tumorigenic human melanocytes. By comparing these maps we discovered that loss of acetylation marks at regulatory regions is characteristic of tumorigenic melanocytes and that modulating acetylation levels can impact tumorigenic potential of cells. In addition, we developed a novel nanostring assay for interrogating the chromatin state at a small subset of genomic locations, which can potentially be used for diagnostic or prognostic purposes in future. The second computational method presented in this work, CSDELTA, is designed to detect differential chromatin sites from genome-wide chromatin state maps in groups with multiple samples. Biological relevance of detected differential sites is supported by associations with changes in gene expression and transcription factor binding. Furthermore, CSDELTA models the functional similarity between chromatin states and improves upon the resolution of detection compared to existing methods, which enables more accurate downstream analyses to gain insights into the regulatory dynamics of biological systems.

Computational Genome Analysis

Computational Genome Analysis
Author: Richard C. Deonier
Publisher: Springer Science & Business Media
Total Pages: 542
Release: 2005-12-27
Genre: Computers
ISBN: 0387288074

This book presents the foundations of key problems in computational molecular biology and bioinformatics. It focuses on computational and statistical principles applied to genomes, and introduces the mathematics and statistics that are crucial for understanding these applications. The book features a free download of the R software statistics package and the text provides great crossover material that is interesting and accessible to students in biology, mathematics, statistics and computer science. More than 100 illustrations and diagrams reinforce concepts and present key results from the primary literature. Exercises are given at the end of chapters.

Theoretical and Computational Methods in Genome Research

Theoretical and Computational Methods in Genome Research
Author: Sándor Suhai
Publisher: Springer Science & Business Media
Total Pages: 332
Release: 2012-12-06
Genre: Science
ISBN: 1461559030

The application ofcomputational methods to solve scientific and practical problems in genome research created a new interdisciplinary area that transcends boundaries tradi tionally separating genetics, biology, mathematics, physics, and computer science. Com puters have, of course, been intensively used in the field of life sciences for many years, even before genome research started, to store and analyze DNA or protein sequences; to explore and model the three-dimensional structure, the dynamics, and the function of biopolymers; to compute genetic linkage or evolutionary processes; and more. The rapid development of new molecular and genetic technologies, combined with ambitious goals to explore the structure and function ofgenomes ofhigher organisms, has generated, how ever, not only a huge and exponentially increasing body of data but also a new class of scientific questions. The nature and complexity of these questions will also require, be yond establishing a new kind ofalliance between experimental and theoretical disciplines, the development of new generations both in computer software and hardware technolo gies. New theoretical procedures, combined with powerful computational facilities, will substantially extend the horizon of problems that genome research can attack with suc cess. Many of us still feel that computational models rationalizing experimental findings in genome research fulfill their promises more slowly than desired. There is also an uncer tainty concerning the real position of a "theoretical genome research" in the network of established disciplines integrating their efforts in this field.

A Study of Computational Methods to Analyze Gene Expression Data

A Study of Computational Methods to Analyze Gene Expression Data
Author: Youn Hee Ko
Publisher:
Total Pages:
Release: 2011
Genre:
ISBN:

The recent advent of new technologies has led to huge amounts of genomic data. With these data come new opportunities to understand biological cellular processes underlying hidden regulation mechanisms and to identify disease related biomarkers for informative diagnostics. However, extracting biological insights from the immense amounts of genomic data is a challenging task. Therefore, effective and efficient computational techniques are needed to analyze and interpret genomic data. In this thesis, novel computational methods are proposed to address such challenges: a Bayesian mixture model, an extended Bayesian mixture model, and an Eigen-brain approach. The Bayesian mixture framework involves integration of the Bayesian network and the Gaussian mixture model. Based on the proposed framework and its conjunction with K-means clustering and principal component analysis (PCA), biological insights are derived such as context specific/dependent relationships and nested structures within microarray where biological replicates are encapsulated. The Bayesian mixture framework is then extended to explore posterior distributions of network space by incorporating a Markov chain Monte Carlo (MCMC) model. The extended Bayesian mixture model summarizes the sampled network structures by extracting biologically meaningful features. Finally, an Eigen-brain approach is proposed to analyze in situ hybridization data for the identification of the cell-type specific genes, which can be useful for informative blood diagnostics. Computational results with region-based clustering reveals the critical evidence for the consistency with brain anatomical structure.

Computational and Statistical Approaches to Genomics

Computational and Statistical Approaches to Genomics
Author: Wei Zhang
Publisher: Springer Science & Business Media
Total Pages: 426
Release: 2007-12-26
Genre: Science
ISBN: 0387262881

The second edition of this book adds eight new contributors to reflect a modern cutting edge approach to genomics. It contains the newest research results on genomic analysis and modeling using state-of-the-art methods from engineering, statistics, and genomics. These tools and models are then applied to real biological and clinical problems. The book’s original seventeen chapters are also updated to provide new initiatives and directions.

Computational Methods for Next Generation Sequencing Data Analysis

Computational Methods for Next Generation Sequencing Data Analysis
Author: Ion Mandoiu
Publisher: John Wiley & Sons
Total Pages: 460
Release: 2016-10-03
Genre: Computers
ISBN: 1118169484

Introduces readers to core algorithmic techniques for next-generation sequencing (NGS) data analysis and discusses a wide range of computational techniques and applications This book provides an in-depth survey of some of the recent developments in NGS and discusses mathematical and computational challenges in various application areas of NGS technologies. The 18 chapters featured in this book have been authored by bioinformatics experts and represent the latest work in leading labs actively contributing to the fast-growing field of NGS. The book is divided into four parts: Part I focuses on computing and experimental infrastructure for NGS analysis, including chapters on cloud computing, modular pipelines for metabolic pathway reconstruction, pooling strategies for massive viral sequencing, and high-fidelity sequencing protocols. Part II concentrates on analysis of DNA sequencing data, covering the classic scaffolding problem, detection of genomic variants, including insertions and deletions, and analysis of DNA methylation sequencing data. Part III is devoted to analysis of RNA-seq data. This part discusses algorithms and compares software tools for transcriptome assembly along with methods for detection of alternative splicing and tools for transcriptome quantification and differential expression analysis. Part IV explores computational tools for NGS applications in microbiomics, including a discussion on error correction of NGS reads from viral populations, methods for viral quasispecies reconstruction, and a survey of state-of-the-art methods and future trends in microbiome analysis. Computational Methods for Next Generation Sequencing Data Analysis: Reviews computational techniques such as new combinatorial optimization methods, data structures, high performance computing, machine learning, and inference algorithms Discusses the mathematical and computational challenges in NGS technologies Covers NGS error correction, de novo genome transcriptome assembly, variant detection from NGS reads, and more This text is a reference for biomedical professionals interested in expanding their knowledge of computational techniques for NGS data analysis. The book is also useful for graduate and post-graduate students in bioinformatics.