Estimating Species Trees

Estimating Species Trees
Author: L. Lacey Knowles
Publisher: John Wiley & Sons
Total Pages: 332
Release: 2011-09-20
Genre: Science
ISBN: 1118211405

Recent computational and modeling advances have produced methods for estimating species trees directly, avoiding the problems and limitations of the traditional phylogenetic paradigm where an estimated gene tree is equated with the history of species divergence. The overarching goal of the volume is to increase the visibility and use of these new methods by the entire phylogenetic community by specifically addressing several challenges: (i) firm understanding of the theoretical underpinnings of the methodology, (ii) empirical examples demonstrating the utility of the methodology as well as its limitations, and (iii) attention to technical aspects involved in the actual software implementation of the methodology. As such, this volume will not only be poised to become the quintessential guide to training the next generation of researchers, but it will also be instrumental in ushering in a new phylogenetic paradigm for the 21st century.

Estimating Species Trees

Estimating Species Trees
Author: L. Lacey Knowles
Publisher: John Wiley and Sons
Total Pages: 230
Release: 2011-05-09
Genre: Science
ISBN: 1118126025

Recent computational and modeling advances have produced methods for estimating species trees directly, avoiding the problems and limitations of the traditional phylogenetic paradigm where an estimated gene tree is equated with the history of species divergence. The overarching goal of the volume is to increase the visibility and use of these new methods by the entire phylogenetic community by specifically addressing several challenges: (i) firm understanding of the theoretical underpinnings of the methodology, (ii) empirical examples demonstrating the utility of the methodology as well as its limitations, and (iii) attention to technical aspects involved in the actual software implementation of the methodology. As such, this volume will not only be poised to become the quintessential guide to training the next generation of researchers, but it will also be instrumental in ushering in a new phylogenetic paradigm for the 21st century.

Estimation of Species Tree Using Approximate Bayesian Computation

Estimation of Species Tree Using Approximate Bayesian Computation
Author: Hang Fan
Publisher:
Total Pages: 27
Release: 2010
Genre:
ISBN:

Abstract: Development of methods for estimating species trees from multilocus data is a current challenge in evolutionary biology. We propose a method for estimating the species tree topology and branch lengths using Approximate Bayesian Computation (ABC). The method takes as data a sample of observed gene tree topologies without branch lengths, and then iterates through the following sequence of steps: First, a randomly selected species tree is used to compute the distribution of gene trees topologies. This distribution is then compared to the observed gene topology frequencies, and if the fit between the observed and the predicted distribution is close enough, the proposed species tree is retained. Repeating this many times leads to a collection of retained species trees that are then used to form the estimate of the overall species tree. We test the performance of the method, which we call ST-ABC, using both simulated and empirical data. The simulation study examines both symmetric and asymmetric species trees over a range of branch lengths and sample sizes. The results from the simulation study show that the model performs very well, giving accurate estimates for both the topology and the branch lengths across the conditions studied, and that a sample size of 25 loci appears to be adequate for the method. Further, we apply the method to two empirical cases: a 4-taxon data set for primates and a 7-taxon data set for yeast. In both cases, we find that estimates obtained with ST-ABC agree with previous studies. Thus, our method is able to deal with complex data in a timely and efficient way. In addition, the method does not require sequence data, but rather uses the observed distribution of gene topologies. Therefore, this method provides a nice alternative to other currently available methods for species tree estimation.

Estimating Species Trees from Gene Trees Despite Gene Tree Incongruence Under Realistic Model Conditions

Estimating Species Trees from Gene Trees Despite Gene Tree Incongruence Under Realistic Model Conditions
Author: Md. Shamsuzzoha Bayzid
Publisher:
Total Pages: 732
Release: 2016
Genre:
ISBN:

Species tree estimation is frequently based on phylogenomic approaches that use multiple genes from throughout the genome. With the rapid growth rate of newly sequenced genomes, species tree inference from multiple genes has become one of the basic and popular tasks in comparative and evolutionary biology. However, combining data on multiple genes is not a trivial task since genes evolve through biological processes that include deep coalescence (also known as incomplete lineage sorting (ILS)), duplication and loss, horizontal gene transfer etc., so that the individual gene histories can differ from each other. In this dissertation, we focus on making advances on phylogenomic analyses with particular attention to the gene tree discordance. In addition to gene tree discordance, we consider other challenging conditions that frequently arise in genome scale data. One of these major challenges is incomplete gene trees, meaning that not all gene trees have individuals from all the species. We performed an extensive simulation study under the multi-species coalescent (MSC) model that shows that existing methods have poor accuracy when gene trees are incomplete. We formalized the optimal completion problem, which seeks to add the missing taxa (species) into the gene trees with respect to a species tree such that the distance (in terms of ILS) between the gene tree and the species tree is minimized. We developed an algorithm for solving this problem. We formalized optimization problems in the context of species tree estimation from a set of incomplete gene trees under the multi-species coalescent model, and proposed algorithms for solving these problems. We formulated different mathematical models for “gene loss” based on different reasons for incompleteness. Next, we addressed the Minimize Gene Duplication (MGD) problem, that seeks to find a species tree from a set of gene trees so as to minimize the total number of duplications needed to explain the evolutionary history. We proposed exact and heuristic algorithms to solve this NP-hard problem. Next, we showed in a comprehensive experimental study that existing methods are susceptible to poorly estimated gene trees in the presence of ILS. We proposed a new technique called “binning” that dramatically improves the performance of species tree estimation methods when gene trees are poorly estimated. We developed a novel technique called “naive binning” and subsequently proposed an improved version called “weighted statistical binning” to address the problem of gene tree estimation error. Finally, we addressed the computational challenges to reconstruct highly accurate species tree from large scale genomic data. We developed divide-and-conquer based meta-methods that can make existing methods scalable to very large datasets (in terms of the number of species). Overall, this dissertation contributes to understanding the limitations of the existing methods under realistic model conditions, developing new approaches to handle the challenging issues that frequently arise in phylogenomics, and improving and scaling the existing methods to larger datasets.

An Examination of the Transfer of Errors to Species Tree Estimation Caused by Model Selection in Gene Tree Estimation

An Examination of the Transfer of Errors to Species Tree Estimation Caused by Model Selection in Gene Tree Estimation
Author: Nevada Basdeo
Publisher:
Total Pages: 130
Release: 2017
Genre:
ISBN:

Inferences from phylogenetic trees is useful in forensic science, bioinformatics, identifying pathogens, and other applications. Thus, building accurate trees is important. Research on nucleotides substitution models has shown the models to be robust for estimating gene trees, but the effects on estimating species trees has not been examined. Cumulative errors on gene tree estimation can transfer over to species tree estimation. Even if the errors are small on each estimated gene tree, they can add up and have a significant impact on accuracy of species tree estimation. In part one of this research, simulations were used to explore how wrongly specified models affect species tree estimation. In part two, data from Austrian finches were used to explore the error of estimation in 30 genes. We found that the models we used in the simulations were robust in species tree estimation. In the finch data, 24 of the 30 estimated genes had a significant chi-square, meaning the 24 genes did not fit the data well. Genes with high GC content appear to have large residuals. Almost all of the residuals were positive suggesting that the evolutionary models were underestimating the frequency of most patterns. Having a vast majority of the genes not being correctly modeled, leads to the adage 'garbage in, garbage out, ' in reference to building a species tree. For improvements, models should better address genes with high GC content and address the under-fitting issue. Due to computational constraints, the results of the simulations may have been affected by the sample size of genes. The simulations might need a bigger sample size of genes to detect an error in species tree estimation if a true error existed.

Species Tree Inference

Species Tree Inference
Author: Laura Kubatko
Publisher: Princeton University Press
Total Pages: 352
Release: 2023-03-14
Genre: Science
ISBN: 0691207607

"Inferring evolutionary relationships among a collection of organisms -- that is, their relationship to each other on the tree of life -- remains a central focus of much of evolutionary biology as these relationships provide the background for key hypotheses. For example, support for different hypotheses about early animal evolution are contingent upon the phylogenetic relationships among the earliest animal lineages. Within the last 20 years, the field of phylogenetics has grown rapidly, both in the quantity of data available for inference and in the number of methods available for phylogenetic estimation. The authors' first book, "Estimating Species Trees: Practical and Theoretical Aspects", published in 2010, gave an overview of the state of phylogenetic practice for analyzing data at the time, but much has changed since then. The goal of this book is to serve as an updated reference on current methods within the field. The book is organized in three sections, the first of which provides an overview of the analytical and methodological developments of species tree inference. Section two focuses on empirical inference. Section three explores various applications of species trees in evolutionary biology. The combination of theoretical and empirical approaches is meant to provide readers with a level of knowledge of both the advances and limitations of species-tree inference that can help researchers in applying the methods, while also inspiring future advances among those researchers with an interest in methodological development"--

Measuring Abundance

Measuring Abundance
Author: Graham Upton
Publisher: Pelagic Publishing Ltd
Total Pages: 278
Release: 2020-10-12
Genre: Science
ISBN: 1784272337

Measuring the abundance of individuals and the diversity of species are core components of most ecological research projects and conservation monitoring. This book brings together in one place, for the first time, the methods used to estimate the abundance of individuals in nature. The statistical basis of each method is detailed along with practical considerations for survey design and data collection. Methods are illustrated using data ranging from Alaskan shrubs to Yellowstone grizzly bears, not forgetting Costa Rican ants and Prince Edward Island lobsters. Where necessary, example code for use with the open source software R is supplied. When appropriate, reference is made to other widely used programs. After opening with a brief synopsis of relevant statistical methods, the first section deals with the abundance of stationary items such as trees, shrubs, coral, etc. Following a discussion of the use of quadrats and transects in the contexts of forestry sampling and the assessment of plant cover, there are chapters addressing line-intercept sampling, the use of nearest-neighbour distances, and variable sized plots. The second section deals with individuals that move, such as birds, mammals, reptiles, fish, etc. Approaches discussed include double-observer sampling, removal sampling, capture-recapture methods and distance sampling. The final section deals with the measurement of species richness; species diversity; species-abundance distributions; and other aspects of diversity such as evenness, similarity, turnover and rarity. This is an essential reference for anyone involved in advanced undergraduate or postgraduate ecological research and teaching, or those planning and carrying out data analysis as part of conservation survey and monitoring programmes.

Specific Gravity and Other Properties of Wood and Bark for 156 Tree Species Found in North America

Specific Gravity and Other Properties of Wood and Bark for 156 Tree Species Found in North America
Author: Patrick D. Miles
Publisher:
Total Pages: 40
Release: 2009
Genre: Specific gravity
ISBN:

Much information is available for specific gravity and other properties of wood and bark, but it is widely scattered in the literature. This paper compiles information for estimation of biomass for 156 tree species found in North America for use in national forest inventory applications. We present specific gravities based on average green volume as well as 12 percent moisture content volume for calculation of oven-dry biomass. Additional information is included on bark thickness, bark voids, and bark percentages by species and green and dry weight of wood and bark. --