Resampling Methods for Dependent Data

Resampling Methods for Dependent Data
Author: S. N. Lahiri
Publisher: Springer Science & Business Media
Total Pages: 382
Release: 2013-03-09
Genre: Mathematics
ISBN: 147573803X

By giving a detailed account of bootstrap methods and their properties for dependent data, this book provides illustrative numerical examples throughout. The book fills a gap in the literature covering research on re-sampling methods for dependent data that has witnessed vigorous growth over the last two decades but remains scattered in various statistics and econometrics journals. It can be used as a graduate level text and also as a research monograph for statisticians and econometricians.

Feature Engineering and Selection

Feature Engineering and Selection
Author: Max Kuhn
Publisher: CRC Press
Total Pages: 266
Release: 2019-07-25
Genre: Business & Economics
ISBN: 1351609467

The process of developing predictive models includes many stages. Most resources focus on the modeling algorithms but neglect other critical aspects of the modeling process. This book describes techniques for finding the best representations of predictors for modeling and for nding the best subset of predictors for improving model performance. A variety of example data sets are used to illustrate the techniques along with R programs for reproducing the results.

Climate Time Series Analysis

Climate Time Series Analysis
Author: Manfred Mudelsee
Publisher: Springer Science & Business Media
Total Pages: 497
Release: 2010-08-26
Genre: Science
ISBN: 9048194822

Climate is a paradigm of a complex system. Analysing climate data is an exciting challenge, which is increased by non-normal distributional shape, serial dependence, uneven spacing and timescale uncertainties. This book presents bootstrap resampling as a computing-intensive method able to meet the challenge. It shows the bootstrap to perform reliably in the most important statistical estimation techniques: regression, spectral analysis, extreme values and correlation. This book is written for climatologists and applied statisticians. It explains step by step the bootstrap algorithms (including novel adaptions) and methods for confidence interval construction. It tests the accuracy of the algorithms by means of Monte Carlo experiments. It analyses a large array of climate time series, giving a detailed account on the data and the associated climatological questions. This makes the book self-contained for graduate students and researchers.

Financial Data Resampling for Machine Learning Based Trading

Financial Data Resampling for Machine Learning Based Trading
Author: Tomé Almeida Borges
Publisher: Springer Nature
Total Pages: 93
Release: 2021-02-22
Genre: Mathematics
ISBN: 3030683796

This book presents a system that combines the expertise of four algorithms, namely Gradient Tree Boosting, Logistic Regression, Random Forest and Support Vector Classifier to trade with several cryptocurrencies. A new method for resampling financial data is presented as alternative to the classical time sampled data commonly used in financial market trading. The new resampling method uses a closing value threshold to resample the data creating a signal better suited for financial trading, thus achieving higher returns without increased risk. The performance of the algorithm with the new resampling method and the classical time sampled data are compared and the advantages of using the system developed in this work are highlighted.

Monte Carlo Simulation and Resampling Methods for Social Science

Monte Carlo Simulation and Resampling Methods for Social Science
Author: Thomas M. Carsey
Publisher: SAGE Publications
Total Pages: 304
Release: 2013-08-05
Genre: Social Science
ISBN: 1483324923

Taking the topics of a quantitative methodology course and illustrating them through Monte Carlo simulation, this book examines abstract principles, such as bias, efficiency, and measures of uncertainty in an intuitive, visual way. Instead of thinking in the abstract about what would happen to a particular estimator "in repeated samples," the book uses simulation to actually create those repeated samples and summarize the results. The book includes basic examples appropriate for readers learning the material for the first time, as well as more advanced examples that a researcher might use to evaluate an estimator he or she was using in an actual research project. The book also covers a wide range of topics related to Monte Carlo simulation, such as resampling methods, simulations of substantive theory, simulation of quantities of interest (QI) from model results, and cross-validation. Complete R code from all examples is provided so readers can replicate every analysis presented using R.

Python Data Science Handbook

Python Data Science Handbook
Author: Jake VanderPlas
Publisher: "O'Reilly Media, Inc."
Total Pages: 609
Release: 2016-11-21
Genre: Computers
ISBN: 1491912138

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools. Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is the must-have reference for scientific computing in Python. With this handbook, you’ll learn how to use: IPython and Jupyter: provide computational environments for data scientists using Python NumPy: includes the ndarray for efficient storage and manipulation of dense data arrays in Python Pandas: features the DataFrame for efficient storage and manipulation of labeled/columnar data in Python Matplotlib: includes capabilities for a flexible range of data visualizations in Python Scikit-Learn: for efficient and clean Python implementations of the most important and established machine learning algorithms

Permutation Tests

Permutation Tests
Author: Phillip Good
Publisher: Springer Science & Business Media
Total Pages: 238
Release: 2013-03-09
Genre: Mathematics
ISBN: 1475723466

A step-by-step guide to the application of permutation tests in biology, medicine, science, and engineering. The intuitive and informal style makes this manual ideally suitable for students and researchers approaching these methods for the first time. In particular, it shows how to handle the problems of missing and censored data, nonresponders, after-the-fact covariates, and outliers.

An Introduction to Statistical Learning

An Introduction to Statistical Learning
Author: Gareth James
Publisher: Springer Nature
Total Pages: 617
Release: 2023-08-01
Genre: Mathematics
ISBN: 3031387473

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance, marketing, and astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, deep learning, survival analysis, multiple testing, and more. Color graphics and real-world examples are used to illustrate the methods presented. This book is targeted at statisticians and non-statisticians alike, who wish to use cutting-edge statistical learning techniques to analyze their data. Four of the authors co-wrote An Introduction to Statistical Learning, With Applications in R (ISLR), which has become a mainstay of undergraduate and graduate classrooms worldwide, as well as an important reference book for data scientists. One of the keys to its success was that each chapter contains a tutorial on implementing the analyses and methods presented in the R scientific computing environment. However, in recent years Python has become a popular language for data science, and there has been increasing demand for a Python-based alternative to ISLR. Hence, this book (ISLP) covers the same materials as ISLR but with labs implemented in Python. These labs will be useful both for Python novices, as well as experienced users.

Time-Series Forecasting

Time-Series Forecasting
Author: Chris Chatfield
Publisher: CRC Press
Total Pages: 281
Release: 2000-10-25
Genre: Business & Economics
ISBN: 1420036203

From the author of the bestselling "Analysis of Time Series," Time-Series Forecasting offers a comprehensive, up-to-date review of forecasting methods. It provides a summary of time-series modelling procedures, followed by a brief catalogue of many different time-series forecasting methods, ranging from ad-hoc methods through ARIMA and state-space

Nonlinear Time Series Analysis

Nonlinear Time Series Analysis
Author: Ruey S. Tsay
Publisher: John Wiley & Sons
Total Pages: 516
Release: 2018-09-13
Genre: Mathematics
ISBN: 1119264065

A comprehensive resource that draws a balance between theory and applications of nonlinear time series analysis Nonlinear Time Series Analysis offers an important guide to both parametric and nonparametric methods, nonlinear state-space models, and Bayesian as well as classical approaches to nonlinear time series analysis. The authors—noted experts in the field—explore the advantages and limitations of the nonlinear models and methods and review the improvements upon linear time series models. The need for this book is based on the recent developments in nonlinear time series analysis, statistical learning, dynamic systems and advanced computational methods. Parametric and nonparametric methods and nonlinear and non-Gaussian state space models provide a much wider range of tools for time series analysis. In addition, advances in computing and data collection have made available large data sets and high-frequency data. These new data make it not only feasible, but also necessary to take into consideration the nonlinearity embedded in most real-world time series. This vital guide: • Offers research developed by leading scholars of time series analysis • Presents R commands making it possible to reproduce all the analyses included in the text • Contains real-world examples throughout the book • Recommends exercises to test understanding of material presented • Includes an instructor solutions manual and companion website Written for students, researchers, and practitioners who are interested in exploring nonlinearity in time series, Nonlinear Time Series Analysis offers a comprehensive text that explores the advantages and limitations of the nonlinear models and methods and demonstrates the improvements upon linear time series models.