Statistical Inference for Everyone

Statistical Inference for Everyone
Author: Brian Blais
Publisher: Createspace Independent Publishing Platform
Total Pages: 200
Release: 2014-08-27
Genre: Mathematics
ISBN: 9781499715071

Approaching an introductory statistical inference textbook in a novel way, this book is motivated by the perspective of "probability theory as logic". Targeted to the typical "Statistics 101" college student this book covers the topics typically treated in such a course - but from a fresh angle. This book walks through a simple introduction to probability, and then applies those principles to all problems of inference. Topics include hypothesis testing, data visualization, parameter inference, and model comparison. Statistical Inference for Everyone is freely available under the Creative Commons License, and includes a software library in Python for making calculations and visualizations straightforward.

All of Statistics

All of Statistics
Author: Larry Wasserman
Publisher: Springer Science & Business Media
Total Pages: 446
Release: 2013-12-11
Genre: Mathematics
ISBN: 0387217363

Taken literally, the title "All of Statistics" is an exaggeration. But in spirit, the title is apt, as the book does cover a much broader range of topics than a typical introductory book on mathematical statistics. This book is for people who want to learn probability and statistics quickly. It is suitable for graduate or advanced undergraduate students in computer science, mathematics, statistics, and related disciplines. The book includes modern topics like non-parametric curve estimation, bootstrapping, and classification, topics that are usually relegated to follow-up courses. The reader is presumed to know calculus and a little linear algebra. No previous knowledge of probability and statistics is required. Statistics, data mining, and machine learning are all concerned with collecting and analysing data.

Statistical Inference as Severe Testing

Statistical Inference as Severe Testing
Author: Deborah G. Mayo
Publisher: Cambridge University Press
Total Pages: 503
Release: 2018-09-20
Genre: Mathematics
ISBN: 1108563309

Mounting failures of replication in social and biological sciences give a new urgency to critically appraising proposed reforms. This book pulls back the cover on disagreements between experts charged with restoring integrity to science. It denies two pervasive views of the role of probability in inference: to assign degrees of belief, and to control error rates in a long run. If statistical consumers are unaware of assumptions behind rival evidence reforms, they can't scrutinize the consequences that affect them (in personalized medicine, psychology, etc.). The book sets sail with a simple tool: if little has been done to rule out flaws in inferring a claim, then it has not passed a severe test. Many methods advocated by data experts do not stand up to severe scrutiny and are in tension with successful strategies for blocking or accounting for cherry picking and selective reporting. Through a series of excursions and exhibits, the philosophy and history of inductive inference come alive. Philosophical tools are put to work to solve problems about science and pseudoscience, induction and falsification.

The Logical Foundations of Statistical Inference

The Logical Foundations of Statistical Inference
Author: Henry E. Kyburg Jr.
Publisher: Springer Science & Business Media
Total Pages: 440
Release: 2012-12-06
Genre: Philosophy
ISBN: 9401021759

Everyone knows it is easy to lie with statistics. It is important then to be able to tell a statistical lie from a valid statistical inference. It is a relatively widely accepted commonplace that our scientific knowledge is not certain and incorrigible, but merely probable, subject to refinement, modifi cation, and even overthrow. The rankest beginner at a gambling table understands that his decisions must be based on mathematical ex pectations - that is, on utilities weighted by probabilities. It is widely held that the same principles apply almost all the time in the game of life. If we turn to philosophers, or to mathematical statisticians, or to probability theorists for criteria of validity in statistical inference, for the general principles that distinguish well grounded from ill grounded generalizations and laws, or for the interpretation of that probability we must, like the gambler, take as our guide in life, we find disagreement, confusion, and frustration. We might be prepared to find disagreements on a philosophical and theoretical level (although we do not find them in the case of deductive logic) but we do not expect, and we may be surprised to find, that these theoretical disagreements lead to differences in the conclusions that are regarded as 'acceptable' in the practice of science and public affairs, and in the conduct of business.

Naked Statistics: Stripping the Dread from the Data

Naked Statistics: Stripping the Dread from the Data
Author: Charles Wheelan
Publisher: W. W. Norton & Company
Total Pages: 307
Release: 2013-01-07
Genre: Mathematics
ISBN: 0393089827

A New York Times bestseller "Brilliant, funny…the best math teacher you never had." —San Francisco Chronicle Once considered tedious, the field of statistics is rapidly evolving into a discipline Hal Varian, chief economist at Google, has actually called "sexy." From batting averages and political polls to game shows and medical research, the real-world application of statistics continues to grow by leaps and bounds. How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? What is causing the rising incidence of autism? As best-selling author Charles Wheelan shows us in Naked Statistics, the right data and a few well-chosen statistical tools can help us answer these questions and more. For those who slept through Stats 101, this book is a lifesaver. Wheelan strips away the arcane and technical details and focuses on the underlying intuition that drives statistical analysis. He clarifies key concepts such as inference, correlation, and regression analysis, reveals how biased or careless parties can manipulate or misrepresent data, and shows us how brilliant and creative researchers are exploiting the valuable data from natural experiments to tackle thorny questions. And in Wheelan’s trademark style, there’s not a dull page in sight. You’ll encounter clever Schlitz Beer marketers leveraging basic probability, an International Sausage Festival illuminating the tenets of the central limit theorem, and a head-scratching choice from the famous game show Let’s Make a Deal—and you’ll come away with insights each time. With the wit, accessibility, and sheer fun that turned Naked Economics into a bestseller, Wheelan defies the odds yet again by bringing another essential, formerly unglamorous discipline to life.

Doing Data Science

Doing Data Science
Author: Cathy O'Neil
Publisher: "O'Reilly Media, Inc."
Total Pages: 320
Release: 2013-10-09
Genre: Computers
ISBN: 144936389X

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis, and the data science process Algorithms Spam filters, Naive Bayes, and data wrangling Logistic regression Financial modeling Recommendation engines and causality Data visualization Social networks and data journalism Data engineering, MapReduce, Pregel, and Hadoop Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O’Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.

Introduction to Probability

Introduction to Probability
Author: Joseph K. Blitzstein
Publisher: CRC Press
Total Pages: 599
Release: 2014-07-24
Genre: Mathematics
ISBN: 1466575573

Developed from celebrated Harvard statistics lectures, Introduction to Probability provides essential language and tools for understanding statistics, randomness, and uncertainty. The book explores a wide variety of applications and examples, ranging from coincidences and paradoxes to Google PageRank and Markov chain Monte Carlo (MCMC). Additional application areas explored include genetics, medicine, computer science, and information theory. The print book version includes a code that provides free access to an eBook version. The authors present the material in an accessible style and motivate concepts using real-world examples. Throughout, they use stories to uncover connections between the fundamental distributions in statistics and conditioning to reduce complicated problems to manageable pieces. The book includes many intuitive explanations, diagrams, and practice problems. Each chapter ends with a section showing how to perform relevant simulations and calculations in R, a free statistical software environment.

Think Stats

Think Stats
Author: Allen B. Downey
Publisher: "O'Reilly Media, Inc."
Total Pages: 137
Release: 2011-07-01
Genre: Computers
ISBN: 1449313108

If you know how to program, you have the skills to turn data into knowledge using the tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python. You'll work with a case study throughout the book to help you learn the entire data analysis process—from collecting data and generating statistics to identifying patterns and testing hypotheses. Along the way, you'll become familiar with distributions, the rules of probability, visualization, and many other tools and concepts. Develop your understanding of probability and statistics by writing and testing code Run experiments to test statistical behavior, such as generating samples from several distributions Use simulations to understand concepts that are hard to grasp mathematically Learn topics not usually covered in an introductory course, such as Bayesian estimation Import data from almost any source using Python, rather than be limited to data that has been cleaned and formatted for statistics tools Use statistical inference to answer questions about real-world data

Statistics As Principled Argument

Statistics As Principled Argument
Author: Robert P. Abelson
Publisher: Psychology Press
Total Pages: 242
Release: 2012-09-10
Genre: Psychology
ISBN: 1135694419

In this illuminating volume, Robert P. Abelson delves into the too-often dismissed problems of interpreting quantitative data and then presenting them in the context of a coherent story about one's research. Unlike too many books on statistics, this is a remarkably engaging read, filled with fascinating real-life (and real-research) examples rather than with recipes for analysis. It will be of true interest and lasting value to beginning graduate students and seasoned researchers alike. The focus of the book is that the purpose of statistics is to organize a useful argument from quantitative evidence, using a form of principled rhetoric. Five criteria, described by the acronym MAGIC (magnitude, articulation, generality, interestingness, and credibility) are proposed as crucial features of a persuasive, principled argument. Particular statistical methods are discussed, with minimum use of formulas and heavy data sets. The ideas throughout the book revolve around elementary probability theory, t tests, and simple issues of research design. It is therefore assumed that the reader has already had some access to elementary statistics. Many examples are included to explain the connection of statistics to substantive claims about real phenomena.