Feature Engineering and Selection

Feature Engineering and Selection
Author: Max Kuhn
Publisher: CRC Press
Total Pages: 266
Release: 2019-07-25
Genre: Business & Economics
ISBN: 1351609467

The process of developing predictive models includes many stages. Most resources focus on the modeling algorithms but neglect other critical aspects of the modeling process. This book describes techniques for finding the best representations of predictors for modeling and for nding the best subset of predictors for improving model performance. A variety of example data sets are used to illustrate the techniques along with R programs for reproducing the results.

Feature Models

Feature Models
Author: ALEXANDER. FALKNER FELFERNIG (ANDREAS. BENAVIDES, DAVID.)
Publisher: Springer Nature
Total Pages: 129
Release: 2024
Genre: Artificial intelligence
ISBN: 3031618742

This open access book provides a basic introduction to feature modelling and analysis as well as to the integration of AI methods with feature modelling. It is intended as an introduction for researchers and practitioners who are new to the field and will also serve as a state-of-the-art reference to this audience. While focusing on the AI perspective, the book covers the topics of feature modelling (including languages and semantics), feature model analysis, and interacting with feature model configurators. These topics are discussed along the AI areas of knowledge representation and reasoning, explainable AI, and machine learning.

Interpretable Machine Learning

Interpretable Machine Learning
Author: Christoph Molnar
Publisher: Lulu.com
Total Pages: 320
Release: 2020
Genre: Artificial intelligence
ISBN: 0244768528

This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as decision trees, decision rules and linear regression. Later chapters focus on general model-agnostic methods for interpreting black box models like feature importance and accumulated local effects and explaining individual predictions with Shapley values and LIME. All interpretation methods are explained in depth and discussed critically. How do they work under the hood? What are their strengths and weaknesses? How can their outputs be interpreted? This book will enable you to select and correctly apply the interpretation method that is most suitable for your machine learning project.

The Self-Service Data Roadmap

The Self-Service Data Roadmap
Author: Sandeep Uttamchandani
Publisher: "O'Reilly Media, Inc."
Total Pages: 297
Release: 2020-09-10
Genre: Computers
ISBN: 1492075205

Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw data can still take days or weeks. Most organizations can’t scale data science teams fast enough to keep up with the growing amounts of data to transform. What’s the answer? Self-service data. With this practical book, data engineers, data scientists, and team managers will learn how to build a self-service data science platform that helps anyone in your organization extract insights from data. Sandeep Uttamchandani provides a scorecard to track and address bottlenecks that slow down time to insight across data discovery, transformation, processing, and production. This book bridges the gap between data scientists bottlenecked by engineering realities and data engineers unclear about ways to make self-service work. Build a self-service portal to support data discovery, quality, lineage, and governance Select the best approach for each self-service capability using open source cloud technologies Tailor self-service for the people, processes, and technology maturity of your data platform Implement capabilities to democratize data and reduce time to insight Scale your self-service portal to support a large number of users within your organization

Feature Engineering for Machine Learning

Feature Engineering for Machine Learning
Author: Alice Zheng
Publisher: "O'Reilly Media, Inc."
Total Pages: 218
Release: 2018-03-23
Genre: Computers
ISBN: 1491953195

Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you’ll learn techniques for extracting and transforming features—the numeric representations of raw data—into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering. Rather than simply teach these principles, authors Alice Zheng and Amanda Casari focus on practical application with exercises throughout the book. The closing chapter brings everything together by tackling a real-world, structured dataset with several feature-engineering techniques. Python packages including numpy, Pandas, Scikit-learn, and Matplotlib are used in code examples. You’ll examine: Feature engineering for numeric data: filtering, binning, scaling, log transforms, and power transforms Natural text techniques: bag-of-words, n-grams, and phrase detection Frequency-based filtering and feature scaling for eliminating uninformative features Encoding techniques of categorical variables, including feature hashing and bin-counting Model-based feature engineering with principal component analysis The concept of model stacking, using k-means as a featurization technique Image feature extraction with manual and deep-learning techniques

EMPOWERED

EMPOWERED
Author: Marty Cagan
Publisher: John Wiley & Sons
Total Pages: 435
Release: 2020-12-03
Genre: Business & Economics
ISBN: 1119691257

"Great teams are comprised of ordinary people that are empowered and inspired. They are empowered to solve hard problems in ways their customers love yet work for their business. They are inspired with ideas and techniques for quickly evaluating those ideas to discover solutions that work: they are valuable, usable, feasible and viable. This book is about the idea and reality of "achieving extraordinary results from ordinary people". Empowered is the companion to Inspired. It addresses the other half of the problem of building tech products?how to get the absolute best work from your product teams. However, the book's message applies much more broadly than just to product teams. Inspired was aimed at product managers. Empowered is aimed at all levels of technology-powered organizations: founders and CEO's, leaders of product, technology and design, and the countless product managers, product designers and engineers that comprise the teams. This book will not just inspire companies to empower their employees but will teach them how. This book will help readers achieve the benefits of truly empowered teams"--

Formal Methods and Software Engineering

Formal Methods and Software Engineering
Author: Zhenhua Duan
Publisher: Springer
Total Pages: 509
Release: 2017-10-13
Genre: Computers
ISBN: 3319686909

This book constitutes the refereed proceedings of the 19th International Conference on Formal Engineering Methods, ICFEM 2017, held in Xi'an, China, in November 2017. The 28 revised full papers presented together with one invited talk and two abstracts of invited talks were carefully reviewed and selected from 80 submissions. The conference focuses on all areas related to formal engineering methods, such as verification and validation, software engineering, formal specification and modeling, software security, and software reliability.

Software Product Lines

Software Product Lines
Author: Patrick Donohoe
Publisher: Springer Science & Business Media
Total Pages: 527
Release: 2012-12-06
Genre: Computers
ISBN: 1461543398

Software product lines are emerging as a critical new paradigm for software development. Product lines are enabling organizations to achieve impressive time-to-market gains and cost reductions. With the increasing number of product lines and product-line researchers and practitioners, the time is right for a comprehensive examination of the issues surrounding the software product line approach. The Software Engineering Institute at Carnegie Mellon University is proud to sponsor the first conference on this important subject. This book comprises the proceedings of the First Software Product Line Conference (SPLC1), held August 28-31, 2000, in Denver, Colorado, USA. The twenty-seven papers of the conference technical program present research results and experience reports that cover all aspects of software product lines. Topics include business issues, enabling technologies, organizational issues, and life-cycle issues. Emphasis is placed on experiences in the development and fielding of product lines of complex systems, especially those that expose problems in the design, development, or evolution of software product lines. The book will be essential reading for researchers and practitioners alike.

Web Information Systems Engineering

Web Information Systems Engineering
Author: Armin Haller
Publisher: Springer
Total Pages: 363
Release: 2013-05-13
Genre: Computers
ISBN: 3642383335

This book constitutes the revised selected papers of the combined workshops on Web Information Systems Engineering, WISE 2011 and WISE 2012, held in Sydney, Australia, in October 2011 and in Paphos, Cyprus, in November 2012. The seven workshops of WISE 2011-2012 have reported the recent developments and advances in the contemporary topics in the related fields of: Advanced Reasoning Technology for e-Science (ART 2012), Cloud-Enabled Business Process Management (CeBPM 2012), Engineering in the Semantic Enterprise (ESE 2012), Social Web Analysis for Trend Detection (SoWeTrend 2012), Big Data and Cloud (BDC 2012), Personalization in Cloud and Service Computing (PC-S 2011), and User-Focused Service Engineering, Consumption and Aggregation (USECA 2011).

Feature Engineering Bookcamp

Feature Engineering Bookcamp
Author: Sinan Ozdemir
Publisher: Simon and Schuster
Total Pages: 270
Release: 2022-10-18
Genre: Computers
ISBN: 1638351406

Deliver huge improvements to your machine learning pipelines without spending hours fine-tuning parameters! This book’s practical case-studies reveal feature engineering techniques that upgrade your data wrangling—and your ML results. In Feature Engineering Bookcamp you will learn how to: Identify and implement feature transformations for your data Build powerful machine learning pipelines with unstructured data like text and images Quantify and minimize bias in machine learning pipelines at the data level Use feature stores to build real-time feature engineering pipelines Enhance existing machine learning pipelines by manipulating the input data Use state-of-the-art deep learning models to extract hidden patterns in data Feature Engineering Bookcamp guides you through a collection of projects that give you hands-on practice with core feature engineering techniques. You’ll work with feature engineering practices that speed up the time it takes to process data and deliver real improvements in your model’s performance. This instantly-useful book skips the abstract mathematical theory and minutely-detailed formulas; instead you’ll learn through interesting code-driven case studies, including tweet classification, COVID detection, recidivism prediction, stock price movement detection, and more. About the technology Get better output from machine learning pipelines by improving your training data! Use feature engineering, a machine learning technique for designing relevant input variables based on your existing data, to simplify training and enhance model performance. While fine-tuning hyperparameters or tweaking models may give you a minor performance bump, feature engineering delivers dramatic improvements by transforming your data pipeline. About the book Feature Engineering Bookcamp walks you through six hands-on projects where you’ll learn to upgrade your training data using feature engineering. Each chapter explores a new code-driven case study, taken from real-world industries like finance and healthcare. You’ll practice cleaning and transforming data, mitigating bias, and more. The book is full of performance-enhancing tips for all major ML subdomains—from natural language processing to time-series analysis. What's inside Identify and implement feature transformations Build machine learning pipelines with unstructured data Quantify and minimize bias in ML pipelines Use feature stores to build real-time feature engineering pipelines Enhance existing pipelines by manipulating input data About the reader For experienced machine learning engineers familiar with Python. About the author Sinan Ozdemir is the founder and CTO of Shiba, a former lecturer of Data Science at Johns Hopkins University, and the author of multiple textbooks on data science and machine learning. Table of Contents 1 Introduction to feature engineering 2 The basics of feature engineering 3 Healthcare: Diagnosing COVID-19 4 Bias and fairness: Modeling recidivism 5 Natural language processing: Classifying social media sentiment 6 Computer vision: Object recognition 7 Time series analysis: Day trading with machine learning 8 Feature stores 9 Putting it all together