Apache Spark Machine Learning Blueprints

Apache Spark Machine Learning Blueprints
Author: Alex Liu
Publisher: Packt Publishing Ltd
Total Pages: 252
Release: 2016-05-30
Genre: Computers
ISBN: 1785887785

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide About This Book Customize Apache Spark and R to fit your analytical needs in customer research, fraud detection, risk analytics, and recommendation engine development Develop a set of practical Machine Learning applications that can be implemented in real-life projects A comprehensive, project-based guide to improve and refine your predictive models for practical implementation Who This Book Is For If you are a data scientist, a data analyst, or an R and SPSS user with a good understanding of machine learning concepts, algorithms, and techniques, then this is the book for you. Some basic understanding of Spark and its core elements and application is required. What You Will Learn Set up Apache Spark for machine learning and discover its impressive processing power Combine Spark and R to unlock detailed business insights essential for decision making Build machine learning systems with Spark that can detect fraud and analyze financial risks Build predictive models focusing on customer scoring and service ranking Build a recommendation systems using SPSS on Apache Spark Tackle parallel computing and find out how it can support your machine learning projects Turn open data and communication data into actionable insights by making use of various forms of machine learning In Detail There's a reason why Apache Spark has become one of the most popular tools in Machine Learning – its ability to handle huge datasets at an impressive speed means you can be much more responsive to the data at your disposal. This book shows you Spark at its very best, demonstrating how to connect it with R and unlock maximum value not only from the tool but also from your data. Packed with a range of project "blueprints" that demonstrate some of the most interesting challenges that Spark can help you tackle, you'll find out how to use Spark notebooks and access, clean, and join different datasets before putting your knowledge into practice with some real-world projects, in which you will see how Spark Machine Learning can help you with everything from fraud detection to analyzing customer attrition. You'll also find out how to build a recommendation engine using Spark's parallel computing powers. Style and approach This book offers a step-by-step approach to setting up Apache Spark, and use other analytical tools with it to process Big Data and build machine learning projects.The initial chapters focus more on the theory aspect of machine learning with Spark, while each of the later chapters focuses on building standalone projects using Spark.

Apache Spark Machine Learning Blueprints

Apache Spark Machine Learning Blueprints
Author: Alex Liu
Publisher: Packt Publishing Ltd
Total Pages: 252
Release: 2016-05-30
Genre: Computers
ISBN: 1785887785

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide About This Book Customize Apache Spark and R to fit your analytical needs in customer research, fraud detection, risk analytics, and recommendation engine development Develop a set of practical Machine Learning applications that can be implemented in real-life projects A comprehensive, project-based guide to improve and refine your predictive models for practical implementation Who This Book Is For If you are a data scientist, a data analyst, or an R and SPSS user with a good understanding of machine learning concepts, algorithms, and techniques, then this is the book for you. Some basic understanding of Spark and its core elements and application is required. What You Will Learn Set up Apache Spark for machine learning and discover its impressive processing power Combine Spark and R to unlock detailed business insights essential for decision making Build machine learning systems with Spark that can detect fraud and analyze financial risks Build predictive models focusing on customer scoring and service ranking Build a recommendation systems using SPSS on Apache Spark Tackle parallel computing and find out how it can support your machine learning projects Turn open data and communication data into actionable insights by making use of various forms of machine learning In Detail There's a reason why Apache Spark has become one of the most popular tools in Machine Learning – its ability to handle huge datasets at an impressive speed means you can be much more responsive to the data at your disposal. This book shows you Spark at its very best, demonstrating how to connect it with R and unlock maximum value not only from the tool but also from your data. Packed with a range of project "blueprints" that demonstrate some of the most interesting challenges that Spark can help you tackle, you'll find out how to use Spark notebooks and access, clean, and join different datasets before putting your knowledge into practice with some real-world projects, in which you will see how Spark Machine Learning can help you with everything from fraud detection to analyzing customer attrition. You'll also find out how to build a recommendation engine using Spark's parallel computing powers. Style and approach This book offers a step-by-step approach to setting up Apache Spark, and use other analytical tools with it to process Big Data and build machine learning projects.The initial chapters focus more on the theory aspect of machine learning with Spark, while each of the later chapters focuses on building standalone projects using Spark.

Hadoop Blueprints

Hadoop Blueprints
Author: Anurag Shrivastava
Publisher: Packt Publishing Ltd
Total Pages: 312
Release: 2016-09-30
Genre: Computers
ISBN: 1783980311

Use Hadoop to solve business problems by learning from a rich set of real-life case studies About This Book Solve real-world business problems using Hadoop and other Big Data technologies Build efficient data lakes in Hadoop, and develop systems for various business cases like improving marketing campaigns, fraud detection, and more Power packed with six case studies to get you going with Hadoop for Business Intelligence Who This Book Is For If you are interested in building efficient business solutions using Hadoop, this is the book for you This book assumes that you have basic knowledge of Hadoop, Java, and any scripting language. What You Will Learn Learn about the evolution of Hadoop as the big data platform Understand the basics of Hadoop architecture Build a 360 degree view of your customer using Sqoop and Hive Build and run classification models on Hadoop using BigML Use Spark and Hadoop to build a fraud detection system Develop a churn detection system using Java and MapReduce Build an IoT-based data collection and visualization system Get to grips with building a Hadoop-based Data Lake for large enterprises Learn about the coexistence of NoSQL and In-Memory databases in the Hadoop ecosystem In Detail If you have a basic understanding of Hadoop and want to put your knowledge to use to build fantastic Big Data solutions for business, then this book is for you. Build six real-life, end-to-end solutions using the tools in the Hadoop ecosystem, and take your knowledge of Hadoop to the next level. Start off by understanding various business problems which can be solved using Hadoop. You will also get acquainted with the common architectural patterns which are used to build Hadoop-based solutions. Build a 360-degree view of the customer by working with different types of data, and build an efficient fraud detection system for a financial institution. You will also develop a system in Hadoop to improve the effectiveness of marketing campaigns. Build a churn detection system for a telecom company, develop an Internet of Things (IoT) system to monitor the environment in a factory, and build a data lake – all making use of the concepts and techniques mentioned in this book. The book covers other technologies and frameworks like Apache Spark, Hive, Sqoop, and more, and how they can be used in conjunction with Hadoop. You will be able to try out the solutions explained in the book and use the knowledge gained to extend them further in your own problem space. Style and approach This is an example-driven book where each chapter covers a single business problem and describes its solution by explaining the structure of a dataset and tools required to process it. Every project is demonstrated with a step-by-step approach, and explained in a very easy-to-understand manner.

Transforming Healthcare with Big Data and AI

Transforming Healthcare with Big Data and AI
Author: Mingbo Gong
Publisher: IAP
Total Pages: 185
Release: 2020-04-01
Genre: Computers
ISBN: 1641138998

Healthcare and technology are at a convergence point where significant changes are poised to take place. The vast and complex requirements of medical record keeping, coupled with stringent patient privacy laws, create an incredibly unwieldy maze of health data needs. While the past decade has seen giant leaps in AI, machine learning, wearable technologies, and data mining capacities that have enabled quantities of data to be accumulated, processed, and shared around the globe. Transforming Healthcare with Big Data and AI examines the crossroads of these two fields and looks to the future of leveraging advanced technologies and developing data ecosystems to the healthcare field. This book is the product of the Transforming Healthcare with Data conference, held at the University of Southern California. Many speakers and digital healthcare industry leaders contributed multidisciplinary expertise to chapters in this work. Authors’ backgrounds range from data scientists, healthcare experts, university professors, and digital healthcare entrepreneurs. If you have an understanding of data technologies and are interested in the future of Big Data and A.I. in healthcare, this book will provide a wealth of insights into the new landscape of healthcare.

Unreal Engine 4 Virtual Reality Projects

Unreal Engine 4 Virtual Reality Projects
Author: Kevin Mack
Publisher: Packt Publishing Ltd
Total Pages: 622
Release: 2019-04-30
Genre: Computers
ISBN: 1789133882

The key problem with VR development is understanding how to set up a project and running it on your desktop or mobile VR device. With this book, you will not only learn the specifics of virtual reality development in Unreal but also build immersive and fun VR projects that can be experienced on your VR devices.

Cloud Computing for Machine Learning and Cognitive Applications

Cloud Computing for Machine Learning and Cognitive Applications
Author: Kai Hwang
Publisher: MIT Press
Total Pages: 626
Release: 2017-06-16
Genre: Computers
ISBN: 026203641X

The first textbook to teach students how to build data analytic solutions on large data sets using cloud-based technologies. This is the first textbook to teach students how to build data analytic solutions on large data sets (specifically in Internet of Things applications) using cloud-based technologies for data storage, transmission and mashup, and AI techniques to analyze this data. This textbook is designed to train college students to master modern cloud computing systems in operating principles, architecture design, machine learning algorithms, programming models and software tools for big data mining, analytics, and cognitive applications. The book will be suitable for use in one-semester computer science or electrical engineering courses on cloud computing, machine learning, cloud programming, cognitive computing, or big data science. The book will also be very useful as a reference for professionals who want to work in cloud computing and data science. Cloud and Cognitive Computing begins with two introductory chapters on fundamentals of cloud computing, data science, and adaptive computing that lay the foundation for the rest of the book. Subsequent chapters cover topics including cloud architecture, mashup services, virtual machines, Docker containers, mobile clouds, IoT and AI, inter-cloud mashups, and cloud performance and benchmarks, with a focus on Google's Brain Project, DeepMind, and X-Lab programs, IBKai HwangM SyNapse, Bluemix programs, cognitive initiatives, and neurocomputers. The book then covers machine learning algorithms and cloud programming software tools and application development, applying the tools in machine learning, social media, deep learning, and cognitive applications. All cloud systems are illustrated with big data and cognitive application examples.

Stream Processing with Apache Spark

Stream Processing with Apache Spark
Author: Gerard Maas
Publisher: "O'Reilly Media, Inc."
Total Pages: 396
Release: 2019-06-05
Genre: Computers
ISBN: 1491944196

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs. Authors Gerard Maas and François Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API. Learn fundamental stream processing concepts and examine different streaming architectures Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams

Actionable Insights with Amazon QuickSight

Actionable Insights with Amazon QuickSight
Author: Manos Samatas
Publisher: Packt Publishing Ltd
Total Pages: 242
Release: 2022-01-28
Genre: Computers
ISBN: 1801072000

Build interactive dashboards and storytelling reports at scale with the cloud-native BI tool that integrates embedded analytics and ML-powered insights effortlessly Key FeaturesExplore Amazon QuickSight, manage data sources, and build and share dashboardsLearn best practices from an AWS certified big data solutions architect Manage and monitor dashboards using the QuickSight API and other AWS services such as Amazon CloudTrailBook Description Amazon Quicksight is an exciting new visualization that rivals PowerBI and Tableau, bringing several exciting features to the table – but sadly, there aren't many resources out there that can help you learn the ropes. This book seeks to remedy that with the help of an AWS-certified expert who will help you leverage its full capabilities. After learning QuickSight's fundamental concepts and how to configure data sources, you'll be introduced to the main analysis-building functionality of QuickSight to develop visuals and dashboards, and explore how to develop and share interactive dashboards with parameters and on-screen controls. You'll dive into advanced filtering options with URL actions before learning how to set up alerts and scheduled reports. Next, you'll familiarize yourself with the types of insights before getting to grips with adding ML insights such as forecasting capabilities, analyzing time series data, adding narratives, and outlier detection to your dashboards. You'll also explore patterns to automate operations and look closer into the API actions that allow us to control settings. Finally, you'll learn advanced topics such as embedded dashboards and multitenancy. By the end of this book, you'll be well-versed with QuickSight's BI and analytics functionalities that will help you create BI apps with ML capabilities. What you will learnUnderstand the wider AWS analytics ecosystem and how QuickSight fits within itSet up and configure data sources with Amazon QuickSightInclude custom controls and add interactivity to your BI application using parametersAdd ML insights such as forecasting, anomaly detection, and narrativesExplore patterns to automate operations using QuickSight APIsCreate interactive dashboards and storytelling with Amazon QuickSightDesign an embedded multi-tenant analytics architectureFocus on data permissions and how to manage Amazon QuickSight operationsWho this book is for This book is for business intelligence (BI) developers and data analysts who are looking to create interactive dashboards using data from Lake House on AWS with Amazon QuickSight. It will also be useful for anyone who wants to learn Amazon QuickSight in depth using practical, up-to-date examples. You will need to be familiar with general data visualization concepts before you get started with this book, however, no prior experience with Amazon QuickSight is required.

Genomics in the Azure Cloud

Genomics in the Azure Cloud
Author: Colby T. Ford
Publisher: "O'Reilly Media, Inc."
Total Pages: 330
Release: 2022-11-14
Genre: Computers
ISBN: 1098139011

This practical guide bridges the gap between general cloud computing architecture in Microsoft Azure and scientific computing for bioinformatics and genomics. You'll get a solid understanding of the architecture patterns and services that are offered in Azure and how they might be used in your bioinformatics practice. You'll get code examples that you can reuse for your specific needs. And you'll get plenty of concrete examples to illustrate how a given service is used in a bioinformatics context. You'll also get valuable advice on how to: Use enterprise platform services to easily scale your bioinformatics workloads Organize, query, and analyze genomic data at scale Build a genomics data lake and accompanying data warehouse Use Azure Machine Learning to scale your model training, track model performance, and deploy winning models Orchestrate and automate processing pipelines using Azure Data Factory and Databricks Cloudify your organization's existing bioinformatics pipelines by moving your workflows to Azure high-performance compute services And more