Apache Spark Machine Learning Blueprints

Apache Spark Machine Learning Blueprints
Author: Alex Liu
Publisher: Packt Publishing Ltd
Total Pages: 252
Release: 2016-05-30
Genre: Computers
ISBN: 1785887785

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide About This Book Customize Apache Spark and R to fit your analytical needs in customer research, fraud detection, risk analytics, and recommendation engine development Develop a set of practical Machine Learning applications that can be implemented in real-life projects A comprehensive, project-based guide to improve and refine your predictive models for practical implementation Who This Book Is For If you are a data scientist, a data analyst, or an R and SPSS user with a good understanding of machine learning concepts, algorithms, and techniques, then this is the book for you. Some basic understanding of Spark and its core elements and application is required. What You Will Learn Set up Apache Spark for machine learning and discover its impressive processing power Combine Spark and R to unlock detailed business insights essential for decision making Build machine learning systems with Spark that can detect fraud and analyze financial risks Build predictive models focusing on customer scoring and service ranking Build a recommendation systems using SPSS on Apache Spark Tackle parallel computing and find out how it can support your machine learning projects Turn open data and communication data into actionable insights by making use of various forms of machine learning In Detail There's a reason why Apache Spark has become one of the most popular tools in Machine Learning – its ability to handle huge datasets at an impressive speed means you can be much more responsive to the data at your disposal. This book shows you Spark at its very best, demonstrating how to connect it with R and unlock maximum value not only from the tool but also from your data. Packed with a range of project "blueprints" that demonstrate some of the most interesting challenges that Spark can help you tackle, you'll find out how to use Spark notebooks and access, clean, and join different datasets before putting your knowledge into practice with some real-world projects, in which you will see how Spark Machine Learning can help you with everything from fraud detection to analyzing customer attrition. You'll also find out how to build a recommendation engine using Spark's parallel computing powers. Style and approach This book offers a step-by-step approach to setting up Apache Spark, and use other analytical tools with it to process Big Data and build machine learning projects.The initial chapters focus more on the theory aspect of machine learning with Spark, while each of the later chapters focuses on building standalone projects using Spark.

Apache Spark Machine Learning Blueprints

Apache Spark Machine Learning Blueprints
Author: Alex Liu
Publisher: Packt Publishing Ltd
Total Pages: 252
Release: 2016-05-30
Genre: Computers
ISBN: 1785887785

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide About This Book Customize Apache Spark and R to fit your analytical needs in customer research, fraud detection, risk analytics, and recommendation engine development Develop a set of practical Machine Learning applications that can be implemented in real-life projects A comprehensive, project-based guide to improve and refine your predictive models for practical implementation Who This Book Is For If you are a data scientist, a data analyst, or an R and SPSS user with a good understanding of machine learning concepts, algorithms, and techniques, then this is the book for you. Some basic understanding of Spark and its core elements and application is required. What You Will Learn Set up Apache Spark for machine learning and discover its impressive processing power Combine Spark and R to unlock detailed business insights essential for decision making Build machine learning systems with Spark that can detect fraud and analyze financial risks Build predictive models focusing on customer scoring and service ranking Build a recommendation systems using SPSS on Apache Spark Tackle parallel computing and find out how it can support your machine learning projects Turn open data and communication data into actionable insights by making use of various forms of machine learning In Detail There's a reason why Apache Spark has become one of the most popular tools in Machine Learning – its ability to handle huge datasets at an impressive speed means you can be much more responsive to the data at your disposal. This book shows you Spark at its very best, demonstrating how to connect it with R and unlock maximum value not only from the tool but also from your data. Packed with a range of project "blueprints" that demonstrate some of the most interesting challenges that Spark can help you tackle, you'll find out how to use Spark notebooks and access, clean, and join different datasets before putting your knowledge into practice with some real-world projects, in which you will see how Spark Machine Learning can help you with everything from fraud detection to analyzing customer attrition. You'll also find out how to build a recommendation engine using Spark's parallel computing powers. Style and approach This book offers a step-by-step approach to setting up Apache Spark, and use other analytical tools with it to process Big Data and build machine learning projects.The initial chapters focus more on the theory aspect of machine learning with Spark, while each of the later chapters focuses on building standalone projects using Spark.

Hadoop Blueprints

Hadoop Blueprints
Author: Anurag Shrivastava
Publisher: Packt Publishing Ltd
Total Pages: 312
Release: 2016-09-30
Genre: Computers
ISBN: 1783980311

Use Hadoop to solve business problems by learning from a rich set of real-life case studies About This Book Solve real-world business problems using Hadoop and other Big Data technologies Build efficient data lakes in Hadoop, and develop systems for various business cases like improving marketing campaigns, fraud detection, and more Power packed with six case studies to get you going with Hadoop for Business Intelligence Who This Book Is For If you are interested in building efficient business solutions using Hadoop, this is the book for you This book assumes that you have basic knowledge of Hadoop, Java, and any scripting language. What You Will Learn Learn about the evolution of Hadoop as the big data platform Understand the basics of Hadoop architecture Build a 360 degree view of your customer using Sqoop and Hive Build and run classification models on Hadoop using BigML Use Spark and Hadoop to build a fraud detection system Develop a churn detection system using Java and MapReduce Build an IoT-based data collection and visualization system Get to grips with building a Hadoop-based Data Lake for large enterprises Learn about the coexistence of NoSQL and In-Memory databases in the Hadoop ecosystem In Detail If you have a basic understanding of Hadoop and want to put your knowledge to use to build fantastic Big Data solutions for business, then this book is for you. Build six real-life, end-to-end solutions using the tools in the Hadoop ecosystem, and take your knowledge of Hadoop to the next level. Start off by understanding various business problems which can be solved using Hadoop. You will also get acquainted with the common architectural patterns which are used to build Hadoop-based solutions. Build a 360-degree view of the customer by working with different types of data, and build an efficient fraud detection system for a financial institution. You will also develop a system in Hadoop to improve the effectiveness of marketing campaigns. Build a churn detection system for a telecom company, develop an Internet of Things (IoT) system to monitor the environment in a factory, and build a data lake – all making use of the concepts and techniques mentioned in this book. The book covers other technologies and frameworks like Apache Spark, Hive, Sqoop, and more, and how they can be used in conjunction with Hadoop. You will be able to try out the solutions explained in the book and use the knowledge gained to extend them further in your own problem space. Style and approach This is an example-driven book where each chapter covers a single business problem and describes its solution by explaining the structure of a dataset and tools required to process it. Every project is demonstrated with a step-by-step approach, and explained in a very easy-to-understand manner.

Transforming Healthcare with Big Data and AI

Transforming Healthcare with Big Data and AI
Author: Mingbo Gong
Publisher: IAP
Total Pages: 185
Release: 2020-04-01
Genre: Computers
ISBN: 1641138998

Healthcare and technology are at a convergence point where significant changes are poised to take place. The vast and complex requirements of medical record keeping, coupled with stringent patient privacy laws, create an incredibly unwieldy maze of health data needs. While the past decade has seen giant leaps in AI, machine learning, wearable technologies, and data mining capacities that have enabled quantities of data to be accumulated, processed, and shared around the globe. Transforming Healthcare with Big Data and AI examines the crossroads of these two fields and looks to the future of leveraging advanced technologies and developing data ecosystems to the healthcare field. This book is the product of the Transforming Healthcare with Data conference, held at the University of Southern California. Many speakers and digital healthcare industry leaders contributed multidisciplinary expertise to chapters in this work. Authors’ backgrounds range from data scientists, healthcare experts, university professors, and digital healthcare entrepreneurs. If you have an understanding of data technologies and are interested in the future of Big Data and A.I. in healthcare, this book will provide a wealth of insights into the new landscape of healthcare.

Learn Azure Synapse Data Explorer

Learn Azure Synapse Data Explorer
Author: Pericles (Peri) Rocha
Publisher: Packt Publishing Ltd
Total Pages: 346
Release: 2023-02-17
Genre: Computers
ISBN: 1803239611

A hands-on guide to working on use cases helping you ingest, analyze, and serve insightful data from IoT as well as telemetry data sources using Azure Synapse Data Explorer Free PDF included with this book Key FeaturesAugment advanced analytics projects with your IoT and application dataExpand your existing Azure Synapse environments with unstructured dataBuild industry-level projects on integration, experimentation, and dashboarding with Azure SynapseBook Description Large volumes of data are generated daily from applications, websites, IoT devices, and other free-text, semi-structured data sources. Azure Synapse Data Explorer helps you collect, store, and analyze such data, and work with other analytical engines, such as Apache Spark, to develop advanced data science projects and maximize the value you extract from data. This book offers a comprehensive view of Azure Synapse Data Explorer, exploring not only the core scenarios of Data Explorer but also how it integrates within Azure Synapse. From data ingestion to data visualization and advanced analytics, you'll learn to take an end-to-end approach to maximize the value of unstructured data and drive powerful insights using data science capabilities. With real-world usage scenarios, you'll discover how to identify key projects where Azure Synapse Data Explorer can help you achieve your business goals. Throughout the chapters, you'll also find out how to manage big data as part of a software as a service (SaaS) platform, as well as tune, secure, and serve data to end users. By the end of this book, you'll have mastered the big data life cycle and you'll be able to implement advanced analytical scenarios from raw telemetry and log data. What you will learnIntegrate Data Explorer pools with all other Azure Synapse servicesCreate Data Explorer pools with Azure Synapse Studio and Azure PortalIngest, analyze, and serve data to users using Azure Synapse pipelinesIntegrate Power BI and visualize data with Synapse StudioConfigure Azure Machine Learning integration in Azure SynapseManage cost and troubleshoot Data Explorer pools in Synapse AnalyticsSecure Synapse workspaces and grant access to Data Explorer poolsWho this book is for If you are a data engineer, data analyst, or business analyst working with unstructured data and looking to learn how to maximize the value of such data, this book is for you. If you already have experience working with Azure Synapse and want to incorporate unstructured data into your data science project, you'll also find plenty of useful information in this book. To maximize your learning experience, familiarity with data and performing simple queries using SQL or KQL is recommended. Basic knowledge of Python will help you get more from the examples.

Stream Processing with Apache Spark

Stream Processing with Apache Spark
Author: Gerard Maas
Publisher: O'Reilly Media
Total Pages: 453
Release: 2019-06-05
Genre: Computers
ISBN: 1491944218

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs. Authors Gerard Maas and François Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API. Learn fundamental stream processing concepts and examine different streaming architectures Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams

Automated Machine Learning for Business

Automated Machine Learning for Business
Author: Kai R. Larsen
Publisher: Oxford University Press
Total Pages: 353
Release: 2021
Genre: Business & Economics
ISBN: 0190941650

This book teaches the full process of how to conduct machine learning in an organizational setting. It develops the problem-solving mind-set needed for machine learning and takes the reader through several exercises using an automated machine learning tool. To build experience with machine learning, the book provides access to the industry-leading AutoML tool, DataRobot, and provides several data sets designed to build deep hands-on knowledge of machinelearning.

Unreal Engine 4 Virtual Reality Projects

Unreal Engine 4 Virtual Reality Projects
Author: Kevin Mack
Publisher: Packt Publishing Ltd
Total Pages: 622
Release: 2019-04-30
Genre: Computers
ISBN: 1789133882

The key problem with VR development is understanding how to set up a project and running it on your desktop or mobile VR device. With this book, you will not only learn the specifics of virtual reality development in Unreal but also build immersive and fun VR projects that can be experienced on your VR devices.

Cloud Computing for Machine Learning and Cognitive Applications

Cloud Computing for Machine Learning and Cognitive Applications
Author: Kai Hwang
Publisher: MIT Press
Total Pages: 626
Release: 2017-06-16
Genre: Computers
ISBN: 026203641X

The first textbook to teach students how to build data analytic solutions on large data sets using cloud-based technologies. This is the first textbook to teach students how to build data analytic solutions on large data sets (specifically in Internet of Things applications) using cloud-based technologies for data storage, transmission and mashup, and AI techniques to analyze this data. This textbook is designed to train college students to master modern cloud computing systems in operating principles, architecture design, machine learning algorithms, programming models and software tools for big data mining, analytics, and cognitive applications. The book will be suitable for use in one-semester computer science or electrical engineering courses on cloud computing, machine learning, cloud programming, cognitive computing, or big data science. The book will also be very useful as a reference for professionals who want to work in cloud computing and data science. Cloud and Cognitive Computing begins with two introductory chapters on fundamentals of cloud computing, data science, and adaptive computing that lay the foundation for the rest of the book. Subsequent chapters cover topics including cloud architecture, mashup services, virtual machines, Docker containers, mobile clouds, IoT and AI, inter-cloud mashups, and cloud performance and benchmarks, with a focus on Google's Brain Project, DeepMind, and X-Lab programs, IBKai HwangM SyNapse, Bluemix programs, cognitive initiatives, and neurocomputers. The book then covers machine learning algorithms and cloud programming software tools and application development, applying the tools in machine learning, social media, deep learning, and cognitive applications. All cloud systems are illustrated with big data and cognitive application examples.