AI and Big Data on IBM Power Systems Servers

AI and Big Data on IBM Power Systems Servers
Author: Scott Vetter
Publisher: IBM Redbooks
Total Pages: 162
Release: 2019-04-10
Genre: Computers
ISBN: 0738457515

As big data becomes more ubiquitous, businesses are wondering how they can best leverage it to gain insight into their most important business questions. Using machine learning (ML) and deep learning (DL) in big data environments can identify historical patterns and build artificial intelligence (AI) models that can help businesses to improve customer experience, add services and offerings, identify new revenue streams or lines of business (LOBs), and optimize business or manufacturing operations. The power of AI for predictive analytics is being harnessed across all industries, so it is important that businesses familiarize themselves with all of the tools and techniques that are available for integration with their data lake environments. In this IBM® Redbooks® publication, we cover the best practices for deploying and integrating some of the best AI solutions on the market, including: IBM Watson Machine Learning Accelerator (see note for product naming) IBM Watson Studio Local IBM Power SystemsTM IBM SpectrumTM Scale IBM Data Science Experience (IBM DSX) IBM Elastic StorageTM Server Hortonworks Data Platform (HDP) Hortonworks DataFlow (HDF) H2O Driverless AI We map out all the integrations that are possible with our different AI solutions and how they can integrate with your existing or new data lake. We also walk you through some of our client use cases and show you how some of the industry leaders are using Hortonworks, IBM PowerAI, and IBM Watson Studio Local to drive decision making. We also advise you on your deployment options, when to use a GPU, and why you should use the IBM Elastic Storage Server (IBM ESS) to improve storage management. Lastly, we describe how to integrate IBM Watson Machine Learning Accelerator and Hortonworks with or without IBM Watson Studio Local, how to access real-time data, and security. Note: IBM Watson Machine Learning Accelerator is the new product name for IBM PowerAI Enterprise. Note: Hortonworks merged with Cloudera in January 2019. The new company is called Cloudera. References to Hortonworks as a business entity in this publication are now referring to the merged company. Product names beginning with Hortonworks continue to be marketed and sold under their original names.

Networking Design for HPC and AI on IBM Power Systems

Networking Design for HPC and AI on IBM Power Systems
Author: Scott Vetter
Publisher: IBM Redbooks
Total Pages: 62
Release: 2018-04-26
Genre: Computers
ISBN: 0738456837

This publication provides information about networking design for IBM® High Performance Computing (HPC) and AI for Power SystemsTM. This paper will help you understand the basic requirements when designing a solution, the components in an infrastructure for HPC and AI Systems, the designing of interconnect and data networks with use cases based in real life scenarios, the administration and the Out-Of-Band management networks. We cover all the necessary requirements, provide a good understanding of the technology and include examples for small, medium and large cluster environments. This paper is intended for IT architects, system designers, data center planners, and system administrators who must design or provide a solution for the infrastructure of a HPC cluster.

Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide

Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide
Author: Dino Quintero
Publisher: IBM Redbooks
Total Pages: 128
Release: 2020-09-10
Genre: Computers
ISBN: 0738459070

This IBM® Redpaper publication describes how to deploy Red Hat OpenShift V4.3 on IBM Power Systems servers. This book presents reference architectures for deployment, initial sizing guidelines for server, storage, and IBM Cloud® Paks. Moreover, this publication delivers information about initial supported Power System configurations for Red Hat OpenShift V4.3 deployment (bare metal, IBM PowerVM® LE LPARs, and others). This book serves as a guide for how to deploy Red Hat OpenShift V4.3 and provide start guidelines and recommended practices for implementing it on Power Systems and completing it with the supported IBM Cloud Paks. The publication addresses topics for developers, IT architects, IT specialists, sellers, and anyone who wants to implement a Red Hat OpenShift V4.3 and IBM Cloud Paks on IBM Power Systems. This book also provides technical content to transfer how-to skills to the support teams, and solution guidance to the sales team. This book compliments the documentation that is available at IBM Knowledge Center, and also aligns with the educational offerings that are provided by the IBM Systems Technical Education (SSE).

IBM PowerAI: Deep Learning Unleashed on IBM Power Systems Servers

IBM PowerAI: Deep Learning Unleashed on IBM Power Systems Servers
Author: Dino Quintero
Publisher: IBM Redbooks
Total Pages: 278
Release: 2019-06-05
Genre: Computers
ISBN: 0738442941

This IBM® Redbooks® publication is a guide about the IBM PowerAI Deep Learning solution. This book provides an introduction to artificial intelligence (AI) and deep learning (DL), IBM PowerAI, and components of IBM PowerAI, deploying IBM PowerAI, guidelines for working with data and creating models, an introduction to IBM SpectrumTM Conductor Deep Learning Impact (DLI), and case scenarios. IBM PowerAI started as a package of software distributions of many of the major DL software frameworks for model training, such as TensorFlow, Caffe, Torch, Theano, and the associated libraries, such as CUDA Deep Neural Network (cuDNN). The IBM PowerAI software is optimized for performance by using the IBM Power SystemsTM servers that are integrated with NVLink. The AI stack foundation starts with servers with accelerators. graphical processing unit (GPU) accelerators are well-suited for the compute-intensive nature of DL training, and servers with the highest CPU to GPU bandwidth, such as IBM Power Systems servers, enable the high-performance data transfer that is required for larger and more complex DL models. This publication targets technical readers, including developers, IT specialists, systems architects, brand specialist, sales team, and anyone looking for a guide about how to understand the IBM PowerAI Deep Learning architecture, framework configuration, application and workload configuration, and user infrastructure.

IBM High-Performance Computing Insights with IBM Power System AC922 Clustered Solution

IBM High-Performance Computing Insights with IBM Power System AC922 Clustered Solution
Author: Dino Quintero
Publisher: IBM Redbooks
Total Pages: 352
Release: 2019-05-02
Genre: Computers
ISBN: 0738457450

This IBM® Redbooks® publication documents and addresses topics to set up a complete infrastructure environment and tune the applications to use an IBM POWER9TM hardware architecture with the technical computing software stack. This publication is driven by a CORAL project solution. It explores, tests, and documents how to implement an IBM High-Performance Computing (HPC) solution on a POWER9 processor-based system by using IBM technical innovations to help solve challenging scientific, technical, and business problems. This book documents the HPC clustering solution with InfiniBand on IBM Power SystemsTM AC922 8335-GTH and 8335-GTX servers with NVIDIA Tesla V100 SXM2 graphics processing units (GPUs) with NVLink, software components, and the IBM SpectrumTM Scale parallel file system. This solution includes recommendations about the components that are used to provide a cohesive clustering environment that includes job scheduling, parallel application tools, scalable file systems, administration tools, and a high-speed interconnect. This book is divided into three parts: Part 1 focuses on the planners of the solution, Part 2 focuses on the administrators, and Part 3 focuses on the developers. This book targets technical professionals (consultants, technical support staff, IT architects, and IT specialists) who are responsible for delivering cost-effective HPC solutions that help uncover insights among clients' data so that they can act to optimize business results, product development, and scientific discoveries.

Enterprise Data Warehouse Optimization with Hadoop on IBM Power Systems Servers

Enterprise Data Warehouse Optimization with Hadoop on IBM Power Systems Servers
Author: Scott Vetter
Publisher: IBM Redbooks
Total Pages: 82
Release: 2018-01-31
Genre: Computers
ISBN: 0738456608

Data warehouses were developed for many good reasons, such as providing quick query and reporting for business operations, and business performance. However, over the years, due to the explosion of applications and data volume, many existing data warehouses have become difficult to manage. Extract, Transform, and Load (ETL) processes are taking longer, missing their allocated batch windows. In addition, data types that are required for business analysis have expanded from structured data to unstructured data. The Apache open source Hadoop platform provides a great alternative for solving these problems. IBM® has committed to open source since the early years of open Linux. IBM and Hortonworks together are committed to Apache open source software more than any other company. IBM Power SystemsTM servers are built with open technologies and are designed for mission-critical data applications. Power Systems servers use technology from the OpenPOWER Foundation, an open technology infrastructure that uses the IBM POWER® architecture to help meet the evolving needs of big data applications. The combination of Power Systems with Hortonworks Data Platform (HDP) provides users with a highly efficient platform that provides leadership performance for big data workloads such as Hadoop and Spark. This IBM RedpaperTM publication provides details about Enterprise Data Warehouse (EDW) optimization with Hadoop on Power Systems. Many people know Power Systems from the IBM AIX® platform, but might not be familiar with IBM PowerLinuxTM, so part of this paper provides a Power Systems overview. A quick introduction to Hadoop is provided for those not familiar with the topic. Details of HDP on Power Reference architecture are included that will help both software architects and infrastructure architects understand the design. In the optimization chapter, we describe various topics: traditional EDW offload, sizing guidelines, performance tuning, IBM Elastic StorageTM Server (ESS) for data-intensive workload, IBM Big SQL as the common structured query language (SQL) engine for Hadoop platform, and tools that are available on Power Systems that are related to EDW optimization. We also dedicate some pages to the analytics components (IBM Data Science Experience (IBM DSX) and IBM SpectrumTM Conductor for Spark workload) for the Hadoop infrastructure.

Deployment and Usage Guide for Running AI Workloads on Red Hat OpenShift and NVIDIA DGX Systems with IBM Spectrum Scale

Deployment and Usage Guide for Running AI Workloads on Red Hat OpenShift and NVIDIA DGX Systems with IBM Spectrum Scale
Author: Simon Lorenz
Publisher: IBM Redbooks
Total Pages: 80
Release: 2020-11-30
Genre: Computers
ISBN: 0738459097

This IBM® Redpaper publication describes the architecture, installation procedure, and results for running a typical training application that works on an automotive data set in an orchestrated and secured environment that provides horizontal scalability of GPU resources across physical node boundaries for deep neural network (DNN) workloads. This paper is mostly relevant for systems engineers, system administrators, or system architects that are responsible for data center infrastructure management and typical day-to-day operations such as system monitoring, operational control, asset management, and security audits. This paper also describes IBM Spectrum® LSF® as a workload manager and IBM Spectrum Discover as a metadata search engine to find the right data for an inference job and automate the data science workflow. With the help of this solution, the data location, which may be on different storage systems, and time of availability for the AI job can be fully abstracted, which provides valuable information for data scientists.

IBM Power Systems LC921 and LC922: Technical Overview and Introduction

IBM Power Systems LC921 and LC922: Technical Overview and Introduction
Author: Scott Vetter
Publisher: IBM Redbooks
Total Pages: 68
Release: 2019-12-10
Genre: Computers
ISBN: 0738457140

This IBM® RedpaperTM publication is a comprehensive guide that covers the IBM Power SystemsTM LC921 and LC922 (9006-12P and 9006-22P)) servers that use the current IBM POWER9TM processor-based technology and supports Linux operating systems (OSes). The objective of this paper is to introduce the offerings and their capacities and available features. These new Linux scale-out systems provide differentiated performance, scalability, and low acquisition cost, and include the following features: Superior throughput and performance for high-value Linux workloads. Low acquisition cost through system optimization (industry-standard memory and industry-standard three-year warranty). Rich I/O options in the system unit. There are 12 large form factor (LFF)/small form factor (SFF) bays for 12 SAS/SATA hard disk drives (HDDs) or solid-state drives (SSDs), and four bays that are available for Non-Volatile Memory Express (NVMe) Gen3 adapters. Includes Trusted Platform Module (TPM) 2.0 Nuvoton NPCT650ABAWX through I2C (for secure boot and trusted boot). Integrated MicroSemi PM8069 SAS/SATA 16-port Internal Storage Controller Peripheral Component Interconnect Express (PCIe) 3.0 x8 with RAID 0, 1, 5, and 10 support (no write cache). Integrated Intel XL710 Quad Port 10 GBase-T PCIe 3.0 x8 UIO built-in local area network (LAN) (one shared management port). Dedicated 1 Gb Intelligent Platform Management Interface (IPMI) port. This publication is for professionals who want to acquire a better understanding of IBM Power Systems products. The intended audience includes: Clients Sales and marketing professionals Technical support professionals IBM Business Partners Independent software vendors (ISVs)

Cognitive Computing Featuring the IBM Power System AC922

Cognitive Computing Featuring the IBM Power System AC922
Author: Scott Vetter
Publisher: IBM Redbooks
Total Pages: 58
Release: 2019-10-17
Genre: Computers
ISBN: 0738458112

This IBM® Redpaper publication describes the advantages of using IBM Power System AC922 for cognitive solutions, and how it can enhance clients' businesses. In order to optimize the hardware and software, IBM partners with NVIDIA, Mellanox, H2O.ai, SQream, Kinetica, and other prominent companies to design the Power AC922 server, specifically enhanced for the cognitive era. Most of its outstanding hardware features, such as NVIDIA NVLink 2.0 and PCIe 4.0, are described in this publication to illustrate the advantages that clients can realize in comparison with IBM competitors. We also include a brief description about what cognitive computing is, and how to use IBM Watson® Machine Learning cognitive solutions to bring more value to your business ecosystem. Additionally, we show performance charts that show the advantages of using Power AC922 versus x86 competitors. In the last chapter, we describe the most remarkable use cases in which IBM solves real problems using cognitive solutions. This IBM Redpaper publication is aimed at IT technical audiences, especially decision-making levels that need a full look at the benefits and improvements that an IBM Cognitive Solution can offer. It also provides valuable information to data science professionals, enabling them to plan their modeling needs. Finally, it offers information to the infrastructure support group in charge of maintaining the solution.

IBM Cloud Pak for Data with IBM Spectrum Scale Container Native

IBM Cloud Pak for Data with IBM Spectrum Scale Container Native
Author: Gero Schmidt
Publisher: IBM Redbooks
Total Pages: 120
Release: 2021-12-17
Genre: Computers
ISBN: 0738460095

This IBM® Redpaper® publication describes configuration guidelines and best practices when IBM Spectrum® Scale Container Native Storage Access is used as a storage provider for IBM Cloud® Pak for Data on Red Hat OpenShift Container Platform. It also provides the steps to install IBM Db2® and several assemblies within IBM Cloud Pak® for Data, including Watson Knowledge Catalog, Watson Studio, IBM DataStage®, Db2 Warehouse, Watson Machine Learning, Watson OpenScale, Data Virtualization, Data Management Console, and Apache Spark. This IBM Redpaper publication was written for IT architects, IT specialists, developers, and others who are interested in installing IBM Cloud Pak for Data with IBM Spectrum Scale Container Native.