Cassandra: The Definitive Guide

Cassandra: The Definitive Guide
Author: Jeff Carpenter
Publisher: "O'Reilly Media, Inc."
Total Pages: 369
Release: 2016-06-29
Genre: Computers
ISBN: 1491933631

Imagine what you could do if scalability wasn't a problem. With this hands-on guide, you’ll learn how the Cassandra database management system handles hundreds of terabytes of data while remaining highly available across multiple data centers. This expanded second edition—updated for Cassandra 3.0—provides the technical details and practical examples you need to put this database to work in a production environment. Authors Jeff Carpenter and Eben Hewitt demonstrate the advantages of Cassandra’s non-relational design, with special attention to data modeling. If you’re a developer, DBA, or application architect looking to solve a database scaling issue or future-proof your application, this guide helps you harness Cassandra’s speed and flexibility. Understand Cassandra’s distributed and decentralized structure Use the Cassandra Query Language (CQL) and cqlsh—the CQL shell Create a working data model and compare it with an equivalent relational model Develop sample applications using client drivers for languages including Java, Python, and Node.js Explore cluster topology and learn how nodes exchange data Maintain a high level of performance in your cluster Deploy Cassandra on site, in the Cloud, or with Docker Integrate Cassandra with Spark, Hadoop, Elasticsearch, Solr, and Lucene

The Data Warehouse Toolkit

The Data Warehouse Toolkit
Author: Ralph Kimball
Publisher: John Wiley & Sons
Total Pages: 464
Release: 2011-08-08
Genre: Computers
ISBN: 1118082141

This old edition was published in 2002. The current and final edition of this book is The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd Edition which was published in 2013 under ISBN: 9781118530801. The authors begin with fundamental design recommendations and gradually progress step-by-step through increasingly complex scenarios. Clear-cut guidelines for designing dimensional models are illustrated using real-world data warehouse case studies drawn from a variety of business application areas and industries, including: Retail sales and e-commerce Inventory management Procurement Order management Customer relationship management (CRM) Human resources management Accounting Financial services Telecommunications and utilities Education Transportation Health care and insurance By the end of the book, you will have mastered the full range of powerful techniques for designing dimensional databases that are easy to understand and provide fast query response. You will also learn how to create an architected framework that integrates the distributed data warehouse using standardized dimensions and facts.

A Guide to the SQL Standard

A Guide to the SQL Standard
Author: C. J. Date
Publisher: Addison Wesley Publishing Company
Total Pages: 440
Release: 1993
Genre: Computers
ISBN:

A guide for users and designers of database systems. Outlines the inherent problems in the study, design, and implementation, and examines the background issues of priorities, administrative prerequisites, design concepts, database management systems, protocols, security, communication processes, and interactivity. Gives advice on developing corporate databases and management sytems. Non- technical, user-oriented text. No bibliography. Date provides a comprehensive treatment of standard SQL, with many worked examples while discussing some of the implications of the standard. Annotation copyrighted by Book News, Inc., Portland, OR

Semantic Modeling for Data

Semantic Modeling for Data
Author: Panos Alexopoulos
Publisher: "O'Reilly Media, Inc."
Total Pages: 332
Release: 2020-08-19
Genre: Computers
ISBN: 1492054224

What value does semantic data modeling offer? As an information architect or data science professional, let’s say you have an abundance of the right data and the technology to extract business gold—but you still fail. The reason? Bad data semantics. In this practical and comprehensive field guide, author Panos Alexopoulos takes you on an eye-opening journey through semantic data modeling as applied in the real world. You’ll learn how to master this craft to increase the usability and value of your data and applications. You’ll also explore the pitfalls to avoid and dilemmas to overcome for building high-quality and valuable semantic representations of data. Understand the fundamental concepts, phenomena, and processes related to semantic data modeling Examine the quirks and challenges of semantic data modeling and learn how to effectively leverage the available frameworks and tools Avoid mistakes and bad practices that can undermine your efforts to create good data models Learn about model development dilemmas, including representation, expressiveness and content, development, and governance Organize and execute semantic data initiatives in your organization, tackling technical, strategic, and organizational challenges

Spark: The Definitive Guide

Spark: The Definitive Guide
Author: Bill Chambers
Publisher: "O'Reilly Media, Inc."
Total Pages: 594
Release: 2018-02-08
Genre: Computers
ISBN: 1491912294

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals. Youâ??ll explore the basic operations and common functions of Sparkâ??s structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Sparkâ??s scalable machine-learning library. Get a gentle overview of big data and Spark Learn about DataFrames, SQL, and Datasetsâ??Sparkâ??s core APIsâ??through worked examples Dive into Sparkâ??s low-level APIs, RDDs, and execution of SQL and DataFrames Understand how Spark runs on a cluster Debug, monitor, and tune Spark clusters and applications Learn the power of Structured Streaming, Sparkâ??s stream-processing engine Learn how you can apply MLlib to a variety of problems, including classification or recommendation

UML Distilled

UML Distilled
Author: Martin Fowler
Publisher: Addison-Wesley Professional
Total Pages: 218
Release: 2018-08-30
Genre: Computers
ISBN: 013486512X

More than 300,000 developers have benefited from past editions of UML Distilled . This third edition is the best resource for quick, no-nonsense insights into understanding and using UML 2.0 and prior versions of the UML. Some readers will want to quickly get up to speed with the UML 2.0 and learn the essentials of the UML. Others will use this book as a handy, quick reference to the most common parts of the UML. The author delivers on both of these promises in a short, concise, and focused presentation. This book describes all the major UML diagram types, what they're used for, and the basic notation involved in creating and deciphering them. These diagrams include class, sequence, object, package, deployment, use case, state machine, activity, communication, composite structure, component, interaction overview, and timing diagrams. The examples are clear and the explanations cut to the fundamental design logic. Includes a quick reference to the most useful parts of the UML notation and a useful summary of diagram types that were added to the UML 2.0. If you are like most developers, you don't have time to keep up with all the new innovations in software engineering. This new edition of Fowler's classic work gets you acquainted with some of the best thinking about efficient object-oriented software design using the UML--in a convenient format that will be essential to anyone who designs software professionally.

Cassandra: The Definitive Guide

Cassandra: The Definitive Guide
Author: Jeff Carpenter
Publisher: "O'Reilly Media, Inc."
Total Pages: 446
Release: 2020-04-06
Genre: Computers
ISBN: 1098115112

Imagine what you could do if scalability wasn't a problem. With this hands-on guide, you’ll learn how the Cassandra database management system handles hundreds of terabytes of data while remaining highly available across multiple data centers. This third edition—updated for Cassandra 4.0—provides the technical details and practical examples you need to put this database to work in a production environment. Authors Jeff Carpenter and Eben Hewitt demonstrate the advantages of Cassandra’s nonrelational design, with special attention to data modeling. If you’re a developer, DBA, or application architect looking to solve a database scaling issue or future-proof your application, this guide helps you harness Cassandra’s speed and flexibility. Understand Cassandra’s distributed and decentralized structure Use the Cassandra Query Language (CQL) and cqlsh—the CQL shell Create a working data model and compare it with an equivalent relational model Develop sample applications using client drivers for languages including Java, Python, and Node.js Explore cluster topology and learn how nodes exchange data

Black & Decker The Complete Guide to Wiring, Updated 7th Edition

Black & Decker The Complete Guide to Wiring, Updated 7th Edition
Author: Editors of Cool Springs Press
Publisher: Cool Springs Press
Total Pages: 339
Release: 2017-12-01
Genre: House & Home
ISBN: 0760359598

BLACK+DECKER’s latest edition of the best-selling home-improvement book in North America for more than a decade is completely updated to conform to 2017–2020 national electrical codes. For more than ten years, BLACK+DECKER The Complete Guide to Wiring has been the best-selling home wiring manual in North America. Now with more than two million copies in print, this is the home reference book more DIYers choose than any other for many reasons. As the most current wiring book on the market, you can be confident that your projects will meet national wiring codes. You’ll also spend more time on your project and less time scratching your head thanks to more than 800 clear color photos and over 40 diagrams that show you exactly what you need to know about home electrical service; all the most common circuits, all the most-needed techniques, all the most essential tools and materials. The information in this book has been created and reviewed by professional electricians under the watchful eye of the experts at BLACK+DECKER. You can find plenty of articles and videos about wiring online or in other publications, but only The Complete Guide to Wiring has passed the rigorous test to make it part of The Best DIY Series from the Brand You Trust.

Presto: The Definitive Guide

Presto: The Definitive Guide
Author: Matt Fuller
Publisher: "O'Reilly Media, Inc."
Total Pages: 352
Release: 2020-04-03
Genre: Computers
ISBN: 1492044229

Perform fast interactive analytics against different data sources using the Presto high-performance, distributed SQL query engine. With this practical guide, you�?�¢??ll learn how to conduct analytics on data where it lives, whether it�?�¢??s Hive, Cassandra, a relational database, or a proprietary data store. Analysts, software engineers, and production engineers will learn how to manage, use, and even develop with Presto. Initially developed by Facebook, open source Presto is now used by Netflix, Airbnb, LinkedIn, Twitter, Uber, and many other companies. Matt Fuller, Manfred Moser, and Martin Traverso show you how a single Presto query can combine data from multiple sources to allow for analytics across your entire organization. Get started: Explore Presto�?�¢??s use cases and learn about tools that will help you connect to Presto and query data Go deeper: Learn Presto�?�¢??s internal workings, including how to connect to and query data sources with support for SQL statements, operators, functions, and more Put Presto in production: Secure Presto, monitor workloads, tune queries, and connect more applications; learn how other organizations apply Presto

Data Mesh

Data Mesh
Author: Zhamak Dehghani
Publisher: "O'Reilly Media, Inc."
Total Pages: 387
Release: 2022-03-08
Genre: Computers
ISBN: 1492092363

Many enterprises are investing in a next-generation data lake, hoping to democratize data at scale to provide business insights and ultimately make automated intelligent decisions. In this practical book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into them, data warehouses and data lakes fail when applied at the scale and speed of today's organizations. A distributed data mesh is a better choice. Dehghani guides architects, technical leaders, and decision makers on their journey from monolithic big data architecture to a sociotechnical paradigm that draws from modern distributed architecture. A data mesh considers domains as a first-class concern, applies platform thinking to create self-serve data infrastructure, treats data as a product, and introduces a federated and computational model of data governance. This book shows you why and how. Examine the current data landscape from the perspective of business and organizational needs, environmental challenges, and existing architectures Analyze the landscape's underlying characteristics and failure modes Get a complete introduction to data mesh principles and its constituents Learn how to design a data mesh architecture Move beyond a monolithic data lake to a distributed data mesh.