Big Data Integration Theory

Big Data Integration Theory
Author: Zoran Majkić
Publisher: Springer Science & Business Media
Total Pages: 528
Release: 2014-01-23
Genre: Computers
ISBN: 3319041568

This book presents a novel approach to database concepts, describing a categorical logic for database schema mapping based on views, within a framework for database integration/exchange and peer-to-peer. Database mappings, database programming languages, and denotational and operational semantics are discussed in depth. An analysis method is also developed that combines techniques from second order logic, data modeling, co-algebras and functorial categorial semantics. Features: provides an introduction to logics, co-algebras, databases, schema mappings and category theory; describes the core concepts of big data integration theory, with examples; examines the properties of the DB category; defines the categorial RDB machine; presents full operational semantics for database mappings; discusses matching and merging operators for databases, universal algebra considerations and algebraic lattices of the databases; explores the relationship of the database weak monoidal topos w.r.t. intuitionistic logic.

Big Data Integration

Big Data Integration
Author: Xin Luna Dong
Publisher: Morgan & Claypool Publishers
Total Pages: 200
Release: 2015-02-01
Genre: Computers
ISBN: 1627052240

The big data era is upon us: data are being generated, analyzed, and used at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of big data. BDI differs from traditional data integration along the dimensions of volume, velocity, variety, and veracity. First, not only can data sources contain a huge volume of data, but also the number of data sources is now in the millions. Second, because of the rate at which newly collected data are made available, many of the data sources are very dynamic, and the number of data sources is also rapidly exploding. Third, data sources are extremely heterogeneous in their structure and content, exhibiting considerable variety even for substantially similar entities. Fourth, the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This book explores the progress that has been made by the data integration community on the topics of schema alignment, record linkage and data fusion in addressing these novel challenges faced by big data integration. Each of these topics is covered in a systematic way: first starting with a quick tour of the topic in the context of traditional data integration, followed by a detailed, example-driven exposition of recent innovative techniques that have been proposed to address the BDI challenges of volume, velocity, variety, and veracity. Finally, it presents merging topics and opportunities that are specific to BDI, identifying promising directions for the data integration community.

Big Data Integration

Big Data Integration
Author: Xin Luna Dong
Publisher: Springer Nature
Total Pages: 178
Release: 2022-05-31
Genre: Computers
ISBN: 3031018532

The big data era is upon us: data are being generated, analyzed, and used at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of big data. BDI differs from traditional data integration along the dimensions of volume, velocity, variety, and veracity. First, not only can data sources contain a huge volume of data, but also the number of data sources is now in the millions. Second, because of the rate at which newly collected data are made available, many of the data sources are very dynamic, and the number of data sources is also rapidly exploding. Third, data sources are extremely heterogeneous in their structure and content, exhibiting considerable variety even for substantially similar entities. Fourth, the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This book explores the progress that has been made by the data integration community on the topics of schema alignment, record linkage and data fusion in addressing these novel challenges faced by big data integration. Each of these topics is covered in a systematic way: first starting with a quick tour of the topic in the context of traditional data integration, followed by a detailed, example-driven exposition of recent innovative techniques that have been proposed to address the BDI challenges of volume, velocity, variety, and veracity. Finally, it presents merging topics and opportunities that are specific to BDI, identifying promising directions for the data integration community.

Knowledge Graphs and Big Data Processing

Knowledge Graphs and Big Data Processing
Author: Valentina Janev
Publisher: Springer Nature
Total Pages: 212
Release: 2020-07-15
Genre: Computers
ISBN: 3030531996

This open access book is part of the LAMBDA Project (Learning, Applying, Multiplying Big Data Analytics), funded by the European Union, GA No. 809965. Data Analytics involves applying algorithmic processes to derive insights. Nowadays it is used in many industries to allow organizations and companies to make better decisions as well as to verify or disprove existing theories or models. The term data analytics is often used interchangeably with intelligence, statistics, reasoning, data mining, knowledge discovery, and others. The goal of this book is to introduce some of the definitions, methods, tools, frameworks, and solutions for big data processing, starting from the process of information extraction and knowledge representation, via knowledge processing and analytics to visualization, sense-making, and practical applications. Each chapter in this book addresses some pertinent aspect of the data processing chain, with a specific focus on understanding Enterprise Knowledge Graphs, Semantic Big Data Architectures, and Smart Data Analytics solutions. This book is addressed to graduate students from technical disciplines, to professional audiences following continuous education short courses, and to researchers from diverse areas following self-study courses. Basic skills in computer science, mathematics, and statistics are required.

Springer Handbook of Science and Technology Indicators

Springer Handbook of Science and Technology Indicators
Author: Wolfgang Glänzel
Publisher: Springer Nature
Total Pages: 1126
Release: 2019-10-30
Genre: Science
ISBN: 303002511X

This handbook presents the state of the art of quantitative methods and models to understand and assess the science and technology system. Focusing on various aspects of the development and application of indicators derived from data on scholarly publications, patents and electronic communications, the individual chapters, written by leading experts, discuss theoretical and methodological issues, illustrate applications, highlight their policy context and relevance, and point to future research directions. A substantial portion of the book is dedicated to detailed descriptions and analyses of data sources, presenting both traditional and advanced approaches. It addresses the main bibliographic metrics and indexes, such as the journal impact factor and the h-index, as well as altmetric and webometric indicators and science mapping techniques on different levels of aggregation and in the context of their value for the assessment of research performance as well as their impact on research policy and society. It also presents and critically discusses various national research evaluation systems. Complementing the sections reflecting on the science system, the technology section includes multiple chapters that explain different aspects of patent statistics, patent classification and database search methods to retrieve patent-related information. In addition, it examines the relevance of trademarks and standards as additional technological indicators. The Springer Handbook of Science and Technology Indicators is an invaluable resource for practitioners, scientists and policy makers wanting a systematic and thorough analysis of the potential and limitations of the various approaches to assess research and research performance.

Big Data, Databases and "Ownership" Rights in the Cloud

Big Data, Databases and
Author: Marcelo Corrales Compagnucci
Publisher: Springer Nature
Total Pages: 324
Release: 2019-11-02
Genre: Law
ISBN: 9811503494

Two of the most important developments of this new century are the emergence of cloud computing and big data. However, the uncertainties surrounding the failure of cloud service providers to clearly assert ownership rights over data and databases during cloud computing transactions and big data services have been perceived as imposing legal risks and transaction costs. This lack of clear ownership rights is also seen as slowing down the capacity of the Internet market to thrive. Click-through agreements drafted on a take-it-or-leave-it basis govern the current state of the art, and they do not allow much room for negotiation. The novel contribution of this book proffers a new contractual model advocating the extension of the negotiation capabilities of cloud customers, thus enabling an automated and machine-readable framework, orchestrated by a cloud broker. Cloud computing and big data are constantly evolving and transforming into new paradigms where cloud brokers are predicted to play a vital role as innovation intermediaries adding extra value to the entire life cycle. This evolution will alleviate the legal uncertainties in society by means of embedding legal requirements in the user interface and related computer systems or its code. This book situates the theories of law and economics and behavioral law and economics in the context of cloud computing and takes database rights and ownership rights of data as prime examples to represent the problem of collecting, outsourcing, and sharing data and databases on a global scale. It does this by highlighting the legal constraints concerning ownership rights of data and databases and proposes finding a solution outside the boundaries and limitations of the law. By allowing cloud brokers to establish themselves in the market as entities coordinating and actively engaging in the negotiation of service-level agreements (SLAs), individual customers as well as small and medium-sized enterprises could efficiently and effortlessly choose a cloud provider that best suits their needs. This approach, which the author calls “plan-like architectures,” endeavors to create a more trustworthy cloud computing environment and to yield radical new results for the development of the cloud computing and big data markets.

Design, Operation and Evaluation of Mobile Communications

Design, Operation and Evaluation of Mobile Communications
Author: Gavriel Salvendy
Publisher: Springer Nature
Total Pages: 504
Release: 2022-06-16
Genre: Computers
ISBN: 3031050142

This book constitutes the refereed proceedings of the Third International Conference on Design, Operation and Evaluation of Mobile Communications, MOBILE 2022, held as part of the 23rd International Conference, HCI International 2022, which was held virtually in June/July 2022. The total of 1271 papers and 275 posters included in the HCII 2022 proceedings was carefully reviewed and selected from 5487 submissions. The MOBILE 2022 proceedings were organized in the following topical sections: Designing Mobile Interactions and Systems; User Experience and Adoption of Mobile Communications; Mobile Commerce and Advertising; Mobile Interactions with Agents; Emerging Mobile Technologies.

New Horizons for a Data-Driven Economy

New Horizons for a Data-Driven Economy
Author: José María Cavanillas
Publisher: Springer
Total Pages: 312
Release: 2016-04-04
Genre: Computers
ISBN: 3319215698

In this book readers will find technological discussions on the existing and emerging technologies across the different stages of the big data value chain. They will learn about legal aspects of big data, the social impact, and about education needs and requirements. And they will discover the business perspective and how big data technology can be exploited to deliver value within different sectors of the economy. The book is structured in four parts: Part I “The Big Data Opportunity” explores the value potential of big data with a particular focus on the European context. It also describes the legal, business and social dimensions that need to be addressed, and briefly introduces the European Commission’s BIG project. Part II “The Big Data Value Chain” details the complete big data lifecycle from a technical point of view, ranging from data acquisition, analysis, curation and storage, to data usage and exploitation. Next, Part III “Usage and Exploitation of Big Data” illustrates the value creation possibilities of big data applications in various sectors, including industry, healthcare, finance, energy, media and public services. Finally, Part IV “A Roadmap for Big Data Research” identifies and prioritizes the cross-sectorial requirements for big data research, and outlines the most urgent and challenging technological, economic, political and societal issues for big data in Europe. This compendium summarizes more than two years of work performed by a leading group of major European research centers and industries in the context of the BIG project. It brings together research findings, forecasts and estimates related to this challenging technological context that is becoming the major axis of the new digitally transformed business environment.

Big Data and Information Theory

Big Data and Information Theory
Author: Jiuping Xu
Publisher: Routledge
Total Pages: 128
Release: 2022-06-02
Genre: Business & Economics
ISBN: 1000591719

Big Data and Information Theory are a binding force between various areas of knowledge that allow for societal advancement. Rapid development of data analytic and information theory allows companies to store vast amounts of information about production, inventory, service, and consumer activities. More powerful CPUs and cloud computing make it possible to do complex optimization instead of using heuristic algorithms, as well as instant rather than offline decision-making. The era of "big data" challenges includes analysis, capture, curation, search, sharing, storage, transfer, visualization, and privacy violations. Big data calls for better integration of optimization, statistics, and data mining. In response to these challenges this book brings together leading researchers and engineers to exchange and share their experiences and research results about big data and information theory applications in various areas. This book covers a broad range of topics including statistics, data mining, data warehouse implementation, engineering management in large-scale infrastructure systems, data-driven sustainable supply chain network, information technology service offshoring project issues, online rumors governance, preliminary cost estimation, and information system project selection. The chapters in this book were originally published in the journal, International Journal of Management Science and Engineering Management.

New Technology, Big Data and the Law

New Technology, Big Data and the Law
Author: Marcelo Corrales
Publisher: Springer
Total Pages: 341
Release: 2017-09-04
Genre: Law
ISBN: 9811050384

This edited collection brings together a series of interdisciplinary contributions in the field of Information Technology Law. The topics addressed in this book cover a wide range of theoretical and practical legal issues that have been created by cutting-edge Internet technologies, primarily Big Data, the Internet of Things, and Cloud computing. Consideration is also given to more recent technological breakthroughs that are now used to assist, and — at times — substitute for, human work, such as automation, robots, sensors, and algorithms. The chapters presented in this edition address these issues from the perspective of different legal backgrounds. The first part of the book discusses some of the shortcomings that have prompted legislators to carry out reforms with regard to privacy, data protection, and data security. Notably, some of the complexities and salient points with regard to the new European General Data Protection Regulation (EU GDPR) and the new amendments to the Japan’s Personal Information Protection Act (PIPA) have been scrutinized. The second part looks at the vital role of Internet intermediaries (or brokers) for the proper functioning of the globalized electronic market and innovation technologies in general. The third part examines an electronic approach to evidence with an evaluation of how these technologies affect civil and criminal investigations. The authors also explore issues that have emerged in e-commerce, such as Bitcoin and its blockchain network effects. The book aims to explain, systemize and solve some of the lingering legal questions created by the disruptive technological change that characterizes the early twenty-first century.