Digital Libraries: The Era of Big Data and Data Science

Digital Libraries: The Era of Big Data and Data Science
Author: Michelangelo Ceci
Publisher: Springer Nature
Total Pages: 189
Release: 2020-01-22
Genre: Computers
ISBN: 3030399052

This book constitutes the thoroughly refereed proceedings of the 16th Italian Research Conference on Digital Libraries, IRCDL 2020, held in Bari, Italy, in January 2020. The 12 full papers and 6 short papers presented were carefully selected from 26 submissions. The papers are organized in topical sections on information retrieval, bid data and data science in DL; cultural heritage; open science.

Digital Libraries: The Era of Big Data and Data Science

Digital Libraries: The Era of Big Data and Data Science
Author: Michelangelo Ceci
Publisher: Springer
Total Pages: 189
Release: 2020-01-22
Genre: Computers
ISBN: 9783030399047

This book constitutes the thoroughly refereed proceedings of the 16th Italian Research Conference on Digital Libraries, IRCDL 2020, held in Bari, Italy, in January 2020. The 12 full papers and 6 short papers presented were carefully selected from 26 submissions. The papers are organized in topical sections on information retrieval, bid data and data science in DL; cultural heritage; open science.

Big Data Shocks

Big Data Shocks
Author: Andrew Weiss
Publisher: Rowman & Littlefield
Total Pages: 219
Release: 2018-03-15
Genre: Language Arts & Disciplines
ISBN: 1538103249

"Big data," as it has become known in business and information technology circles, has the potential to improve our knowledge about human behavior, and to help us gain insight into the ways in which we organize ourselves, our cultures, and our external and internal lives. Libraries stand at the center of the information world, both facilitating and contributing to this flood as well as helping to shape and channel it to specific purposes. But all technologies come with a price. Where the tool can serve a purpose, it can also change the user's behavior to fit the purposes of the tool. Big Data Shocks: An Introduction to Big Data for Librarians and Information Professionals examines the roots of big data, the current climate and rising stars in this world. The book explores the issues raised by big data and discusses theoretical as well as practical approaches to managing information whose scope exists beyond the human scale. What’s at stake ultimately is the privacy of the people who support and use our libraries and the temptation for us to examine their behaviors. Such tension lies deep in the heart of our great library institutions. This book addresses these issues and many of the questions that arise from them, including: What is our role as librarians within this new era of big data? What are the impacts of new powerful technologies that track and analyze our behavior? Do data aggregators know more about us and our patrons than we do? How can librarians ethically balance the need to demonstrate learning and knowledge creation and privacy? Do we become less private merely because we use a tool or is it because the tool has changed us? What's in store for us with the internet of things combining with data mining techniques? All of these questions and more are explored in this book

Big Data

Big Data
Author: Viktor Mayer-Schönberger
Publisher: Houghton Mifflin Harcourt
Total Pages: 257
Release: 2013
Genre: Business & Economics
ISBN: 0544002695

A exploration of the latest trend in technology and the impact it will have on the economy, science, and society at large.

Data Science for Librarians

Data Science for Librarians
Author: Yunfei Du
Publisher: Bloomsbury Publishing USA
Total Pages: 169
Release: 2020-03-26
Genre: Language Arts & Disciplines
ISBN:

This unique textbook intersects traditional library science with data science principles that readers will find useful in implementing or improving data services within their libraries. Data Science for Librarians introduces data science to students and practitioners in library services. Writing for academic, public, and school library managers; library science students; and library and information science educators, authors Yunfei Du and Hammad Rauf Khan provide a thorough overview of conceptual and practical tools for data librarian practice. Partially due to how quickly data science evolves, libraries have yet to recognize core competencies and skills required to perform the job duties of a data librarian. As society transitions from the information age into the era of big data, librarians and information professionals require new knowledge and skills to stay current and take on new job roles, such as data librarianship. Such skills as data curation, research data management, statistical analysis, business analytics, visualization, smart city data, and learning analytics are relevant in library services today and will become increasingly so in the near future. This text serves as a tool for library and information science students and educators working on data science curriculum design.

Big Data Applications for Improving Library Services

Big Data Applications for Improving Library Services
Author: Dhamdhere, Sangeeta Namdev
Publisher: IGI Global
Total Pages: 211
Release: 2020-09-25
Genre: Language Arts & Disciplines
ISBN: 1799830519

Today, libraries must provide various web-based services, social media, and internet to patrons in order to adequately support their information needs. In addition to these services, the maintenance of online literature, databases, data sets, and archives cause librarians to have to handle huge amounts of data each day. Big data can support with quality improvement and problem solving to improve library services and can help librarians to provide up-to-date and innovative real-time services to library users. Big Data Applications for Improving Library Services is an essential scholarly publication that examines the implications and applications of big data analytics on services provided by libraries. Highlighting a wide range of topics such as data analytics, mobile technologies, and web-based services, this book is ideal for librarians, knowledge managers, data scientists, data analysts, cataloguers, academicians, IT professionals, researchers, and students.

Data Science and Big Data Analytics

Data Science and Big Data Analytics
Author: EMC Education Services
Publisher: John Wiley & Sons
Total Pages: 432
Release: 2014-12-19
Genre: Computers
ISBN: 1118876229

Data Science and Big Data Analytics is about harnessing the power of data for new insights. The book covers the breadth of activities and methods and tools that Data Scientists use. The content focuses on concepts, principles and practical applications that are applicable to any industry and technology environment, and the learning is supported and explained with examples that you can replicate using open-source software. This book will help you: Become a contributor on a data science team Deploy a structured lifecycle approach to data analytics problems Apply appropriate analytic techniques and tools to analyzing big data Learn how to tell a compelling story with data to drive business action Prepare for EMC Proven Professional Data Science Certification Get started discovering, analyzing, visualizing, and presenting data in a meaningful way today!

Big Data, Little Data, No Data

Big Data, Little Data, No Data
Author: Christine L. Borgman
Publisher: MIT Press
Total Pages: 411
Release: 2017-02-03
Genre: Language Arts & Disciplines
ISBN: 0262529912

An examination of the uses of data within a changing knowledge infrastructure, offering analysis and case studies from the sciences, social sciences, and humanities. “Big Data” is on the covers of Science, Nature, the Economist, and Wired magazines, on the front pages of the Wall Street Journal and the New York Times. But despite the media hyperbole, as Christine Borgman points out in this examination of data and scholarly research, having the right data is usually better than having more data; little data can be just as valuable as big data. In many cases, there are no data—because relevant data don't exist, cannot be found, or are not available. Moreover, data sharing is difficult, incentives to do so are minimal, and data practices vary widely across disciplines. Borgman, an often-cited authority on scholarly communication, argues that data have no value or meaning in isolation; they exist within a knowledge infrastructure—an ecology of people, practices, technologies, institutions, material objects, and relationships. After laying out the premises of her investigation—six “provocations” meant to inspire discussion about the uses of data in scholarship—Borgman offers case studies of data practices in the sciences, the social sciences, and the humanities, and then considers the implications of her findings for scholarly practice and research policy. To manage and exploit data over the long term, Borgman argues, requires massive investment in knowledge infrastructures; at stake is the future of scholarship.

Big Data Science & Analytics

Big Data Science & Analytics
Author: Arshdeep Bahga
Publisher: Vpt
Total Pages: 544
Release: 2016-04-15
Genre:
ISBN: 9780996025539

We are living in the dawn of what has been termed as the "Fourth Industrial Revolution," which is marked through the emergence of "cyber-physical systems" where software interfaces seamlessly over networks with physical systems, such as sensors, smartphones, vehicles, power grids or buildings, to create a new world of Internet of Things (IoT). Data and information are fuel of this new age where powerful analytics algorithms burn this fuel to generate decisions that are expected to create a smarter and more efficient world for all of us to live in. This new area of technology has been defined as Big Data Science and Analytics, and the industrial and academic communities are realizing this as a competitive technology that can generate significant new wealth and opportunity. Big data is defined as collections of datasets whose volume, velocity or variety is so large that it is difficult to store, manage, process and analyze the data using traditional databases and data processing tools. Big data science and analytics deals with collection, storage, processing and analysis of massive-scale data. Industry surveys, by Gartner and e-Skills, for instance, predict that there will be over 2 million job openings for engineers and scientists trained in the area of data science and analytics alone, and that the job market is in this area is growing at a 150 percent year-over-year growth rate. We have written this textbook, as part of our expanding "A Hands-On Approach"(TM) series, to meet this need at colleges and universities, and also for big data service providers who may be interested in offering a broader perspective of this emerging field to accompany their customer and developer training programs. The typical reader is expected to have completed a couple of courses in programming using traditional high-level languages at the college-level, and is either a senior or a beginning graduate student in one of the science, technology, engineering or mathematics (STEM) fields. An accompanying website for this book contains additional support for instruction and learning (www.big-data-analytics-book.com) The book is organized into three main parts, comprising a total of twelve chapters. Part I provides an introduction to big data, applications of big data, and big data science and analytics patterns and architectures. A novel data science and analytics application system design methodology is proposed and its realization through use of open-source big data frameworks is described. This methodology describes big data analytics applications as realization of the proposed Alpha, Beta, Gamma and Delta models, that comprise tools and frameworks for collecting and ingesting data from various sources into the big data analytics infrastructure, distributed filesystems and non-relational (NoSQL) databases for data storage, and processing frameworks for batch and real-time analytics. This new methodology forms the pedagogical foundation of this book. Part II introduces the reader to various tools and frameworks for big data analytics, and the architectural and programming aspects of these frameworks, with examples in Python. We describe Publish-Subscribe messaging frameworks (Kafka & Kinesis), Source-Sink connectors (Flume), Database Connectors (Sqoop), Messaging Queues (RabbitMQ, ZeroMQ, RestMQ, Amazon SQS) and custom REST, WebSocket and MQTT-based connectors. The reader is introduced to data storage, batch and real-time analysis, and interactive querying frameworks including HDFS, Hadoop, MapReduce, YARN, Pig, Oozie, Spark, Solr, HBase, Storm, Spark Streaming, Spark SQL, Hive, Amazon Redshift and Google BigQuery. Also described are serving databases (MySQL, Amazon DynamoDB, Cassandra, MongoDB) and the Django Python web framework. Part III introduces the reader to various machine learning algorithms with examples using the Spark MLlib and H2O frameworks, and visualizations using frameworks such as Lightning, Pygal and Seaborn.

Encyclopedia of Data Science and Machine Learning

Encyclopedia of Data Science and Machine Learning
Author: Wang, John
Publisher: IGI Global
Total Pages: 3296
Release: 2023-01-20
Genre: Computers
ISBN: 1799892212

Big data and machine learning are driving the Fourth Industrial Revolution. With the age of big data upon us, we risk drowning in a flood of digital data. Big data has now become a critical part of both the business world and daily life, as the synthesis and synergy of machine learning and big data has enormous potential. Big data and machine learning are projected to not only maximize citizen wealth, but also promote societal health. As big data continues to evolve and the demand for professionals in the field increases, access to the most current information about the concepts, issues, trends, and technologies in this interdisciplinary area is needed. The Encyclopedia of Data Science and Machine Learning examines current, state-of-the-art research in the areas of data science, machine learning, data mining, and more. It provides an international forum for experts within these fields to advance the knowledge and practice in all facets of big data and machine learning, emphasizing emerging theories, principals, models, processes, and applications to inspire and circulate innovative findings into research, business, and communities. Covering topics such as benefit management, recommendation system analysis, and global software development, this expansive reference provides a dynamic resource for data scientists, data analysts, computer scientists, technical managers, corporate executives, students and educators of higher education, government officials, researchers, and academicians.