Managing Data From Knowledge Bases: Querying and Extraction

Managing Data From Knowledge Bases: Querying and Extraction
Author: Wei Emma Zhang
Publisher: Springer
Total Pages: 148
Release: 2018-07-31
Genre: Computers
ISBN: 3319949357

In this book, the authors first address the research issues by providing a motivating scenario, followed by the exploration of the principles and techniques of the challenging topics. Then they solve the raised research issues by developing a series of methodologies. More specifically, the authors study the query optimization and tackle the query performance prediction for knowledge retrieval. They also handle unstructured data processing, data clustering for knowledge extraction. To optimize the queries issued through interfaces against knowledge bases, the authors propose a cache-based optimization layer between consumers and the querying interface to facilitate the querying and solve the latency issue. The cache depends on a novel learning method that considers the querying patterns from individual’s historical queries without having knowledge of the backing systems of the knowledge base. To predict the query performance for appropriate query scheduling, the authors examine the queries’ structural and syntactical features and apply multiple widely adopted prediction models. Their feature modelling approach eschews the knowledge requirement on both the querying languages and system. To extract knowledge from unstructured Web sources, the authors examine two kinds of Web sources containing unstructured data: the source code from Web repositories and the posts in programming question-answering communities. They use natural language processing techniques to pre-process the source codes and obtain the natural language elements. Then they apply traditional knowledge extraction techniques to extract knowledge. For the data from programming question-answering communities, the authors make the attempt towards building programming knowledge base by starting with paraphrase identification problems and develop novel features to accurately identify duplicate posts. For domain specific knowledge extraction, the authors propose to use a clustering technique to separate knowledge into different groups. They focus on developing a new clustering algorithm that uses manifold constraints in the optimization task and achieves fast and accurate performance. For each model and approach presented in this dissertation, the authors have conducted extensive experiments to evaluate it using either public dataset or synthetic data they generated.

Integration, Provenance, and Temporal Queries for Large-Scale Knowledge Bases

Integration, Provenance, and Temporal Queries for Large-Scale Knowledge Bases
Author: Shi Gao
Publisher:
Total Pages: 111
Release: 2016
Genre:
ISBN:

Knowledge bases that summarize web information in RDF triples deliver many benefits, including support for natural language question answering and powerful structured queries that extract encyclopedic knowledge via SPARQL. Large scale knowledge bases grow rapidly in terms of scale and significance, and undergo frequent changes in both schema and content. Two critical problems have thus emerged: (i) how to support temporal queries that explore the history of knowledge bases or flash-back to the past; (ii) how to integrate knowledge from difference sources and improve the quality of integrated knowledge base while preserving the provenance information. In this dissertation, we propose a framework that supports knowledge integration, temporal query evaluation and user-friendly interfaces for large-scale knowledge bases. Towards this goal, we make the following contributions: (i) We propose SPARQLT, a temporal extension of structured query language SPARQL based on a point temporal model which simplifies the expression of temporal joins and eliminates the need for temporal coalescing. This approach makes possible an end-user interface HKB (Historical Knowledge Browser) where users can browse the evolution history of knowledge bases and express historical queries via simple by-example conditions in the infoboxes of Wikipedia pages. (ii) We have designed and implemented RDF-TX (RDF Temporal eXpress), an efficient system for managing temporal RDF data and evaluating SPARQLT queries. RDF-TX takes advantage of compressed Multiversion B+ trees to achieve fast evaluation of temporal queries. The experimental result demonstrates that our indexing and query optimization techniques deliver superior performance over other systems. (iii) We propose a framework for knowledge extraction and integration. We first introduce IBMiner, a novel NLP-based system that derives knowledge bases from free text and preserves the provenance of extracted triples. IBminer uses a deep NLP-based approach to extract subject-attribute-value triples from free text, and maps the attributes to those introduced in existing knowledge bases. Then we integrate public knowledge bases with the knowledge base generated by IBMiner into one of superior quality and coverage, called IKBStore. User-friendly interfaces are provided to manage the knowledge in IKBStore while maintaining provenance information.

Entity-Oriented Search

Entity-Oriented Search
Author: Krisztian Balog
Publisher: Springer
Total Pages: 358
Release: 2018-10-02
Genre: Computers
ISBN: 3319939351

This open access book covers all facets of entity-oriented search—where “search” can be interpreted in the broadest sense of information access—from a unified point of view, and provides a coherent and comprehensive overview of the state of the art. It represents the first synthesis of research in this broad and rapidly developing area. Selected topics are discussed in-depth, the goal being to establish fundamental techniques and methods as a basis for future research and development. Additional topics are treated at a survey level only, containing numerous pointers to the relevant literature. A roadmap for future research, based on open issues and challenges identified along the way, rounds out the book. The book is divided into three main parts, sandwiched between introductory and concluding chapters. The first two chapters introduce readers to the basic concepts, provide an overview of entity-oriented search tasks, and present the various types and sources of data that will be used throughout the book. Part I deals with the core task of entity ranking: given a textual query, possibly enriched with additional elements or structural hints, return a ranked list of entities. This core task is examined in a number of different variants, using both structured and unstructured data collections, and numerous query formulations. In turn, Part II is devoted to the role of entities in bridging unstructured and structured data. Part III explores how entities can enable search engines to understand the concepts, meaning, and intent behind the query that the user enters into the search box, and how they can provide rich and focused responses (as opposed to merely a list of documents)—a process known as semantic search. The final chapter concludes the book by discussing the limitations of current approaches, and suggesting directions for future research. Researchers and graduate students are the primary target audience of this book. A general background in information retrieval is sufficient to follow the material, including an understanding of basic probability and statistics concepts as well as a basic knowledge of machine learning concepts and supervised learning algorithms.

Fuzzy Knowledge Management for the Semantic Web

Fuzzy Knowledge Management for the Semantic Web
Author: Zongmin Ma
Publisher: Springer
Total Pages: 282
Release: 2013-09-28
Genre: Technology & Engineering
ISBN: 3642392830

This book goes to great depth concerning the fast growing topic of technologies and approaches of fuzzy logic in the Semantic Web. The topics of this book include fuzzy description logics and fuzzy ontologies, queries of fuzzy description logics and fuzzy ontology knowledge bases, extraction of fuzzy description logics and ontologies from fuzzy data models, storage of fuzzy ontology knowledge bases in fuzzy databases, fuzzy Semantic Web ontology mapping, and fuzzy rules and their interchange in the Semantic Web. The book aims to provide a single record of current research in the fuzzy knowledge representation and reasoning for the Semantic Web. The objective of the book is to provide the state of the art information to researchers, practitioners and graduate students of the Web intelligence and at the same time serve the knowledge and data engineering professional faced with non-traditional applications that make the application of conventional approaches difficult or impossible.

Search Computing

Search Computing
Author: Stefano Ceri
Publisher: Springer
Total Pages: 265
Release: 2012-11-06
Genre: Computers
ISBN: 3642342132

Search computing, which has evolved from service computing, focuses on building the answers to complex search queries by interacting with a constellation of cooperating search services, using the ranking and joining of results as the dominant factors for service composition. The field is multi-disciplinary in nature and takes advantage of contributions from other research areas such as knowledge representation, human-computer interfaces, psychology, sociology, economics, and legal sciences. This book is the third in the Search Computing series and contains a collection of 16 papers, which in most cases were contributed to several workshops during 2011 organized by members of the Search Computing project in the context of major international conferences: ExploreWeb at ICWE 2011, Very Large Data Search and DBRank at VLDB 2011, DATAVIEW at ECOWS 2011, and OrdRing at ISWC 2011. The papers provide very useful insights on search computing problems and issues. The book has been divided into four parts focussing on: extraction and integration; query and visualization paradigms; exploring linked data; and games, social search and economics.

Research, Practice, and Educational Advancements in Telecommunications and Networking

Research, Practice, and Educational Advancements in Telecommunications and Networking
Author: Bartolacci, Michael
Publisher: IGI Global
Total Pages: 341
Release: 2012-01-31
Genre: Technology & Engineering
ISBN: 1466600519

The study of telecommunications and networking allows us to understand existing modes of communication and information transfer while also developing new methods for managing, modeling, and regulating the exchange of information.Research, Practice, and Educational Advancements in Telecommunications and Networking offers multidisciplinary perspectives on architectures and systems for effective, efficient communication across different types of infrastructures, which include online and wireless networks. Collecting research on mobile ad hoc networks, VoIP, and mobile recommendation systems, this book provides theoretical discussions, as well as practical research on new and emerging developments in telecommunications and networking.

Knowledge Engineering and Knowledge Management

Knowledge Engineering and Knowledge Management
Author: Paolo Ciancarini
Publisher: Springer
Total Pages: 296
Release: 2017-05-17
Genre: Computers
ISBN: 3319586947

This book contains the best selected papers of two Satellite Events held at the 20th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2016, in November 2016 in Bologna, Italy: The Second International Workshop on Educational Knowledge Management, EKM 2016, and the First Workshop: Detection, Representation and Management of Concept Drift in Linked Open Data, Drift-an-LOD 2016. The 6 revised full papers included in this volume were carefully reviewed and selected from the 13 full papers that were accepted for presentation at the conference from the initial 82 submissions. This volume also contains the 37 accepted contributions for the EKAW 2016 tutorials, demo and poster sessions, and the doctoral consortium. The special focus of this year's EKAW was "evolving knowledge", which concerns all aspects of the management and acquisition of knowledge representations of evolving, contextual, and local models. This includes change management, trend detection, model evolution, streaming data and stream reasoning, event processing, time-and space dependent models, contextual and local knowledge representations with a special emphasis on the evolvability and localization of knowledge and the correct usage of these limits.

Intelligent Knowledge-Based Systems

Intelligent Knowledge-Based Systems
Author: Cornelius T. Leondes
Publisher: Springer Science & Business Media
Total Pages: 2041
Release: 2010-04-28
Genre: Computers
ISBN: 1402078293

This five-volume set clearly manifests the great significance of these key technologies for the new economies of the new millennium. The discussions provide a wealth of practical ideas intended to foster innovation in thought and, consequently, in the further development of technology. Together, they comprise a significant and uniquely comprehensive reference source for research workers, practitioners, computer scientists, academics, students, and others on the international scene for years to come.

Medinfo 2007

Medinfo 2007
Author: Klaus A. Kuhn
Publisher: IOS Press
Total Pages: 1532
Release: 2007
Genre: Electronic books
ISBN: 1586037749

The papers presented are refereed and from all over the world. They reflect the breadth and depth of the field of biomedical and health informatics, covering topics such as; health information systems, knowledge and data management, education, standards, consumer health and human factors, emerging technologies, sustainability, organizational and economic issues, genomics, and image and signal processing. As this volume carries such a wide collection, it will be of great interest to anyone engaged in biomedical and health informatics research and application.

Knowledge Management Systems for Business

Knowledge Management Systems for Business
Author: Robert J. Thierauf
Publisher: Bloomsbury Publishing USA
Total Pages: 376
Release: 1999-07-30
Genre: Computers
ISBN: 0313003718

Until now, business systems have focused on selected data within a certain context to produce information. A better approach, says Thierauf, is to take information accompanied by experience over time to generate knowledge. He demonstrates that knowledge management systems can be used as a source of power to outmaneuver business competitors. Knowledge discovery tools enable decision makers to extract the patterns, trends, and correlations that underlie the inner (and inter-) workings of a company. His book is the first comprehensive text to define this important new direction in computer technology and will be essential reading for MIS practitioners, systems analysts, and academics researching and teaching the theory and applications of knowledge management systems. Thierauf centers on leveraging a company's knowledge capital. Indeed, knowledge is power—the power to improve customer satisfaction, marketing and production methods, financial operations, and other functions. Thierauf shows how knowledge, when developed and renewed, can be applied to a company's functional areas and provide an important competitive advantage. By utilizing some form of internal and external computer networks and providing some type of knowledge discovery software that encapsulates usable knowledge, Thierauf shows how to create an infrastructure to capture knowledge, store it, improve it, clarify it, and disseminate it throughout the organization, then how to use it regularly. His book demonstrates clearly how knowledge management systems focus on making knowledge available to company employees in the right format, at the right time, and in the right place. The result is inevitably a higher order of intelligence in decision making, more so now than could ever have been possible in even the most recent past.