Cross-Lingual Word Embeddings

Cross-Lingual Word Embeddings
Author: Anders Søgaard
Publisher: Springer Nature
Total Pages: 120
Release: 2022-05-31
Genre: Computers
ISBN: 3031021711

The majority of natural language processing (NLP) is English language processing, and while there is good language technology support for (standard varieties of) English, support for Albanian, Burmese, or Cebuano--and most other languages--remains limited. Being able to bridge this digital divide is important for scientific and democratic reasons but also represents an enormous growth potential. A key challenge for this to happen is learning to align basic meaning-bearing units of different languages. In this book, the authors survey and discuss recent and historical work on supervised and unsupervised learning of such alignments. Specifically, the book focuses on so-called cross-lingual word embeddings. The survey is intended to be systematic, using consistent notation and putting the available methods on comparable form, making it easy to compare wildly different approaches. In so doing, the authors establish previously unreported relations between these methods and are able to present a fast-growing literature in a very compact way. Furthermore, the authors discuss how best to evaluate cross-lingual word embedding methods and survey the resources available for students and researchers interested in this topic.

Adjective Adverb Interfaces in Romance

Adjective Adverb Interfaces in Romance
Author: Martin Hummel
Publisher: John Benjamins Publishing Company
Total Pages: 382
Release: 2017-12-21
Genre: Language Arts & Disciplines
ISBN: 9027264872

Within the current discussion on grammatical interfaces, the word-classes of adjective and adverb are of particular interest because they appear to be separated or joined in manifold ways at the level of word-class or syntax, with morphology playing a prominent role, especially in Romance. The volume provides typological and theoretical insights into the common or different usage of adjectives and adverbs in Romance. Diachronic change is discussed alongside with synchronic variation and the representation in grammar. The discussion turns out to be controversial, calling into question traditional assumptions such as the dogma of the invariability and the categorial status of the adverb.

Dependency Parsing

Dependency Parsing
Author: Sandra Kübler
Publisher: Morgan & Claypool Publishers
Total Pages: 128
Release: 2009
Genre: Computers
ISBN: 1598295969

Dependency-based methods for syntactic parsing have become increasingly popular in natural language processing in recent years. This book gives a thorough introduction to the methods that are most widely used today. After an introduction to dependency grammar and dependency parsing, followed by a formal characterization of the dependency parsing problem, the book surveys the three major classes of parsing models that are in current use: transition-based, graph-based, and grammar-based models. It continues with a chapter on evaluation and one on the comparison of different methods, and it closes with a few words on current trends and future prospects of dependency parsing. The book presupposes a knowledge of basic concepts in linguistics and computer science, as well as some knowledge of parsing methods for constituency-based representations. Table of Contents: Introduction / Dependency Parsing / Transition-Based Parsing / Graph-Based Parsing / Grammar-Based Parsing / Evaluation / Comparison / Final Thoughts

Anaphora Resolution

Anaphora Resolution
Author: Massimo Poesio
Publisher: Springer
Total Pages: 509
Release: 2016-08-04
Genre: Computers
ISBN: 3662479095

This book lays out a path leading from the linguistic and cognitive basics, to classical rule-based and machine learning algorithms, to today’s state-of-the-art approaches, which use advanced empirically grounded techniques, automatic knowledge acquisition, and refined linguistic modeling to make a real difference in real-world applications. Anaphora and coreference resolution both refer to the process of linking textual phrases (and, consequently, the information attached to them) within as well as across sentence boundaries, and to the same discourse referent. The book offers an overview of recent research advances, focusing on practical, operational approaches and their applications. In part I (Background), it provides a general introduction, which succinctly summarizes the linguistic, cognitive, and computational foundations of anaphora processing and the key classical rule- and machine-learning-based anaphora resolution algorithms. Acknowledging the central importance of shared resources, part II (Resources) covers annotated corpora, formal evaluation, preprocessing technology, and off-the-shelf anaphora resolution systems. Part III (Algorithms) provides a thorough description of state-of-the-art anaphora resolution algorithms, covering enhanced machine learning methods as well as techniques for accomplishing important subtasks such as mention detection and acquisition of relevant knowledge. Part IV (Applications) deals with a selection of important anaphora and coreference resolution applications, discussing particular scenarios in diverse domains and distilling a best-practice model for systematically approaching new application cases. In the concluding part V (Outlook), based on a survey conducted among the contributing authors, the prospects of the research field of anaphora processing are discussed, and promising new areas of interdisciplinary cooperation and emerging application scenarios are identified. Given the book’s design, it can be used both as an accompanying text for advanced lectures in computational linguistics, natural language engineering, and computer science, and as a reference work for research and independent study. It addresses an audience that includes academic researchers, university lecturers, postgraduate students, advanced undergraduate students, industrial researchers, and software engineers.