Multiword expressions

Multiword expressions
Author: Manfred Sailer
Publisher: Language Science Press
Total Pages: 376
Release: 2018
Genre: Bilingualism
ISBN: 3961100632

Multiword expressions (MWEs) are a challenge for both the natural language applications and the linguistic theory because they often defy the application of the machinery developed for free combinations where the default is that the meaning of an utterance can be predicted from its structure. There is a rich body of primarily descriptive work on MWEs for many European languages but comparative work is little. The volume brings together MWE experts to explore the benefits of a multilingual perspective on MWEs. The ten contributions in this volume look at MWEs in Bulgarian, English, French, German, Maori, Modern Greek, Romanian, Serbian, and Spanish. They discuss prominent issues in MWE research such as classification of MWEs, their formal grammatical modeling, and the description of individual MWE types from the point of view of different theoretical frameworks, such as Dependency Grammar, Generative Grammar, Head-driven Phrase Structure Grammar, Lexical Functional Grammar, Lexicon Grammar.

Essential Speech and Language Technology for Dutch

Essential Speech and Language Technology for Dutch
Author: Peter Spyns
Publisher: Springer Science & Business Media
Total Pages: 414
Release: 2013-02-26
Genre: Language Arts & Disciplines
ISBN: 3642309100

The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created are now available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology policy makers and even for science policy makers in general.

Complex Lexical Units

Complex Lexical Units
Author: Barbara Schlücker
Publisher: Walter de Gruyter GmbH & Co KG
Total Pages: 366
Release: 2019-01-14
Genre: Language Arts & Disciplines
ISBN: 3110632535

Both compounds and multi-word expressions are complex lexical units, made up of at least two constituents. The most basic difference is that the former are morphological objects and the latter result from syntactic processes. However, the exact demarcation between compounds and multi-word expressions differs greatly from language to language and is often a matter of debate in and across languages. Similarly debated is whether and how these two different kinds of units complement or compete with each other. The volume presents an overview of compounds and multi-word expressions in a variety of European languages. Central questions that are discussed for each language concern the formal distinction between compounds and multi-word expressions, their formation and their status in lexicon and grammar. The volume contains chapters on German, English, Dutch, French, Italian, Spanish, Greek, Russian, Polish, Finnish, and Hungarian as well as a contrastive overview with a focus on German. It brings together insights from word-formation theory, phraseology and theory of grammar and aims to contribute to the understanding of the lexicon, both from a language-specific and cross-linguistic perspective.

Multiword expressions in lexical resources

Multiword expressions in lexical resources
Author: Voula Giouli
Publisher: Language Science Press
Total Pages: 372
Release: 2024-06-17
Genre: Language Arts & Disciplines
ISBN: 3961104700

This volume contains chapters that paint the current landscape of the multiword expressions (MWE) representation in lexical resources, in view of their robust identification and computational processing. Both large-size general lexica and smaller MWE-centred ones are included, with special focus on the representation decisions and mechanisms that facilitate their usage in Natural Language Processing tasks. The presentations go beyond the morpho-syntactic description of MWEs, into their semantics. One challenge in representing MWEs in lexical resources is ensuring that the variability along with extra features required by the different types of MWEs can be captured efficiently. In this respect, recommendations for representing MWEs in mono- and multilingual computational lexicons have been proposed; these focus mainly on the syntactic and semantic properties of support verbs and noun compounds and their proper encoding thereof.

The role of constituents in multiword expressions

The role of constituents in multiword expressions
Author: Sabine Schulte im Walde
Publisher: Language Science Press
Total Pages: 238
Release: 2020
Genre: Language Arts & Disciplines
ISBN: 3961101841

Multiword expressions (MWEs), such as noun compounds (e.g. nickname in English, and Ohrwurm in German), complex verbs (e.g. give up in English, and aufgeben in German) and idioms (e.g. break the ice in English, and das Eis brechen in German), may be interpreted literally but often undergo meaning shifts with respect to their constituents. Theoretical, psycholinguistic as well as computational linguistic research remain puzzled by when and how MWEs receive literal vs. meaning-shifted interpretations, what the contributions of the MWE constituents are to the degree of semantic transparency (i.e., meaning compositionality) of the MWE, and how literal vs. meaning-shifted MWEs are processed and computed. This edited volume presents an interdisciplinary selection of seven papers on recent findings across linguistic, psycholinguistic, corpus-based and computational research fields and perspectives, discussing the interaction of constituent properties and MWE meanings, and how MWE constituents contribute to the processing and representation of MWEs. The collection is based on a workshop at the 2017 annual conference of the German Linguistic Society (DGfS) that took place at Saarland University in Saarbrücken, Germany

Analogical classification in formal grammar

Analogical classification in formal grammar
Author: Matías Guzmán Naranjo
Publisher: Language Science Press
Total Pages: 256
Release: 2019
Genre: Language Arts & Disciplines
ISBN: 3961101868

The organization of the lexicon, and especially the relations between groups of lexemes is a strongly debated topic in linguistics. Some authors have insisted on the lack of any structure of the lexicon. In this vein, Di Sciullo & Williams (1987: 3) claim that “[t]he lexicon is like a prison – it contains only the lawless, and the only thing that its inmates have in commonis lawlessness”. In the alternative view, the lexicon is assumed to have a rich structure that captures all regularities and partial regularities that exist between lexical entries.Two very different schools of linguistics have insisted on the organization of the lexicon. On the one hand, for theories like HPSG (Pollard & Sag 1994), but also some versions of construction grammar (Fillmore & Kay 1995), the lexicon is assumed to have a very rich structure which captures common grammatical properties between its members. In this approach, a type hierarchy organizes the lexicon according to common properties between items. For example, Koenig (1999: 4, among others), working from an HPSG perspective, claims that the lexicon “provides a unified model for partial regularties, medium-size generalizations, and truly productive processes”. On the other hand, from the perspective of usage-based linguistics, several authors have drawn attention to the fact that lexemes which share morphological or syntactic properties, tend to be organized in clusters of surface (phonological or semantic) similarity (Bybee & Slobin 1982; Skousen 1989; Eddington 1996). This approach, often called analogical, has developed highly accurate computational and non-computational models that can predict the classes to which lexemes belong. Like the organization of lexemes in type hierarchies, analogical relations between items help speakers to make sense of intricate systems, and reduce apparent complexity (Köpcke & Zubin 1984). Despite this core commonality, and despite the fact that most linguists seem to agree that analogy plays an important role in language, there has been remarkably little work on bringing together these two approaches. Formal grammar traditions have been very successful in capturing grammatical behaviour, but, in the process, have downplayed the role analogy plays in linguistics (Anderson 2015). In this work, I aim to change this state of affairs. First, by providing an explicit formalization of how analogy interacts with grammar, and second, by showing that analogical effects and relations closely mirror the structures in the lexicon. I will show that both formal grammar approaches, and usage-based analogical models, capture mutually compatible relations in the lexicon.

Semantic Relations and the Lexicon

Semantic Relations and the Lexicon
Author: M. Lynne Murphy
Publisher: Cambridge University Press
Total Pages: 306
Release: 2003-10-02
Genre: Language Arts & Disciplines
ISBN: 1139437453

Semantic Relations and the Lexicon explores the many paradigmatic semantic relations between words, such as synonymy, antonymy and hyponymy, and their relevance to the mental organization of our vocabularies. Drawing on a century's research in linguistics, psychology, philosophy, anthropology and computer science, M. Lynne Murphy proposes a pragmatic approach to these relations. Whereas traditional approaches have claimed that paradigmatic relations are part of our lexical knowledge, Dr Murphy argues that they constitute metalinguistic knowledge, which can be derived through a single relational principle, and may also be stored as part of our extra-lexical, conceptual representations of a word. Part I shows how this approach can account for the properties of lexical relations in ways that traditional approaches cannot, and Part II examines particular relations in detail. This book will serve as an informative handbook for all linguists and cognitive scientists interested in the mental representation of vocabulary.

The Lexicon

The Lexicon
Author: Elisabetta Ježek
Publisher: Oxford University Press
Total Pages: 249
Release: 2016-01-29
Genre: Language Arts & Disciplines
ISBN: 0191667110

The Lexicon provides an introduction to the study of words, their main properties, and how we use them to create meaning. It offers a detailed description of the organizing principles of the lexicon, and of the categories used to classify a wide range of lexical phenomena, including polysemy, meaning variation in composition, and the interplay with ontology, syntax, and pragmatics. Elisabetta Ježek uses empirical data from digitalized corpora and speakers' judgements, combined with the formalisms developed in the field of general and theoretical linguistics, to propose representations for each of these phenomena. The key feature of the book is that it merges theoretical accounts with lexicographic approaches and computational insights. Its clear structure and accessible approach make The Lexicon an ideal textbook for all students of linguistics—theoretical, applied, and computational—and a valuable resource for scholars and students of language in the fields of cognitive science and philosophy.

The Oxford Handbook of Computational Linguistics

The Oxford Handbook of Computational Linguistics
Author: Ruslan Mitkov
Publisher: Oxford University Press
Total Pages: 808
Release: 2004
Genre: Computers
ISBN: 019927634X

This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.

Subsymbolic Natural Language Processing

Subsymbolic Natural Language Processing
Author: Risto Miikkulainen
Publisher: MIT Press
Total Pages: 422
Release: 1993
Genre: Computers
ISBN: 9780262132909

Risto Miikkulainen draws on recent connectionist work in language comprehension tocreate a model that can understand natural language. Using the DISCERN system as an example, hedescribes a general approach to building high-level cognitive models from distributed neuralnetworks and shows how the special properties of such networks are useful in modeling humanperformance. In this approach connectionist networks are not only plausible models of isolatedcognitive phenomena, but also sufficient constituents for complete artificial intelligencesystems.Distributed neural networks have been very successful in modeling isolated cognitivephenomena, but complex high-level behavior has been tractable only with symbolic artificialintelligence techniques. Aiming to bridge this gap, Miikkulainen describes DISCERN, a completenatural language processing system implemented entirely at the subsymbolic level. In DISCERN,distributed neural network models of parsing, generating, reasoning, lexical processing, andepisodic memory are integrated into a single system that learns to read, paraphrase, and answerquestions about stereotypical narratives.Miikkulainen's work, which includes a comprehensive surveyof the connectionist literature related to natural language processing, will prove especiallyvaluable to researchers interested in practical techniques for high-level representation,inferencing, memory modeling, and modular connectionist architectures.Risto Miikkulainen is anAssistant Professor in the Department of Computer Sciences at The University of Texas atAustin.