Linguistic Modeling of Information and Markup Languages

Linguistic Modeling of Information and Markup Languages
Author: Andreas Witt
Publisher: Springer Science & Business Media
Total Pages: 272
Release: 2010-01-09
Genre: Computers
ISBN: 9048133319

This book covers recent developments in the field, from multi-layered mark-up and standards to theoretical formalisms to applications. It presents results from international research in text technology, computational linguistics, hypertext modeling and more.

Linguistic Linked Data

Linguistic Linked Data
Author: Philipp Cimiano
Publisher: Springer Nature
Total Pages: 289
Release: 2020-01-13
Genre: Computers
ISBN: 3030302253

This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.

Modeling, Learning, and Processing of Text-Technological Data Structures

Modeling, Learning, and Processing of Text-Technological Data Structures
Author: Alexander Mehler
Publisher: Springer
Total Pages: 398
Release: 2011-10-14
Genre: Technology & Engineering
ISBN: 3642226132

Researchers in many disciplines have been concerned with modeling textual data in order to account for texts as the primary information unit of written communication. The book “Modelling, Learning and Processing of Text-Technological Data Structures” deals with this challenging information unit. It focuses on theoretical foundations of representing natural language texts as well as on concrete operations of automatic text processing. Following this integrated approach, the present volume includes contributions to a wide range of topics in the context of processing of textual data. This relates to the learning of ontologies from natural language texts, the annotation and automatic parsing of texts as well as the detection and tracking of topics in texts and hypertexts. In this way, the book brings together a wide range of approaches to procedural aspects of text technology as an emerging scientific discipline.

Handbook of Linguistic Annotation

Handbook of Linguistic Annotation
Author: Nancy Ide
Publisher: Springer
Total Pages: 1440
Release: 2017-06-16
Genre: Language Arts & Disciplines
ISBN: 9402408819

This handbook offers a thorough treatment of the science of linguistic annotation. Leaders in the field guide the reader through the process of modeling, creating an annotation language, building a corpus and evaluating it for correctness. Essential reading for both computer scientists and linguistic researchers.Linguistic annotation is an increasingly important activity in the field of computational linguistics because of its critical role in the development of language models for natural language processing applications. Part one of this book covers all phases of the linguistic annotation process, from annotation scheme design and choice of representation format through both the manual and automatic annotation process, evaluation, and iterative improvement of annotation accuracy. The second part of the book includes case studies of annotation projects across the spectrum of linguistic annotation types, including morpho-syntactic tagging, syntactic analyses, a range of semantic analyses (semantic roles, named entities, sentiment and opinion), time and event and spatial analyses, and discourse level analyses including discourse structure, co-reference, etc. Each case study addresses the various phases and processes discussed in the chapters of part one.

Linked Data in Linguistics

Linked Data in Linguistics
Author: Christian Chiarcos
Publisher: Springer Science & Business Media
Total Pages: 220
Release: 2012-02-21
Genre: Computers
ISBN: 3642282490

The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects. The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources. The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology). This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).

The Shape of Data in Digital Humanities

The Shape of Data in Digital Humanities
Author: Julia Flanders
Publisher: Routledge
Total Pages: 373
Release: 2018-11-02
Genre: Language Arts & Disciplines
ISBN: 1317016149

Data and its technologies now play a large and growing role in humanities research and teaching. This book addresses the needs of humanities scholars who seek deeper expertise in the area of data modeling and representation. The authors, all experts in digital humanities, offer a clear explanation of key technical principles, a grounded discussion of case studies, and an exploration of important theoretical concerns. The book opens with an orientation, giving the reader a history of data modeling in the humanities and a grounding in the technical concepts necessary to understand and engage with the second part of the book. The second part of the book is a wide-ranging exploration of topics central for a deeper understanding of data modeling in digital humanities. Chapters cover data modeling standards and the role they play in shaping digital humanities practice, traditional forms of modeling in the humanities and how they have been transformed by digital approaches, ontologies which seek to anchor meaning in digital humanities resources, and how data models inhabit the other analytical tools used in digital humanities research. It concludes with a glossary chapter that explains specific terms and concepts for data modeling in the digital humanities context. This book is a unique and invaluable resource for teaching and practising data modeling in a digital humanities context.

Development of Linguistic Linked Open Data Resources for Collaborative Data-Intensive Research in the Language Sciences

Development of Linguistic Linked Open Data Resources for Collaborative Data-Intensive Research in the Language Sciences
Author: Antonio Pareja-Lora
Publisher: MIT Press
Total Pages: 273
Release: 2020-01-07
Genre: Language Arts & Disciplines
ISBN: 0262536250

Making diverse data in linguistics and the language sciences open, distributed, and accessible: perspectives from language/language acquistiion researchers and technical LOD (linked open data) researchers. This volume examines the challenges inherent in making diverse data in linguistics and the language sciences open, distributed, integrated, and accessible, thus fostering wide data sharing and collaboration. It is unique in integrating the perspectives of language researchers and technical LOD (linked open data) researchers. Reporting on both active research needs in the field of language acquisition and technical advances in the development of data interoperability, the book demonstrates the advantages of an international infrastructure for scholarship in the field of language sciences. With contributions by researchers who produce complex data content and scholars involved in both the technology and the conceptual foundations of LLOD (linguistics linked open data), the book focuses on the area of language acquisition because it involves complex and diverse data sets, cross-linguistic analyses, and urgent collaborative research. The contributors discuss a variety of research methods, resources, and infrastructures. Contributors Isabelle Barrière, Nan Bernstein Ratner, Steven Bird, Maria Blume, Ted Caldwell, Christian Chiarcos, Cristina Dye, Suzanne Flynn, Claire Foley, Nancy Ide, Carissa Kang, D. Terence Langendoen, Barbara Lust, Brian MacWhinney, Jonathan Masci, Steven Moran, Antonio Pareja-Lora, Jim Reidy, Oya Y. Rieger, Gary F. Simons, Thorsten Trippel, Kara Warburton, Sue Ellen Wright, Claus Zinn

Annotations in Scholarly Editions and Research

Annotations in Scholarly Editions and Research
Author: Julia Nantke
Publisher: Walter de Gruyter GmbH & Co KG
Total Pages: 416
Release: 2020-10-26
Genre: Literary Criticism
ISBN: 3110689170

The term ‘annotation’ is associated in the Humanities and Technical Sciences with different concepts that vary in coverage, application and direction but which also have instructive parallels. This publication mirrors the increasing cooperation that has been taking place between the two disciplines within the scope of the digitalization of the Humanities. It presents the results of an international conference on the concept of annotation that took place at the University of Wuppertal in February 2019. This publication reflects on different practices and associated concepts of annotation in an interdisciplinary perspective, puts them in relation to each other and attempts to systematize their commonalities and divergences. The following dynamic visualizations allow an interactive navigation within the volume based on keywords: Wordcloud ☁ , Matrix ▦ , Edge Bundling ⊛

Challenges of Anglophone Language(s), Literatures and Cultures

Challenges of Anglophone Language(s), Literatures and Cultures
Author: Alena Kačmárová
Publisher: Cambridge Scholars Publishing
Total Pages: 340
Release: 2017-01-06
Genre: Social Science
ISBN: 1443861472

This book explores scholarly challenges within the fields of Anglophone language, literature, and culture. The section focusing on language details issues falling within two areas: namely, language contact and the language-culture relationship, and stylistic and syntactic perspectives on the English language. The literature part investigates twentieth-century American, English, and Australian literature, dealing with both poetry and prose and discussing topics of identity, gender, metafiction, postmodern conditions, and other relevant theoretical issues in contemporary literature. The culture part treats theoretical approaches in cultural studies that are vital in today’s cultural context, especially in Central European universities, the Irish language and culture, and contemporary cultural phenomena inspired by the growing ubiquity of technological intrusions into various fields of cultural production.

The People’s Web Meets NLP

The People’s Web Meets NLP
Author: Iryna Gurevych
Publisher: Springer Science & Business Media
Total Pages: 394
Release: 2013-04-03
Genre: Language Arts & Disciplines
ISBN: 3642350852

Collaboratively Constructed Language Resources (CCLRs) such as Wikipedia, Wiktionary, Linked Open Data, and various resources developed using crowdsourcing techniques such as Games with a Purpose and Mechanical Turk have substantially contributed to the research in natural language processing (NLP). Various NLP tasks utilize such resources to substitute for or supplement conventional lexical semantic resources and linguistically annotated corpora. These resources also provide an extensive body of texts from which valuable knowledge is mined. There are an increasing number of community efforts to link and maintain multiple linguistic resources. This book aims offers comprehensive coverage of CCLR-related topics, including their construction, utilization in NLP tasks, and interlinkage and management. Various Bachelor/Master/Ph.D. programs in natural language processing, computational linguistics, and knowledge discovery can use this book both as the main text and as a supplementary reading. The book also provides a valuable reference guide for researchers and professionals for the above topics.