Linguistic Linked Data

Linguistic Linked Data
Author: Philipp Cimiano
Publisher: Springer Nature
Total Pages: 289
Release: 2020-01-13
Genre: Computers
ISBN: 3030302253

This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.

Linked Data in Linguistics

Linked Data in Linguistics
Author: Christian Chiarcos
Publisher: Springer Science & Business Media
Total Pages: 220
Release: 2012-02-21
Genre: Computers
ISBN: 3642282490

The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects. The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources. The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology). This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).

Analyzing Linguistic Data

Analyzing Linguistic Data
Author: R. H. Baayen
Publisher: Cambridge University Press
Total Pages: 40
Release: 2008-03-06
Genre: Language Arts & Disciplines
ISBN: 1139470736

Statistical analysis is a useful skill for linguists and psycholinguists, allowing them to understand the quantitative structure of their data. This textbook provides a straightforward introduction to the statistical analysis of language. Designed for linguists with a non-mathematical background, it clearly introduces the basic principles and methods of statistical analysis, using 'R', the leading computational statistics programme. The reader is guided step-by-step through a range of real data sets, allowing them to analyse acoustic data, construct grammatical trees for a variety of languages, quantify register variation in corpus linguistics, and measure experimental data using state-of-the-art models. The visualization of data plays a key role, both in the initial stages of data exploration and later on when the reader is encouraged to criticize various models. Containing over 40 exercises with model answers, this book will be welcomed by all linguists wishing to learn more about working with and presenting quantitative data.

Linguistic Ethnography

Linguistic Ethnography
Author: Fiona Copland
Publisher: SAGE
Total Pages: 307
Release: 2015-01-22
Genre: Social Science
ISBN: 147391115X

This is an engaging interdisciplinary guide to the unique role of language within ethnography. The book provides a philosophical overview of the field alongside practical support for designing and developing your own ethnographic research. It demonstrates how to build and develop arguments and engages with practical issues such as ethics, transcription and impact. There are chapter-long case studies based on real research that will explain key themes and help you create and analyse your own linguistic data. Drawing on the authors’ experience they outline the practical, epistemological and theoretical decisions that researchers must take when planning and carrying out their studies. Other key features include: A clear introduction to discourse analytic traditions Tips on how to produce effective field notes Guidance on how to manage interview and conversational data Advice on writing linguistic ethnographies for different audiences Annotated suggestions for further reading Full glossary This book is a master class in understanding linguistic ethnography, it will of interest to anyone conducting field research across the social sciences.

Linked Democracy

Linked Democracy
Author: Marta Poblet
Publisher: Springer
Total Pages: 141
Release: 2019-05-28
Genre: Law
ISBN: 303013363X

This open access book shows the factors linking information flow, social intelligence, rights management and modelling with epistemic democracy, offering licensed linked data along with information about the rights involved. This model of democracy for the web of data brings new challenges for the social organisation of knowledge, collective innovation, and the coordination of actions. Licensed linked data, licensed linguistic linked data, right expression languages, semantic web regulatory models, electronic institutions, artificial socio-cognitive systems are examples of regulatory and institutional design (regulations by design). The web has been massively populated with both data and services, and semantically structured data, the linked data cloud, facilitates and fosters human-machine interaction. Linked data aims to create ecosystems to make it possible to browse, discover, exploit and reuse data sets for applications. Rights Expression Languages semi-automatically regulate the use and reuse of content.

Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN:

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Linguistic Fieldwork

Linguistic Fieldwork
Author: Jeanette Sakel
Publisher: Cambridge University Press
Total Pages: 193
Release: 2012-02-02
Genre: Language Arts & Disciplines
ISBN: 0521837278

A handy beginner's guide to linguistic fieldwork - from the preparation of the work to the presentation of the results.

Reasoning Web. Reasoning and the Web in the Big Data Era

Reasoning Web. Reasoning and the Web in the Big Data Era
Author: Manolis Koubarakis
Publisher: Springer
Total Pages: 397
Release: 2014-09-03
Genre: Computers
ISBN: 3319105876

This volume contains the lecture notes of the 10th Reasoning Web Summer School 2014, held in Athens, Greece, in September 2014. In 2014, the lecture program of the Reasoning Web introduces students to recent advances in big data aspects of semantic web and linked data, and the fundamentals of reasoning techniques that can be used to tackle big data applications.

Linguistics for the Age of AI

Linguistics for the Age of AI
Author: Marjorie Mcshane
Publisher: MIT Press
Total Pages: 449
Release: 2021-03-02
Genre: Computers
ISBN: 0262362600

A human-inspired, linguistically sophisticated model of language understanding for intelligent agent systems. One of the original goals of artificial intelligence research was to endow intelligent agents with human-level natural language capabilities. Recent AI research, however, has focused on applying statistical and machine learning approaches to big data rather than attempting to model what people do and how they do it. In this book, Marjorie McShane and Sergei Nirenburg return to the original goal of recreating human-level intelligence in a machine. They present a human-inspired, linguistically sophisticated model of language understanding for intelligent agent systems that emphasizes meaning--the deep, context-sensitive meaning that a person derives from spoken or written language.

The Atlas of Pidgin and Creole Language Structures

The Atlas of Pidgin and Creole Language Structures
Author: Susanne Maria Michaelis
Publisher: Oxford University Press, USA
Total Pages: 572
Release: 2013-09-05
Genre: Language Arts & Disciplines
ISBN: 0199691398

The Atlas presents commentaries and colour maps showing how 130 linguistic features - phonological, syntactic, morphological, and lexical - are distributed among the world's pidgins and creoles. Designed and written by the world's leading experts, it is a unique resource of outstanding value for linguists of all persuasions throughout the world.