Corpus Linguistics Beyond the Word

Corpus Linguistics Beyond the Word
Author:
Publisher: BRILL
Total Pages: 287
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9401203849

This volume will be of particular interest to readers interested in expanding the applications of corpus linguistics techniques through new tools and approaches. The text includes selected papers from the Fifth North American Symposium, hosted by the Linguistics Department at Montclair State University in Montclair New Jersey in May 2004. The symposium papers represented several areas of corpus studies including language development, syntactic analysis, pragmatics and discourse, language change, register variation, corpus creation and annotation, and practical applications of corpus work, primarily in language teaching, but also in medical training and machine translation. A common thread through most of the papers was the use of corpora to study domains longer than the word. Not surprisingly, fully half of the papers deal with the computational tools and linguistic strategies needed to search for and analyze these longer spans of language while most of the remaining papers examine particular syntactic and rhetorical properties of one or more corpora.

Corpus Linguistics and Statistics with R

Corpus Linguistics and Statistics with R
Author: Guillaume Desagulier
Publisher: Springer
Total Pages: 359
Release: 2017-11-17
Genre: Computers
ISBN: 3319645722

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.

The Cambridge Handbook of English Corpus Linguistics

The Cambridge Handbook of English Corpus Linguistics
Author: Douglas Biber
Publisher: Cambridge University Press
Total Pages: 757
Release: 2015-06-25
Genre: Language Arts & Disciplines
ISBN: 1316298701

The Cambridge Handbook of English Corpus Linguistics (CHECL) surveys the breadth of corpus-based linguistic research on English, including chapters on collocations, phraseology, grammatical variation, historical change, and the description of registers and dialects. The most innovative aspects of the CHECL are its emphasis on critical discussion, its explicit evaluation of the state of the art in each sub-discipline, and the inclusion of empirical case studies. While each chapter includes a broad survey of previous research, the primary focus is on a detailed description of the most important corpus-based studies in this area, with discussion of what those studies found, and why they are important. Each chapter also includes a critical discussion of the corpus-based methods employed for research in this area, as well as an explicit summary of new findings and discoveries.

Corpus Linguistics and the Web

Corpus Linguistics and the Web
Author:
Publisher: BRILL
Total Pages: 311
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9401203792

Using the Web as Corpus is one of the recent challenges for corpus linguistics. This volume presents a current state-of-the-arts discussion of the topic. The articles address practical problems such as suitable linguistic search tools for accessing the www, the question of register variation, or they probe into methods for culling data from the web. The book also offers a wide range of case studies, covering morphology, syntax, lexis, as well as synchronic and diachronic variation in English. These case studies make use of the two approaches to the www in corpus linguistics – web-as-corpus and web-for-corpus-building. The case studies demonstrate that web data can provide useful additional evidence for a broad range of research questions.

Beyond Concordance Lines

Beyond Concordance Lines
Author: Pascual Pérez-Paredes
Publisher: John Benjamins Publishing Company
Total Pages: 267
Release: 2021-12-15
Genre: Language Arts & Disciplines
ISBN: 902725849X

In over 30 years of data-driven learning (DDL) research, there has been a growing sophistication in the ways we collect, analyse, and put corpus data to use. This volume takes a three-fold perspective on DDL. It first looks at DDL and its role in informing language learning theory and how it might shed light on the language development process; secondly it addresses how DDL can help us characterise learner language and inform teaching accordingly, and thirdly it showcases practical applications for the use of DDL in classrooms. The contributors to this volume examine a variety of instructional settings and languages across the world. They reflect on theoretical, methodological and classroom implications using both novel and established language learning theories, natural language processing (NLP), longitudinal research designs, and a variety of language learning targets. The present volume is an invitation from some of the leading researchers in DDL to reflect on the research avenues that will define the field in the coming years.

The Routledge Handbook of Corpus Linguistics

The Routledge Handbook of Corpus Linguistics
Author: Anne O'Keeffe
Publisher: Routledge
Total Pages: 1263
Release: 2010-04-05
Genre: Education
ISBN: 1135153620

The Routledge Handbook of Corpus Linguistics provides a timely overview of a dynamic and rapidly growing area with a widely applied methodology. Through the electronic analysis of large bodies of text, corpus linguistics demonstrates and supports linguistic statements and assumptions. In recent years it has seen an ever-widening application in a variety of fields: computational linguistics, discourse analysis, forensic linguistics, pragmatics and translation studies. Bringing together experts in the key areas of development and change, the handbook is structured around six themes which take the reader through building and designing a corpus to using a corpus to study literature and translation. A comprehensive introduction covers the historical development of the field and its growing influence and application in other areas. Structured around five headings for ease of reference, each contribution includes further reading sections with three to five key texts highlighted and annotated to facilitate further exploration of the topics. The Routledge Handbook of Corpus Linguistics is the ideal resource for advanced undergraduates and postgraduates.

The Changing Face of Corpus Linguistics

The Changing Face of Corpus Linguistics
Author: Antoinette Renouf
Publisher: Rodopi
Total Pages: 408
Release: 2016-08
Genre: Computers
ISBN: 940120179X

Preliminary Material /Antoinette Renouf and Andrew Kehoe -- The corpus-user's chorus: (Based on The Major General's Song from Gilbert and Sullivan's The Pirates of Penzance) /Antoinette Renouf and Andrew Kehoe -- Introduction: The changing face of corpus linguistics /Antoinette Renouf and Andrew Kehoe -- Oh Canada! Towards the Corpus of Early Ontario English /Stefan Dollinger -- Favoring Americanisms? vs. before and in Early English in Australia: A corpus-based approach /Clemens Fritz -- Computing the Lexicons of Early Modern English /Ian Lancashire -- EFL dictionaries, grammars and language guides from 1700 to 1850: testing a new corpus on points of spokenness /Manfred Markus -- The Old English Apollonius of Tyre in the light of the Old English Concordancer /Antonio Miranda García , Javier Calle Martín , David Moreno Olalla and Gustavo Muñoz González -- Prediction with SHALL and WILL: a diachronic perspective /Maurizio Gotti -- Circumstantial adverbials in discourse: a synchronic and a diachronic perspective /Anneli Meurman-Solin and Päivi Pahta -- Changes in textual structures of book advertisements in the ZEN Corpus /Caren auf dem Keller -- “Curtains like these are selling right in the city of Chicago for USD 1.50” - The mediopassive in American 20th-century advertising language /Marianne Hundt -- Recent grammatical change in written English 1961-1992: some preliminary findings of a comparison of American with British English /Geoffrey Leech and Nicholas Smith -- Social variation in the use of apology formulae in the British National Corpus /Mats Deutschmann -- How recent is recent? On overcoming interpretational difficulties /Göran Kjellmer -- Looking at looking: Functions and contexts of progressives in spoken English and 'school' English /Ute Römer -- Ditransitives, the Given Before New principle, and textual retrievability: a corpus-based study using ICECUP /Gabriel Ozón -- The Spanish pragmatic marker pues and its English equivalents /Anna-Brita Stenström -- WebCorp: A tool for online linguistic information retrieval and analysis /Barry Morley -- Diachronic linguistic analysis on the web with WebCorp /Andrew Kehoe -- New ways of analysing ESL on the WWW with WebCorp and WebPhraseCount /Josef Schmied -- I'm like, “Hey, it works!”: Using GlossaNet to find attestations of the quotative (be) like in English-language newspapers /Cédrick Fairon and John V. Singler -- Corpus linguistics and English reference grammars /Joybrato Mukherjee -- Tracking ongoing grammatical change and recent diversification in present-day standard English: the complementary role of small and large corpora /Christian Mair -- but it will take time...points of view on a lexical grammar of English /Michaela Mahlberg -- Corpus linguistics, grammar and theory: Report on a panel discussion at the 24th ICAME conference /Jan Aarts.

Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN:

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.