Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig

Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig
Author: Dawn Knight
Publisher: Springer Nature
Total Pages: 178
Release: 2021-07-05
Genre: Language Arts & Disciplines
ISBN: 3030724840

This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.

The Routledge Handbook of Corpus Linguistics

The Routledge Handbook of Corpus Linguistics
Author: Anne O'Keeffe
Publisher: Taylor & Francis
Total Pages: 755
Release: 2022-02-08
Genre: Language Arts & Disciplines
ISBN: 0429634137

The Routledge Handbook of Corpus Linguistics 2e provides an updated overview of a dynamic and rapidly growing area with a widely applied methodology. Over a decade on from the first edition of the Handbook, this collection of 47 chapters from experts in key areas offers a comprehensive introduction to both the development and use of corpora as well as their ever-evolving applications to other areas, such as digital humanities, sociolinguistics, stylistics, translation studies, materials design, language teaching and teacher development, media discourse, discourse analysis, forensic linguistics, second language acquisition and testing. The new edition updates all core chapters and includes new chapters on corpus linguistics and statistics, digital humanities, translation, phonetics and phonology, second language acquisition, social media and theoretical perspectives. Chapters provide annotated further reading lists and step-by-step guides as well as detailed overviews across a wide range of themes. The Handbook also includes a wealth of case studies that draw on some of the many new corpora and corpus tools that have emerged in the last decade. Organised across four themes, moving from the basic start-up topics such as corpus building and design to analysis, application and reflection, this second edition remains a crucial point of reference for advanced undergraduates, postgraduates and scholars in applied linguistics.

Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN:

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Corpus and Context

Corpus and Context
Author: Svenja Adolphs
Publisher: John Benjamins Publishing
Total Pages: 176
Release: 2008
Genre: Language Arts & Disciplines
ISBN: 9789027223043

Corpus and Context explores the relationship between corpus linguistics and pragmatics by discussing possible frameworks for analysing utterance function on the basis of spoken corpora. The book articulates the challenges and opportunities associated with a change of focus in corpus research, from lexical to functional units, from concordance lines to extended stretches of discourse, and from the purely textual to multi-modal analysis of spoken corpus data. Drawing on a number of spoken corpora including the five million word Cambridge and Nottingham Corpus of Discourse in English (CANCODE, funded by CUP (c)), a specific speech act function is being explored using different approaches and different levels of analysis. This involves a close analysis of contextual variables in relation to lexico-grammatical and discoursal patterns that emerge from the corpus data, as well as a wider discussion of the role of context in spoken corpus research.

Building and Using the Siarad Corpus

Building and Using the Siarad Corpus
Author: Margaret Deuchar
Publisher:
Total Pages: 0
Release: 2018
Genre: Bilingualism
ISBN: 9789027200112

Introduction -- Building the corpus. Data collection and profile of the speakers in our corpus -- Transcription of the data -- Code-switching vs. borrowing: New implications arising from our data -- Using the corpus. The grammar of code-switching -- Code-switching and independent variables -- Change in Welsh grammar -- Additional research using Siarad -- Conclusion and future directions

Corpus Pragmatics

Corpus Pragmatics
Author: Karin Aijmer
Publisher: Cambridge University Press
Total Pages: 481
Release: 2015
Genre: Language Arts & Disciplines
ISBN: 1107015049

The first handbook to survey and expand the burgeoning field of corpus pragmatics, the intersection of pragmatics and corpus linguistics.

Comparative Stylistics of Welsh and English

Comparative Stylistics of Welsh and English
Author: Steve Morris
Publisher: University of Wales Press
Total Pages: 308
Release: 2018-07-15
Genre: Language Arts & Disciplines
ISBN: 1786832577

An analysis of Welsh stylistics in a corpus of 20th and 21st century texts. A study of the structure of Welsh compared with English via a translation corpus. A study of methods in translation.

Building a National Corpus

Building a National Corpus
Author: Dawn Knight
Publisher: Springer Nature
Total Pages: 192
Release: 2021-10-08
Genre: Language Arts & Disciplines
ISBN: 3030818586

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.

The Language of Business Meetings

The Language of Business Meetings
Author: Michael Handford
Publisher: Cambridge University Press
Total Pages: 289
Release: 2010-08-19
Genre: Foreign Language Study
ISBN: 052111666X

This book presents a corpus-based study of the language used in business meetings.

Multimodality and Active Listenership

Multimodality and Active Listenership
Author: Dawn Knight
Publisher: Bloomsbury Publishing
Total Pages: 273
Release: 2011-10-13
Genre: Language Arts & Disciplines
ISBN: 1441107479

Current corpora are invaluable resources for generating accurate and objective analyses of patterns of language use. However, spoken corpora are effectively mono-modal, presenting data in the same physical medium – text. The reality of a discourse situation is lost in its representation as text. Using multimodal data sets when conducting corpus-based pragmatic analyses is one solution. This book looks at multimodal corpora in some depth, using backchanneling as the conversational feature to be analysed. It provides a bottom-up investigation of the issues and challenges faced at every stage of multimodal corpus construction and analysis, as well as providing an in-depth linguistic analysis of a cross section of multimodal corpus data. The collaborative and co-operative nature of backchannels is highlighted in this book and an adapted pragmatic-functional linguistic coding matrix for the characterisation of backchanneling phenomena is presented. Dawn Knight also looks at possible directions in the construction and use of multimodal corpus linguistics.