Working with German Corpora

Working with German Corpora
Author: Bill Dodd
Publisher: A&C Black
Total Pages: 329
Release: 2006-06-21
Genre: Language Arts & Disciplines
ISBN: 1847143393

The essays in this volume, writen by Germanists from Britain, Ireland, the USA and Australia, illustrate the enormous potential which corpus-based work has for German Studies as a whole and the rich diversity of work currently being undertaken. A detailed introduction explains basic concepts, methods, and applications of corpus-based work.

Cross-Linguistic Corpora for the Study of Translations

Cross-Linguistic Corpora for the Study of Translations
Author: Silvia Hansen-Schirra
Publisher: Walter de Gruyter
Total Pages: 320
Release: 2012-12-06
Genre: Language Arts & Disciplines
ISBN: 3110260328

The book specifies a corpus architecture, including annotation and querying techniques, and its implementation. The corpus architecture is developed for empirical studies of translations, and beyond those for the study of texts which are inter-lingually comparable, particularly texts of similar registers. The compiled corpus, CroCo, is a resource for research and is, with some copyright restrictions, accessible to other research projects. Most of the research was undertaken as part of a DFG-Project into linguistic properties of translations. Fundamentally, this research project was a corpus-based investigation into the language pair English-German. The long-term goal is a contribution to the study of translation as a contact variety, and beyond this to language comparison and language contact more generally with the language pair English - German as our object languages. This goal implies a thorough interest in possible specific properties of translations, and beyond this in an empirical translation theory. The methodology developed is not restricted to the traditional exclusively system-based comparison of earlier days, where real-text excerpts or constructed examples are used as mere illustrations of assumptions and claims, but instead implements an empirical research strategy involving structured data (the sub-corpora and their relationships to each other, annotated and aligned on various theoretically motivated levels of representation), the formation of hypotheses and their operationalizations, statistics on the data, critical examinations of their significance, and interpretation against the background of system-based comparisons and other independent sources of explanation for the phenomena observed. Further applications of the resource developed in computational linguistics are outlined and evaluated.

Working with Spanish Corpora

Working with Spanish Corpora
Author: Giovanni Parodi
Publisher: Bloomsbury Publishing
Total Pages: 280
Release: 2007-09-01
Genre: Language Arts & Disciplines
ISBN: 0826438350

The main focus of this book is the investigation of linguistic variation in Spanish, considering spoken and written, specialised and non-specialised registers from a corpus linguistics approach and employing computational updated tools. The ten chapters represent a range of research on Spanish using a number of different corpora drawn from, amongst others, research articles, student writing, formal conversation and technical reports. A variety of methodologies are brought to bear upon these corpora including multi-dimensional and multi-register analysis, latent semantics and lexical bundles. This in-depth analysis of using Spanish corpora will be of interest to researchers in corpus linguistics or Spanish language.

Teaching and Language Corpora

Teaching and Language Corpora
Author: Anne Wichmann
Publisher: Routledge
Total Pages: 362
Release: 2014-06-11
Genre: Foreign Language Study
ISBN: 1317889584

Corpora are well-established as a resource for language research; they are now also increasingly being used for teaching purposes. This book is the first of its kind to deal explicitly and in a wide-ranging way with the use of corpora in teaching. It contains an extensive collection of articles by corpus linguists and practising teachers, covering not only the use of data to inform and create teaching materials but also the direct exploitation of corpora by students, both in the study of linguistics in general and in the acquisition of proficiency in individual languages, including English, Welsh, German, French and Italian. In addition, the book offers practical information on the sources of corpora and concordances, including those suitable for work on non-roman scripts such as Greek and Cyrillic.

Corpora and Language Education

Corpora and Language Education
Author: Lynne Flowerdew
Publisher: Springer
Total Pages: 365
Release: 2012-01-15
Genre: Computers
ISBN: 1403998930

Corpora and Language Education critically examines key concepts and issues in corpus linguistics, with a particular focus on the expanding interdisciplinary nature of the field and the role that written and spoken corpora now play in the fields of professional communication, teacher education, translation studies, lexicography, literature, critical discourse analysis, and forensic linguistics. The book also presents a series of corpus-based case studies illustrating central themes and best practices in the field.

Corpora in the Foreign Language Classroom

Corpora in the Foreign Language Classroom
Author:
Publisher: BRILL
Total Pages: 376
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9401203903

The papers published in this volume were originally presented at the Sixth International Conference on Teaching and Language Corpora (4-7 July 2004 Granada, Spain) and reflect the latest developments that have taken place in the field of the teaching applications of text corpora, with a special emphasis on their use in the foreign language classroom. The book is divided into three main sections. The first section sets the scene for what this collection of essays aims to be. It deals with the issue of what corpus linguistics can do not only for the understanding of the nature of language itself but also for so fundamental and miraculous a matter such as language learning and language acquisition. The second section tackles the issues of corpus design and corpus exploitation and provides the reader with a great variety of evidence in favour of corpora exploitation for the building of a successful teaching environment. The final section deals with practical applications of corpora in the foreign language classroom. Although each of the papers here reports particular experiences in very different teaching and learning contexts, as a whole they show that corpora can be used on the spot in a language teaching context by teachers and learners without extensive training in computational tools, and studies of linguistics features can be tailored to specific pedagogic context and learning requirements. The book represents a solid contribution to linguistic studies and language teaching and it is a good example of the diversity of the scientific lines in which corpus linguistics is involved at the present moment.

Working with Portuguese Corpora

Working with Portuguese Corpora
Author: Tony Berber Sardinha
Publisher: A&C Black
Total Pages: 347
Release: 2014-04-10
Genre: Language Arts & Disciplines
ISBN: 1472570014

Although Portuguese is one of the main world languages and researchers have been working on Portuguese electronic text collections for decades (e.g. Kelly, 1970; Biderman, 1978; Bacelar do Nascimento et al., 1984; see Berber Sardinha, 2005), this is the first volume in English that encapsulates the exciting and cutting-edge corpus linguistic work being done with Portuguese language corpora on different continents. The book includes chapters by leading corpus linguists dealing with Portuguese corpora across the world, and their contributions explore various methods and how they are applicable to a wide range of language issues. The book is divided into six sections, each covering a key issue in Corpus Linguistics: lexis and grammar, lexicography, language teaching and terminology, translation, corpus building and sharing, and parsing and annotation. Together these sections present the reader with a broad picture of the field.

The Routledge Handbook of Second Language Acquisition and Corpora

The Routledge Handbook of Second Language Acquisition and Corpora
Author: Nicole Tracy-Ventura
Publisher: Routledge
Total Pages: 395
Release: 2020-12-17
Genre: Language Arts & Disciplines
ISBN: 0429687559

The Routledge Handbook of Second Language Acquisition and Corpora is a state-of-the-art collection of cutting-edge scholarship at the intersection of second language acquisition and learner corpus research. It draws on data-driven, statistical analysis to outline the background, methods, and outcomes of language learning, with a range of global experts providing detailed guidelines and findings. The volume is organized into five sections: Methodological and theoretical contributions to the study of learner language using corpora – setting the scene Key aspects in corpus design, annotation, and analysis for SLA Corpora in SLA theory and practice SLA constructs and corpora Future directions This is a ground-breaking collection of essays offering incisive and essential reading for anyone with an interest in second language acquisition, learner corpus research, and applied linguistics.

GECCo - German-English Contrasts in Cohesion

GECCo - German-English Contrasts in Cohesion
Author: Kerstin Kunz
Publisher: Walter de Gruyter GmbH & Co KG
Total Pages: 288
Release: 2021-10-25
Genre: Language Arts & Disciplines
ISBN: 3110711079

In contrastive linguistics of English and German, there is a tradition of accounting for contrasts with respect to grammar and, to a lesser extent, for lexis and phonetics. Moving on to discourse and text, there is a sizeable body of literature on cohesive patterns in English and German respectively - but very little in terms of a comparison. The latter, though, is of particular interest for language learners, translators and, of course, linguists and researchers in language technology. This book attempts to close this gap, based on a number of years of corpus-based study into variation and cohesion in the two languages. While there is an overall focus on language contrasts, it also investigates variation between different registers language-internally, and between written and spoken mode in particular. For each of the five major types of cohesion (co-reference, substitution, ellipsis, conjunctive relations and lexical cohesion), overviews are given of contrasts in the system and of contrastive frequencies in texts. Results and methods presented in this book are thus relevant for language teaching, translation, language technology and corpus-based work on English and German generally.

Best Practices for Spoken Corpora in Linguistic Research

Best Practices for Spoken Corpora in Linguistic Research
Author: Şükriye Ruhi
Publisher: Cambridge Scholars Publishing
Total Pages: 285
Release: 2014-08-11
Genre: Language Arts & Disciplines
ISBN: 1443865540

A key concern of researchers involved in the creation and sharing of language resources is to attain maximum usability, reliability and longevity of these resources for present and future researchers in the language sciences. The view developed in this volume is that spoken corpora construction and sharing are major research endeavours that should also be laid open to academic debate in a manner that is more visible than is currently the case in corpus linguistics. The present volume brings together multiple research perspectives to bear on the question of what constitutes best practices for the construction of spoken corpora. The book brings into closer contact scholars whose specializations have often remained in relatively different streams of scientific investigation; that is, scholars whose work falls primarily in conversation analysis, pragmatics and discourse analysis, but who are involved in spoken corpus compilation, on the one hand, and scholars who also specialize in linguistics but who have been intensively involved in developing various infrastructures for spoken corpora, on the other hand. This combination of scholars brings into better relief the concerns of data providers, data curators and data users in linguistic research. This book is thus unique in that it highlights best practices from both the perspective of assembling, annotating and linguistic analysis of spoken corpora, as well as from the perspective of processing, archiving and disseminating spoken language. In doing so, the contributions emphasise not only the considerable promise that the rapid technological changes that society continues to experience in this area offer, but also possible dangers for the unwary.