Seeing through Multilingual Corpora

Seeing through Multilingual Corpora
Author: Stig Johansson
Publisher: John Benjamins Publishing
Total Pages: 380
Release: 2007-02-14
Genre: Language Arts & Disciplines
ISBN: 9027292825

Through electronic corpora we can observe patterns which we were unaware of before or only vaguely glimpsed. The availability of multilingual corpora has led to a renewal of contrastive studies. We gain new insight into similarities and differences between languages, at the same time as the characteristics of each language are brought into relief. The present book focuses on the work in building and using the English-Norwegian Parallel Corpus and the Oslo Multilingual Corpus. Case studies are reported on lexis, grammar, and discourse. A concluding chapter sums up problems and prospects of corpus-based contrastive studies, including applications in lexicography, translator training, and foreign-language teaching. Though the main focus is on English and Norwegian, the approach should be of interest more generally for corpus-based contrastive research and for language studies in general. Seeing through corpora we can see through language.

Seeing Through Multilingual Corpora

Seeing Through Multilingual Corpora
Author: Stig Johansson
Publisher: John Benjamins Publishing
Total Pages: 392
Release: 2007-01-01
Genre: Language Arts & Disciplines
ISBN: 9789027223005

Through electronic corpora we can observe patterns which we were unaware of before or only vaguely glimpsed. The availability of multilingual corpora has led to a renewal of contrastive studies. We gain new insight into similarities and differences between languages, at the same time as the characteristics of each language are brought into relief. The present book focuses on the work in building and using the English-Norwegian Parallel Corpus and the Oslo Multilingual Corpus. Case studies are reported on lexis, grammar, and discourse. A concluding chapter sums up problems and prospects of corpus-based contrastive studies, including applications in lexicography, translator training, and foreign-language teaching. Though the main focus is on English and Norwegian, the approach should be of interest more generally for corpus-based contrastive research and for language studies in general. Seeing through corpora we can see through language.

Corpus Linguistics 25 Years on

Corpus Linguistics 25 Years on
Author:
Publisher: BRILL
Total Pages: 391
Release: 2015-07-14
Genre: Language Arts & Disciplines
ISBN: 9401204349

This volume offers a state-of-the-art picture of work undertaken in the field of computer-aided corpus linguistics. While the focus is on English, central insights can be generalised to other languages, as well. As a work intended to mark the Silver Jubilee of ICAME, the International Computer Archive of Modern and Medieval English, the book combines surveys of the discipline by some of its major pioneers, including founders of ICAME itself, with cutting-edge work by younger scholars. It is divided into three sections: “Overviewing years of corpus linguistic studies”, “Descriptive studies in English syntax and semantics”, and “Second Language Acquisition, parallel corpora and specialist corpora”. The book bears witness to the impressive advances that have characterised the development of corpus linguistics over the past few decades – from terminological issues to practical applications, from theoretical and descriptive research to applied approaches, from monolingual to multilingual and specialist corpora, from corpus design to corpus exploitation tools.

New Trends in Corpora and Language Learning

New Trends in Corpora and Language Learning
Author: Ana Frankenberg-Garcia
Publisher: Bloomsbury Publishing
Total Pages: 303
Release: 2011-01-20
Genre: Language Arts & Disciplines
ISBN: 1441112022

This book provides an up-to-date snapshot of recent research and developments in the use of corpora for language learning and teaching. It is divided into three parts. Part I focusses on innovative uses of corpora by language teachers and learners. These cover the world's first corpus-based TV program for the teaching of English conversation, as well as corpus-based approaches to the teaching of EAP, cultural studies and translation. Part II focuses on new corpus-based tools for LSP learning. Part III illustrates research findings from corpora consisting of language learner data and discusses their implications for language teaching and learning. It will appeal to scholars in both language teaching and learning and corpus and computational linguistics.

Advances in Corpus-based Contrastive Linguistics

Advances in Corpus-based Contrastive Linguistics
Author: Karin Aijmer
Publisher: John Benjamins Publishing
Total Pages: 307
Release: 2013-03-13
Genre: Computers
ISBN: 9027272328

Contrastive studies have experienced a dramatic revival in the last decades. By combining the methodological advantages of computer corpus linguistics and the possibility of contrasting texts in two or more languages, the structure and use of languages can be explored with greater accuracy, detail and empirical strength than before. The approach has also proved to have fruitful practical applications in a number of areas such as language teaching, lexicography, translation studies and computer-aided translation. This volume contains twelve studies comparing linguistic phenomena in English and seven other languages. The topics range from comparisons of specific lexical categories and word combinations to syntactic constructions and discourse phenomena such as cohesion and thematic structure. The studies highlight similarities and differences in the use, semantics and functions of the compared items, as well as the emergence of new meanings and language change. The emphasis varies from purely linguistic studies to those focusing on practical applications.

Corpus Linguistics

Corpus Linguistics
Author: Tony McEnery
Publisher: Cambridge University Press
Total Pages: 311
Release: 2011-10-06
Genre: Language Arts & Disciplines
ISBN: 1139502441

Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.

Multilingual Corpora and Multilingual Corpus Analysis

Multilingual Corpora and Multilingual Corpus Analysis
Author: Thomas Schmidt
Publisher: John Benjamins Publishing
Total Pages: 423
Release: 2012
Genre: Language Arts & Disciplines
ISBN: 9027219346

This volume deals with different aspects of the creation and use of multilingual corpora. The term 'multilingual corpus' is understood in a comprehensive sense, meaning any systematic collection of empirical language data enabling linguists to carry out analyses of multilingual individuals, multilingual societies or multilingual communication. The individual contributions are thus concerned with a variety of spoken and written corpora ranging from learner and attrition corpora, language contact corpora and interpreting corpora to comparable and parallel corpora. The overarching aim of the volume is first to take stock of the variety of existing multilingual corpora, documenting possible corpus designs and uses, second to discuss methodological and technological challenges in the creation and analysis of multilingual corpora, and third to provide examples of linguistic analyses that were carried out on the basis of multilingual corpora.

The Routledge Handbook of Corpus Linguistics

The Routledge Handbook of Corpus Linguistics
Author: Anne O'Keeffe
Publisher: Routledge
Total Pages: 1263
Release: 2010-04-05
Genre: Education
ISBN: 1135153620

The Routledge Handbook of Corpus Linguistics provides a timely overview of a dynamic and rapidly growing area with a widely applied methodology. Through the electronic analysis of large bodies of text, corpus linguistics demonstrates and supports linguistic statements and assumptions. In recent years it has seen an ever-widening application in a variety of fields: computational linguistics, discourse analysis, forensic linguistics, pragmatics and translation studies. Bringing together experts in the key areas of development and change, the handbook is structured around six themes which take the reader through building and designing a corpus to using a corpus to study literature and translation. A comprehensive introduction covers the historical development of the field and its growing influence and application in other areas. Structured around five headings for ease of reference, each contribution includes further reading sections with three to five key texts highlighted and annotated to facilitate further exploration of the topics. The Routledge Handbook of Corpus Linguistics is the ideal resource for advanced undergraduates and postgraduates.

Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN:

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Building and Using Comparable Corpora

Building and Using Comparable Corpora
Author: Serge Sharoff
Publisher: Springer Science & Business Media
Total Pages: 333
Release: 2013-12-13
Genre: Computers
ISBN: 3642201288

The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.