Corpora and Cross-linguistic Research

Corpora and Cross-linguistic Research
Author: Stig Johansson
Publisher: Rodopi
Total Pages: 400
Release: 1998
Genre: Computers
ISBN: 9789042002913

From the contents: On the role of corpora in cross-linguistic research (Stig Johansson). - Classifying translational correspondences (Martha Thunes). - A translational basis for semantics (Helge Dyvik). - Connectors and sentence openings in English and Swedish (Bengt Altenberg). - Thematic structure in translation between English and Norwegian (Hilde Hasselgard). - Epistemic predicates in contrast (Karin Aijmer).

Corpora and Cross-Linguistic Research

Corpora and Cross-Linguistic Research
Author:
Publisher: BRILL
Total Pages: 390
Release: 2023-11-27
Genre: Foreign Language Study
ISBN: 900465366X

In recent years there has been increasing interest in the development and use of bilingual and multilingual corpora. As Karin Aijmer writes in this book, 'The contrastive or comparative perspective ... makes it possible to dig deeper and to ask new questions about the relationship between languages with the aim of sharpening our conceptions of cross-linguistic correspondences and adding to our knowledge of the languages compared.' The papers in this volume are a showcase of the great variety of purposes to which bilingual and multilingual corpora can be put. They do not only lend themselves to descriptive and applied approaches, but are also suitable for theory-oriented studies. The range of linguistic phenomena covered by the various approaches is very wide; the papers focus on fields of research like syntax, discourse, semantics, information structure, lexis, and translation studies. The range of languages studied comprises English, Norwegian, Swedish, German, Dutch, and Portuguese. In addition to purely linguistic papers, there are contributions on computer programs developed for the compilation and use of bilingual and multilingual corpora.

Cross-Linguistic Corpora for the Study of Translations

Cross-Linguistic Corpora for the Study of Translations
Author: Silvia Hansen-Schirra
Publisher: Walter de Gruyter
Total Pages: 320
Release: 2012-12-06
Genre: Language Arts & Disciplines
ISBN: 3110260328

The book specifies a corpus architecture, including annotation and querying techniques, and its implementation. The corpus architecture is developed for empirical studies of translations, and beyond those for the study of texts which are inter-lingually comparable, particularly texts of similar registers. The compiled corpus, CroCo, is a resource for research and is, with some copyright restrictions, accessible to other research projects. Most of the research was undertaken as part of a DFG-Project into linguistic properties of translations. Fundamentally, this research project was a corpus-based investigation into the language pair English-German. The long-term goal is a contribution to the study of translation as a contact variety, and beyond this to language comparison and language contact more generally with the language pair English - German as our object languages. This goal implies a thorough interest in possible specific properties of translations, and beyond this in an empirical translation theory. The methodology developed is not restricted to the traditional exclusively system-based comparison of earlier days, where real-text excerpts or constructed examples are used as mere illustrations of assumptions and claims, but instead implements an empirical research strategy involving structured data (the sub-corpora and their relationships to each other, annotated and aligned on various theoretically motivated levels of representation), the formation of hypotheses and their operationalizations, statistics on the data, critical examinations of their significance, and interpretation against the background of system-based comparisons and other independent sources of explanation for the phenomena observed. Further applications of the resource developed in computational linguistics are outlined and evaluated.

Developmental and Crosslinguistic Perspectives in Learner Corpus Research

Developmental and Crosslinguistic Perspectives in Learner Corpus Research
Author: Yukio Tono
Publisher: John Benjamins Publishing
Total Pages: 369
Release: 2012-03-28
Genre: Language Arts & Disciplines
ISBN: 9027271720

This volume provides a state-of-the-art overview of current research and developments on the use of learner corpora perceived from developmental and crosslinguistic perspectives. The book is divided into two parts. The eleven contributions of Part I investigate the development of English language skills of young learners across seven countries/regions on the basis of a new corpus resource called the International Corpus of Crosslinguistic Interlanguage (ICCI). Part II contains seven papers devoted to other varieties of learner corpora, especially spoken learner corpora and learner corpora of languages other than English. Presenting original research in corpus linguistics, this book will be of interest to researchers and postgraduates in the fields of learner corpus research and second language acquisition and those who wish to apply corpus methodology in teaching and learning. For sale in all countries except Japan. For customers in Japan: please contact Yushodo Co.

Corpora in Language Acquisition Research

Corpora in Language Acquisition Research
Author: Heike Behrens
Publisher: John Benjamins Publishing
Total Pages: 280
Release: 2008
Genre: Language Arts & Disciplines
ISBN: 9789027234766

Corpus research forms the backbone of research on children's language development. Leading researchers in the field present a survey on the history of data collection, different types of data, and the treatment of methodological problems. Morphologically and syntactically parsed corpora allow for the concise explorations of formal phenomena, the quick retrieval of errors, and reliability checks. New probabilistic and connectionist computations investigate how children integrate the multiple sources of information available in the input, and new statistical methods compute rates of acquisition as well as error rates dependent on sample size. Sample analyses show how multi-modal corpora are used to investigate the interaction of discourse and linguistic structure, how cross-linguistic generalizations for acquisition can be formulated and tested, and how individual variation can be explored. Finally, ways in which corpus research interacts with computational linguistics and experimental research are presented.

Parallel Corpora for Contrastive and Translation Studies

Parallel Corpora for Contrastive and Translation Studies
Author: Irene Doval
Publisher: John Benjamins Publishing Company
Total Pages: 313
Release: 2019-03-20
Genre: Language Arts & Disciplines
ISBN: 9027262845

This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.

Advances in Corpus Linguistics

Advances in Corpus Linguistics
Author: Karin Aijmer
Publisher: Rodopi
Total Pages: 430
Release: 2004
Genre: Computers
ISBN: 9789042017412

This book provides an up-to-date survey of current issues and approaches in corpus linguistics in the form of twenty-two recent research articles. The articles cover a wide range of topics illustrating the diversity of research that is characteristic of corpus linguistics today. Central themes are the relationship between theory, intuition and corpus data and the role of corpora in linguistic research. The majority of the articles are empirical studies of specific aspects of English, ranging from lexis and grammar to discourse and pragmatics. Other areas explored are language variation, language change and development, language learning, cross-linguistic comparisons of English and other languages, and the development of linguistic software tools. The contributors to the volume include some of the leading figures in the field such as M.A.K. Halliday, John Sinclair, Geoffrey Leech and Michael Hoey. The theoretical and methodological issues addressed in the volume demonstrate clearly the steady advance of an expanding discipline inspired by an empirical, usage-based approach to the study of language. The volume is essential reading for researchers and students interested in the use of computer corpora in linguistic research.

Corpus Use in Cross-linguistic Research

Corpus Use in Cross-linguistic Research
Author: Marlén Izquierdo
Publisher: John Benjamins Publishing Company
Total Pages: 245
Release: 2023-11-02
Genre: Language Arts & Disciplines
ISBN: 9027249318

Cross-linguistic research is a fruitful field of language inquiry that has benefited enormously from the use of corpora. As sources of linguistic data of various kinds and as tools for language processing, corpora have shaped the development of cross-linguistic research, enabling both language description and practical applications. This volume contains twelve studies that emphasize the usefulness and usability of parallel corpora in accurately exploring the structure and use of seven under-researched languages and language varieties. The first part emphasizes the role of corpus-based descriptive analyses at the lexicogrammatical and discursive levels, as a first step on the way towards concrete applications like translation or language teaching. The second part focuses on the role of parallel-corpus-based language processing techniques and applications that facilitate professional communication. This book will be of interest to scholars in contrastive linguistics, translation studies, discourse analysis, language teaching, and natural language processing.

Information Structure in a Cross-Linguistic Perspective

Information Structure in a Cross-Linguistic Perspective
Author:
Publisher: BRILL
Total Pages: 233
Release: 2016-08-29
Genre: Language Arts & Disciplines
ISBN: 9004334254

The present volume draws together contributions from a number of scholars with an interest in empirical, cross-linguistic description. Most of the papers were first presented at the symposium Information Structure in a Cross-linguistic Perspective held in Oslo in November/December 2000. The descriptions are functionally oriented, and their common focus is how information structure – in a broad sense – can be compared across languages. 'Information structure' has been approached in a variety of ways by the authors, so as to give a broad picture of this fundamental principle of text production, involving the way in which a speaker/writer chooses to present a message in terms of given/new information, focus, cohesion, and point of view. Central to much of the research is the problem of establishing criteria for isolating linguistic constraints on language use from cultural-linguistic conventions in text production. The linguistic comparison includes English, German and/or one of the Scandinavian languages, with sidelights to other languages. Most of the papers are text- or corpus-based, and the ongoing work on parallel corpora in Scandinavia is reflected in several contributions.

Spoken Corpora and Linguistic Studies

Spoken Corpora and Linguistic Studies
Author: Tommaso Raso
Publisher: John Benjamins Publishing Company
Total Pages: 508
Release: 2014-11-14
Genre: Language Arts & Disciplines
ISBN: 9027270031

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media