Empirical Methods for Exploiting Parallel Texts

Empirical Methods for Exploiting Parallel Texts
Author: I. Dan Melamed
Publisher: MIT Press
Total Pages: 224
Release: 2001
Genre: Computers
ISBN: 9780262133807

This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing
Author: Alexander Gelbukh
Publisher: Springer
Total Pages: 664
Release: 2003-08-03
Genre: Language Arts & Disciplines
ISBN: 3540364560

CICLing 2003 (www.CICLing.org) was the 4th annual Conference on Intelligent Text Processing and Computational Linguistics. It was intended to provide a balanced view of the cutting-edge developments in both the theoretical foundations of computational linguistics and the practice of natural language text processing with its numerous applications. A feature of CICLing conferences is their wide scope that covers nearly all areas of computational linguistics and all aspects of natural language processing applications. The conference is a forum for dialogue between the specialists working in these two areas. This year we were honored by the presence of our keynote speakers Eric Brill (Microsoft Research, USA), Aravind Joshi (U. Pennsylvania, USA), Adam Kilgarriff (Brighton U., UK), and Ted Pedersen (U. Minnesota, USA), who delivered excellent extended lectures and organized vivid discussions. Of 92 submissions received, after careful reviewing 67 were selected for presentation; 43 as full papers and 24 as short papers, by 150 authors from 23 countries: Spain (23 authors), China (20), USA (16), Mexico (13), Japan (12), UK (11), Czech Republic (8), Korea and Sweden (7 each), Canada and Ireland (5 each), Hungary (4), Brazil (3), Belgium, Germany, Italy, Romania, Russia and Tunisia (2 each), Cuba, Denmark, Finland and France (1 each).

Bibliography of Translation Studies: 2001

Bibliography of Translation Studies: 2001
Author: Lynne Bowker
Publisher: Routledge
Total Pages: 93
Release: 2017-07-05
Genre: Language Arts & Disciplines
ISBN: 1351573853

A volume of selected, annotated references arranged under specific headings to provide a non-partisan guide to teachers involved in designing courses in translation and/or interpreting.

Routledge Encyclopedia of Translation Technology

Routledge Encyclopedia of Translation Technology
Author: Chan Sin-wai
Publisher: Routledge
Total Pages: 958
Release: 2014-11-13
Genre: Foreign Language Study
ISBN: 1317608143

The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.

Bitext Alignment

Bitext Alignment
Author: Jörg Tiedemann
Publisher: Morgan & Claypool Publishers
Total Pages: 168
Release: 2011
Genre: Computers
ISBN: 1608455106

This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

Machine Learning: ECML 2003

Machine Learning: ECML 2003
Author: Nada Lavrač
Publisher: Springer
Total Pages: 521
Release: 2003-11-18
Genre: Computers
ISBN: 3540398570

The proceedings of ECML/PKDD2003 are published in two volumes: the P- ceedings of the 14th European Conference on Machine Learning (LNAI 2837) and the Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (LNAI 2838). The two conferences were held on September 22–26, 2003 in Cavtat, a small tourist town in the vicinity of Dubrovnik, Croatia. As machine learning and knowledge discovery are two highly related ?elds, theco-locationofbothconferencesisbene?cialforbothresearchcommunities.In Cavtat, ECML and PKDD were co-located for the third time in a row, following the successful co-location of the two European conferences in Freiburg (2001) and Helsinki (2002). The co-location of ECML2003 and PKDD2003 resulted in a joint program for the two conferences, including paper presentations, invited talks, tutorials, and workshops. Out of 332 submitted papers, 40 were accepted for publication in the ECML2003proceedings,and40wereacceptedforpublicationinthePKDD2003 proceedings. All the submitted papers were reviewed by three referees. In ad- tion to submitted papers, the conference program consisted of four invited talks, four tutorials, seven workshops, two tutorials combined with a workshop, and a discovery challenge.

Semantic Domains in Computational Linguistics

Semantic Domains in Computational Linguistics
Author: Alfio Gliozzo
Publisher: Springer Science & Business Media
Total Pages: 138
Release: 2009-07-31
Genre: Language Arts & Disciplines
ISBN: 3540681582

Semantic fields are lexically coherent – the words they contain co-occur in texts. In this book the authors introduce and define semantic domains, a computational model for lexical semantics inspired by the theory of semantic fields. Semantic domains allow us to exploit domain features for texts, terms and concepts, and they can significantly boost the performance of natural-language processing systems. Semantic domains can be derived from existing lexical resources or can be acquired from corpora in an unsupervised manner. They also have the property of interlinguality, and they can be used to relate terms in different languages in multilingual application scenarios. The authors give a comprehensive explanation of the computational model, with detailed chapters on semantic domains, domain models, and applications of the technique in text categorization, word sense disambiguation, and cross-language text categorization. This book is suitable for researchers and graduate students in computational linguistics.

Advances in Information Retrieval

Advances in Information Retrieval
Author: Mohand Boughanem
Publisher: Springer Science & Business Media
Total Pages: 841
Release: 2009-03-27
Genre: Computers
ISBN: 3642009573

This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR.

The Oxford Handbook of Computational Linguistics

The Oxford Handbook of Computational Linguistics
Author: Ruslan Mitkov
Publisher: Oxford University Press
Total Pages: 808
Release: 2004
Genre: Computers
ISBN: 019927634X

This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.

A Chronology of Translation in China and the West

A Chronology of Translation in China and the West
Author: Sin-wai Chan
Publisher: Chinese University Press
Total Pages: 596
Release: 2009
Genre: Education
ISBN: 9789629963552

This book is a study of the major events and publications in the world of translation in China and the West from its beginning in the legendary period to 2004, with special references to works published in Chinese and English. It covers a total of 72 countries/places and 1,000 works. All the events and activities in the field have been grouped into 22 areas or categories for easy referencing. This book is a valuable reference tool for all scholars working in the field of translation.