Empirical Methods For Exploiting Parallel Texts
Download Empirical Methods For Exploiting Parallel Texts full books in PDF, epub, and Kindle. Read online free Empirical Methods For Exploiting Parallel Texts ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!
Author | : I. Dan Melamed |
Publisher | : MIT Press |
Total Pages | : 224 |
Release | : 2001 |
Genre | : Computers |
ISBN | : 9780262133807 |
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.
Author | : Alexander Gelbukh |
Publisher | : Springer |
Total Pages | : 664 |
Release | : 2003-08-03 |
Genre | : Language Arts & Disciplines |
ISBN | : 3540364560 |
CICLing 2003 (www.CICLing.org) was the 4th annual Conference on Intelligent Text Processing and Computational Linguistics. It was intended to provide a balanced view of the cutting-edge developments in both the theoretical foundations of computational linguistics and the practice of natural language text processing with its numerous applications. A feature of CICLing conferences is their wide scope that covers nearly all areas of computational linguistics and all aspects of natural language processing applications. The conference is a forum for dialogue between the specialists working in these two areas. This year we were honored by the presence of our keynote speakers Eric Brill (Microsoft Research, USA), Aravind Joshi (U. Pennsylvania, USA), Adam Kilgarriff (Brighton U., UK), and Ted Pedersen (U. Minnesota, USA), who delivered excellent extended lectures and organized vivid discussions. Of 92 submissions received, after careful reviewing 67 were selected for presentation; 43 as full papers and 24 as short papers, by 150 authors from 23 countries: Spain (23 authors), China (20), USA (16), Mexico (13), Japan (12), UK (11), Czech Republic (8), Korea and Sweden (7 each), Canada and Ireland (5 each), Hungary (4), Brazil (3), Belgium, Germany, Italy, Romania, Russia and Tunisia (2 each), Cuba, Denmark, Finland and France (1 each).
Author | : Lynne Bowker |
Publisher | : Routledge |
Total Pages | : 93 |
Release | : 2017-07-05 |
Genre | : Language Arts & Disciplines |
ISBN | : 1351573853 |
A volume of selected, annotated references arranged under specific headings to provide a non-partisan guide to teachers involved in designing courses in translation and/or interpreting.
Author | : Chan Sin-wai |
Publisher | : Routledge |
Total Pages | : 958 |
Release | : 2014-11-13 |
Genre | : Foreign Language Study |
ISBN | : 1317608143 |
The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.
Author | : Jörg Tiedemann |
Publisher | : Morgan & Claypool Publishers |
Total Pages | : 168 |
Release | : 2011 |
Genre | : Computers |
ISBN | : 1608455106 |
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks
Author | : Nada Lavrač |
Publisher | : Springer |
Total Pages | : 521 |
Release | : 2003-11-18 |
Genre | : Computers |
ISBN | : 3540398570 |
The proceedings of ECML/PKDD2003 are published in two volumes: the P- ceedings of the 14th European Conference on Machine Learning (LNAI 2837) and the Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (LNAI 2838). The two conferences were held on September 22–26, 2003 in Cavtat, a small tourist town in the vicinity of Dubrovnik, Croatia. As machine learning and knowledge discovery are two highly related ?elds, theco-locationofbothconferencesisbene?cialforbothresearchcommunities.In Cavtat, ECML and PKDD were co-located for the third time in a row, following the successful co-location of the two European conferences in Freiburg (2001) and Helsinki (2002). The co-location of ECML2003 and PKDD2003 resulted in a joint program for the two conferences, including paper presentations, invited talks, tutorials, and workshops. Out of 332 submitted papers, 40 were accepted for publication in the ECML2003proceedings,and40wereacceptedforpublicationinthePKDD2003 proceedings. All the submitted papers were reviewed by three referees. In ad- tion to submitted papers, the conference program consisted of four invited talks, four tutorials, seven workshops, two tutorials combined with a workshop, and a discovery challenge.
Author | : Alfio Gliozzo |
Publisher | : Springer Science & Business Media |
Total Pages | : 138 |
Release | : 2009-07-31 |
Genre | : Language Arts & Disciplines |
ISBN | : 3540681582 |
Semantic fields are lexically coherent – the words they contain co-occur in texts. In this book the authors introduce and define semantic domains, a computational model for lexical semantics inspired by the theory of semantic fields. Semantic domains allow us to exploit domain features for texts, terms and concepts, and they can significantly boost the performance of natural-language processing systems. Semantic domains can be derived from existing lexical resources or can be acquired from corpora in an unsupervised manner. They also have the property of interlinguality, and they can be used to relate terms in different languages in multilingual application scenarios. The authors give a comprehensive explanation of the computational model, with detailed chapters on semantic domains, domain models, and applications of the technique in text categorization, word sense disambiguation, and cross-language text categorization. This book is suitable for researchers and graduate students in computational linguistics.
Author | : Mohand Boughanem |
Publisher | : Springer Science & Business Media |
Total Pages | : 841 |
Release | : 2009-03-27 |
Genre | : Computers |
ISBN | : 3642009573 |
This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR.
Author | : Ruslan Mitkov |
Publisher | : Oxford University Press |
Total Pages | : 808 |
Release | : 2004 |
Genre | : Computers |
ISBN | : 019927634X |
This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.
Author | : Sin-wai Chan |
Publisher | : Chinese University Press |
Total Pages | : 596 |
Release | : 2009 |
Genre | : Education |
ISBN | : 9789629963552 |
This book is a study of the major events and publications in the world of translation in China and the West from its beginning in the legendary period to 2004, with special references to works published in Chinese and English. It covers a total of 72 countries/places and 1,000 works. All the events and activities in the field have been grouped into 22 areas or categories for easy referencing. This book is a valuable reference tool for all scholars working in the field of translation.