Comparable Corpora and Computer-assisted Translation

Regular price €172.30
Quantity:
In stock with our UK publisher. 14-28 days
Delivery/Collection within 10-20 working days
14 days return policy Shipping & Delivery
A01=Estelle Maryline Delpech
Author_Estelle Maryline Delpech
belongs
bilingual
cat
Category=CFP
Category=UM
comparable
context
corpora
corpus
eq_bestseller
eq_computing
eq_dictionaries-language-reference
eq_isMigrated=1
eq_isMigrated=2
eq_nobargain
eq_non-fiction
evaluation
field
instance
introduction
issue
ix
leveraging
part
previous
software
translation
translations
translator
usercentered

Product details

  • ISBN 9781848216891
  • Weight: 590g
  • Dimensions: 163 x 241mm
  • Publication Date: 04 Jul 2014
  • Publisher: ISTE Ltd and John Wiley & Sons Inc
  • Publication City/Country: GB
  • Product Form: Hardback
Secure checkout Fast Shipping Easy returns

Computer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another.

This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators' needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations.

The experiments are carried out on two language pairs (English–French and English–German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability – methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation.

The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).

Estelle Maryline Delpech holds a PhD in Computer Science from the University of Nantes in France, where she specialized in natural language processing and computer-aided translation. She is currently Chief Scientist at Nomao, a web and mobile app search engine company. Her research interests include multilingualism, computational linguistics, information extraction and data integration.

More from this author