Building and Using Comparable Corpora for Multilingual Natural Language Processing | Agenda Bookshop Skip to content
Selected Colleen Hoover Books at €9.99c | In-store & Online
Selected Colleen Hoover Books at €9.99c | In-store & Online
A01=Pierre Zweigenbaum
A01=Reinhard Rapp
A01=Serge Sharoff
Age Group_Uncategorized
Age Group_Uncategorized
Author_Pierre Zweigenbaum
Author_Reinhard Rapp
Author_Serge Sharoff
automatic-update
Category1=Non-Fiction
Category=CFX
Category=UB
Category=UY
Category=UYQ
Category=UYQL
Category=UYQM
COP=Switzerland
Delivery_Pre-order
Language_English
PA=Not yet available
Price_€20 to €50
PS=Active
softlaunch

Building and Using Comparable Corpora for Multilingual Natural Language Processing

This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.
See more
Current price €41.79
Original price €43.99
Save 5%
A01=Pierre ZweigenbaumA01=Reinhard RappA01=Serge SharoffAge Group_UncategorizedAuthor_Pierre ZweigenbaumAuthor_Reinhard RappAuthor_Serge Sharoffautomatic-updateCategory1=Non-FictionCategory=CFXCategory=UBCategory=UYCategory=UYQCategory=UYQLCategory=UYQMCOP=SwitzerlandDelivery_Pre-orderLanguage_EnglishPA=Not yet availablePrice_€20 to €50PS=Activesoftlaunch

Will deliver when available. Publication date 14 Sep 2024

Product Details
  • Dimensions: 168 x 240mm
  • Publication Date: 24 Aug 2024
  • Publisher: Springer International Publishing AG
  • Publication City/Country: Switzerland
  • Language: English
  • ISBN13: 9783031313868

About Pierre ZweigenbaumReinhard RappSerge Sharoff

Serge Sharoff Ph.D.  is Professor of Language Technology and Digital Humanities at the Centre for Translation Studies University of Leeds. His research focuses on Natural Language Processing including automated methods for collecting very large corpora from the Web their analysis in terms of domains genres or text quality as well as extraction of lexicons and terminology from corpora. The application domains for this kind of research in the Digital Humanities include text annotation information retrieval machine translation and computer-assisted language learning. His research stresses the inherent multilingualism of NLP which implies that tools and resources can be ported across languages by paying attention to the respective linguistic properties.Pierre Zweigenbaum Ph.D. FACMI FIAHSI is a Senior Researcher at the Interdisciplinary Laboratory for Digital Sciences (LISN Orsay France) a laboratory of the French National Center forScientific Research (CNRS) and Université Paris-Saclay where he has led the ILES Natural Language Processing group. Before CNRS he was a researcher at Paris Public Hospitals in an Inserm team. He also was a part-time professor at the National Institute for Oriental Languages and Civilizations.  His research focus is Natural Language Processing with medicine as a main application domain. He has also designed methods to acquire linguistic knowledge automatically from corpora and thesauri to help extend monolingual and bilingual lexicons and terminologies using parallel and comparable corpora.Reinhard Rapp Ph.D. is Professor of Applied Translation Studies at Magdeburg-Stendal University of Applied Sciences and is also affiliated with the University of Mainz. He has conducted EU-funded research projects at the University of Geneva the University of Tarragona the University of Leeds at Aix-Marseille University at the University of Mainz and at the Athena Research Center in Athens. His main research interests are in computational linguistics translation studies and cognitive science. His publications have dealt with unsupervised language learning from text corpora word sense disambiguation text mining thesaurus construction bilingual dictionary induction from parallel and comparable corpora and with statistical and neural machine translation. 

Customer Reviews

Be the first to write a review
0%
(0)
0%
(0)
0%
(0)
0%
(0)
0%
(0)
We use cookies to ensure that we give you the best experience on our website. If you continue we'll assume that you are understand this. Learn more
Accept