Natural Language Annotation for Machine Learning

Name: Natural Language Annotation for Machine Learning
Brand: O'Reilly Media
SKU: 9781449306663
Price: 40.99 EUR
Availability: InStock

James Pustejovsky

€40.99

Product variants

Quantity:

4.8/5

Judge.me

603 verified reviews

100% verified

In stock with our UK publisher. 14-28 days

Delivery/Collection within 10-20 working days

14 days return policy Shipping & Delivery

A01=James Pustejovsky

A32=Amber Stubbs

Age Group_Uncategorized

Author_James Pustejovsky

automatic-update

Category1=Non-Fiction

Category=UYQL

COP=United States

data machine learning natural language processing corpus n-grams annotation algorithm analysis

Delivery_Delivery within 10-20 working days

eq_bestseller

eq_computing

eq_isMigrated=2

eq_nobargain

eq_non-fiction

Inc

Language_English

PA=Available

Price_€20 to €50

PS=Active

softlaunch

USA

Product details

ISBN 9781449306663
Publication Date: 04 Dec 2012
Publisher: O'Reilly Media
Publication City/Country: US
Product Form: Paperback
Language: English

Secure checkout

Fast Shipping

Easy returns

Create your own natural language training corpus for machine learning. This example-driven book walks you through the annotation cycle, from selecting an annotation task and creating the annotation specification to designing the guidelines, creating a "gold standard" corpus, and then beginning the actual data creation with the annotation process. Systems exist for analyzing existing corpora, but making a new corpus can be extremely complex. To help you build a foundation for your own machine learning goals, this easy-to-use guide includes case studies that demonstrate four different annotation tasks in detail. You'll also learn how to use a lightweight software package for annotating texts and adjudicating the annotations. This book is a perfect companion to O'Reilly's Natural Language Processing with Python, which describes how to use existing corpora with the Natural Language Toolkit.

James Pustejovsky teaches and does research in Artificial Intelligence and Computational Linguistics in the Computer Science Department at Brandeis University. His main areas of interest include: lexical meaning, computational semantics, temporal and spatial reasoning, and corpus linguistics. He is active in the development of standards for interoperability between language processing applications, and lead the creation of the recently adopted ISO standard for time annotation, ISO-TimeML. He is currently heading the development of a standard for annotating spatial information in language. More information on publications and research activities can be found at his webpage: pusto.com. Amber Stubbs is a Ph.D. candidate in Computer Science at Brandeis University in the Laboratory for Linguistics and Computation. Her dissertation is focused on creating an annotation methodology to aid in extracting high-level information from natural language files, particularly biomedical texts. Information about her publications and other projects can be found on her website: http://pages.cs.brandeis.edu/~astubbs/.

Natural Language Annotation for Machine Learning

Shipping & Delivery

Product details

More from this author