Statistical Methods for Annotation Analysis (Synthesis Lectures on Human Language Technologies)

★★★★★ 5.0 34 reviews

US$24.14
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by www.sikiyouth.com
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
US$24.14
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives Jun 30
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by www.sikiyouth.com
Free 30-day returns Details

Product details

Management number 231975866 Release Date 2026/06/18 List Price US$24.14 Model Number 231975866
Category

Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well.Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meant to provide a survey of the most widely used among these statistical methods supporting annotation practice.As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders.The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science. Read more

ISBN10 1636392555
ISBN13 978-1636392554
Language English
Publisher Morgan & Claypool
Item Weight 1.11 pounds
Print length 217 pages
Publication date January 13, 2022

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

5 out of 5
★★★★★
34 ratings | 14 reviews
How item rating is calculated
View all reviews
5 stars
90% (31)
4 stars
0% (0)
3 stars
0% (0)
2 stars
0% (0)
1 star
10% (3)
Sort by

There are currently no written reviews for this product.