10 Annotation quality for different machine learning tasks

 

This chapter covers

  • Adapting annotation quality control methods from labeling to continuous tasks
  • Managing annotation quality for computer vision tasks
  • Managing annotation quality for natural language processing tasks
  • Understanding annotation quality for other tasks

Most machine learning tasks are more complicated than labeling an entire image or document. Imagine that you need to generate subtitles for movies in a creative way. Creating transcriptions of spoken and signed language is a language generation task. If you want to emphasize angry language with bold text, that task is an additional sequence labeling task. If you want to display the transcriptions like the speech bubbles of text in comics, you could use object detection to make sure that the speech bubble comes from the right person, and you could also use semantic segmentation to ensure that the speech bubble is placed over background elements in the scene. You might also want to predict what a given person might rate the film as part of a recommendation system or feed the content into a search engine that can find matches for abstract phrases such as motivational speeches.

10.1 Annotation quality for continuous tasks

 
 
 

10.1.1 Ground truth for continuous tasks

 
 
 

10.1.2 Agreement for continuous tasks

 
 
 

10.1.3 Subjectivity in continuous tasks

 
 
 
 

10.1.4 Aggregating continuous judgments to create training data

 
 
 

10.1.5 Machine learning for aggregating continuous tasks to create training data

 
 

10.2 Annotation quality for object detection

 
 
 

10.2.1 Ground truth for object detection

 
 
 

10.2.2 Agreement for object detection

 
 
 
 

10.2.3 Dimensionality and accuracy in object detection

 

10.2.4 Subjectivity for object detection

 
 
 
sitemap

Unable to load book!

The book could not be loaded.

(try again in a couple of minutes)

manning.com homepage