10 Annotation quality for different machine learning tasks

This chapter covers

Adapting annotation quality control methods from labeling to continuous tasks
Managing annotation quality for computer vision tasks
Managing annotation quality for natural language processing tasks
Understanding annotation quality for other tasks

Most machine learning tasks are more complicated than labeling an entire image or document. Imagine that you need to generate subtitles for movies in a creative way. Creating transcriptions of spoken and signed language is a language generation task. If you want to emphasize angry language with bold text, that task is an additional sequence labeling task. If you want to display the transcriptions like the speech bubbles of text in comics, you could use object detection to make sure that the speech bubble comes from the right person, and you could also use semantic segmentation to ensure that the speech bubble is placed over background elements in the scene. You might also want to predict what a given person might rate the film as part of a recommendation system or feed the content into a search engine that can find matches for abstract phrases such as motivational speeches.

10.1 Annotation quality for continuous tasks

10 Annotation quality for different machine learning tasks

This chapter covers

10.1 Annotation quality for continuous tasks

10.1.1 Ground truth for continuous tasks

10.1.2 Agreement for continuous tasks

10.1.3 Subjectivity in continuous tasks

10.1.4 Aggregating continuous judgments to create training data

10.1.5 Machine learning for aggregating continuous tasks to create training data

10.2 Annotation quality for object detection

10.2.1 Ground truth for object detection

10.2.2 Agreement for object detection

10.2.3 Dimensionality and accuracy in object detection

10.2.4 Subjectivity for object detection

10 Annotation quality for different machine learning tasks

This chapter covers

10.1 Annotation quality for continuous tasks

10.1.1 Ground truth for continuous tasks

10.1.2 Agreement for continuous tasks

10.1.3 Subjectivity in continuous tasks

10.1.4 Aggregating continuous judgments to create training data

10.1.5 Machine learning for aggregating continuous tasks to create training data

10.2 Annotation quality for object detection

10.2.1 Ground truth for object detection

10.2.2 Agreement for object detection

10.2.3 Dimensionality and accuracy in object detection

10.2.4 Subjectivity for object detection

Unable to load book!