12 Combining smaller controls into a meta-control

 

This chapter covers:

  • Creating simple sub-quality controls for complex attributes
  • Combining sub-quality controls in a meta-quality control
  • Treating attributes with varying references as recurring attributes

Anybody who has ever searched for anything on the web knows Wikipedia, the free on-line encyclopedia. Wikipedia is a tremendous example of the power of the Internet. A large group of people, who don’t really know one another, come together on a single website to try to make available all of knowledge of mankind.

Wikipedia is the flagship project of the Wikimedia foundation. The nonprofit organization has a few other interesting projects; Wiktionary (a multilingual dictionary), Wikiquote (repository of quotes), Wikimedia commons (multimedia content), and Wikidata (knowledge base) amongst others.

Out of these Wikimedia projects, the Wikidata project is especially interesting for data people (as the name suggests). It provides a common source of data that can, and is, used by other Wikimedia projects like Wikipedia. The small informational boxes (infoboxes) you can find on many Wikipedia pages showing essential information can use the data from the Wikidata knowledge base, irrespective of the language.

12.1  TL;DR

12.2  Completeness

12.2.1  What do I mean by metadata?

12.2.2  Example dataset

12.2.3  Filling in the metadata

12.2.4  Other resources

12.3  Monitoring completeness

12.3.1  Checking the metadata

12.3.2  Checking data values

12.3.3  Trying our quality controls out

12.3.4  Combining multiple checks into one

12.3.5  Try the meta-quality control

12.4  Summary