Social media are full of Covid-19 graphs, each pointing to an “obvious” conclusion that fits the author’s agenda. Unfortunately, even the official sources publish analytics that point at incorrect conclusions. Bad data quality has become a matter of life and death.
We will look at the quality problems with official Covid-19 data presentations. The problems are common in all domains, and solutions are known, but not widespread. We will describe tools and patterns that data mature companies use to assess and improve data quality in similar situations. Mastering data quality and data operations is a prerequisite for building sustainable AI solutions, and we will explain how these patterns fit into machine learning product development.