“We’re all in this together.” – UNDGC

Overview

As the first year of the second decade of the twentieth century is drawing to a close, the whole country’s daily new cases of the coronavirus are as high as it has ever been. From the New York Times post, the number is now over 171,000, the most rapid growth happening all over the world. There are nine states in this country reported more than twice as many new confirmed cases each day as they did two weeks ago.1

This project aims to visualize the spread of COVID-19 within the United States. The animation begins from Jan.25 and updates every twenty days through Nov.18. The visualization treats each state as an individual polygon and colors each of the polygon based on the numbers of cases. As the color blue gets darker, the numbers increase. The detailed data processing procedures are in the Data Description section. This pandemic affects every individual in the country, and thus it is worth paying more attention to analyze the pattern of the confirmed cases. By knowing which states have the most cases, people in that area could be especially watchful in social distancing and considering more when making their travel plans in this upcoming holidays.

Data Description

The data that was used to create the visualization comes from the GitHub repository of the New York Times2. The data file, originally in CSV3 format, contains the cumulative counts of coronavirus cases and deaths for each state. Each row of data contains the information of the date, the state, the state’s FIP codes, and the cumulative number of coronavirus cases and deaths based on the best reporting up to the moment it has published.

For the data processing, a Python conversion script is created to load and read the original data file4. Since the data visualization needs the number of confirmed cases for every 20 days from late January to late November, the python script is intended to create a list that contains all the dates needed and loop through the original csv file to produce 16 separate CSV data files. For each of the newly created files, the data was visualized on QGIS5 by combining it with a cartographic boundary shape file, downloaded from the website of the U.S. Census Bureau6. Each images was then exported to a PNG7 format and converted to an animated GIF8 by using EZGIF9. Finally, the GIF was converted to the MPEG-410 format by using EZGIF[^giftomp4] again.

Author: Xianglu Peng. Last edited: 2020-11-21.


  1. “Covid-19 Live Updates.” The New York Times. https://www.nytimes.com/live/2020/11/23/world/covid-19-coronavirus↩︎

  2. COVID-19-Data. https://github.com/nytimes/covid-19-data↩︎

  3. CSV. Accessed 2020-11-23, Online: https://docs.fileformat.com/spreadsheet/csv/↩︎

  4. Python. Copyright (C) 2001–2020. Python Software Foundation. Accessed 2020-11-22. Online: https://www.python.org/↩︎

  5. QGIS. Software released through CC-BY-SA by the QGIS Development Team. Accessed 2020-11-22. Online: https://www.qgis.org↩︎

  6. U.S. Census Bureau. Accessed 2020-11-23. Online: https://www.census.gov/en.html↩︎

  7. Portable Network Graphics (PNG). Accessed 2020-11-23. Online: http://www.libpng.org/pub/png/↩︎

  8. GIF. Accessed 2020-11-23. Online: https://docs.fileformat.com/image/gif/↩︎

  9. EZGIF Online Tool. Accessed 2020-11-23. Online: https://ezgif.com/maker↩︎

  10. Moving Picture Experts Group Standard 4 (MPEG-4). Accessed 2020-11-23. Online: https://mpeg.chiariglione.org/↩︎