Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Stitching Across Data Seams - WashU

Stitching Across Data Seams - WashU

C07da636fa8428f90daa78b89c90bafb?s=128

Christopher Prener

February 22, 2021
Tweet

Transcript

  1. Stitching Across Data Seams Tracking COVID-19 Disparities in Missouri Christopher

    Prener, Ph.D. Assistant Professor of Sociology, SLU 02.22.2021
  2. Tracking COVID-19 Disparities in Missouri Christopher Prener, Ph.D. Assistant Professor

    of Sociology, SLU 02.22.2021 Stitching Across Data Seams
  3. Acknowledgments Saint Louis University Department of Computer Science Especially Chair

    Michael Goldwasser, PhD and David Ferry, PhD Student Contributors: Alvin Do, Metta Pham, and Eric Quach Washington University Institute for Public Health Center for Health Economics and Policy Especially Karen Joynt Maddox, MD, MPH and Timothy McBride, PhD
  4. AGENDA 1. Preface 2. How can the public find information

    on COVID? 3. What is wrong with this arrangement for COVID data? 4. What does the sewing kit look like for my COVID work? 5. What COIVD disparities exist in Missouri and St. Louis? WASHINGTON UNIVERSITY IN ST. LOUIS | ST. LOUIS, MO | 02.22.2021
  5. ▸ Medical and urban sociologist with an interest in spatial

    and computational methods ▸ Affiliations: • Core Faculty, SLU’s Advanced HEAlth Data Research Institute • SLU’s Institute for Healing Justice and Equity • Northeastern University’s Institute for Health Equity and Social Justice Research 1. PREFACE “HI, I’M CHRIS”
  6. 1. PREFACE INTELLECTUAL TRAJECTORY

  7. How can the public find information on COVID? ?

  8. Everybody's got a dashboard these days…

  9. NEWS SITES

  10. MISSOURI DHSS

  11. ST. LOUIS COUNTY PUBLIC HEALTH

  12. What is wrong with this arrangement for COVID data? ?

  13. DIFFERENT TOOLS Dashboards are being powered by a number of

    different commercial tools, including ESRI, Microsoft, Tableau, and in-house solutions.
  14. DIFFERENT TOOLS Dashboards are being powered by a number of

    different commercial tools, including ESRI, Microsoft, Tableau, and in-house solutions.
  15. DIFFERENT TOOLS Dashboards are being powered by a number of

    different commercial tools, including ESRI, Microsoft, Tableau, and in-house solutions.
  16. DIFFERENT TOOLS Dashboards are being powered by a number of

    different commercial tools, including ESRI, Microsoft, Tableau, and in-house solutions.
  17. DASHBOARDS ≠ OPEN DATA Few dashboards provide easy access to

    underlying data, though it is there if you know where to look. There is also little to no standardization.
  18. DASHBOARDS ≠ OPEN DATA Few dashboards provide easy access to

    underlying data, though it is there if you know where to look. There is also little to no standardization.
  19. 3. DATA SEAMS DATA ARE OFTEN POORLY VISUALIZED Basic rules,

    like using per capita rates, are often ignored.
  20. Similar fabric panels with jagged edges that we need to

    stitch together. Data Seams.
  21. What does the sewing kit look like for my COVID

    work? ?
  22. Navigating these seems requires an array of computational tools for

    scraping and standardizing various jurisdictions’ data.
  23. 4. THE SEWING KIT PULLING THE FABRIC TOGETHER New York

    Times COVID-19 Database (via GitHub) County Public Health Zip Code Data (via scrapers+API calls) Missouri COVID Tracking Database State of Missouri and Illinois (via scrapers) CMS Nursing Home Data (via API) Census Bureau (via API)
  24. 4. THE SEWING KIT PULLING THE FABRIC TOGETHER

  25. THE TWEET THAT LAUNCHED… MANY MORE TWEETS

  26. None
  27. None
  28. None
  29. What COVID disparities exist in Missouri and St. Louis? ?

  30. None
  31. None
  32. FOR THE PROBLEM OF THE TWENTIETH CENTURY IS THE PROBLEM

    OF COLOR LINE W.E.B. Du Bois The Souls of Black Folk (1903) Wikimedia Commons
  33. None
  34. None
  35. None
  36. None
  37. None
  38. None
  39. None
  40. None
  41. 6. FINAL THOUGHTS REFLECTIONS ▸ Research, data, and communication are

    not the first priority: ▸ COVID-19 data sources suffer from a lack of standardization, often cannot be accessed. ▸ COVID-19 “dashboards” are all the rage, but there are frequent issues across jurisdictions with how similar metrics are being communicated. ▸ This has increased demand for my COVID tracking project. ▸ Iterating on analyses is not something we always get to do, but it is tremendously gratifying. ▸ Open data and science are also about community and communication.
  42. Slides available via SpeakerDeck Follow on the web: speakerdeck.com/chrisprener/ stitching-across-data-seams-washu

    Raw data, code available via GitHub github.com/slu-openGIS/ MO_HEALTH_Covid_Tracking chris.prener@slu.edu chris-prener.github.io LEARN MORE THANKS FOR COMING! @chrisprener Visualization code available via GitHub github.com/slu-openGIS/ covid_daily_viz slu-opengis.github.io/ covid_daily_viz/ chrisprener.substack.com