Project Purpose

Process, clean, and host tidy UNAIDS’ annual HIV Country Estimates data for analyts’ ease of access and use.

Overview

  • Munged the messy spreadsheet UNAIDS posted on their site to tidy it into an analytic and viz ready dataset
  • Added functions for standard graphical views
  • Created a way for public data to be easy hosted and accessed

Outcomes

  • Standardize output of an untidy dataset
  • Added features to the dataset (e.g. PEPFAR affliation, goal reach, etc) to minimize analysts reinventing the wheel
  • Allowed for easy access of data through an R package

Lessons Learned

  • Leveraged GitHub Releases to host the dataset
  • Designed unit tests
  • Worked with stakeholder to understand data needs and explain limitations
  • Learned the importance of hiding developer only functions from end users
  • Optimized data storage/access as well as function parameters
  • Improved naming conventions for functions
  • Collaborated with UNAIDS to gain access to their database to establish a query
UNAIDS HIV estimates untidy spreadhsheet Tidied data structure Kenya Epi Trends plot (1990-2024) Various Releases of the data hosted on GitHub

Updated: