My unofficial undergraduate honours thesis was a piece of work motivated by the controversy surrounding Hillary Clinton's private email server in the 2016 United States Presidential Election. Following the publication of all of these private emails by the State Department, Wikileaks performed a PDF-to-HTML conversion and made this HTML database public and searchable.
What was lacking in this database was any way to view a broad summary of the data. Here we had a massive database with thousands of emails, and the best one could do was search through emails by keyword! I worked to scrape the entire database and use this data set to design a public web app (https://shiny.math.uwaterloo.ca/sas/clinton/) which allows users to explore this data themselves. This work led to a publication in Significance which outlined the tool and some interesting findings (see https://rss.onlinelibrary.wiley.com/doi/full/10.1111/j.1740-9713.2018.01...).