URL Collection
-------
Because there is no single source maintaining a list of up-to-date government websites, we combined multiple publicly-available datasets and systematically curated them. For each of the G7 countries, we first crawled the list of domains under the Regional/ Continent Name/ Country Name/ Government category on Alexa.
Because the scope and number of websites vary by country, we improved the data quality of the combination of Alexa and public datasets by heuristically restricting domain names to the corresponding country code’s top-level domains (cc-TLD) or second-level domain (e.g., gov, go), assuming that the majority of the collected government domains in that country adhere to a naming convention. As we noticed that the Alexa datasets are often incomplete and mixed with non-government sites (such as politicians’ websites), we also searched for publicly-available datasets online. We found official lists of government websites for the US, the UK, Canada, and Japan, but not the rest of G7. To compensate biases introduced by different sample sizes and government structures, we then selected the top 100 in each country based on their Alexa ranks. The final lists can be accessed at.