This repository contains datasets collected for our study "Funny Accents: Exploring Genuine Interest in Internationalized Domain Names".
Currently we publish the following datasets publicly:
lgr.tar.gz: Label Generation Rulesets for a large set of TLDs
titles.tar.gz: web page titles of the root pages in the Tranco top million of 29 August 2018
If you are interested in other data from our study, ...