Extensive dataset including 49,438 webpage screenshots together with textual and numerical data. The dataset considers all countries worldwide and a broad range of topics such as art, entertainment, economy, business, education, government, news, media, science, and environment, covering different cultural characteristics and varied design preferences.
The dataset is freely available and can be useful for multiple uses:
- Web design and redesign: identification of metrics and guidelines to support the work of beginners and experts.
- Analysis of aesthetics and quality of Web pages.
- Optimization and improvement of the indexing of Web pages.
- Categorization Web: classifiers and recommenders systems,
Web directories, and crawlers.
- Security: detection of illegitimate Web sites (phishing).
- Accessibility without limitations, regardless of knowledge, skills and technology.
- Programming: automatic code generation.
- Machine Learning and Artificial Intelligence projects.
- Challenges of performance of algorithms and competitions of the best Web sites.