- 2019.11.14-ElDiarioCorpus: Fourth iteration of corpus; more renaming of files, cleaned some bad OCR.
- 2019.10.14-ElDiarioCorpus: Third iteration of corpus; more renaming of files, filling in empty files, cleaned some bad OCR; still requires file renaming for parts that were automated via webscraping.
- 2019.10.09-ElDiarioCorpus: Second iteration of corpus; files renamed for consistency; folders created for document type.
- 2019.10.07-ElDiarioCorpus: First iteration of corpus building
- Images.zip: Images were optionally collected by the class. Includes artwork, photographs, adverts, and other graphics from *El Diario de la Gente.*