Main content
Enron Language Model Personalization Dataset
Date created: 2023-05-31 04:08 AM | Last Updated: 2024-08-07 04:04 PM
Identifier: DOI 10.17605/OSF.IO/45P3J
Category: Data
Description: A dataset for investigating how to adapt a language model to a given user. Data is based on the sent email messages of employees at Enron.
This dataset is designed for comparing different algorithms for adapting a language model to the writing of a particular user. It contains the sent email messages of employees of Enron separated by user and in chronological order.
We based our dataset on the Enron Personalization Validation Set released by Google and used in this CHI 2015 paper by Fowler, et al. on language model personalization. …
Files
Files can now be accessed and managed under the Files tab.
Citation
Recent Activity
Unable to retrieve logs at this time. Please refresh the page or contact support@osf.io if the problem persists.