Papers  /

401.4 Email Preservation at Scale: Preliminary Findings Supporting the Use of Predictive Coding

  1. Joanne Kaczmarek
  2. Brent West

Date created: | Last Updated:


Creating DOI. Please wait...

Create DOI

Category: Communication

Description: Email provides a rich history of an organization yet poses unique challenges to archivists. It is difficult to acquire and process due to sensitive content and diverse topics and formats, which inhibits access and research. Predictive coding alleviates these challenges by using supervised machine learning to: augment appraisal decisions, identify and prioritize sensitive content for review and redaction, and generate descriptive metadata of themes and trends. Following the authors’ previous work which describes the project at its inception, preliminary findings support the use of predictive coding as an effective tool to enable digital preservation at scale. Specific tools, methodologies, and human factors that affect their success are discussed.

License: CC-By Attribution 4.0 International


Loading files...



  • 401. Workflows

    The four papers in Session 401 explore the issues and topics pertaining to the theme of Worflows for digital preservation with examples of good practi...

    Recent Activity

    Loading logs...


Recent Activity

Loading logs...

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.