Main content
Home
Menu
This repository contains materials for Regression Modeling for Linguistic Data (MIT Press: 2023).
Comments, including typos, are welcome (morgan.sonderegger@gmail.com).
Please see license info (below).
Directories
Data folder: datasets analyzed in the book (CSVs)
Code:
source
: source code for each chapter (Rnw/Rmd files)---only needed to compile the book.R
: code files showing all R commands used in each chapter, including for making plots (R)- The code file for each chapter is just all the R code in each Rnw/Rmd file, extracted.
Output: Now that the print and e-book versions are available, the free PDF version has been taken down.
- The online-only appendix to Chapter 10 is here.
Overview
The book introduces applied regression analysis for language scientists, using R and (mainly) tidyverse functionality. It aims to provide both conceptual understanding and practical skills through extensive examples with different kinds of linguistic data. The primary differences from existing texts on quantitative analysis of linguistic data (e.g. Baayen, Winter, Gries, Levshina, K. Johnson) are that we focus on regression analysis, and do not start from scratch. The book assumes you have familiarity with basic statistical analysis (e.g. t-tests, ANOVAs), R, and math, but it aims to be readable regardless.
Chapters:
- Chapter 1: Preliminaries
- Chapter 2: Statistical Inference I
- Point and interval estimation
- Hypothesis testing
- Chapter 3: Statistical Inference II
- Effect size
- Power
- Type I/II and M/S errors
- Pseudoreplication
- Chapter 4: Linear Regression I
- Simple linear regression
- Multiple linear regression
- Interactions I
- Reporting
- Chapter 5: Linear Regression II
- Model assumptions and validation
- Transformations
- Collinearity
- Over and under-fitting
- Model comparison and variable selection
- Chapter 6: Categorical data analysis and logistic regression
- Categorical data analysis: background
- Logistic regression
- Visualization of model predictions
- Model validation and reporting
- Chapter 7: Practical regression topics
- Contrast coding
- Omnibus and post-hoc tests
- Interactions II
- Nonlinear effects
- Chapter 8: Linear mixed-effects models
- Fixed and random effects
- Crossed random effects, random slopes
- Hypothesis testing
- Random-effect correlations
- Model predictions
- Model validation and reporting
- Chapter 9: Mixed-effects models 2: logistic regression
- Mixed-effects logistic regression
- Model summarization and validation
- Nonlinear and factor effects for MEMs
- Variable importance for MEMs
- Chapter 10: Mixed-effects models 3: practical and advanced topics
- Model convergence
- Singular models
- Model selection
- Prediction/uncertainty for individual levels
- Online appendix to Chapter 10
License
Draft (PDF): CC BY NC ND 4.0 license---you are free to share it, but not modify it or use it for commercial purposes. This license covers all PDF files, and all non-R code portions of Rnw files. (The PDF of the book is no longer on this site, but the license applies if you previously downloaded it.)
Code: Code portions of the Rnw files, as well as the Rmd and HTML files, are under a CC BY SA 4.0 license---you are free to share and adapt (with some conditions).
Datasets:
diatones_rmld.csv
: CC BY 4.0 license---you are free to share and adapt (with credit given).french_cdi_24.csv
: dataset derived from Wordbank; CC BY 4.0 license.givenness_rmld.csv
: see OSF project.neutralization_rmld.csv
: see OSF project.turkish_if0_rmld.csv
: CC BY 4.0 license.transitions_rmld.csv
: see OSF project.vot_rmld.csv
: CC BY 4.0 license.
Page permissions have changed
Your browser should refresh shortly…
Renaming wiki...
Wiki page deleted
Press Confirm to return to the project wiki home page.
Connected to the collaborative wiki
This page is currently connected to the collaborative wiki. All edits made will be visible to contributors with write permission in real time. Changes will be stored but not published until you click the "Save" button.
Connecting to the collaborative wiki
This page is currently attempting to connect to the collaborative wiki. You may continue to make edits. Changes will not be saved until you press the "Save" button.
Collaborative wiki is unavailable
The collaborative wiki is currently unavailable. You may continue to make edits. Changes will not be saved until you press the "Save" button.
Browser unsupported
Your browser does not support collaborative editing. You may continue to make edits. Changes will not be saved until you press the "Save" button.

Start managing your projects on the OSF today.
Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.
Copyright © 2011-2025
Center for Open Science
|
Terms of Use
|
Privacy Policy
|
Status
|
API
TOP Guidelines
|
Reproducibility Project: Psychology
|
Reproducibility Project: Cancer Biology