Data and code for the paper "[Regional Personality Assessment through Social Media Language][1]"
The data consists of two files and is available in both CSV and MySQL formats.
- *outcomes_and_controls*: dense table with all non-personality outcomes and controls used throughout the paper
- *big5*: Big 5 language estimates for each county in sparse format
**Additional Files:**
- *UScounties.shp*: shape file for calculating spatial regressions and autocorrelations
We have included a Jupyter Python notebook to reproduce the results in Tables 3 and A1: *code/spatial_regressions.ipynb*