Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
## Summary The Data component includes final, synthetic files used for analysis. This material is also included in the 13beaches-data-README.txt file, saved in this component. ## Additional information Please review the “13 Beach Data Management Plan 151217.pdf” file for a summary of the workflow that created the datasets in this directory. The objective of the work flow was to preserve the maximum number of variables common across the 13 cohorts that might be useful for characterizing exposures or outcomes in the beachgoer population. For each of the final datasets, there is a comma-delimited version (.csv) and a Stata version (.dta) of the data (backwards compatible to v12). There is also a Stata-generated codebook for each file (.txt). The Stata files include a significant amount of metadata embeded in them, which we have tried to display to the extent possible in the codebooks for each dataset. The Stata dataset named “13beaches-epi-varlist.dta / .csv” is a summary list of the 294 variables included in the epidemiology dataset, with indicators for whether the variable was present in the NEEAR epidemiology studies, the Avalon, Doheny, and Malibu (“ADM”) studies, and the Mission Bay (“MB”) study. Across the 13 beaches, it is these three groupings that shared roughly identical instruments, with broad overlap across all beaches. ### Base datasets **13beaches-epi** : cleaned combined survey data **13beaches-wq-samples** : cleaned individual water sample results ### Derived datasets used in the primary analysis **13beaches-wq** : daily average water quality measures **13beaches-analysis** : merged version of 13beaches-wq and 13beaches-epi ### Data management scripts The following Stata scripts create the derived datasets from the base datasets (provided, with log files): [9-avg-wq-data.do](https://github.com/ben-arnold/13beaches/blob/master/src/dm/9-avg-wq-data.do) [10-make-analysis-dataset.do](https://github.com/ben-arnold/13beaches/blob/master/src/dm/10-make-analysis-dataset.do)
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.