1 Introduction
clusteR is an R package that assists epidemiologists (or data scientists or analysts or…) in local, regional, or state health departments in managing a cluster-sampled cohort survey similar to CDC’s CASPER.
1.1 What clusteR can do
In short, clusteR is built to give epidemiologists a framework to manage and analyze a cluster-sampled cohort survey. clusteR is built to handle most data management tasks so epidemiologists don’t need to rely on a difficult-to- maintain (and nearly impossible to share) set of custom scripts.
clusteR can:
- Given information about your state and county/counties of interest, randomly select U.S. Census blocks for participation and display simple maps.
- Standardize, manage, update, and export a cohort file with key data and status information on your participants.
- Export PDF and CSV lists to contact participants via mail, phone, and email.
- Filter groups by aggregate status of participants, group selected clusters (by proximity), and produce customizable walk lists for door-to-door interviews.
- Produce reports on completion in your cohort.
- Establish a data connection, retrieve data, and standardize it.
- Customizably clean and weight standardized data.
- Produce analytic reports from weighted or unweighted responses.
- Export cohort data, raw or cleaned data, and analytic products.
clusteR cannot:
- Replace a trained epidemiologist.
- Obtain a random sample of participants or addresses in clusters of interest, even when clusteR selects U.S. Census blocks for you.
- Build, maintain, or host a survey platform.
- Build, maintain, or host a dashboard or other web platform.
- Host cohort files or survey data for collaboration.
- Secure cohort files or survey data.