Using the Data
Estimates for counties, places, census tracts, or ZCTAs can be compared on individual measures. However, these comparisons need to be carefully done and interpreted. First, it is not adequate to simply compare point estimates. The confidence intervals should also be considered, and some are very broad. The smaller the areas are, the broader confidence intervals an estimate has. The confidence intervals are related to the population prevalence of the measure, the sample size of the data source (BRFSS), that area’s population, the modeling process, and the method used for constructing the confidence intervals. It may not be appropriate to draw conclusions when comparing estimates that have very broad confidence intervals. Second, when interpreting differences, we recommend considering differences in population characteristics (e.g., age, race/ethnicity) of the geographic units being compared that may also contribute to any differences observed. Finally, age-adjusted estimates that adjust for potential differences in the age-distribution across geographic units can be used for county- and place-level comparisons. As part of the Compare Counties Report, age-adjusted and crude prevalence estimates are included so users can see how both estimates compare.
- Identify the relevant population group for the measure of interest. While most PLACES measures relate to the adult population (e.g., adults aged ≥ 18 years), some are limited to specific populations (e.g., women aged 21–65 years, adults aged ≥ 65 years). This information can be found in the Measure Definition section.
- Obtain the total Census population count/estimate for each area for the relevant population group. County-level total population estimates can be downloaded from: https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-detail.html. Place, ZCTA, or tract-level census 2020/2010 population counts by age and sex can be downloaded from https://data.census.gov. Search for P12 and expand the view of all 14 products, then select the table ‘2020: DEC Demographic and Housing Characteristics’ or ‘2010: DEC Summary File 1’. Expand the Geographies filter at left and select specific geographic areas.
- An example of all places in Alabama
- An example for all census tracts in Alabama
- https://data.census.gov/table/DECENNIALDHC2020.P12?q=P12&g=040XX00US01$1400000
- https://data.census.gov/table/DECENNIALSF12010.P12?q=P12&g=040XX00US01$1400000
You will get a message indicating the dataset is too large to display, but a download button is provided for you to download the data.
- For each area (e.g., county, census tract), estimate the number of adults reporting a specific measure (N) by multiplying the relevant Census population counts or estimates (Pop) from step 1 times the specific prevalence estimate (p) and dividing by 100. (N = Pop × p / 100)
- Sum the estimated number of adults calculated in step 3 across all the included areas to generate the aggregate estimate for a specific measure. Divide this by the sum of the total population count/estimate to obtain the aggregate prevalence estimate.
There are a number of methods for small area estimation (SAE). The multilevel regression modeling with poststratification framework, which was used in PLACES and the 500 Cities Project, is one methodology that communities might consider when embarking on generating their own small area estimates. Additional information on the methodology is available on this website. Some communities have already generated their own direct survey estimates or small area estimates, and they are encouraged to use their local estimates as their primary data. However, the estimates from PLACES may provide additional insights into the health issues affecting residents for those communities.
The SAE code used in PLACES was developed specifically for the project outcomes, using the entire BRFSS dataset for all states and Washington, DC, and include variables in the model for state and local levels. The use of the code by Washington, DC and other communities may or may not be appropriate without some modification.
In addition, use of the SAE code assumes that the end user has access to geocoded (in the case of PLACES this was the county) survey data. Restricted BRFSS data, which includes substate geographic identifiers (county) are available through the Research Data Centers (RDCs) by way of a formal data hosting agreement on a case-by-case basis for research purposes. Learn more about the proposal process.
Unfortunately, CDC does not currently have the capacity or resources to respond to all individual requests for technical assistance on the modeling process, modifying the code, or running special data analyses. Requests for such assistance will have to be handled on a case-by-case basis and will depend on existing resources and workload.
We cannot include policy or program intervention effects, which would occur locally, in the modeling process. Therefore, the estimates for local areas are the statistically expected prevalence of the risk factor, health outcome, or preventive service use based on the associations that we observe through the overall model. It is possible that a community may have a program intervention that has a substantial effect, such that the resulting prevalence of a health risk factor (for example) is lower or preventive service is higher than what is statistically predicted by our model. In that case, if a community relies solely on the small area estimates, the effect of that local intervention would be underestimated. Thus, without reliable local information about public health programs, model-based local estimates should not be used to evaluate the effect of local public health programs, policies, or interventions.
The data can be used to:
- Identify the health issues facing a local area or neighborhoods.
- Establish key health objectives.
- Develop and implement effective and targeted prevention activities.
Because these are modeled and not direct estimates, the data should not be used for ranking the overall health of any county, place, census tract, or ZCTA. PLACES does not provide a weighted composite score for the included counties, places, census tracts, or ZCTAs; therefore, the data should not be used to rank the overall health of a local area. However, counties, places, census tracts, or ZCTAs can be compared on individual measures.
The current modeling procedure does not support using the estimates to track changes at the local level over time.
Estimates depend on two main components: 1) the survey responses in the given survey year; and 2) the detailed population distribution within the local area. Because we use the 2010 US Census as the poststratification dataset for estimates at place, census tract and ZCTA levels, we cannot incorporate year-to-year population change in the modeled results. So the assumption for any given point-in-time estimate is that the place, census tract and ZCTA population in that year is the same as it was measured in 2010. For county level estimates, we do use census annual population estimates as the poststratification dataset, but the time is not included in the model as a variable, therefore the data could not be used to access trend for county.
If researchers choose to include adjustments for area-level demographic characteristics in an analysis using model-based PLACES estimates, caution should be taken when interpreting results. The PLACES approach incorporates age, race or ethnicity, sex, education, and poverty to generate the model-based estimates. This approach should be considered when planning, conducting, and interpreting any regression analyses using the model-based PLACES estimates. See Amanda Y. Kong and Xingyou Zhang, “The Use of Small Area Estimates in Place-Based Health Research,” American Journal of Public Health 2020;110(6):829–832, for a more detailed discussion.