%0 Report %D 2016 %T NCRN Meeting Spring 2016: Developing job linkages for the Health and Retirement Study %A Mccue, Kristin %A Abowd, John %A Levenstein, Margaret %A Patki, Dhiren %A Rodgers, Ann %A Shapiro, Matthew %A Wasi, Nada %X NCRN Meeting Spring 2016: Developing job linkages for the Health and Retirement Study McCue, Kristin; Abowd, John; Levenstein, Margaret; Patki, Dhiren; Rodgers, Ann; Shapiro, Matthew; Wasi, Nada This paper documents work using probabilistic record linkage to create a crosswalk between jobs reported in the Health and Retirement Study (HRS) and the list of workplaces on Census Bureau’s Business Register. Matching job records provides an opportunity to join variables that occur uniquely in separate datasets, to validate responses, and to develop missing data imputation models. Identifying the respondent’s workplace (“establishment”) is valuable for HRS because it allows researchers to incorporate the effects of particular social, economic, and geospatial work environments in studies of respondent health and retirement behavior. The linkage makes use of name and address standardizing techniques tailored to business data that were recently developed in a collaboration between researchers at Census, Cornell, and the University of Michigan. The matching protocol makes no use of the identity of the HRS respondent and strictly protects the confidentiality of information about the respondent’s employer. The paper first describes the clerical review process used to create a set of human-reviewed candidate pairs, and use of that set to train matching models. It then describes and compares several linking strategies that make use of employer name, address, and phone number. Finally it discusses alternative ways of incorporating information on match uncertainty into estimates based on the linked data, and illustrates their use with a preliminary sample of matched HRS jobs. Presented at the NCRN Meeting Spring 2016 in Washington DC on May 9-10, 2016; see http://www.ncrn.info/event/ncrn-spring-2016-meeting %I University of Michigan %G eng %U http://hdl.handle.net/1813/43895 %9 Preprint %0 Journal Article %J The Stata Journal %D 2015 %T Record Linkage using STATA: Pre-processing, Linking and Reviewing Utilities %A Wasi, Nada %A Flaaen, Aaron %X In this article, we describe Stata utilities that facilitate probabilistic record linkage—the technique typically used for merging two datasets with no common record identifier. While the preprocessing tools are developed specifically for linking two company databases, the other tools can be used for many different types of linkage. Specifically, the stnd_compname and stnd_address commands parse and standardize company names and addresses to improve the match quality when linking. The reclink2 command is a generalized version of Blasnik's reclink (2010, Statistical Software Components S456876, Department of Economics, Boston College) that allows for many-to-one matching. Finally, clrevmatch is an interactive tool that allows the user to review matched results in an efficient and seamless manner. Rather than exporting results to another file format (for example, Excel), inputting clerical reviews, and importing back into Stata, one can use the clrevmatch tool to conduct all of these steps within Stata. This helps improve the speed and flexibility of matching, which often involves multiple runs. %B The Stata Journal %V 15 %P 1-15 %G eng %U http://www.stata-journal.com/article.html?article=dm0082 %N 3 %0 Report %D 2014 %T NCRN Meeting Spring 2014: Summer Working Group for Employer List Linking (SWELL) %A Gathright, Graton %A Kutzbach, Mark %A Mccue, Kristin %A McEntarfer, Erika %A Monti, Holly %A Trageser, Kelly %A Vilhuber, Lars %A Wasi, Nada %A Wignall, Christopher %X NCRN Meeting Spring 2014: Summer Working Group for Employer List Linking (SWELL) Gathright, Graton; Kutzbach, Mark; Mccue, Kristin; McEntarfer, Erika; Monti, Holly; Trageser, Kelly; Vilhuber, Lars; Wasi, Nada; Wignall, Christopher Presentation for NCRN Spring 2014 meeting %I NCRN Coordinating Office %G eng %U http://hdl.handle.net/1813/36396 %9 Preprint