Jerry Reiter - Duke University
Date: 01/28/2021 (Thu)
Time: 3:30pm- 5:00pm
Location: ZOOM: https://duke.zoom.us/j/99953374350
Organizer: Laura Satterfield
Meeting Schedule: (Not currently open for scheduling. Please contact the seminar organizer listed above.)
3:30pm - Seminar Presentation (3:30pm to 5:00pm)
Additional Comments: Data stewards seeking to provide access to large-scale social science and health data face a difficult challenge. They have to share data in ways that protect privacy and confidentiality, are informative for many analyses and purposes, and are relatively straightforward to use by data analysts. I present an integrated system for data access designed to meet these objectives, in which data stewards generate and release synthetic data, that is, data simulated from statistical models, while also providing users access to a verification server that allows them to assess the quality of inferences from the synthetic data. I present an application of the synthetic data plus verification server approach to longitudinal data on employees of the U.S. federal government. I illustrate the integrated use of synthetic data plus verification via analysis of differentials in pay by race and sex.