<p><b>A reference to answer all your statistical confidentiality questions.</b></p><p>This handbook provides technical guidance on statistical disclosure control and on how to approach the problem of balancing the need to provide users with statistical outputs and the need to protect the confidentia
Synthetic Datasets for Statistical Disclosure Control: Theory and Implementation
β Scribed by JΓΆrg Drechsler (auth.)
- Publisher
- Springer-Verlag New York
- Year
- 2011
- Tongue
- English
- Leaves
- 159
- Series
- Lecture Notes in Statistics 201
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints.
Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice.
The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure.
The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesnβt consist only of the originally collected values.
The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.
β¦ Table of Contents
Front Matter....Pages i-xx
Introduction....Pages 1-5
Background on Multiply Imputed Synthetic Datasets....Pages 7-11
Background on Multiple Imputation....Pages 13-21
The IAB Establishment Panel....Pages 23-25
Multiple Imputation for Nonresponse....Pages 27-37
Fully Synthetic Datasets....Pages 39-51
Partially Synthetic Datasets....Pages 53-63
Multiple Imputation for Nonresponse and Statistical Disclosure Control....Pages 65-85
A Two-Stage Imputation Procedure to Balance the RiskβUtility Trade-Off....Pages 87-97
Chances and Obstacles for Multiply Imputed Synthetic Datasets....Pages 99-102
Back Matter....Pages 103-138
β¦ Subjects
Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law; Statistics for Business/Economics/Mathematical Finance/Insurance; Statistics for Life Sciences, Medicine, Health Sciences
π SIMILAR VOLUMES
<p><p>This book on statistical disclosure control presents the theory, applications and software implementation of the traditional approach to (micro)data anonymization, including data perturbation methods, disclosure risk, data utility, information loss and methods for simulating synthetic data. In
<p>The aim of this book is to discuss various aspects associated with disseminating personal or business data collected in censuses or surveys or copied from administrative sources. The problem is to present the data in such a form that they are useful for statistical research and to provide suffici
<p>Statistical disclosure control is the discipline that deals with producing statistical data that are safe enough to be released to external researchers. This book concentrates on the methodology of the area. It deals with both microdata (individual data) and tabular (aggregated) data. The book at
<p>This guide aims to strip away the mystery surrounding statistical process control and to present its concepts and principles in as simple and straightforward a manner as possible. It is directed primarily at American business managers.</p>