There is very little network data missing on respondents to the 1985 General Social Survey. Missing data on relations between discussion patterns pose the greatest problem but the unk.nown relations are strongly associated with relations known to be weak. The association between missing and weak rel
Effects of missing data in social networks
β Scribed by Gueorgi Kossinets
- Publisher
- Elsevier Science
- Year
- 2006
- Tongue
- English
- Weight
- 633 KB
- Volume
- 28
- Category
- Article
- ISSN
- 0378-8733
No coin nor oath required. For personal study only.
β¦ Synopsis
We perform sensitivity analyses to assess the impact of missing data on the structural properties of social networks. The social network is conceived of as being generated by a bipartite graph, in which actors are linked together via multiple interaction contexts or affiliations. We discuss three principal missing data mechanisms: network boundary specification (non-inclusion of actors or affiliations), survey non-response, and censoring by vertex degree (fixed choice design), examining their impact on the scientific collaboration network from the Los Alamos E-print Archive as well as random bipartite graphs. The simulation results show that network boundary specification and fixed choice designs can dramatically alter estimates of network-level statistics. The observed clustering and assortativity coefficients are overestimated via omission of affiliations or fixed choice thereof, and underestimated via actor non-response, which results in inflated measurement error. We also find that social networks with multiple interaction contexts may have certain interesting properties due to the presence of overlapping cliques. In particular, assortativity by degree does not necessarily improve network robustness to random omission of nodes as predicted by current theory.
π SIMILAR VOLUMES
## Abstract Principal components analysis was applied to solar irradiation (energy per unit area integrated over a daily period) in Paraiba State, northβeast Brazil, in order to find homogeneous subregions. The network data set included measurements of 19 solarimetric stations, generated by bimetal
Survey studies of complete social networks often involve non-respondents, whereby certain people within the "boundary" of a network do not complete a sociometric questionnaire-either by their own choice or by the design of the study-yet are still nominated by other respondents as network partners. W