Imputing Missing Social Media Data Stream in Multisensor Studies of Human Behavior
Abstract:
The ubiquitous use of social media enables researchers to obtain self-recorded longitudinal data of individuals in real-time. Because this data can be collected in an inexpensive and unobtrusive way at scale, social media has been adopted as a 'passive sensor' to study human behavior. However, such research is impacted by the lack of homogeneity in the use of social media, and the engineering challenges in obtaining such data. This paper proposes a statistical framework to leverage the potential of social media in sensing studies of human behavior, while navigating the challenges associated with its sparsity. Our framework is situated in a large-scale in-situ study concerning the passive assessment of psychological constructs of 757 information workers wherein of four sensing streams was deployed-bluetooth beacons, wearable, smartphone, and social media. Our framework includes principled feature transformation and machine learning models that pbkp_redict latent social media features from the other passive sensors. We demonstrate the efficacy of this imputation framework via a high correlation of 0.78 between actual and imputed social media features. With the imputed features we test and validate pbkp_redictions on psychological constructs like personality traits and affect. We find that adding the social media data streams, in their imputed form, improves the pbkp_rediction of these measures. We discuss how our framework can be valuable in multimodal sensing studies that aim to gather comprehensive signals about an individual's state or situation.
Año de publicación:
2019
Keywords:
- imputation
- Multisensor
- wellbeing
- Social media
Fuente:
Tipo de documento:
Conference Object
Estado:
Acceso restringido
Áreas de conocimiento:
- Redes sociales
- Análisis de datos
- Análisis de datos
Áreas temáticas:
- Funcionamiento de bibliotecas y archivos
- Psicología diferencial y del desarrollo
- Interacción social