TY - GEN
T1 - Detecting spatial clusters of disease infection risk using sparsely sampled social media mobility patterns
AU - Souza, Roberto C.S.N.P.
AU - Assunção, Renato M.
AU - Neill, Daniel B.
AU - Meira, Wagner
N1 - Funding Information:
The authors would like to thank FAPEMIG, CNPq and CAPES for their financial support. This work was also partially funded by projects InWeb (MCT/CNPq 573871/2008-6), MASWeb (FAPEMIG-PRONEX APQ-01400-14), EUBra-BIGSEA (H2020-EU.2.1.1 690116, Brazil/MCTI/RNP GA-000650/04), INCT-Cyber, (CNPq 465714/2014-5), ATMOSPHERE (H2020 777154 and MCTIC/RNP 51119) and by the Google Research Awards for Latin America program.
Funding Information:
The authors would like to thank FAPEMIG, CNPq and CAPES for their financial support. This work was also partially funded by projects InWeb (MCT/CNPq 573871/2008-6), MASWeb (FAPEMIGPRONEX APQ-01400-14), EUBra-BIGSEA (H2020-EU.2.1.1 690116, Brazil/MCTI/RNP GA-000650/04), INCT-Cyber, (CNPq 465714/2014-5), ATMOSPHERE (H2020 777154 and MCTIC/RNP 51119) and by the Google Research Awards for Latin America program.
Publisher Copyright:
© 2019 Copyright held by the owner/author(s).
PY - 2019/11/5
Y1 - 2019/11/5
N2 - Standard spatial cluster detection methods used in public health surveillance assign each disease case to a single location (typically, the patient's home address), aggregate locations to small areas, and monitor the number of cases in each area over time. However, such methods cannot detect clusters of disease resulting from visits to non-residential locations, such as a park or a university campus. Thus we develop two new spatial scan methods, the unconditional and conditional spatial logistic models, to search for spatial clusters of increased infection risk. We use mobility data from two sets of individuals, disease cases and healthy individuals, where each individual is represented by a sparse sample of geographical locations (e.g., from geo-tagged social media data). The methods account for the multiple, varying number of spatial locations observed per individual, either by non-parametric estimation of the odds of being a case, or by matching case and control individuals with similar numbers of observed locations. Applying our methods to synthetic and real-world scenarios, we demonstrate robust performance on detecting spatial clusters of infection risk from mobility data, outperforming competing baselines.
AB - Standard spatial cluster detection methods used in public health surveillance assign each disease case to a single location (typically, the patient's home address), aggregate locations to small areas, and monitor the number of cases in each area over time. However, such methods cannot detect clusters of disease resulting from visits to non-residential locations, such as a park or a university campus. Thus we develop two new spatial scan methods, the unconditional and conditional spatial logistic models, to search for spatial clusters of increased infection risk. We use mobility data from two sets of individuals, disease cases and healthy individuals, where each individual is represented by a sparse sample of geographical locations (e.g., from geo-tagged social media data). The methods account for the multiple, varying number of spatial locations observed per individual, either by non-parametric estimation of the odds of being a case, or by matching case and control individuals with similar numbers of observed locations. Applying our methods to synthetic and real-world scenarios, we demonstrate robust performance on detecting spatial clusters of infection risk from mobility data, outperforming competing baselines.
KW - Social media data
KW - Spatial cluster detection
KW - Spatial scan statistics
UR - http://www.scopus.com/inward/record.url?scp=85076995033&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076995033&partnerID=8YFLogxK
U2 - 10.1145/3347146.3359369
DO - 10.1145/3347146.3359369
M3 - Conference contribution
AN - SCOPUS:85076995033
T3 - GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
SP - 359
EP - 368
BT - 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2019
A2 - Banaei-Kashani, Farnoush
A2 - Trajcevski, Goce
A2 - Guting, Ralf Hartmut
A2 - Kulik, Lars
A2 - Newsam, Shawn
PB - Association for Computing Machinery
T2 - 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2019
Y2 - 5 November 2019 through 8 November 2019
ER -