TY - JOUR
T1 - Improving age measurement in low- and middleincome countries through computer vision
T2 - A test in Senegal
AU - Helleringer, Stéphane
AU - You, Chong
AU - Fleury, Laurence
AU - Douillot, Laetitia
AU - Diouf, Insa
AU - Ndiaye, Cheikh Tidiane
AU - Delaunay, Valerie
AU - Vidal, Rene
N1 - Publisher Copyright:
© 2019 Stéphane Helleringer et al.
PY - 2019
Y1 - 2019
N2 - BACKGROUND Age misreporting is pervasive in most low- and middle-income countries (LMIC). It may bias estimates of key demographic indicators, such as those required to track progress towards sustainable development goals. Existing methods to improve age data are often ineffective, cannot be adopted on a large scale, and/or do not permit estimating age over the entire life course. OBJECTIVE We tested a computer vision approach, which produces an age estimate by analyzing a photograph of an individual's face. METHODS We constituted a small training dataset in a population of Senegal covered by a health and demographic surveillance system (HDSS) since 1962. We collected facial images of 353 women aged 18 and above, whose age could be ascertained precisely using HDSS data. We developed automatic age estimation (AAE) systems through machine learning and cross-validation. RESULTS AAE was highly accurate in distinguishing women of reproductive age from women aged 50 and older (area under the curve > 0.95). It allowed estimating age in completed years, with a level of precision comparable to those obtained in European or East Asian populations with training datasets of similar sizes (mean absolute error = 4.62 years). CONCLUSION Computer vision might help improve age ascertainment in demographic datasets collected in LMICs. Further improving the accuracy of this approach will require constituting larger and more complete training datasets in additional LMIC populations. CONTRIBUTION Our work highlights the potential benefits of widely used computer science tools for improving demographic measurement in LMIC settings with deficient data.
AB - BACKGROUND Age misreporting is pervasive in most low- and middle-income countries (LMIC). It may bias estimates of key demographic indicators, such as those required to track progress towards sustainable development goals. Existing methods to improve age data are often ineffective, cannot be adopted on a large scale, and/or do not permit estimating age over the entire life course. OBJECTIVE We tested a computer vision approach, which produces an age estimate by analyzing a photograph of an individual's face. METHODS We constituted a small training dataset in a population of Senegal covered by a health and demographic surveillance system (HDSS) since 1962. We collected facial images of 353 women aged 18 and above, whose age could be ascertained precisely using HDSS data. We developed automatic age estimation (AAE) systems through machine learning and cross-validation. RESULTS AAE was highly accurate in distinguishing women of reproductive age from women aged 50 and older (area under the curve > 0.95). It allowed estimating age in completed years, with a level of precision comparable to those obtained in European or East Asian populations with training datasets of similar sizes (mean absolute error = 4.62 years). CONCLUSION Computer vision might help improve age ascertainment in demographic datasets collected in LMICs. Further improving the accuracy of this approach will require constituting larger and more complete training datasets in additional LMIC populations. CONTRIBUTION Our work highlights the potential benefits of widely used computer science tools for improving demographic measurement in LMIC settings with deficient data.
UR - http://www.scopus.com/inward/record.url?scp=85065042364&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85065042364&partnerID=8YFLogxK
U2 - 10.4054/DEMRES.2019.40.9
DO - 10.4054/DEMRES.2019.40.9
M3 - Article
AN - SCOPUS:85065042364
SN - 1435-9871
VL - 40
SP - 219
EP - 260
JO - Demographic Research
JF - Demographic Research
ER -