TY - JOUR
T1 - Structured Open Urban Data
T2 - Understanding the Landscape
AU - Barbosa, Luciano
AU - Pham, Kien
AU - Silva, Claudio
AU - Vieira, Marcos R.
AU - Freire, Juliana
N1 - Publisher Copyright:
© Copyright 2014, Mary Ann Liebert, Inc.
PY - 2014/9
Y1 - 2014/9
N2 - A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.
AB - A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.
UR - http://www.scopus.com/inward/record.url?scp=84991810061&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84991810061&partnerID=8YFLogxK
U2 - 10.1089/big.2014.0020
DO - 10.1089/big.2014.0020
M3 - Review article
AN - SCOPUS:84991810061
SN - 2167-6461
VL - 2
SP - 144
EP - 154
JO - Big Data
JF - Big Data
IS - 3
ER -