TY - GEN
T1 - CrowdLabs
T2 - 23rd International Conference on Scientific and Statistical Database Management, SSDBM 2011
AU - Mates, Phillip
AU - Santos, Emanuele
AU - Freire, Juliana
AU - Silva, Cláudio T.
PY - 2011
Y1 - 2011
N2 - Managing and understanding the growing volumes of scientific data is one of the most challenging issues scientists face today. As analyses get more complex and large interdisciplinary groups need to work together, knowledge sharing becomes essential to support effective scientific data exploration. While science portals and visualization Web sites have provided a first step towards this goal, by aggregating data from different sources and providing a set of pre-designed analyses and visualizations, they have important limitations. Often, these sites are built manually and are not flexible enough to support the vast heterogeneity of data sources, analysis techniques, data products, and the needs of different user communities. In this paper we describe CrowdLabs, a system that adopts the model used by social Web sites, allowing users to share not only data but also computational pipelines. The shared repository opens up many new opportunities for knowledge sharing and re-use, exposing scientists to tasks that provide examples of sophisticated uses of algorithms they would not have access to otherwise. CrowdLabs combines a set of usable tools and a scalable infrastructure to provide a rich collaborative environment for scientists, taking into account the requirements of computational scientists, such as accessing high-performance computers and manipulating large amounts of data.
AB - Managing and understanding the growing volumes of scientific data is one of the most challenging issues scientists face today. As analyses get more complex and large interdisciplinary groups need to work together, knowledge sharing becomes essential to support effective scientific data exploration. While science portals and visualization Web sites have provided a first step towards this goal, by aggregating data from different sources and providing a set of pre-designed analyses and visualizations, they have important limitations. Often, these sites are built manually and are not flexible enough to support the vast heterogeneity of data sources, analysis techniques, data products, and the needs of different user communities. In this paper we describe CrowdLabs, a system that adopts the model used by social Web sites, allowing users to share not only data but also computational pipelines. The shared repository opens up many new opportunities for knowledge sharing and re-use, exposing scientists to tasks that provide examples of sophisticated uses of algorithms they would not have access to otherwise. CrowdLabs combines a set of usable tools and a scalable infrastructure to provide a rich collaborative environment for scientists, taking into account the requirements of computational scientists, such as accessing high-performance computers and manipulating large amounts of data.
KW - Computational Sciences
KW - Cyberinfrastructure
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=79961200591&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79961200591&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-22351-8_38
DO - 10.1007/978-3-642-22351-8_38
M3 - Conference contribution
AN - SCOPUS:79961200591
SN - 9783642223501
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 555
EP - 564
BT - Scientific and Statistical Database Management - 23rd International Conference, SSDBM 2011, Proceedings
Y2 - 20 July 2011 through 22 July 2011
ER -