TY - GEN
T1 - A dataset of simplified syntax trees for C#
AU - Proksch, Sebastian
AU - Amann, Sven
AU - Nadi, Sarah
AU - Mezini, Mira
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/5/14
Y1 - 2016/5/14
N2 - In this paper, we present a curated collection of 2833 C# solutions taken from Github. We encode the data in a new intermediate representation (IR) that facilitates further analysis by restricting the complexity of the syntax tree and by avoiding implicit information. The dataset is intended as a standardized input for research on recommendation systems for software engineering, but is also useful in many other areas that analyze source code.
AB - In this paper, we present a curated collection of 2833 C# solutions taken from Github. We encode the data in a new intermediate representation (IR) that facilitates further analysis by restricting the complexity of the syntax tree and by avoiding implicit information. The dataset is intended as a standardized input for research on recommendation systems for software engineering, but is also useful in many other areas that analyze source code.
UR - http://www.scopus.com/inward/record.url?scp=84974539753&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84974539753&partnerID=8YFLogxK
U2 - 10.1145/2901739.2903507
DO - 10.1145/2901739.2903507
M3 - Conference contribution
AN - SCOPUS:84974539753
T3 - Proceedings - 13th Working Conference on Mining Software Repositories, MSR 2016
SP - 476
EP - 479
BT - Proceedings - 13th Working Conference on Mining Software Repositories, MSR 2016
PB - Association for Computing Machinery, Inc
T2 - 13th Working Conference on Mining Software Repositories, MSR 2016
Y2 - 14 May 2016 through 15 May 2016
ER -