TY - GEN
T1 - What do developers know about machine learning
T2 - 16th IEEE/ACM International Conference on Mining Software Repositories, MSR 2019
AU - Bangash, Abdul Ali
AU - Sahar, Hareem
AU - Chowdhury, Shaiful
AU - Wong, Alexander William
AU - Hindle, Abram
AU - Ali, Karim
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/5
Y1 - 2019/5
N2 - Machine learning, a branch of Artificial Intelligence, is now popular in software engineering community and is successfully used for problems like bug prediction, and software development effort estimation. Developers' understanding of machine learning, however, is not clear, and we require investigation to understand what educators should focus on, and how different online programming discussion communities can be more helpful. We conduct a study on Stack Overflow (SO) machine learning related posts using the SOTorrent dataset. We found that some machine learning topics are significantly more discussed than others, and others need more attention. We also found that topic generation with Latent Dirichlet Allocation (LDA) can suggest more appropriate tags that can make a machine learning post more visible and thus can help in receiving immediate feedback from sites like SO.
AB - Machine learning, a branch of Artificial Intelligence, is now popular in software engineering community and is successfully used for problems like bug prediction, and software development effort estimation. Developers' understanding of machine learning, however, is not clear, and we require investigation to understand what educators should focus on, and how different online programming discussion communities can be more helpful. We conduct a study on Stack Overflow (SO) machine learning related posts using the SOTorrent dataset. We found that some machine learning topics are significantly more discussed than others, and others need more attention. We also found that topic generation with Latent Dirichlet Allocation (LDA) can suggest more appropriate tags that can make a machine learning post more visible and thus can help in receiving immediate feedback from sites like SO.
KW - Machine learning
KW - Stackoverflow
KW - Topic modeling
UR - http://www.scopus.com/inward/record.url?scp=85072306720&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072306720&partnerID=8YFLogxK
U2 - 10.1109/MSR.2019.00052
DO - 10.1109/MSR.2019.00052
M3 - Conference contribution
AN - SCOPUS:85072306720
T3 - IEEE International Working Conference on Mining Software Repositories
SP - 260
EP - 264
BT - Proceedings - 2019 IEEE/ACM 16th International Conference on Mining Software Repositories, MSR 2019
PB - IEEE Computer Society
Y2 - 26 May 2019 through 27 May 2019
ER -