All student projects

Topic Modeling In Open Source Artifacts

The goal of this project is to investigate the topics discussed by developers in various artifacts generated by open source projects. This includes discussions between developers in different communication channels such as emails, issues in issue trackers, pull requests, and irc (internet relay chat). This also includes information from comments and commit messages.

This topic can be used for multiple projects, each focussing on a different type of artifact. Possible artifacts: emails, issues/pull requests, irc, commit messages, comments

Relevant Literature (Emails)

[1] A. Guzzi, A. Bacchelli, M. Lanza, M. Pinzger, and A. van Deursen, ‘Communication in open source software development mailing lists’, in 2013 10th Working Conference on Mining Software Repositories (MSR), pp. 277–286, 2013. doi: 10.1109/MSR.2013.6624039.

[2] S. Panichella, G. Bavota, M. D. Penta, G. Canfora, and G. Antoniol, ‘How Developers’ Collaborations Identified from Different Sources Tell Us about Code Changes’, in 2014 IEEE International Conference on Software Maintenance and Evolution, pp. 251–260, 2014. doi: 10.1109/ICSME.2014.47.

Relevant Literature (Issues)

[3] S. Panichella, G. Bavota, M. D. Penta, G. Canfora, and G. Antoniol, ‘How Developers’ Collaborations Identified from Different Sources Tell Us about Code Changes’, in 2014 IEEE International Conference on Software Maintenance and Evolution, pp. 251–260, 2014. doi: 10.1109/ICSME.2014.47.

[4] A. Hindle, C. Bird, T. Zimmermann, and N. Nagappan, ‘Do topics make sense to managers and developers?’, Empirical Software Engineering, vol. 20, no. 2, pp. 479–515, 2015. doi: 10.1007/s10664-014-9312-1.

Relevant Literature (Code Comments)

[5] L. Pascarella and A. Bacchelli, “Classifying Code Comments in Java Open-Source Software Systems,” in 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), Buenos Aires, Argentina, pp. 227-237, 2017. doi: 10.1109/MSR.2017.63.

Relevant Literature (IRC)

[6] E. Shihab, Z. M. Jiang, and A. E. Hassan, ‘Studying the use of developer IRC meetings in open source projects’, in 2009 IEEE International Conference on Software Maintenance, pp. 147–156, 2009. doi: 10.1109/ICSM.2009.5306333.

Relevant Literature (Commit Messages)

[7] A. Hindle, N. A. Ernst, M. W. Godfrey, and J. Mylopoulos, ‘Automated topic naming to support cross-project analysis of software maintenance activities’, in Proceedings of the 8th Working Conference on Mining Software Repositories, in MSR ’11. New York, NY, USA: Association for Computing Machinery, pp. 163–172, 2011. doi: 10.1145/1985441.1985466.

Supervisor(s)

SEARCH Group • University of Groningen • 2023
Some graphics by Font Awesome, Icons8, and Vectors Market.