Identifying sustainability in posts from Q&A platforms like Stack Overflow is no easy task. In this project, your main focus will be to explore a list of posts from Stack Overflow. This initial list has already been pre-filtered to ensure the posts include 1- sustainability-related terms and 2- cloud computing-related terms. Your job is to further analyze and filter these posts to determine whether they are genuinely related to sustainability within the context of cloud computing. Next, you will identify which dimensions of sustainability are discussed in each post. By the end of the activity, you will have an annotated dataset of sustainability-related posts from Stack Overflow. This dataset will serve as the foundation for performing the required data extraction and analysis tasks. To build the dataset, you will work in a group under the supervision of a head researcher. Once the dataset is complete, you will work individually on the provided dataset to perform data extraction and analysis. This individual work will vary for each student, with a unique focus.
[1] Lago P, Kocak SA, Crnkovic I, Penzenstadler B. Framing sustainability as a property of software quality. Communications of the ACM. 2015 Sep 28;58(10):70-8. [2] Ahmadisakha S, Andrikopoulos V. Mining for sustainability in cloud architecture among the discussions of software practitioners: building a dataset. In European Conference on Software Architecture 2024 Sep 1 (pp. 150-166). Cham: Springer Nature Switzerland. [3] Ahmadisakha S, Andrikopoulos V. Architecting for sustainability of and in the cloud: A systematic literature review. Information and Software Technology. 2024 Mar 28:107459. [4] Albonico M, Malavolta I, Pinto G, Guzman E, Chinnappan K, Lago P. Mining energy-related practices in robotics software. In2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR) 2021 May 17 (pp. 483-494). IEEE.