Deciphering the Story of Software Development Through Frequent Pattern Mining

Nicolas Bettenburg bio photo By Nicolas Bettenburg

Disclaimer

This work was conducted at Microsoft Research under the supervision of Andrew Begel.

Summary

Software teams record their work progress in task repositories which often require them to encode their activities in a set of edits to field values in a form-based user interface. When others read the tasks, they must decode the schema used to write the activities down. We interviewed four software teams and found out how they used the task repository fields to record their work activities. However, we also found that they had trouble interpreting task revisions that encoded for multiple activities at the same time. To assist engineers in decoding tasks, we developed a scalable method based on frequent pattern mining to identify patterns of frequently co-edited fields that each represent a conceptual work activity. We applied our method to our two years of our interviewee’s task repositories and were able to abstract 83,000 field changes into just 27 patterns that cover 95% of the task revisions. We used the 27 patterns to render the teams’ tasks in web-based English newsfeeds and evaluated them with the product teams. The team agreed with most of our patterns and English interpretations, but outlined a number of improvements that we will incorporate into future work.

Download the Full Paper

The full paper is available for download, if you want to learn more about .

Citation

If you would like to cite the research in your own work, please use the following citation:

@inproceedings{Bettenburg:2013:DSS:2486788.2486960,
  author = "Bettenburg, Nicolas and Begel, Andrew",
  title = "Deciphering the story of software development through frequent pattern mining",
  booktitle = "Proceedings of the 2013 International Conference on Software Engineering",
  series = "ICSE '13",
  year = "2013",
  isbn = "978-1-4673-3076-3",
  location = "San Francisco, CA, USA",
  pages = "1197--1200",
  numpages = "4",
  url = "http://dl.acm.org/citation.cfm?id=2486788.2486960",
  acmid = "2486960",
  publisher = "IEEE Press",
  address = "Piscataway, NJ, USA"
}

Legal Disclaimer

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.