Comparative Study between Two Approaches Using Edit Operations and Code Differences to Detect Past Refactorings

Takayuki OMORI  Katsuhisa MARUYAMA  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E101-D   No.3   pp.644-658
Publication Date: 2018/03/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2017EDP7160
Type of Manuscript: PAPER
Category: Software Engineering
Keyword: 
software evolution,  fine-grained code change,  understanding code change,  refactoring detection,  

Full Text: PDF(2.3MB)
>>Buy this Article


Summary: 
Understanding which refactoring transformations were performed is in demand in modern software constructions. Traditionally, many researchers have been tackling understanding code changes with history data derived from version control systems. In those studies, problems of the traditional approach are pointed out, such as entanglement of multiple changes. To alleviate the problems, operation histories on IDEs' code editors are available as a new source of software evolution data nowadays. By replaying such histories, we can investigate past code changes in a fine-grained level. However, the prior studies did not provide enough evidence of their effectiveness for detecting refactoring transformations. This paper describes an experiment in which participants detect refactoring transformations performed by other participants after investigating the code changes with an operation-replay tool and diff tools. The results show that both approaches have their respective factors that pose misunderstanding and overlooking of refactoring transformations. Two negative factors on divided operations and generated compound operations were observed in the operation-based approach, whereas all the negative factors resulted from three problems on tangling, shadowing, and out-of-order of code changes in the difference-based approach. This paper also shows seven concrete examples of participants' mistakes in both approaches. These findings give us hints for improving existing tools for understanding code changes and detecting refactoring transformations.