A dataset containing software metrics of 1,000 calculation of Breiman's variable importance scores
maven
A data frame with 1,000 rows and 27 variables:
An average physical lines of clone siblings of a clone.
An average comment lines in the methods that contain clone siblings of a clone.
McCabe Cyclomatic complexity of the method that contains the clone.
Number of commits that impact the method containing the clone.
Number of lines added into the method that contains the clone.
Number of source code lines in the method that contains the clone.
Maximum nesting level of control constructs in the method that contains the clone.
Number of commits that introduce new feature and that impact the method containing the clone.
Ratio of CommentLineCount to LineCodeCount.
Ratio of LineCount to CloneLineCount.
Number of tokens in the clone.
Type of clone class to which the clone belongs.
Number of physical lines in the clone.
Number of comment lines in the method that contains the clone.
McCabe Cyclomatic complexity of the method that contains the clone.
Number of distinct developers who modified the method that contains the clone.
Numberical measure of structuredness of the method that contains the clone.
Number of unique methods that call the method containg the clone.
Number of unique methods that are called by the method containing the clone.
Number of commits with a description of fixing bugs and that impact the method containing the clone.
Number of declarative source code lines in the method that contains the clone.
Number of lines in the method that contains the clone.
Number of lines deleted from the method that contains the clone.
Number of commits that introduce new feature and that impact the method containing the clone.
Number of tokens in the clone.
Number of directories that are traversed from the method containing one sibling to the method containing another sibling of the clone.
Number of clone siblings in the clone.