Skip to content

Commit b7b2b1b

Browse files
authored
Merge pull request #52 from VariantSync/develop
Update Main with New Features and Artifact Camera-Ready Changes for ESEC/FSE'22
2 parents 08ec795 + 2157478 commit b7b2b1b

673 files changed

Lines changed: 13455 additions & 8990 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/workflows/maven.yml

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,16 +5,18 @@ name: Java CI with Maven
55

66
on:
77
push:
8-
branches: [ "main" ]
8+
branches: [ "main", "develop" ]
99
pull_request:
10-
branches: [ "main" ]
10+
branches: [ "main", "develop" ]
1111

1212
jobs:
1313
build:
1414

1515
runs-on: ubuntu-latest
1616

1717
steps:
18+
- name: Install Graphviz
19+
run: sudo apt-get install -y graphviz
1820
- uses: actions/checkout@v3
1921
- name: Set up JDK 17
2022
uses: actions/setup-java@v3

INSTALL.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -66,6 +66,9 @@ To execute the replication you can run the `execute` script corresponding to you
6666
> #### Linux/Mac (bash):
6767
> `./stop-execution.sh`
6868
69+
You might see warnings or errors reported from SLF4J like `Failed to load class "org.slf4j.impl.StaticLoggerBinder"` which you can safely ignore.
70+
Further troubleshooting advice can be found at the bottom of this file.
71+
6972
The results of the verification will be stored in the [results](results) directory.
7073

7174
### Expected Output of the Verification
@@ -149,3 +152,8 @@ The input file must have the same format as the other dataset files (i.e., repos
149152

150153
`Fix:` If there is a _results_ directory, delete it with elevated permission (e.g., `sudo rm -r results`).
151154
Then, create a new _results_ directory without elevated permissions, or execute `git restore .` to restore the deleted directory.
155+
156+
### Failed to load class "org.slf4j.impl.StaticLoggerBinder"
157+
`Problem:` An operation within the initialization phase of the logger library we use (tinylog) failed.
158+
159+
`Fix:` Please ignore this warning. Tinylog will fall back onto a default implementation (`Defaulting to no-operation (NOP) logger implementation`) and logging will work as expected.

README.md

Lines changed: 14 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,13 @@
1-
# Classifying Edits to Variability in Source Code
1+
<img padding="10" align="right" src="https://www.acm.org/binaries/content/gallery/acm/publications/artifact-review-v1_1-badges/artifacts_evaluated_reusable_v1_1.png" alt="ACM Artifacts Evaluated Reusable" width="114" height="113"/>
22

33
![Maven](https://github.com/VariantSync/DiffDetective/actions/workflows/maven.yml/badge.svg)
44
[![Documentation](https://img.shields.io/badge/Documentation-Read-purple)][documentation]
55
[![Install](https://img.shields.io/badge/Install-Instructions-blue)](INSTALL.md)
66
[![GitHubPages](https://img.shields.io/badge/GitHub%20Pages-online-blue.svg?style=flat)][website]
77
[![License](https://img.shields.io/badge/License-GNU%20LGPLv3-blue)](LICENSE.LGPL3)
88
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.7110095.svg)](https://doi.org/10.5281/zenodo.7110095)
9-
[![Status](https://img.shields.io/badge/ESEC%2FFSE'22-Badge%20Application-blue)](STATUS.md)
9+
10+
# Classifying Edits to Variability in Source Code
1011

1112
This is the replication package for our paper _Classifying Edits to Variability in Source Code_ accepted at the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022).
1213

@@ -18,7 +19,7 @@ This replication package consists of four parts:
1819
4. **Dataset Overview**: We provide an overview of the 44 inspected datasets with updated links to their repositories in the file [docs/datasets.md][dataset].
1920

2021
## 1. DiffDetective
21-
DiffDetective is a java library and command-line tool to parse and classify edits to variability in git histories of preprocessor-based software product lines by creating [variation tree diffs][difftree_class] and operating on them.
22+
DiffDetective is a java library and command-line tool to parse and classify edits to variability in git histories of preprocessor-based software product lines by creating [variation diffs][difftree_class] and operating on them.
2223

2324
We offer a [Docker](https://www.docker.com/) setup to easily __replicate__ the validation performed in our paper.
2425
In the following, we provide a quickstart guide for running the replication.
@@ -53,45 +54,47 @@ To execute the replication you can run the `execute` script corresponding to you
5354
> #### Linux/Mac (bash):
5455
> `./stop-execution.sh`
5556
57+
You might see warnings or errors reported from SLF4J like `Failed to load class "org.slf4j.impl.StaticLoggerBinder"` which you can safely ignore.
58+
Further troubleshooting advice can be found at the bottom of the [Install](INSTALL.md) file.
5659

5760
### 1.3 View the results in the [results][resultsdir] directory
5861
All raw results are stored in the [results][resultsdir] directory.
5962
The aggregated results can be found in the following files.
6063
(Note that the links below only have a target _after_ running the replication or verification.)
6164
- [speed statistics][resultsdir_speed_statistics]: contains information about the total runtime, median runtime, mean runtime, and more.
62-
- [classification results][resultsdir_classification_results]: contains information about how often each pattern was found, and more.
65+
- [classification results][resultsdir_classification_results]: contains information about how often each class was found, and more.
6366

6467
Moreover, the results comprise the (LaTeX) tables that are part of our paper and appendix.
6568

6669
### Documentation
6770

6871
DiffDetective is documented with javadoc. The documentation can be accessed on this [website][documentation]. Notable classes of our library are:
69-
- [DiffTree](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/difftree/DiffTree.html) and [DiffNode](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/difftree/DiffNode.html) implement variation tree diffs from our paper. A variation tree diff is represented by an instance of the `DiffTree` class. It stores the root node of the diff and offers various methods to parse, traverse, and analyze variation tree diffs. `DiffNode`s represent individual nodes within a variation tree diff.
72+
- [DiffTree](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/difftree/DiffTree.html) and [DiffNode](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/difftree/DiffNode.html) implement variation diffs from our paper. A variation diff is represented by an instance of the `DiffTree` class. It stores the root node of the diff and offers various methods to parse, traverse, and analyze variation diffs. `DiffNode`s represent individual nodes within a variation diff.
7073
- [Validation](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/validation/Validation.html) contains the main method for our validation.
71-
- [ProposedElementaryPatterns](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/pattern/elementary/proposed/ProposedElementaryPatterns.html) holds the catalog of nine edit patterns we proposed in our paper. It implements the interface [ElementaryPatternCatalogue](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/pattern/elementary/ElementaryPatternCatalogue.html), which allows to define custom pattern catalogs.
74+
- [ProposedEditClasses](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/editclass/proposed/ProposedEditClasses.html) holds the catalog of the nine edit classes we proposed in our paper. It implements the interface [EditClassCatalogue](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/editclass/EditClassCatalogue.html), which allows to define custom edit classifications.
7275
- [BooleanAbstraction](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/feature/BooleanAbstraction.html) contains data and methods for boolean abstraction of higher-order logic formulas. We use this for macro parsing.
73-
- [GitDiffer](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/GitDiffer.html) may parse the history of a git repository to variation tree diffs.
76+
- [GitDiffer](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/diff/GitDiffer.html) may parse the history of a git repository to variation diffs.
7477
- The [datasets](https://variantsync.github.io/DiffDetective/docs/javadoc/org/variantsync/diffdetective/datasets/package-summary.html) package contains various classes for describing and loading datasets.
7578

7679
## 2. Appendix
7780

7881
Our [appendix][appendix] consists of:
7982
1. An extended formalization of our concepts in the [Haskell][haskell] programming language. The corresponding source code is also part of this replication package (see below).
80-
2. The proofs for (a) the completeness of variation tree diffs to represent edits to variation trees, and (b) the completeness and unambiguity of our elementary edit patterns.
81-
3. An inspection of edit patterns from related work to show that existing patterns are either composite patterns built from our elementary patterns or similar to our elementary patterns. The used diffs of these patterns can also be found in [docs/compositepatterns](docs/compositepatterns).
83+
2. The proofs for (a) the completeness of variation diffs to represent edits to variation trees, and (b) the completeness and unambiguity of our edit classes.
84+
3. An inspection of edit patterns from related work to show that existing patterns are either composite patterns built from our edit classes or similar to one of our edit classes. The used diffs of these patterns can also be found in [docs/compositepatterns](docs/compositepatterns).
8285
4. The complete results of our validation for all 44 datasets.
8386

8487
## 3. Haskell Formalization
8588
The extended formalization is a [Haskell][haskell] library in the [`proofs`](proofs) subdirectory.
8689
Since the `proofs` library is its own software project, we provide a separate documentation of requirements and installation instructions within the projects subdirectory.
87-
Instructions for manually installing Stack are given in [proofs/REQUIREMENTS.md](proofs/REQUIREMENTS.md).
90+
Requirements and instructions for setting up the build environment (Stack) are given in [proofs/REQUIREMENTS.md](proofs/REQUIREMENTS.md).
8891
How to build our library and how to run the example is described in the [proofs/INSTALL.md](proofs/INSTALL.md).
8992

9093

9194
## 4. Dataset Overview
9295
### 4.1 Open-Source Repositories
9396
We provide an overview of the used 44 open-source preprocessor-based software product lines in the [docs/datasets.md][dataset] file.
94-
As described in our paper in Section 5.1 this list contains all systems that were studied by Liebig et al., extended by four new subject systems (Busybox, Marlin, LibSSH, Godot).
97+
As described in our paper in Section 5.1, this list contains all systems that were studied by Liebig et al., extended by four new subject systems (Busybox, Marlin, LibSSH, Godot).
9598
We provide updated links for each system's repository.
9699

97100
### 4.2 Forked Repositories for Replication

REQUIREMENTS.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ None
55
## Software Requirements
66

77
We do not require a certain operating system or prepared environment.
8-
We tested our setup on Windows 10, WSL2, and Manjaro.
8+
The setup is tested on Windows 10, WSL2, Manjaro, Ubuntu, and MacOS Monterey.
99

1010
To run DiffDetective, JDK16, and Maven are required.
1111
Dependencies to other packages are documented in the maven build file ([pom.xml](pom.xml)) and are handled automatically by Maven.

STATUS.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
The artifact for the paper _Classifying Edits to Variability in Source Code_ consists of four parts:
44

55
1. **DiffDetective**: For our validation, we built DiffDetective, a java library and command-line tool to classify edits to variability in git histories of preprocessor-based software product lines.
6-
DiffDetective is the main artifact used to replicate the validation for our paper (see Section 5).
6+
DiffDetective is the main artifact used to replicate the validation of our paper (see Section 5).
77
DiffDetective is self-contained in that it does not require or depend on in-depth knowledge on the theoretical foundation of our work.
88
Practitioners and researches are free to ignore the appendix as well as the haskell formalization and may use DiffDetective out-of-the-box.
99
2. **Appendix**: The appendix of our paper is given in PDF format in the file [`appendix.pdf`][ddappendix].
@@ -22,18 +22,18 @@ DiffDetective is designed as a library that offers reusable functionality.
2222
Researchers and practitioners can use our DiffDetective library to build on our theory and results (e.g., for future prototypes to study the evolution of variability in source code).
2323

2424
DiffDetective offers various features, including but not limited to:
25-
parsing variation tree diffs from unix diffs, obtaining variation tree diffs for certain patches and commits, matching elementary edit patterns on variation tree diffs, defining custom classifications, rendering, traversing, and transforming variation tree diffs, various de-/serialization methods, and running analyses for the git histories of C preprocessor-based software product lines. We documented each part of the library and provide a [javadoc website][dddocumentation] within the repository.
25+
parsing variation diffs from unix diffs, obtaining variation diffs for certain patches and commits, classifying edits in variation diffs, defining custom classifications, rendering, traversing, and transforming variation diffs, various de-/serialization methods, and running analyses for the git histories of C preprocessor-based software product lines. We documented each part of the library and provide a [javadoc website][dddocumentation] within the repository.
2626
Moreover, our validation (see _replicability_ above) may also be run on any custom dataset as described in our [README.md][ddreadme].
2727

2828
### **Extended Formal Specification**
2929
The [`proofs`][ddproofs] Haskell project provides an extended formal specification of our theory.
3030
Its main purpose is to document the theory and its extensions to serve as a reference for the proofs in our appendix.
3131
Yet, the project can also be used as a library to reason on variation trees and diffs in Haskell projects.
32-
The library is accompanied by a small demo application that shows an example test case for our proof of completeness by creating a variation tree diff from two variation trees and re-projecting them.
32+
The library is accompanied by a small demo application that shows an example test case for our proof of completeness by creating a variation diff from two variation trees and re-projecting them.
3333
The `proofs` project is described in detail in our appendix.
3434

3535
## Claims
36-
We claim the _Artifacts Available_ badge as we made our artefacts publicly available on [Github][ddgithub] and [Zenodo][ddzenodo] with an [open-source license][ddlicense]. All [44 input datasets][ddforks] are open-source projects and publicly available.
36+
We claim the _Artifacts Available_ badge as we made our artifacts publicly available on [Github][ddgithub] and [Zenodo][ddzenodo] with an [open-source license][ddlicense]. All [44 input datasets][ddforks] are open-source projects and publicly available.
3737

3838
We claim the _Artifacts Evaluated Reusable_ badge as we implemented DiffDetective as a reusable library (see above).
3939
Furthermore, both DiffDetective and our Haskell formalization serve as reference implementations if researchers or practitioners want to reimplement our theory in other programming languages.

0 commit comments

Comments
 (0)