NAME1: André Miguel Ferreira da Cruz, NR1: 201503776, GRADE1: 19, CONTRIBUTION1: 33.(3)%
NAME2: Edgar Filipe Gomes Carneiro, NR2: 201503776, GRADE2: 19, CONTRIBUTION2: 33.(3)%
NAME3: Miguel Sozinho Ramalho, NR3: 201403027, GRADE3: 19, CONTRIBUTION3: 33.(3)%
This Project is a multipurpose pattern matching tool, enabling you to analyse whole projects for specific code patterns (e.g. use of ternary operators instead of if-else constructs). You may use the patterns defined by us or define your own patterns. User defined patterns may describe a code template to be matched, or an operation over the Spoon AST.
Consult the project's user manual for more information.
java -jar pattern_matcher.jar <filename|foldername> [<userSettings.json>] [DEBUG]
filename/foldername
denotes de target java file or project folder;userSettings.json
denotes the program settings (which patterns should be tested, which operations to report on, number of threads to use, etc.) - provided optionally;DEBUG
denotes the debug flag, meaning whether the run should report execution details - provided optionally.
Our tool receives a source code file/folder for analyzing, and, optionally, a settings.json file indicating which patterns to run, which operations to perform on the results, input/output paths, and number of threads to use. After parsing command line arguments, setting up configurations, and creating WorkerFactories for each type of Worker (pattern analyzer) to be run, we execute a ClassScanner for each class set to be analyzed, in parallel.
A ClassScanner extends Spoon's CtScanner, and, thus, iteratively searches the class file's AST in a depth-first manner. For each node traversed, it triggers the appropriate Worker instances (found by a request to the FactoryManager class, and matched by a TypeFilter). Each worker is run in parallel, and Futures of the task's execution are kept in order to later retrieve the execution report. By traversing the AST, we also construct an equally shaped tree of Nodes, to keep tabs on execution reports, and to be able to retrieve these reports from every part of the program (be it a single method, an inner class, or the global file).
Information regarding how to define patterns and how they work can be found in the project's user manual.
We used the jUnit testing framework for quality assurance. We strove for 100% method coverage on all types of Workers (pattern definitions), as well as the pattern matching engine itself.
The testsuite can be run simply by setting up the project in IntelliJ, mark the src/test folder as containing test files, and clicking "Run all tests".
In general, each group member participated in every aspect of the project's development. The group member Edgar Carneiro worked more closely with the code pattern matching engine, integrating it on the existing project architecture, and providing several configuration options. The group members André Cruz and Miguel Ramalho worked more closely with the ClassScanner-WorkerFactory-Worker architecture, and the Report collection innerworks.
- Felxible architecture that can be extended, in any of the following ways:
- Inclusion of more static patterns
- Use of more information from each pattern worker
- Implementation of new operations on the results of a Pattern, i.e. quartiles, skewness, kurtosis, ...
- Off-the-shelf implementation of many patterns, that serve as examples for expansion to other patterns
- Pattern Report is fetcheable from any node in the AST, as it is done recursively
- Ability to use template matching patterns, that is generaly more accessible for users
- Ability yo use Spoon-based pattern matching functionality for users that want more flexibility
- Configurations file is both expandable and shareable, it can also be shared between users so as to guarantee the same execution environment
- Debug mode for simple troubleshooting
- The dynamic patterns have some limitations, namely only supporting patterns with one root element in its definition and forcing the Users to know the type of variables to be matched, as they need to be declared with that type (for more detailed information see project's user manual)
- The use of a parallel Spoon fork implies we are using a stale version (at least until it is merged) and ought to update our dependencies once it integrates Spoon
See the project's kanban board for live updates on the project's completion.
- Configurations from JSON file (a default exists) uses GSON
- Build a Spoon model from a given source file or folder
- Implement concurrent architecture to trigger workers, with threadpool
- Discover all the classes fed into Spoon so as to trigger ClassScanner instances
- Implement WorkerFactory and create one for each type of worker
- Implement Static and Dynamic Worker Factories
- Static workers have a trigger node and are more flexible as they allow the user to play with Spoon (uses reflection to find Workers defined in the configuration - if configuration is
countIfs
, worker should beW_countIfs
) - Dynamic workers access a Patterns file and try to match them
- Static workers have a trigger node and are more flexible as they allow the user to play with Spoon (uses reflection to find Workers defined in the configuration - if configuration is
- Implement a CtElement iterator to go through the AST in DFS order. Accepted PR in spoon.
- Extend CtElement to be iterable in forLoops. Accepted in PR in spoon.
- Recursive DFS collection of Worker Results
- Creation of a horizontal Report from the triggered workers
- Implementation of Streams to merge WorkerReports with custom functions, for instance: sum, count, avg, ... Using guava
- Inclusion of unmerged Spoon PR that implements template matching - public discussion on how to use it in this issue
- Creation of simple Patterns that match variables only
- Addition of template methods in pattern matching
- Implementation of many static Patterns, such as cyclomatic complexity, inner loop max depth, ...
- Creation of tests for static and dynamic patterns
- Implementation of the "any" template matcher to our project
- Insertion of Debug modes
- Code cleanup and configuration standardization
- Project Delivery :)
We needed to iterate through the nodes in the AST in depth-first manner, and this functionality was not available in Spoon. As such, we decided to implement it and submit a pull request, additionally closing the oldest open Spoon issue at the time.
- Spoon#1980: implementation of a CtIterator class, enabling depth-first-search from any node of the AST;
- Spoon#1986: interface enabling iteration over a CtElement's descendants, using range-based for loops.