Supporting concept location through identifier parsing and ontology extraction

SL Abebe, A Alicante, A Corazza, P Tonella - Journal of Systems and …, 2013 - Elsevier
Journal of Systems and Software, 2013Elsevier
Identifier names play a key role in program understanding and in particular in concept
location. Programmers can easily “parse” identifiers and understand the intended meaning.
This, however, is not trivial for tools that try to exploit the information in the identifiers to
support program understanding. To address this problem, we resort to natural language
analyzers, which parse tokenized identifier names and provide the syntactic relationships
(dependencies) among the terms composing the identifiers. Such relationships are then …
Abstract
Identifier names play a key role in program understanding and in particular in concept location. Programmers can easily “parse” identifiers and understand the intended meaning. This, however, is not trivial for tools that try to exploit the information in the identifiers to support program understanding. To address this problem, we resort to natural language analyzers, which parse tokenized identifier names and provide the syntactic relationships (dependencies) among the terms composing the identifiers. Such relationships are then mapped to semantic relationships.
In this study, we have evaluated the use of off-the-shelf and trained natural language analyzers to parse identifier names, extract an ontology and use it to support concept location. In the evaluation, we assessed whether the concepts taken from the ontology can be used to improve the efficiency of queries used in concept location. We have also investigated if the use of different natural language analyzers has an impact on the ontology extracted and the support it provides to concept location. Results show that using the concepts from the ontology significantly improves the efficiency of concept location queries (e.g., in some cases, an improvement of 127% is observed). The results also indicate that the efficiency of concept location queries is not affected by the differences in the ontologies produced by different analyzers.
Elsevier
以上显示的是最相近的搜索结果。 查看全部搜索结果